Brave Search

Capture groups with Regular Expression (Python)

stackoverflow.com › questions › 48719537 › capture-groups-with-regular-expression-python

You need the first captured group:

a.group(1)
b.group(1)
...

without any captured group specification as argument to group(), it will show the full match, like what you're getting now.

Here's an example:

In [8]: string_one = 'file_record_transcript.pdf'

In [9]: re.search(r'^(file.*)\.pdf$', string_one).group()
Out[9]: 'file_record_transcript.pdf'

In [10]: re.search(r'^(file.*)\.pdf$', string_one).group(1)
Out[10]: 'file_record_transcript'

Answer from heemayl on Stack Overflow

Python documentation

docs.python.org › 3 › library › re.html

re — Regular expression operations

4 days ago - The regex matching flags. This is a combination of the flags given to compile(), any (?...) inline flags in the pattern, and implicit flags such as UNICODE if the pattern is a Unicode string. ... The number of capturing groups in the pattern.

Stack Overflow

stackoverflow.com › questions › 48719537 › capture-groups-with-regular-expression-python

regex - Capture groups with Regular Expression (Python) - Stack Overflow

Top answer

1 of 2

116

You need the first captured group:

a.group(1)
b.group(1)
...

without any captured group specification as argument to group(), it will show the full match, like what you're getting now.

Here's an example:

In [8]: string_one = 'file_record_transcript.pdf'

In [9]: re.search(r'^(file.*)\.pdf$', string_one).group()
Out[9]: 'file_record_transcript.pdf'

In [10]: re.search(r'^(file.*)\.pdf$', string_one).group(1)
Out[10]: 'file_record_transcript'

2 of 2

5

you can also use match[index]

a[0] => Full match (file_record_transcript.pdf)
a[1] => First group (file_record_transcript)
a[2] => Second group (if any)

Discussions

(Regex | Regular expression) match.group(1) = 'None' due to using appended patterns

I'm not sure if reddit messed up your code or something, but there seems some missing escapes in the regex (e.g. d+ rather than \d+). Secondly, there's something strange, 1x300g does not match the pattern you provided, since it requires at least one whitespace character after the 'x'. The groups when there are alternatives are not independent. For example, in the regex a(\d)|b(\d) there are two groups, group(1) would be the digit after an 'a', and group(2) would be the digit after a 'b'. group(0) is always the entire match. e.g. >>> re.search("a(\d)|b(\d)", "a1").groups() ('1', None) >>> re.search("a(\d)|b(\d)", "b1").groups() (None, '1') The built-in 're' module does not support overlapping groups, but you can name them: >>> re.search("a(?P\d)|b(?P\d)", "a1").groupdict() {'adigit': '1', 'bdigit': None} >>> re.search("a(?P\d)|b(?P\d)", "b1").groupdict() {'adigit': None, 'bdigit': '1'} So you can first figure out whether you are in the 'a' or 'b' case, then access the correct group. The third-party 'regex' package does support overlapping groups, which means you can make code a bit more robust: >>> import regex >>> regex.search("a(?P\d)|b(?P\d)", "a1").groupdict() {'digit': '1'} >>> regex.search("a(?P\d)|b(?P\d)", "b1").groupdict() {'digit': '1'} More on reddit.com

r/learnpython

6

8

December 19, 2024

Accessing a "symbolic group name" in Python regex

You use them with Match objects. findall doesn't return those, it returns strings only. For what you want, you would need to use finditer which does return Match objects. However, this is a generator so you couldn't index it, but you can iterate it: for match in found: print(f'Data Content:\t{match["value"]}') More on reddit.com

r/learnpython

6

1

November 28, 2022

regex - Match groups in Python - Stack Overflow

Is there a way in Python to access match groups without explicitly creating a match object (or another way to beautify the example below)? Here is an example to clarify my motivation for the quest... More on stackoverflow.com

stackoverflow.com

Python Conditional Regex to Print Decimal Number

re.findall will return the result captured in your capture group. Just use a non-capture group instead >>> (?: More on reddit.com

r/regex

11

3

February 18, 2021

Videos