python regex findall groups

stackoverflow.com › questions › 6018340 › capturing-group-with-findall

findall just returns the captured groups:

>>> re.findall('abc(de)fg(123)', 'abcdefg123 and again abcdefg123')
[('de', '123'), ('de', '123')]

Relevant doc excerpt:

Return all non-overlapping matches of pattern in string, as a list of strings. The string is scanned left-to-right, and matches are returned in the order found. If one or more groups are present in the pattern, return a list of groups; this will be a list of tuples if the pattern has more than one group. Empty matches are included in the result unless they touch the beginning of another match.

Answer from Eli Bendersky on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 6018340 › capturing-group-with-findall

python - Capturing group with findall? - Stack Overflow

Top answer

1 of 4

125

findall just returns the captured groups:

>>> re.findall('abc(de)fg(123)', 'abcdefg123 and again abcdefg123')
[('de', '123'), ('de', '123')]

Relevant doc excerpt:

Return all non-overlapping matches of pattern in string, as a list of strings. The string is scanned left-to-right, and matches are returned in the order found. If one or more groups are present in the pattern, return a list of groups; this will be a list of tuples if the pattern has more than one group. Empty matches are included in the result unless they touch the beginning of another match.

2 of 4

Use groups freely. The matches will be returned as a list of group-tuples:

>>> re.findall('(1(23))45', '12345')
[('123', '23')]

If you want the full match to be included, just enclose the entire regex in a group:

>>> re.findall('(1(23)45)', '12345')
[('12345', '23')]

reddit.com › r/learnpython › regex findall() method show entire match with capture groups?

r/learnpython on Reddit: Regex findall() method show entire match with capture groups?

October 7, 2019 -

hey all, curious what would need to be done to have a regex with a capture group show all of the matches.

import re

var = 'Agent Alice and Agent Bob'
ourRegex = re.compile(r'Agent (\w)\w*')
print(ourRegex.findall(var))

This will output "['A', 'B']" and not "Agent Alice" and "Agent Bob"

Top answer

1 of 4

re.finditer() will return match objects, instead if groups like re.findall() does.

2 of 4

could you explain why you need parenthesis at all for the given example? >>> ourRegex = re.compile(r'Agent \w\w*') >>> print(ourRegex.findall(var)) ['Agent Alice', 'Agent Bob'] >>> ourRegex = re.compile(r'Agent \w+') >>> print(ourRegex.findall(var)) ['Agent Alice', 'Agent Bob'] if you do need the capture group but get entire match as well, then you can use finditer >>> words = 'effort flee facade oddball rat tool' # whole words containing at least one consecutive repeated character >>> repeat_char = re.compile(r'\b\w*(\w)\1\w*\b') # () in findall will only return text matched by capture groups >>> repeat_char.findall(words) ['f', 'e', 'l', 'o'] # finditer to the rescue >>> m_iter = repeat_char.finditer(words) >>> [m[0] for m in m_iter] ['effort', 'flee', 'oddball', 'tool']

Videos

11:42

YouTube

[regex_06] python regular expression tutorial - findall and groups ...

Python Regular Expressions Tutorial #6: Capturing Groups - YouTube

REGEX CAPTURING GROUPS | Part 9 - YouTube

Matching a regex pattern using Python's re.findall() - YouTube

RegEx in Python (Part-12) | Grouping - YouTube

February 27, 2019

View all

Google

developers.google.com › google for education › python › python regular expressions

Python Regular Expressions | Python Education | Google for Developers

If the pattern includes 2 or more parentheses groups, then instead of returning a list of strings, findall() returns a list of *tuples*. Each tuple represents one match of the pattern, and inside the tuple is the group(1), group(2) .. data. So if 2 parentheses groups are added to the email pattern, then findall() returns a list of tuples, each length 2 containing the username and host, e.g.

Blogger

how2itsec.blogspot.com › 2020 › 10 › python-regex-findall-groups.html

how2itsec: Python regex findall groups

>>> re.findall('ab(cde)fg(0123)', 'abcdefg0123 and again abcdefg0123') [('cde', '0123'), ('cde', '0123')] ... Return all non-overlapping matches of pattern in string, as a list of strings. The string is scanned left-to-right, and matches are returned in the order found.

Python documentation

docs.python.org › 3 › library › re.html

re — Regular expression operations — Python 3.14.3 ...

The regex matching flags. This is a combination of the flags given to compile(), any (?...) inline flags in the pattern, and implicit flags such as UNICODE if the pattern is a Unicode string. ... The number of capturing groups in the pattern.

LearnByExample

learnbyexample.github.io › py_regular_expressions › working-with-matched-portions.html

Working with matched portions - Understanding Python re(gex)?

To get all the matches instead of just the first match, you can use re.findall() (which gives a list of strings as output) and re.finditer() (which gives an iterator of re.Match objects).

Stack Overflow

stackoverflow.com › questions › 25628973 › capturing-named-groups-in-regex-with-re-findall

python - Capturing named groups in regex with re.findall - Stack Overflow

Top answer

1 of 3

Take 3, based on a further clarification of the OP's intent in this comment.

Ashwin is correct that findall does not preserve named capture groups (e.g. (?P<name>regex)). finditer to the rescue! It returns the individual match objects one-by-one. Simple example:

data = """34% passed 23% failed 46% deferred"""
for m in re.finditer('(?P<percentage>\w+)%\s(?P<word>\w+)', data):
    print( m.group('percentage'), m.group('word') )

2 of 3

As you've identified in your second example, re.findall returns the groups in the original order.

The problem is that the standard Python dict type does not preserve the order of keys in any way. Here's the manual for Python 2.x, which makes it explicit, but it's still true in Python 3.x: https://docs.python.org/2/library/stdtypes.html#dict.items

What you should use instead is collections.OrderedDict:

from collections import OrderedDict as odict

data = """34% passed 23% failed 46% deferred"""
result = odict((key,value) for value, key in re.findall('(\w+)%\s(\w+)', data))
print(result)
>>> OrderedDict([('passed', '34'), ('failed', '23'), ('deferred', '46')])

Notice that you must use the pairwise constructor form (dict((k,v) for k,v in ...) rather than the dict comprehension constructor ({k:v for k,v in ...}). That's because the latter constructs instances of dicttype, which cannot be converted to OrderedDict without losing the order of the keys... which is of course what you are trying to preserve in the first place.

Python Tutorial

pythontutorial.net › home › python regex › python regex findall()

Python Regex findall() Function By Practical Examples

December 10, 2021 - If the pattern has one capturing group, the findall() function returns a list of strings that match the group. If the pattern has multiple capturing groups, the findall() function returns the tuples of strings that match the groups.

Find elsewhere

Google Bing Mojeek

Stack Overflow

stackoverflow.com › questions › 11103856 › re-findall-which-returns-a-dict-of-named-capturing-groups

python - re.findall which returns a dict of named capturing groups? - Stack Overflow

Top answer

1 of 4

165

Using Pattern.finditer() then Match.groupdict():

>>> import re
>>> s = "bob sue jon richard harry"
>>> r = re.compile('(?P<name>[a-z]+)\s+(?P<name2>[a-z]+)')
>>> [m.groupdict() for m in r.finditer(s)]
[{'name2': 'sue', 'name': 'bob'}, {'name2': 'richard', 'name': 'jon'}]

2 of 4

you could switch to finditer

>>> import re
>>> text = "bob sue jon richard harry"
>>> pat = re.compile('(?P<name>[a-z]+)\s+(?P<name2>[a-z]+)')
>>> for m in pat.finditer(text):
...     print m.groupdict()
... 
{'name2': 'sue', 'name': 'bob'}
{'name2': 'richard', 'name': 'jon'}

PYnative

pynative.com › home › python › regex › python regex capturing groups

Python Regex Capturing Groups – PYnative

April 12, 2021 - If you try to apply it to the findall method, you will get AttributeError: ‘list’ object has no attribute ‘groups.’ · So always use finditer if you wanted to capture all matches to the group. ... import re target_string = "The price of ice-creams PINEAPPLE 20 MANGO 30 CHOCOLATE 40" # two groups enclosed in separate ( and ) bracket # group 1: find all uppercase letter # group 2: find all numbers # you can compile a pattern or directly pass to the finditer() method pattern = re.compile(r"(\b[A-Z]+\b).(\b\d+\b)") # find all matches to groups for match in pattern.finditer(target_string): # extract words print(match.group(1)) # extract numbers print(match.group(2))Code language: Python (python) Run

Python documentation

docs.python.org › 3 › howto › regex.html

Regular Expression HOWTO — Python 3.14.3 documentation

Author, A.M. Kuchling ,. Abstract: This document is an introductory tutorial to using regular expressions in Python with the re module. It provides a gentler introduction than th...

Finxter

blog.finxter.com › home › learn python blog › python re groups

Python Re Groups - Be on the Right Side of Change

May 5, 2023 - Each group flag has its own meaning: For example, if you want to switch off the differentiation of capitalization, you’ll use the i flag as follows: >>> re.findall('(?i:PYTHON)', 'python is great') ['python'] You can also switch off the capitalization for the whole regex with the “global ...

GeeksforGeeks

geeksforgeeks.org › python › python-regex-re-search-vs-re-findall

Python Regex: re.search() VS re.findall() - GeeksforGeeks

July 12, 2025 - If the pattern has capturing groups, it returns a list of tuples. ... import re s = "My favorite fruits are apple, banana, and mango." res = re.findall(r'\b\w*a\b', s) print(res)

Python for Network Engineers

pyneng.readthedocs.io › en › latest › book › 15_module_re › findall.html

Findall function - Python for network engineers

findall searches for a match of the entire string but returns a result similar to group method in Match object. If there are several groups, findall will return the list of tuples:

RegexOne

regexone.com › references › python

RegexOne - Learn Regular Expressions - Python

import re # Lets create a pattern and extract some information with it regex = re.compile(r"(\w+) World") result = regex.search("Hello World is the easiest") if result: # This will print: # 0 11 # for the start and end of the match print(result.start(), result.end()) # This will print: # Hello # Bonjour # for each of the captured groups that matched for result in regex.findall("Hello World, Bonjour World"): print(result) # This will substitute "World" with "Earth" and print: # Hello Earth print(regex.sub(r"\1 Earth", "Hello World")) For more information about using regular expressions in Python, please visit the following links: Python Documentation for Regular Expressions ·

Stack Overflow

stackoverflow.com › questions › 40157550 › python-regexp-groups-how-do-i-get-all-groups

regex - Python regexp groups: how do I get all groups? - Stack Overflow

Top answer

1 of 2

With re.findall()

Example:

s = "-ab-cde-fghi-jkl-mn"
re.findall(r'[a-z]+', s)

Output:

['ab', 'cde', 'fghi', 'jkl', 'mn']

2 of 2

It works like you want by default in .NET.

Python does not support this though. The closest behavior you could get in Python, would be to repeat the match on the captured substring:

>>> match = re.match(r"(?P<all>(?:-(?P<one>\w+))*)","-ab-cde-fghi-jkl-mn")
>>> re.findall(r"-(?P<one>\w+)", match.group("all"))
['ab', 'cde', 'fghi', 'jkl', 'mn']

It could get complicated if the inner pattern is not extremely simple.

Stack Overflow

stackoverflow.com › questions › 41593016 › python-re-findall-to-get-all-matched-groups › 41593404

regex - Python re.findall() to get all matched groups - Stack Overflow

Top answer

1 of 2

You seem to be asking about whether you can use variable number of regex groups. Based on a quick Google search, the answer appears to be no, the regex will match the full pattern but only the last value will be recorded for repeated matches of the same group.

Consider simply doing s.split('|') and then whatever checks that are necessary on each of the substrings instead.

2 of 2

import re 

s = '''aaa
bbb|30s
ccc|500ms|1s'''

print(re.findall(r'(\w+)\|?(\w+)?\|?(\w+)?', s))

Output:

[('aaa', '', ''), ('bbb', '30s', ''), ('ccc', '500ms', '1s')]

Medium

medium.com › @yeukhon › non-capturing-group-in-pythons-regular-expression-75c4a828a9eb

Non-capturing group in Python’s regular expression | by Facing Security | Medium

August 29, 2014 - Why? It turns out that when re.findall sees a group in a regular expression pattern, the findall method will return the matches for the group.

Safjan

safjan.com › home › note › python regex named groups

Python Regex Named Groups

July 11, 2023 - In Python regex, match.groupdict() is a method that returns a dictionary containing all the named groups of a regular expression match.