python regex list of strings

How to match any string from a list of strings in regular expressions in python?

stackoverflow.com › questions › 33406313 › how-to-match-any-string-from-a-list-of-strings-in-regular-expressions-in-python

Join the list on the pipe character |, which represents different options in regex.

string_lst = ['fun', 'dum', 'sun', 'gum']
x="I love to have fun."

print re.findall(r"(?=("+'|'.join(string_lst)+r"))", x)

Output: ['fun']

You cannot use match as it will match from start. Using search you will get only the first match. So use findall instead.

Also use lookahead if you have overlapping matches not starting at the same point.

Answer from vks on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 33406313 › how-to-match-any-string-from-a-list-of-strings-in-regular-expressions-in-python

regex - How to match any string from a list of strings in regular expressions in python? - Stack Overflow

Top answer

1 of 5

Join the list on the pipe character |, which represents different options in regex.

string_lst = ['fun', 'dum', 'sun', 'gum']
x="I love to have fun."

print re.findall(r"(?=("+'|'.join(string_lst)+r"))", x)

Output: ['fun']

You cannot use match as it will match from start. Using search you will get only the first match. So use findall instead.

Also use lookahead if you have overlapping matches not starting at the same point.

2 of 5

regex module has named lists (sets actually):

#!/usr/bin/env python
import regex as re # $ pip install regex

p = re.compile(r"\L<words>", words=['fun', 'dum', 'sun', 'gum'])
if p.search("I love to have fun."):
    print('matched')

Here words is just a name, you can use anything you like instead.
.search() methods is used instead of .* before/after the named list.

To emulate named lists using stdlib's re module:

#!/usr/bin/env python
import re

words = ['fun', 'dum', 'sun', 'gum']
longest_first = sorted(words, key=len, reverse=True)
p = re.compile(r'(?:{})'.format('|'.join(map(re.escape, longest_first))))
if p.search("I love to have fun."):
    print('matched')

re.escape() is used to escape regex meta-characters such as .*? inside individual words (to match the words literally).
sorted() emulates regex behavior and it puts the longest words first among the alternatives, compare:

>>> import re
>>> re.findall("(funny|fun)", "it is funny")
['funny']
>>> re.findall("(fun|funny)", "it is funny")
['fun']
>>> import regex
>>> regex.findall(r"\L<words>", "it is funny", words=['fun', 'funny'])
['funny']
>>> regex.findall(r"\L<words>", "it is funny", words=['funny', 'fun'])
['funny']

freeCodeCamp

forum.freecodecamp.org › python

Matching a list of regex to every string in a list one by one - Python - The freeCodeCamp Forum

Top answer

1 of 1

The problem with your code is your inner for loop will append a value (either a matched value or ‘None’ for each pattern checked. You don’t want that. Your code does correctly append a matched value one time and stops searching the rest of the regex expressions, but if there is no match, your else …

Stack Abuse

stackabuse.com › bytes › check-if-elements-in-list-matches-a-regex-in-python

Check if Elements in List Matches a Regex in Python

September 14, 2023 - Link: For more information on using regex in Python, check out our article, Introduction to Regular Expressions in Python · To check if any element in a list matches a regular expression, you can use a loop to iterate over the list and the re module's match() function to check each element. Here's an example: import re # List of strings list_of_strings = ['apple', 'banana', 'cherry', 'date'] # Regular expression pattern for strings starting with 'a' pattern = '^a' for string in list_of_strings: if re.match(pattern, string): print(string, "matches the pattern")

Python documentation

docs.python.org › 3 › library › re.html

re — Regular expression operations — Python 3.14.3 ...

1 week ago - Usually patterns will be expressed ... this raw string notation. It is important to note that most regular expression operations are available as module-level functions and methods on compiled regular expressions. The functions are shortcuts that don’t require you to compile a regex object first, but miss some fine-tuning parameters. ... The third-party regex module, which has an API compatible with the standard library re module, but offers additional ...

reddit.com › r/regex › how....do you apply regex to a list in python?

r/regex on Reddit: How....do you apply regex to a list in python?

April 21, 2020 -

So I spent a few days figuring out my beautiful regex found here to parse whatsapp messages. It through the text and puts it into groups, but now....I just don't know how to use it to pull out the data.

I am in google collab pulling my whatsapp raw text into a list like this:

def read_file(file):
'''Reads Whatsapp text file into a list of strings'''
x = open(file,'r', encoding = 'utf-8') #Opens the text file into variable x but the variable cannot be explored yet
y = x.read() #By now it becomes a huge chunk of string that we need to separate line by line
content = y.splitlines() #The splitline method converts the chunk of string into a list of strings
return content
chat = read_file('18042020_cut.txt')

Cool. so like...what do I do now?

I tried:

content = re.search('[(?P<date>\d{2}/\d{2}/\d{4}),\s(?P<time>\d{1,2}:\d{2}:\d{2}.{3})]\s(?P<sender>[^{:]*):\s(?P<message>.+|\n+(?!)[\d{2}/\d{2}/\d{4})',} chat).group('Content')

Top answer

1 of 1

regex = re.compile("your regex here") results = [regex.match(x).groups() for x in chat]

reddit.com › r/learnpython › how to create a regular expression from list of input strings?

r/learnpython on Reddit: How to create a regular expression from list of input strings?

July 10, 2024 -

I am working on a project where I have a input list of filenames and I want to compute a regular expression that is as precise as possible and validate against all elements of the input list of file names. Is there a regular expression library or method to solve this? I've tried looking online but I only find results regarding validating a list of string inputs against an already defined input regular expression which is the opposite of what I am trying to do.

I have a DB with 2 tables that is being used to correlate excel tables/ ranges to specific files. One DB table has the data source name and file name pattern to the name of the file that data source is saved in. The other table has all the excel tables/ ranges that can be found in each data source. By joining these two tables, I can get a list of all data sources, the excel ranges the data source includes and the file name pattern they come from.

I want to fill the database's file patterns using existing XML files which store actual file names that had been used in the past to store the data. Processing these XML files, I can determine that, for example, data source A has in the past had file names of BAB.xls, CAB.xls, and DAB.xls.

I want to try and create a program that can take the list ['BAB.xls', 'CAB.xls', 'DAB.xls'] and return a regular expression like /.AB\.xls/.

Top answer

1 of 3

So you want something that automatically generates a regex based on a list of strings (filenames) so the regex matches all of these strings as closely as possible? What are you actually trying to achieve? What is the actual problem you are trying to solve? I believe it would be possible to create what you describe but the regex will very likely end up being too broad so I don't think you could rely on it.

2 of 3

I want to try and create a program that can take the list ['BAB.xls', 'CAB.xls', 'DAB.xls'] and return a regular expression like /.AB.xls/. If this is what you're looking for, this should do it: >>> import re >>> a = ['BAB.xls', 'CAB.xls', 'DAB.xls'] >>> y = [ len(set(x)) == 1 for x in zip(*a) ] >>> y [False, True, True, True, True, True, True] >>> z = [ re.escape(j) if i else '.' for i, j in zip(y, a[0]) ] >>> z ['.', 'A', 'B', '\\.', 'x', 'l', 's'] >>> ''.join(z) '.AB\\.xls' This assumes list len(a) > 0 and all the strings in a are the same length. It looks like re.escape() makes regular expressions that are suitable for use Python re module. (i.e. double backslash) I'm not sure if that's exactly what you want. Fixing it wouldn't be hard.

W3Schools

w3schools.com › python › python_regex.asp

Python RegEx

RegEx can be used to check if a string contains the specified search pattern. Python has a built-in package called re, which can be used to work with Regular Expressions. ... You can add flags to the pattern when using regular expressions. A special sequence is a \ followed by one of the characters in the list below, and has a special meaning:

Stack Overflow

stackoverflow.com › questions › 3640359 › regular-expressions-search-in-list

python - Regular Expressions: Search in list - Stack Overflow

Top answer

1 of 4

298

Full Example (Python 3):
For Python 2.x look into Note below

import re

mylist = ["dog", "cat", "wildcat", "thundercat", "cow", "hooo"]
r = re.compile(".*cat")
newlist = list(filter(r.match, mylist)) # Read Note below
print(newlist)

Prints:

['cat', 'wildcat', 'thundercat']

Note:

For Python 2.x developers, filter returns a list already. In Python 3.x filter was changed to return an iterator so it has to be converted to list (in order to see it printed out nicely).

Python 3 code example
Python 2.x code example

2 of 4

158

You can create an iterator in Python 3.x or a list in Python 2.x by using:

filter(r.match, list)

To convert the Python 3.x iterator to a list, simply cast it; list(filter(..)).

GeeksforGeeks

geeksforgeeks.org › python-check-if-string-matches-regex-list

Python | Check if string matches regex list | GeeksforGeeks

April 23, 2023 - In this, we create a new regex string by joining all the regex list and then match the string against it to check for match using match() with any of the element of regex list. ... # Python3 code to demonstrate working of # Check if string matches ...

Find elsewhere

Google Bing Mojeek

Quora

quora.com › What-is-matching-a-list-of-regex-to-every-string-in-a-list-one-by-one-Python

What is matching a list of regex to every string in a list one by one (Python)? - Quora

Answer (1 of 2): The answer is (as usual) you just do that. Suppose I have a list of regular expression objects made with re.compile called pats and a list of strings called strs. Then: [code]for s in strs: x = [p.match(s) for p in pats] ... do something with x [/code]x is a list of Nones or ...

Quora

quora.com › Is-there-a-way-in-Python-to-apply-a-list-of-regex-patterns-that-are-stored-in-a-list-to-a-single-string

Is there a way in Python to apply a list of regex patterns that are stored in a list to a single string? - Quora

Answer: You can combine the map function along with re to do this. [code]import re my_string = 'Allwin' patterns = [r'^A', r'.{6}'] print(list(map(lambda pattern: True if re.match(pattern, my_string) else False, patterns))) # output: [True, True] [/code]I have two patterns. 1. r’A’ - This pat...

Stack Overflow

stackoverflow.com › questions › 5653118 › python-parsing-list-of-string

regex - Python parsing list of string - Stack Overflow

Top answer

1 of 6

Firstly, your regex seems to not work properly. The Key field should have values which could include f, right? So its group should not be ([0-9A-Ea-e]+) but instead ([0-9A-Fa-f]+). Also, it is a good - actually, a wonderful - practice to prefix the regex string with r when dealing with regexes because it avoids problems with \ escaping characters. (If you do not understand why to do it, look at raw strings)

Now, my approach to the problem. First, I would create a regex without pipes:

>>> regex = r"(Key):[\s]*([0-9A-Fa-f]+)[\s]*" \
...     r"(Index):[\s]*([0-9]+)[\s]*" \
...     r"(Field 1):[\s]*([0-9]+)[\s]*" \
...     r"(Field 2):[\s]*([0-9 A-Za-z]+)[\s]*" \
...     r"(Field 3):[\s]*([-+]?[0-9]+)[\s]*"

With this change, the findall() will return only one tuple of found groups for an entire line. In this tuple, each key is followed by its value:

>>> re.findall(regex, line)
[('Key', 'af12d9', 'Index', '0', 'Field 1', '1234', 'Field 2', '1234 Ring ', 'Field 3', '-10')]

So I get the tuple...

>>> found = re.findall(regex, line)[0]
>>> found
('Key', 'af12d9', 'Index', '0', 'Field 1', '1234', 'Field 2', '1234 Ring ', 'Field 3', '-10')

...and using slices I get only the keys...

>>> found[::2]
('Key', 'Index', 'Field 1', 'Field 2', 'Field 3')

...and also only the values:

>>> found[1::2]
('af12d9', '0', '1234', '1234 Ring ', '-10')

Then I create a list of tuples containing the key and its corresponding value with zip() function:

>>> zip(found[::2], found[1::2])
[('Key', 'af12d9'), ('Index', '0'), ('Field 1', '1234'), ('Field 2', '1234 Ring '), ('Field 3', '-10')]

The gran finale is to pass the list of tuples to the dict() constructor:

>>> dict(zip(found[::2], found[1::2]))
{'Field 3': '-10', 'Index': '0', 'Field 1': '1234', 'Key': 'af12d9', 'Field 2': '1234 Ring '}

I find this solution the best, but it is indeed a subjective question in some sense. HTH anyway :)

2 of 6

OK, with help of brandizzi, I have found THE answer to this question.

Solution:

listconfig = []
for line in list_of_strings:
    matched = re.search(r"Key:[\s]*(?P<key>[0-9A-Fa-f]+)[\s]*" \ 
                        r"(Index:[\s]*(?P<index>[0-9]+)[\s]*)?" \ 
                        r"(Field 1:[\s]*(?P<field_1>[0-9]+)[\s]*)?" \ 
                        r"(Field 2:[\s]*(?P<field_2>[0-9 A-Za-z]+)[\s]*)?" \ 
                        r"(Field 3:[\s]*(?P<field_3>[-+]?[0-9]+)[\s]*)?", line) 
    if matched:
        print matched.groupdict()
        listconfig.append(matched.groupdict())

Stack Overflow

stackoverflow.com › questions › 37974047 › if-any-strings-in-a-list-match-regex › 37974167

python - If any strings in a list match regex - Stack Overflow

Top answer

1 of 4

You can use the builtin any():

r = re.compile('.*search.*')
if any(r.match(line) for line in output):
    do_stuff()

Passing in the lazy generator to any() will allow it to exit on the first match without having to check any farther into the iterable.

2 of 4

Starting Python 3.8, and the introduction of assignment expressions (PEP 572) (:= operator), we can also capture a witness of an any expression when a match is found and directly use it:

# pattern = re.compile('.*search.*')
# items = ['hello', 'searched', 'world', 'still', 'searching']
if any((match := pattern.match(x)) for x in items):
  print(match.group(0))
# 'searched'

For each item, this:

Applies the regex search (pattern.match(x))
Assigns the result to a match variable (either None or a re.Match object)
Applies the truth value of match as part of the any expression (None -> False, Match -> True)
If match is None, then the any search loop continues
If match has captured a group, then we exit the any expression which is considered True and the match variable can be used within the condition's body

Bobby Hadz

bobbyhadz.com › blog › python-check-if-any-element-in-list-matches-regex

Check if any element in a List matches Regex in Python | bobbyhadz

April 9, 2024 - Use the re.match method to check if each string in the list matches the regex.

reddit.com › r/regex › python regex when string contains any word from a list of words and any word from another list

r/regex on Reddit: Python regex when string contains any word from a list of words AND any word from another list

December 8, 2022 -

I'd like to be able to match strings which meet these criteria:

['foo' OR 'bar' OR 'Python'] AND ['me', OR 'you' OR 'we']

Top answer

1 of 2

Use lookaheads. ^(?=.*foo|.*bar|.*Python)(?=.*me|.*you|.*we)

Add \b around the words (e.g. \bfoo\b) if you want them as isolated words, otherwise you get matches like fool)

https://regex101.com/r/dnqSjr/1

2 of 2

If you want to use regex you have to construct the regex string in your code from the lists.

You have to sting all the words together using regex '|' or - and that might not be the most efficient solution depending on the length of the word lists.

But lets start with the base regex:

\bAWORD\b

Will match "AWORD". \b means word boundary, meaning we don't match partial words. In sted of AWORD we can use a list here: (word1|word2|...ect).

This list you can construct in with python, like so:

import re
word_list1 = ['foo', 'bar', 'Python']
word_list2 = ['me', 'you', 'we']

words1 = '|'.join(word_list1)
words2 = '|'.join(word_list2)
regex = r'\b(?:{})\b'
test_str = "foo is a me word"
return (re.search(regex.format(words1), test_str) and
        re.search(regex.format(words2), test_str)) != None

.format just inserts the '|' spectated words into the regex in place of '{}'. I am sure the is a more "pythonic" way of doing this, but this is the regex way. :)

TutorialsPoint

tutorialspoint.com › python-check-if-a-string-matches-regex-list

Python - Check if a string matches regex list

August 2, 2023 - Predefined sets of characters represented by such special sequences beginning with '\' are listed below − · Python's re module provides useful functions for finding a match, searching for a pattern, and substitute a matched string with other string etc.

Pyladiespdx

pyladiespdx.github.io › listcomps

Get Comfortable with List Comprehensions and Regex | Pyladiespdx

Searching and matching works by looking for the complete pattern in each string; running through from start to finish; and stopping as soon as a match is found, returning the match object (or None if not found). ... \ : inhibit the uniqueness of a character that is otherwise considered a meta-character (i.e., \ . for period; \ \ for slash; $ for a dollar sign) ? : match zero or one occurrence of the pattern to the left (i.e., may or may not appear in pattern) - : if between two [], indicates a range of digits or alphabetic chars; will be interpreted as a literal if appearing first or last inside [].

Spark By {Examples}

sparkbyexamples.com › home › python › python regex list

Python regex list - Spark By {Examples}

May 31, 2024 - The code iterates through each language in the list and uses the re.search() function to check if the pattern is present in the language string. If a match is found, it prints a message indicating that the pattern was found in the specific language. ... Another way for searching patterns within elements in a list is by using the re.findall() method. Here is an example: import re # the list of languages to search in languages = ["JavaScript", "Java", "Python", "C", "C++", "C#"] # the pattern to search for pattern = r"va" # assigning the method return results to matches variable matches = re.findall(pattern, " ".join(languages)) # prints the list of the matches print(matches)

Coderanch

coderanch.com › t › 755566 › languages › Matching-list-regex-string-list

Matching a list of regex to every string in a list one by one [Python] (Jython/Python forum at Coderanch)

November 2, 2022 - adina nadeem wrote:As soon as a match is found, I want to no longer look in the regex list for further matches. I believe that's what's happening. For each of the three strings in str, it's looking at patterns until it finds a pattern that matches - which seems to be the third pattern, which can be found in all three strings.

Spark By {Examples}

sparkbyexamples.com › home › python › python regex search list

Python regex search list - Spark By {Examples}

May 31, 2024 - In this tutorial, we will be exploring how we can use regex in Python to search through a list. ... The question that might pop into your mind is why in the first place would you even bother to use regex to search a list? Below are the reasons why you should use regex for your list search: ... In this section, I will demonstrate how you can use the re.search() method to search a list. Here is an example: import re # list of names names = ["Guido Van Rossum", "Brendan Eich", "Rasmus Lerdorf", "Bjarne Stroustrup", "Dennis Ritchie"] # regex pattern to search for names starting with "B" pattern = r"^B\w+" # iterate over the list and search for matching names for name in names: # calling the search() method match = re.search(pattern, name) if match: print("Match found:", name)