Brave Search

How to test if a string contains one of the substrings in a list, in pandas?

stackoverflow.com › questions › 26577516 › how-to-test-if-a-string-contains-one-of-the-substrings-in-a-list-in-pandas

One option is just to use the regex | character to try to match each of the substrings in the words in your Series s (still using str.contains).

You can construct the regex by joining the words in searchfor with |:

>>> searchfor = ['og', 'at']
>>> s[s.str.contains('|'.join(searchfor))]
0    cat
1    hat
2    dog
3    fog
dtype: object

As @AndyHayden noted in the comments below, take care if your substrings have special characters such as $ and ^ which you want to match literally. These characters have specific meanings in the context of regular expressions and will affect the matching.

You can make your list of substrings safer by escaping non-alphanumeric characters with re.escape:

>>> import re
>>> matches = ['$money', 'x^y']
>>> safe_matches = [re.escape(m) for m in matches]
>>> safe_matches
['\\$money', 'x\\^y']

The strings with in this new list will match each character literally when used with str.contains.

Answer from Alex Riley on Stack Overflow

PHP

php.net › manual › en › function.str-contains.php

PHP: str_contains - Manual

A couple of functions for checking if a string contains any of the strings in an array, or all of the strings in an array: <?php function str_contains_any(string $haystack, array $needles): bool { return array_reduce($needles, fn($a, $n) => $a || str_contains($haystack, $n), false); } function str_contains_all(string $haystack, array $needles): bool { return array_reduce($needles, fn($a, $n) => $a && str_contains($haystack, $n), true); } ?> str_contains_all() will return true if $needles is an empty array.

Pandas

pandas.pydata.org › docs › reference › api › pandas.Series.str.contains.html

pandas.Series.str.contains — pandas 3.0.3 documentation

If False, treats the pat as a literal string. ... A Series or Index of boolean values indicating whether the given pattern is contained within the string of each element of the Series or Index.

Discussions

python - How to test if a string contains one of the substrings in a list, in pandas? - Stack Overflow

The strings with in this new list will match each character literally when used with str.contains. ... Sign up to request clarification or add additional context in comments. More on stackoverflow.com

stackoverflow.com

python - How to use str.contains() with multiple expressions in pandas dataframes - Stack Overflow

I'm wondering if there is a more efficient way to use the str.contains() function in Pandas, to search for two partial strings at once. I want to search a given column in a dataframe for data that contains either "nt" or "nv". More on stackoverflow.com

stackoverflow.com

python - pandas dataframe str.contains() AND operation - Stack Overflow

I'd like to grab strings that contains 10-20 different words (grape, watermelon, berry, orange, ..., etc.) More on stackoverflow.com

stackoverflow.com

How to use str.contains to get exact matches and not partial ones?

If you're looking for exact matches, str.contains may not be the function you should be using. The output looks correct to me in that all of the strings in the output do contain your keyword. More on reddit.com

r/learnpython

November 10, 2021

Videos