Brave Search

How to use str.contains() with multiple expressions in pandas dataframes

stackoverflow.com › questions › 19169649 › how-to-use-str-contains-with-multiple-expressions-in-pandas-dataframes

They should be one regular expression, and should be in one string:

"nt|nv"  # rather than "nt" | " nv"
f_recs[f_recs['Behavior'].str.contains("nt|nv", na=False)]

Python doesn't let you use the or (|) operator on strings:

In [1]: "nt" | "nv"
TypeError: unsupported operand type(s) for |: 'str' and 'str'

Answer from Andy Hayden on Stack Overflow

Pandas

pandas.pydata.org › docs › reference › api › pandas.Series.str.contains.html

pandas.Series.str.contains — pandas 3.0.2 documentation

>>> s1.str.contains("house|dog", regex=True) 0 False 1 True 2 True 3 False 4 False dtype: bool

Stack Overflow

stackoverflow.com › questions › 19169649 › how-to-use-str-contains-with-multiple-expressions-in-pandas-dataframes

python - How to use str.contains() with multiple expressions in pandas dataframes - Stack Overflow

Top answer

1 of 3

They should be one regular expression, and should be in one string:

"nt|nv"  # rather than "nt" | " nv"
f_recs[f_recs['Behavior'].str.contains("nt|nv", na=False)]

Python doesn't let you use the or (|) operator on strings:

In [1]: "nt" | "nv"
TypeError: unsupported operand type(s) for |: 'str' and 'str'

2 of 3

If you have the patterns in a list, then it might be convenient if you join them by a pipe (|) and pass it to str.contains. Return False for NaNs by na=False and turn off case sensitivity by case=False.

lst = ['nt', 'nv', 'nf']
df['Behavior'].str.contains('|'.join(lst), na=False)

Otherwise, it might be cleaner to group the alternations. For the example in the OP, that is:

df['Behavior'].str.contains(r'n[t|v|f]')

Videos

03:57

YouTube

String Contains and REGEX | Pandas Tutorial 30.0 - YouTube

pandas str.contains() gives wrong results? - YouTube

January 10, 2023

14:20

YouTube

Filtering Pandas Dataframe with .str.contains, .str.startswith, ...

July 28, 2021

00:24

YouTube

Filter DataFrame where it contains a specific string #python #pandas ...

March 18, 2023

youtube.com

Pandas tutorial | How to filter multiple substring using pandas ...

youtube.com

Pandas str.contains - Search for multiple values in a string and ...

View all

GeeksforGeeks

geeksforgeeks.org › pandas › python-pandas-series-str-contains

Pandas Series.str.contains() - Python - GeeksforGeeks

January 13, 2026 - The Series.str.contains() method is used to check whether each string value in a Pandas Series contains a given substring or pattern.

Programiz

programiz.com › python-programming › pandas › methods › str-contains

Pandas str.contains() (With Examples)

The str.contains() method returns a Boolean Series showing whether each element in the Series contains the pattern or regex. import pandas as pd # create a Series data = pd.Series(['apple', 'banana', 'cherry', 'date'])

w3resource

w3resource.com › pandas › series › series-str-contains.php

Pandas Series: str.contains() function - w3resource

September 15, 2022 - Pandas Series - str.contains() function: The str.contains() function is used to test if pattern or regex is contained within a string of a Series or Index.

Note.nkmk.me

note.nkmk.me › home › python › pandas

pandas: Extract rows that contain specific strings from a DataFrame | note.nkmk.me

July 30, 2023 - By using str.contains(), you can generate a Series where elements that contain a given substring are True. pandas.Series.str.contains — pandas 2.0.3 documentation

Vultr Docs

docs.vultr.com › python › third-party › pandas › Series › str › contains

Python Pandas Series str contains() - Check Substring Presence | Vultr Docs

December 5, 2024 - The str.contains() method in Pandas is an essential tool for checking the presence of a substring within each string element of a Series. This approach is particularly useful in data analysis and processing where conditions based on string patterns ...

Medium

medium.com › @amit25173 › understanding-pandas-str-contains-ba3e6a7d30b3

Understanding pandas str.contains() | by Amit Yadav | Medium

March 6, 2025 - It helps you check if a string exists within another string in a pandas Series. Simply put, str.contains() checks if a particular substring (like "apple" or "data") exists within the text data of a pandas Series.

Find elsewhere

Google Bing Mojeek

Statology

statology.org › home › pandas: how to check if column contains string

Pandas: How to Check if Column Contains String

March 31, 2025 - This tutorial explains how to check if a column contains a string in a pandas DataFrame, including several examples.

Pandas

pandas.pydata.org › pandas-docs › version › 0.23.4 › generated › pandas.Series.str.contains.html

pandas.Series.str.contains — pandas 0.23.4 documentation

Series.str.contains(pat, case=True, flags=0, na=nan, regex=True)[source]¶ · Test if pattern or regex is contained within a string of a Series or Index. Return boolean Series or Index based on whether a given pattern or regex is contained within a string of a Series or Index.

Medium

medium.com › @whyamit101 › understanding-pandas-series-str-contains-7c9a3b3bd545

Understanding pandas.Series.str.contains() | by why amit | Medium

February 26, 2025 - str.contains() is like asking Pandas, “Hey, does this text contain what I’m looking for?” It helps you filter data in a column based on whether a string pattern exists.

Spark By {Examples}

sparkbyexamples.com › home › pandas › pandas series.str.contains() with examples

Pandas Series.str.contains() With Examples - Spark By {Examples}

December 9, 2024 - Pandas Series.str.contains() method is used to check whether each string in a Series contains a specified substring or pattern. It returns a

Stack Overflow

stackoverflow.com › questions › 37011734 › pandas-dataframe-str-contains-and-operation

python - pandas dataframe str.contains() AND operation - Stack Overflow

Top answer

1 of 10

You can do that as follows:

df[(df['col_name'].str.contains('apple')) & (df['col_name'].str.contains('banana'))]

2 of 10

You can also do it in regex expression style:

df[df['col_name'].str.contains(r'^(?=.*apple)(?=.*banana)')]

You can then, build your list of words into a regex string like so:

base = r'^{}'
expr = '(?=.*{})'
words = ['apple', 'banana', 'cat']  # example
base.format(''.join(expr.format(w) for w in words))

will render:

'^(?=.*apple)(?=.*banana)(?=.*cat)'

Then you can do your stuff dynamically.

Plus2Net

plus2net.com › python › pandas-str-contains.php

str.contains to search string or pattern matching in columns series using Pandas

df=df[df['name'].str.contains(... pd.DataFrame(data=my_dict) print(df) This will return all the rows We will use contains() to get only rows having ar in name column....

TutorialsPoint

tutorialspoint.com › python_pandas › python_pandas_series_str_contains_method.htm

Pandas Series.str.contains() Method

The Series.str.contains() method in Pandas is used to test if a pattern or regex is contained within a string of a Series or Index. This method returns a boolean Series or Index based on whether the given pattern is present in each string.

Stack Overflow

stackoverflow.com › questions › 48631769 › pandas-str-contains-search-for-multiple-values-in-a-string-and-print-the-value

python - Pandas str.contains - Search for multiple values in a string and print the values in a new column - Stack Overflow

Top answer

1 of 2

You need to set the regex flag (to interpret your search as a regular expression):

whatIwant = df['Column_with_text'].str.contains('value1|value2|value3',
                                                 case=False, regex=True)

df['New_Column'] = np.where(whatIwant, df['Column_with_text'])

------ Edit ------

Based on the updated problem statement, here is an updated answer:

You need to define a capture group in the regular expression using parentheses and use the extract() function to return the values found within the capture group. The lower() function deals with any upper case letters

df['MatchedValues'] = df['Text'].str.lower().str.extract( '('+pattern+')', expand=False)

2 of 2

Here is one way:

foods =['apples', 'oranges', 'grapes', 'blueberries']

def matcher(x):
    for i in foods:
        if i.lower() in x.lower():
            return i
    else:
        return np.nan

df['Match'] = df['Text'].apply(matcher)

#                                           Text        Match
# 0                   I want to buy some apples.       apples
# 1             Oranges are good for the health.      oranges
# 2                  John is eating some grapes.       grapes
# 3  This line does not contain any fruit names.          NaN
# 4            I bought 2 blueberries yesterday.  blueberries

IncludeHelp

includehelp.com › python › pandas-dataframe-str-contains-and-operation.aspx

Python - Pandas dataframe str.contains() AND operation

For this purpose, we will first create a DataFrame with a column containing some strings and then we will use contains() method with this column of DataFrame. We will then use AND operator use & syntax and apply another contains() method on the same column. ... # Importing pandas package import ...

Stack Overflow

stackoverflow.com › questions › 26577516 › how-to-test-if-a-string-contains-one-of-the-substrings-in-a-list-in-pandas

python - How to test if a string contains one of the substrings in a list, in pandas? - Stack Overflow

Top answer

1 of 4

467

One option is just to use the regex | character to try to match each of the substrings in the words in your Series s (still using str.contains).

You can construct the regex by joining the words in searchfor with |:

>>> searchfor = ['og', 'at']
>>> s[s.str.contains('|'.join(searchfor))]
0    cat
1    hat
2    dog
3    fog
dtype: object

As @AndyHayden noted in the comments below, take care if your substrings have special characters such as $ and ^ which you want to match literally. These characters have specific meanings in the context of regular expressions and will affect the matching.

You can make your list of substrings safer by escaping non-alphanumeric characters with re.escape:

>>> import re
>>> matches = ['$money', 'x^y']
>>> safe_matches = [re.escape(m) for m in matches]
>>> safe_matches
['\\$money', 'x\\^y']

The strings with in this new list will match each character literally when used with str.contains.

2 of 4

110

You can use str.contains alone with a regex pattern using OR (|):

s[s.str.contains('og|at')]

Or you could add the series to a dataframe then use str.contains:

df = pd.DataFrame(s)
df[s.str.contains('og|at')]

Output:

0 cat
1 hat
2 dog
3 fog

Stack Overflow

stackoverflow.com › questions › 32616261 › filtering-pandas-dataframe-rows-by-contains-str

python - Filtering pandas dataframe rows by contains str - Stack Overflow

Top answer

1 of 2

You could either use .str again to get access to the string methods, or (better, IMHO) use case=False to guarantee case insensitivity:

>>> df = pd.DataFrame({"body": ["ball", "red BALL", "round sphere"]})
>>> df[df["body"].str.contains("ball")]
   body
0  ball
>>> df[df["body"].str.lower().str.contains("ball")]
       body
0      ball
1  red BALL
>>> df[df["body"].str.contains("ball", case=False)]
       body
0      ball
1  red BALL
>>> df[df["body"].str.contains("ball", case=True)]
   body
0  ball

(Note that if you're going to be doing assignments, it's a better habit to use df.loc, to avoid the dreaded SettingWithCopyWarning, but if we're just selecting here it doesn't matter.)

(Note #2: guess I really didn't need to specify 'round' there..)

2 of 2

You can also use contains inside query:

In [2]: df = pd.DataFrame({'body': ['Ball', 'cUbE', 'bAll'], 'color': ['red', 'green', 'blue']})

In [3]: df
Out[3]: 
   body  color
0  Ball    red
1  cUbE  green
2  bAll   blue

In [4]: df.query('body.str.contains("ball", case=False).values')
Out[4]: 
   body color
0  Ball   red
2  bAll  blue

If you try to match multiple patterns use |:

In [5]: df.query('body.str.contains("ball|cube", case=False).values')
Out[5]: 
   body  color
0  Ball    red
1  cUbE  green
2  bAll   blue

InterviewQs

interviewqs.com › ddi-code-snippets › col-contains-pandas

SearchInput pandas column with string contains and does not contain - InterviewQs

#here we can count the number of distinct users viewing on a given day new_df2 = df[~df['name'].str.contains('Morris', na=False)] new_df2.head()