pandas drop rows with condition

How to delete rows from a pandas DataFrame based on a conditional expression [duplicate]

stackoverflow.com › questions › 13851535 › how-to-delete-rows-from-a-pandas-dataframe-based-on-a-conditional-expression

To directly answer this question's original title "How to delete rows from a pandas DataFrame based on a conditional expression" (which I understand is not necessarily the OP's problem but could help other users coming across this question) one way to do this is to use the drop method:

df = df.drop(some labels)
df = df.drop(df[<some boolean condition>].index)

Example

To remove all rows where column 'score' is < 50:

df = df.drop(df[df.score < 50].index)

In place version (as pointed out in comments)

df.drop(df[df.score < 50].index, inplace=True)

Multiple conditions

(see Boolean Indexing)

The operators are: | for or, & for and, and ~ for not. These must be grouped by using parentheses.

To remove all rows where column 'score' is < 50 and > 20

df = df.drop(df[(df.score < 50) & (df.score > 20)].index)

Answer from User on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 13851535 › how-to-delete-rows-from-a-pandas-dataframe-based-on-a-conditional-expression

python - How to delete rows from a pandas DataFrame based on a conditional expression - Stack Overflow

Top answer

1 of 6

1590

df = df.drop(some labels)
df = df.drop(df[<some boolean condition>].index)

Example

To remove all rows where column 'score' is < 50:

df = df.drop(df[df.score < 50].index)

In place version (as pointed out in comments)

df.drop(df[df.score < 50].index, inplace=True)

Multiple conditions

(see Boolean Indexing)

The operators are: | for or, & for and, and ~ for not. These must be grouped by using parentheses.

To remove all rows where column 'score' is < 50 and > 20

df = df.drop(df[(df.score < 50) & (df.score > 20)].index)

2 of 6

280

When you do len(df['column name']) you are just getting one number, namely the number of rows in the DataFrame (i.e., the length of the column itself). If you want to apply len to each element in the column, use df['column name'].map(len). So try

df[df['column name'].map(len) < 2]

GeeksforGeeks

geeksforgeeks.org › pandas › drop-rows-from-the-dataframe-based-on-certain-condition-applied-on-a-column

Drop Rows from Dataframe based on certain Condition applied on a Column - Pandas - GeeksforGeeks

October 3, 2025 - Rows not satisfying the condition are automatically dropped. In this article, we have used nba.csv dataset to download CSV used click here. Example: This code drops all NBA players whose Age is greater than and equal to 25. Python · import pandas as pd df = pd.read_csv('nba.csv') b = df[df['Age'] >= 25] print(b.head(15)) Output ·

Discussions

How to remove rows from a dataframe based on conditions across two rows?

you can make true false arrays using conditions, so you can do for example: exclude = df[Day] > 30 and df[Month] == 2 Which would give exclude == [False True False] for your example values. More on reddit.com

r/learnpython

February 3, 2022

how to select rows based on some conditions [pandas]

yeah don't iterate over the rows, just filter the dataframe using a conditional. examples: df = df[df['score'] >= 90.0] df = df[df['username'].isin(AUTHORIZED_USERS)] df = df[df['name'].str.contains('good')] More on reddit.com

r/learnpython

October 29, 2021

Dropping rows based on values in columns?

filtered_df = df[df[Wage] > 0] filtered_df = filtered_df[filtered_df[Club].notna()] Hope this helps :) More on reddit.com

r/learnpython

September 20, 2022

Delete duplicates with a condition

This is a bit of a 'hack' (it's not a formula-based approach that can be instantly copy/pasted across files), but it would no doubt be your quickest option. By default, the Remove Duplicates feature retains the first matching result, and deletes all subsequent results. So the solution is to simply sort your table of data by date, with the newest records at the top, and oldest dates at the bottom. To do this: Ensure your data is formatted as a table (Ctrl+T). Ensure the dates are correctly formatted using the Date data type (not just "text"... this won't work) You will then have the option to sort the column by Newest to Oldest. Once the sort is applied, you can then use the standard Remove Duplicates feature (DATA tab > Remove Duplicates), based on columns A,B,C. Voila. More on reddit.com

r/excel

September 22, 2022

Videos