python lambda if else dataframe

stackoverflow.com › questions › 44991438 › lambda-including-if-elif-else

Nest if .. elses:

lambda x: x*10 if x<2 else (x**2 if x<4 else x+10)

Answer from Uriel on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 44991438 › lambda-including-if-elif-else

python - Lambda including if...elif...else - Stack Overflow

Top answer

1 of 4

225

Nest if .. elses:

lambda x: x*10 if x<2 else (x**2 if x<4 else x+10)

2 of 4

I do not recommend the use of apply here: it should be avoided if there are better alternatives.

For example, if you are performing the following operation on a Series:

if cond1:
    exp1
elif cond2:
    exp2
else:
    exp3

This is usually a good use case for np.where or np.select.

`numpy.where`

The if else chain above can be written using

np.where(cond1, exp1, np.where(cond2, exp2, ...))

np.where allows nesting. With one level of nesting, your problem can be solved with,

df['three'] = (
    np.where(
        df['one'] < 2, 
        df['one'] * 10, 
        np.where(df['one'] < 4, df['one'] ** 2, df['one'] + 10))
df

   one  two  three
0    1    6     10
1    2    7      4
2    3    8      9
3    4    9     14
4    5   10     15

`numpy.select`

Allows for flexible syntax and is easily extensible. It follows the form,

np.select([cond1, cond2, ...], [exp1, exp2, ...])

Or, in this case,

np.select([cond1, cond2], [exp1, exp2], default=exp3)

df['three'] = (
    np.select(
        condlist=[df['one'] < 2, df['one'] < 4], 
        choicelist=[df['one'] * 10, df['one'] ** 2], 
        default=df['one'] + 10))
df

   one  two  three
0    1    6     10
1    2    7      4
2    3    8      9
3    4    9     14
4    5   10     15

`and`/`or` (similar to the `if`/`else`)

Similar to if-else, requires the lambda:

df['three'] = df["one"].apply(
    lambda x: (x < 2 and x * 10) or (x < 4 and x ** 2) or x + 10) 

df
   one  two  three
0    1    6     10
1    2    7      4
2    3    8      9
3    4    9     14
4    5   10     15

List Comprehension

Loopy solution that is still faster than apply.

df['three'] = [x*10 if x<2 else (x**2 if x<4 else x+10) for x in df['one']]
# df['three'] = [
#    (x < 2 and x * 10) or (x < 4 and x ** 2) or x + 10) for x in df['one']
# ]
df
   one  two  three
0    1    6     10
1    2    7      4
2    3    8      9
3    4    9     14
4    5   10     15

Medium

medium.com › @whyamit101 › using-pandas-lambda-if-else-0d8368b70459

Using pandas lambda if else. The biggest lie in data science? That… | by why amit | Medium

April 12, 2025 - Can I use multiple conditions in a pandas lambda if else statement? Absolutely! You can nest if statements within your lambda function to handle multiple conditions. How do I apply a lambda function to an entire DataFrame?

Discussions

python - Using lambda if condition on different columns in Pandas dataframe - Stack Overflow

2 Using if else statements on lambda expressions on a pandas data frame based on column names · 0 DataFrame: IF-condition in lambda asks about Series, but single element is needed to be checked More on stackoverflow.com

stackoverflow.com

Implementing if-else in python dataframe using lambda when there are multiple variables - Stack Overflow

I am trying to implement if-elif or if-else logic in python while working on a dataframe. I am struggling when working with more than one column. ... If my if-else logic is based on only one column - I know how to do it. df['one'] = df["one"].apply(lambda x: x*10 if x<2 else (x**2 if x<4 else x+10)) More on stackoverflow.com

stackoverflow.com

pandas - Lambda function with if else clause with Python - Stack Overflow

Why not just check if a number ... < 0? You wouldn't even need a lambda function. You could replace all the negative values in the Dataframe with None and set rows whose sum is < 0 to None as well. ... This isn't a problem with if/else or lambda, but of how a series of values ... More on stackoverflow.com

stackoverflow.com

python - lambda row function with if else statement - Stack Overflow

I have a pandas dataframe df and an array of datetimes holidays df.head() date hour count Relative Humidity Temperature Precipitation dow 0 2019-07-01 0 672 57.64 71.8 0.0 Mo... More on stackoverflow.com

stackoverflow.com

Top answer

1 of 5

In [1]: df
Out[1]:
   data
0     1
1     2
2     3
3     4

You want to apply a function that conditionally returns a value based on the selected dataframe column.

In [2]: df['data'].apply(lambda x: 'true' if x <= 2.5 else 'false')
Out[2]:
0     true
1     true
2    false
3    false
Name: data

You can then assign that returned column to a new column in your dataframe:

In [3]: df['desired_output'] = df['data'].apply(lambda x: 'true' if x <= 2.5 else 'false')

In [4]: df
Out[4]:
   data desired_output
0     1           true
1     2           true
2     3          false
3     4          false

2 of 5

Just compare the column with that value:

In [9]: df = pandas.DataFrame([1,2,3,4], columns=["data"])

In [10]: df
Out[10]: 
   data
0     1
1     2
2     3
3     4

In [11]: df["desired"] = df["data"] > 2.5
In [11]: df
Out[12]: 
   data desired
0     1   False
1     2   False
2     3    True
3     4    True

Stack Overflow

stackoverflow.com › questions › 37443082 › using-lambda-if-condition-on-different-columns-in-pandas-dataframe

python - Using lambda if condition on different columns in Pandas dataframe - Stack Overflow

Top answer

1 of 3

is that what you want?

In [300]: frame[['b','c']].apply(lambda x: x['c'] if x['c']>0 else x['b'], axis=1)
Out[300]:
0   -1.099891
1    0.582815
2    0.901591
3    0.900856
dtype: float64

2 of 3

Solution

use a vectorized approach

frame['d'] = frame.b + (frame.c > 0) * (frame.c - frame.b)

Explanation

This is derived from the sum of

(frame.c > 0) * frame.c  # frame.c if positive

Plus

(frame.c <= 0) * frame.b  # frame.b if c is not positive

However

(frame.c <=0 )

is equivalent to

(1 - frame.c > 0)

and when combined you get

frame['d'] = frame.b + (frame.c > 0) * (frame.c - frame.b)

Data to Fish

datatofish.com › if-condition-in-pandas-dataframe

Two Ways to Apply an If-Condition on a pandas DataFrame

import pandas as pd data = {'fish': ['salmon', 'pufferfish', 'shark'], 'caught_count': [100, 5, 0] } df = pd.DataFrame(data) df['caught_count'] = df['fish'].apply(lambda x: 10 if x == "pufferfish") df['ge_100'] = df['caught_count'].apply(lambda x: True if x >= 100 else False) That's it!

Stack Overflow

stackoverflow.com › questions › 50730408 › implementing-if-else-in-python-dataframe-using-lambda-when-there-are-multiple-va

Implementing if-else in python dataframe using lambda when there are multiple variables - Stack Overflow

Top answer

1 of 2

Apply across columns

Use pd.DataFrame.apply instead of pd.Series.apply and specify axis=1:

df['one'] = df.apply(lambda row: row['one']*100 if row['two']>8 else \
                     (row['one']*1 if row['two']<8 else row['one']**2), axis=1)

Unreadable? Yes, I agree. Let's try again but this time rewrite as a named function.

Using a function

Note lambda is just an anonymous function. We can define a function explicitly and use it with pd.DataFrame.apply:

def calc(row):
    if row['two'] > 8:
        return row['one'] * 100
    elif row['two'] < 8:
        return row['one']
    else:
        return row['one']**2

df['one'] = df.apply(calc, axis=1)

Readable? Yes. But this isn't vectorised. We're looping through each row one at at at time. We might as well have used a list. Pandas isn't just for clever table formatting, you can use it for vectorised calculations using arrays in contiguous memory blocks. So let's try one more time.

Vectorised calculations

Using numpy.where:

df['one'] = np.where(row['two'] > 8, row['one'] * 100,
                     np.where(row['two'] < 8, row['one'],
                              row['one']**2))

There we go. Readable and efficient. We have effectively vectorised our if / else statements. Does this mean that we are doing more calculations than necessary? Yes! But this is more than offset by the way in which we are performing the calculations, i.e. with well-defined blocks of memory rather than pointers. You will find an order of magnitude performance improvement.

Another example

Well, we can just use numpy.where again.

df['one'] = np.where(df['name'].isin(['a', 'b']), 100, df['two'])

2 of 2

you can do

df.apply(lambda x: x["one"] + x["two"], axis=1)

but i don't think that such a long lambda as lambda x: x["one"]*100 if x["two"]>8 else (x["one"]*1 if x["two"]<8 else x["one"]**2) is very pythonic. apply takes any callback:

def my_callback(x):
    if x["two"] > 8:
        return x["one"]*100
    elif x["two"] < 8:
        return x["one"]
    else:
        return x["one"]**2

df.apply(my_callback, axis=1)

Towards Data Science

towardsdatascience.com › home › latest › 5 ways to apply if-else conditional statements in pandas

5 Ways to Apply If-Else Conditional Statements in Pandas | Towards Data Science

January 28, 2025 - df['new column name'] = df['column name'].apply(lambda x: 'value if condition is true' if x condition else 'value if condition is false')

Find elsewhere

Google Bing Mojeek

sqlpey

sqlpey.com › python › top-methods-to-use-lambda-with-if-else-in-pandas

Top Methods to Use Lambda with If-Else in Pandas - sqlpey

November 23, 2024 - Learn how to effectively apply a lambda function with if-elif-else logic in a Pandas DataFrame, alongside other efficient methods.

GeeksforGeeks

geeksforgeeks.org › using-apply-in-pandas-lambda-functions-with-multiple-if-statements

Using Apply in Pandas Lambda functions with multiple if statements - GeeksforGeeks

April 20, 2022 - We normally use lambda functions to apply any condition on a dataframe, ... An anonymous function which we can pass in instantly without defining a name or any thing like a full traditional function.

ListenData

listendata.com › home › python

Python Lambda Function with Examples

Example 4 : Multiple or Nested IF-ELSE Statement Suppose you want to create a flag wherein it is yes when value of a variable is greater than or equal to 1 but less than or equal to 5. Else it is no if value is equal to 7. Otherwise missing. mydf = pd.DataFrame({'Names': np.arange(1,10,2)}) mydf["flag"] = mydf["Names"].apply(lambda x: "yes" if x>=1 and x<=5 else "no" if x==7 else np.nan)

Stack Overflow

stackoverflow.com › questions › 51789766 › lambda-function-with-if-else-clause-with-python

pandas - Lambda function with if else clause with Python - Stack Overflow

I have a dataframe that looks like: A B C D SUM 2 5 -4 12 15 I try and run: df.apply((lambda x: x / x.sum() if x/x.sum() >= 0 else None), axis=1).fillna(0) to get, if cell is s...

Saturn Cloud

saturncloud.io › blog › how-to-use-ifelse-function-in-pandas-dataframe

How to Use If-Else Function in Pandas DataFrame | Saturn Cloud Blog

November 10, 2023 - To use the if-else function in Pandas DataFrame, you can use the apply() function along with a lambda function. The apply() function applies a function along an axis of the DataFrame.

Statology

statology.org › home › pandas: how to use apply & lambda together

Pandas: How to Use Apply & Lambda Together

June 23, 2022 - You can use the following basic syntax to apply a lambda function to a pandas DataFrame: df['col'] = df['col'].apply(lambda x: 'value1' if x < 20 else 'value2')

Spark By {Examples}

sparkbyexamples.com › home › python › python lambda using if else

Python Lambda using if else - Spark By {Examples}

May 31, 2024 - The lambda should have only one expression so here, it should be if-else. The if returns the body when the condition is satisfied, and the else body is returned when the condition is not satisfied.

Stack Overflow

stackoverflow.com › questions › 65192684 › lambda-row-function-with-if-else-statement

python - lambda row function with if else statement - Stack Overflow

Top answer

1 of 1

By default, df.apply(...) applies on columns. To apply your lambda on each row, specify:

df.apply(..., axis=1)

Aside from that, this looks very inefficient and can be made much faster without any lambda. A more efficient method is to vectorize your logic:

cond_wkend = df['dow'].isin({'Saturday', 'Sunday'})
cond_holdy = pd.to_datetime(df['date']).isin(holidays)

df['is_workday'] = ~(cond_wkend | cond_holdy)

Stack Overflow

stackoverflow.com › questions › 28195028 › python-using-lambda-function-on-pandas-series-if-else

if statement - Python: using lambda function on Pandas Series, if.. else - Stack Overflow

Top answer

1 of 1

Your

testdata.map(lambda x: x if (x < 30 or x > 60) else 0)

already returns what you want.

Stack Overflow

stackoverflow.com › questions › 77030534 › python-assign-a-column-using-lambda-with-if-else

pandas - Python : assign a column using lambda with if else - Stack Overflow

Top answer

1 of 3

The easiest way to do this is:

df["pk_day"] = df["Date"].dt.weekday.lt(5)

Now, for why your second statement does not work. You're using:

df[(df['Date'].dt.strftime('%a')=='Sat')|(df['Date'].dt.strftime('%a')=='Sun')]

This returns the rows in the DataFrame where the day is a weekend. Hence, it is not boolean. You could use:

lambda pkday:False if ((df['Date'].dt.strftime('%a')=='Sat')|(df['Date'].dt.strftime('%a')=='Sun')) else True

Your last statement works without any errors.

2 of 3

Keep it simple, the following will do the job.

pk_day = lambda pkday : df['Date'].dt.strftime('%a') in ('Sat','Sun')

GeeksforGeeks

geeksforgeeks.org › python › ways-to-apply-an-if-condition-in-pandas-dataframe

How to apply if condition in Pandas DataFrame - GeeksforGeeks

July 15, 2025 - Python · import pandas as pd # Sample DataFrame data = {'Name': ['John', 'Sophia', 'Daniel', 'Emma'], 'Experience': [5, 8, 3, 10]} df = pd.DataFrame(data) print("Original Dataset") display(df) # Apply if condition using lambda function df['Category'] = df['Experience'].apply(lambda x: 'Senior' if x >= 5 else 'Junior') print("Dataset with 'Senior'and 'Junior' Category") display(df) Output: Applying 'if condition' to classify the 'Experience' column into 'Senior' and 'Junior' categories ·

Stack Overflow

stackoverflow.com › questions › 50952129 › pandas-apply-lambda-if-else-incorrect

python 2.7 - pandas apply lambda if else incorrect - Stack Overflow

Top answer

1 of 2

This issue was that you were first filling the NaN and then using .str.split(), so the equality should be with a list, not the element of the list. You can see this by first checking what x is in your lambda function.

dfs['freq'].str.split(',')
#0                                       [text1]
#1                         [text1, text2, text1]
#2                         [text1, text2, text3]
#3                                       [text1]
#4           [text1, text2, text3, text4, text5]
#5                                    [no_guide]
#6    [text1, text2, text3, text4, text5, text6]

The correct equality to check is whether x is a list whose only element is 'no_guide':

lambda x: 0 if x == ['no_guide'] else len(set(x))

Since len(set(x)) returns a number, you may also want to return 0 and not the string '0'.

2 of 2

You could use this:

df['freq'].fillna('no_guide', inplace=True)
df['counts'] = df['freq'].str.split(',', expand=True)\
                         .apply(lambda x: x.str.contains('text')).sum(1)

df

Output:

  guide                                 freq  counts
0    g1                                text1     1.0
1    g2                    text1,text2,text1     3.0
2    g3                    text1,text2,text3     3.0
3    g4                                text1     1.0
4    g5        text1,text2,text3,text4,text5     5.0
5    g6                             no_guide     0.0
6    g7  text1,text2,text3,text4,text5,text6     6.0

reddit.com › r/learnpython › if-else condition in assign statement in pipe chain with pandas?

r/learnpython on Reddit: If-else condition in assign statement in pipe chain with pandas?

December 30, 2022 -

Hello everyone,

I can't figure this one out, unfortunately: How do I use an if-else statement in the assign function from pandas to create a new column called 'id'? I know that I could do it alternatively with np.where at the beginning, like in the commented out line, but for future applications, I would like to know how to do it properly in the pipe chain. I tried a lambda function, but it throws the following error: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().

Help to solve this is much appreciated!

(data
    # .assign(country_indicator = np.where(data.country == "Switzerland", "CH", "EU"))
    .query("continent == 'Europe'")
    .reset_index(drop=True)
    .assign(id = lambda df: "CH" if df.country == "Switzerland" else "EU")
)

The data is the gapminder dataset and the relevant columns look like this:

index | country | continent
12	Albania	Europe
13	Albania	Europe
14	Albania	Europe
15	Albania	Europe
16	Albania	Europe
...	...	...
1603	United Kingdom	Europe
1604	United Kingdom	Europe
1605	United Kingdom	Europe
1606	United Kingdom	Europe
1607	United Kingdom	Europe

Top answer

1 of 2

You can still use np.where() inside the lambda >>> df.assign(some="value").assign(id=lambda df: np.where(df["country"] == "Switzerland", "CH", "EU")) country some id 0 Switzerland value CH 1 Poland value EU 2 Germany value EU If you modify the country column in the chain - you can see it's working on the current "piped" version of the data: >>> df.assign(country="should not match").assign(id=lambda df: np.where(df["country"] == "Switzerland", "CH", "EU")) country id 0 should not match EU 1 should not match EU 2 should not match EU Although you may want to choose a different variable name for the lambda - it can get confusing. There's also .map() which can be passed a dictionary - it's useful for multiple options: >>> df.assign(some="value").assign(id=lambda df: df["country"].map(dict(Switzerland="CH", Poland="PL")).fillna("EU")) country some id 0 Switzerland value CH 1 Poland value PL 2 Germany value EU

2 of 2

The problem is a conflict between numpy and python - Who takes precedence when comparing an entire array to a constant? If you search the internet you can find more details. As for specific problem, you should be able to do that more easily using .map() on the relevant column, e.g. df['continent'].map(). Just pay attention that your lambda doesn't take df as argument, but a generic variable, say c, and then compare it to the desired continent. Used in this way, the comparison is element wise, and not column wise.

numpy.where

numpy.select

and/or (similar to the if/else)

List Comprehension

Solution

Explanation

Apply across columns

Using a function

Vectorised calculations

Another example

`numpy.where`

`numpy.select`

`and`/`or` (similar to the `if`/`else`)