pandas modify column values lambda

how to update a pandas dataframe column value, when a specific string appears in another column?

reddit.com › r › learnpython › comments › 1dngi2b › how_to_update_a_pandas_dataframe_column_value

It's not something you'd really use .apply for. You would use boolean indexing, e.g. df['A'].str.contains('foo') would give you a Series of True/False values. You can then use .loc to set column(s) to a particular value for the True rows: df.loc[df['A'].str.contains('foo'), 'B'] = 'bar' Answer from commandlineluser on reddit.com

reddit.com › r/learnpython › how to update a pandas dataframe column value, when a specific string appears in another column?

r/learnpython on Reddit: how to update a pandas dataframe column value, when a specific string appears in another column?

June 24, 2024 -

So, i've figured out how to use the pandas apply method to update/change the values of a column, row-wise based on multiple comparisons like this:

# for each row, if the value of both 'columns to check' are 'SOME STRING', change to 'NEW STRING
# otherwise leave it as is
my_df ['column_to_change'] = df.apply(lambda row: 'NEW STRING' if row['column_to_check_1'] and row['column_to_check_2'] == 'SOME STRING' else row['column_to_change'], axis=1)

Now, I can't figure out how to expand that beyond simple comparison operators. The specific example I'm trying to solve is:

" for each row, if the string value in COLUMN A contains 'foo', change the value in COLUMN B to 'bar', otherwise leave it as is"

I think this is all right, except the ##parts between the hashmarks##

my_df ['columb_b'] = df.apply(lambda row: 'bar' if ##column A contains 'foo'## else row['columb_b'], axis=1)

Top answer

1 of 3

2 of 3

Are you asking how to check if a string contains 'foo'?

Stack Overflow

stackoverflow.com › questions › 57614324 › replace-values-in-dataframe-column-when-they-start-with-string-using-lambda

pandas - Replace values in DataFrame column when they start with string using lambda - Stack Overflow

Top answer

1 of 3

use the apply method

In [80]: x = {'Value': ['Test', 'XXX123', 'XXX456', 'Test']}
In [81]: df = pd.DataFrame(x)
In [82]: df.Value.apply(lambda x: np.nan if x.startswith('XXX') else x)
Out[82]:
0    Test
1     NaN
2     NaN
3    Test
Name: Value, dtype: object

Performance Comparision of apply, where, loc

2 of 3

np.where() performs way better here:

df.Value=np.where(df.Value.str.startswith('XXX'),np.nan,df.Value)

Performance vs apply on larger dfs:

Discussions

python - Using Lambda Function Pandas to Set Column Values - Stack Overflow

Could anyone suggest a way answer the same question (see link) but by using lambda function: Update a dataframe in pandas while iterating row by row More on stackoverflow.com

stackoverflow.com

python - Update a pandas data frame column using Apply,Lambda and Group by Functions - Data Science Stack Exchange

I have a data frame in the format mentioned in the screenshot below. Column 'Candidate Won' has only 'loss' as the column value for all the rows. I want to update the Column 'Candidate Won' to a va... More on datascience.stackexchange.com

datascience.stackexchange.com

June 6, 2020

pandas - use .apply() function to change values to a column of the dataframe - Data Science Stack Exchange

I have a dataframe which is the following: and I would like to consider only the column of instructions and keep just the values push, test, mov, test ,....., so just the first word of each string ... More on datascience.stackexchange.com

datascience.stackexchange.com

October 31, 2019

python - Pandas change column value based on other column with lambda function - Stack Overflow

Trying to replicate a simple Excel function in pandas, with no success. Haven't tried np.where() yet, as I want to learn lambda functions and rely less on imports where possible. Function to replic... More on stackoverflow.com

stackoverflow.com

Saturn Cloud

saturncloud.io › blog › using-lambda-function-pandas-to-set-column-values

Using Lambda Function Pandas to Set Column Values | Saturn Cloud Blog

January 25, 2024 - We can use a lambda function inside the .apply() function to set column values based on certain conditions. Let’s take a look at an example. Suppose we have a DataFrame df with columns A, B, and C. We want to set the values in column C based ...

Stack Overflow

stackoverflow.com › questions › 44752968 › using-lambda-function-pandas-to-set-column-values

python - Using Lambda Function Pandas to Set Column Values - Stack Overflow

Top answer

1 of 1

You'll want to use apply with the parameter axis=1 to insure the function passed to apply is applied to each row.

The referenced question has an answer that uses this loop.

for i, row in df.iterrows():
    if <something>:
        row['ifor'] = x
    else:
        row['ifor'] = y

    df.ix[i]['ifor'] = x

To use a lambda with the same logic

df['ifor'] = df.apply(lambda row: x if something else y, axis=1)

Statology

statology.org › home › pandas: how to use apply & lambda together

Pandas: How to Use Apply & Lambda Together

June 23, 2022 - You can use the following basic syntax to apply a lambda function to a pandas DataFrame: df['col'] = df['col'].apply(lambda x: 'value1' if x < 20 else 'value2')

YouTube

youtube.com › watch

How to Change Values in a DataFrame Column Using Lambda without Overwriting Existing Data - YouTube

01:28

Discover a simple method to use lambda functions with pandas to modify column values conditionally, while preserving existing data in a DataFrame.---This vid...

Published April 16, 2025

Views 0

Stack Exchange

datascience.stackexchange.com › questions › 75556 › update-a-pandas-data-frame-column-using-apply-lambda-and-group-by-functions

python - Update a pandas data frame column using Apply,Lambda and Group by Functions - Data Science Stack Exchange

June 6, 2020 - df_andhrapradesh['Candidate Won']=df_andhrapradesh['% of Votes'].apply(lambda x:"Won" if x==df_andhrapradesh.groupby('Constituency')['% of Votes'].max() else "Loss") ... Srujan K.N. 4511 gold badge22 silver badges1010 bronze badges $\endgroup$ 1 · $\begingroup$ You want to change the column "Candidate Won" value to won if the '% of votes' column is maximum in each group where grouping based on 'Constituency' column, right? $\endgroup$ ... I used 'Apply' function to every row in the pandas data frame and created a custom function to return the value for the 'Candidate Won' Column using data frame,row-level 'Constituency','% of Votes'

Saturn Cloud

saturncloud.io › blog › how-to-update-column-values-in-pandas-based-on-criteria-from-another-column

How to Update Column Values in Pandas Based on Criteria From Another Column | Saturn Cloud Blog

January 18, 2024 - In some cases, you may need to ... column. You can do this by modifying the lambda function to return a tuple of updated values, and then assigning the tuple to the corresponding columns....

Find elsewhere

Google Bing Mojeek

CopyProgramming

copyprogramming.com › howto › pandas-change-values-in-column-based-on-condition-lambda

Python: Modifying column values in pandas using a lambda function based on a condition

September 15, 2023 - My goal is to assign a value of 1 to every cell that matches the highest value found in the other columns of the same row. ... df_ref['max'] = df_ref.max(axis=1) df_ref['col1'] = df_ref.col1.apply(lambda x:1 if (x==df_ref['max']) else 0) ... You are close to the solution.

TutorialsPoint

tutorialspoint.com › article › how-to-replace-values-in-columns-based-on-condition-in-pandas

How to Replace Values in Columns Based on Condition in Pandas

July 10, 2023 - import pandas as pd data = { 'name': ['Alice', 'Bob', 'Charlie', 'David', 'Emily'], 'age': [25, 35, 45, 55, 65], 'gender': ['F', 'M', 'M', 'F', 'F'] } df = pd.DataFrame(data) # Replace gender with 'F' where name starts with 'A' df['gender'] = df.apply(lambda x: 'F' if x['name'].startswith('A') else x['gender'], axis=1) print(df) name age gender 0 Alice 25 F 1 Bob 35 M 2 Charlie 45 M 3 David 55 F 4 Emily 65 F · NumPy's where function provides a vectorized approach for conditional replacement ? import pandas as pd import numpy as np data = { 'name': ['Alice', 'Bob', 'Charlie', 'David', 'Emily'], 'age': [25, 35, 45, 55, 65], 'gender': ['F', 'M', 'M', 'F', 'F'] } df = pd.DataFrame(data) # Replace age with 0 where gender is 'M', keep original age otherwise df['age'] = np.where(df['gender'] == 'M', 0, df['age']) print(df)

Stack Exchange

datascience.stackexchange.com › questions › 62452 › use-apply-function-to-change-values-to-a-column-of-the-dataframe

pandas - use .apply() function to change values to a column of the dataframe - Data Science Stack Exchange

Top answer

1 of 2

Given you dataframe is data, use the below apply() function:

For column with list of words separated by space:

data['New_instructions'] = data['instructions'].apply(lambda x: [i.split()[0].strip()for i in x])

For column with single list word:

data['New_instructions'] = data['instructions'].apply(lambda x: x.split()[0].strip())

2 of 2

use lambda function like as follows

dataFrame['opcodes'] = dataFrame['instructions'].apply(lambda x:[i.split()[0] for i in x])

Stack Overflow

stackoverflow.com › questions › 70262762 › pandas-change-column-value-based-on-other-column-with-lambda-function

python - Pandas change column value based on other column with lambda function - Stack Overflow

Top answer

1 of 1

I this you need to add axis=1 to your apply() call, to make the lambda function be executed for each row, instead of for each column, which is the default:

test["Structured Pos"] = test.apply(
    lambda x: "Freeform" if x["Coupa Type"] == "freeform" else "Structured PO",
    axis=1,
)

(You also need to use x["Coupa Type"] instead of test["Coupa Type"] in the Lambda function, as I've done above.)

A more efficient solution for this case though would be to do something link this:

test["Structured Pos"] = test["Coupa Type"].map({"freeform": "Freeform"}).fillna("Structured PO")

...because map replaces all values in the series that are keys in the dictionary with the values of the dictionary, and values in the Series that aren't in keys in the dictionary, it replaces with NaN, so you can use fillna to supply the default.

Towards Data Science

towardsdatascience.com › home › latest › manipulating values in pandas dataframes

Manipulating Values in Pandas DataFrames | Towards Data Science

January 23, 2025 - Each element in the selected dataframe will be passed as argument into the lambda function. The values in the specified columns are now rounded to 2 decimal places: I hope this article has made it clear for you to decide when you should use the map(), apply(), or applymap() functions. In summary: If you want to modify a single column in a dataframe, use map()

GeeksforGeeks

geeksforgeeks.org › applying-lambda-functions-to-pandas-dataframe

Applying Lambda functions to Pandas Dataframe - GeeksforGeeks

August 9, 2024 - In this example, we will apply the lambda function Dataframe.assign() to a single column. The function is applied to the 'Total_Marks' column, and a new column 'Percentage' is formed with its help.

GeeksforGeeks

geeksforgeeks.org › how-to-replace-values-in-column-based-on-condition-in-pandas

How to Replace Values in Column Based on Condition in Pandas? - GeeksforGeeks

November 15, 2024 - import pandas as pd # Data Student = { 'Name': ['John', 'Jay', 'sachin', 'Geetha', 'Amutha', 'ganesh'], 'gender': ['male', 'male', 'male', 'female', 'female', 'male'], 'math score': [50, 100, 70, 80, 75, 40], 'test preparation': ['none', 'completed', 'none', 'completed', 'completed', 'none'], } # Creating a DataFrame object df = pd.DataFrame(Student) # Replacing 'female' with 0 using apply and lambda df['gender'] = df['gender'].apply(lambda x: 0 if x == 'female' else x) print(df) ... Name gender math score test preparation 0 John male 50 none 1 Jay male 100 completed 2 sachin male 70 none 3 Geetha 0 80 completed 4 Amutha 0 75 completed 5 ganesh male 40 none · In this article, we’ve explored four effective methods to replace values in a Pandas DataFrame column based on conditions: using loc[], np.where(), masking, and apply() with a lambda function.

Stack Overflow

stackoverflow.com › questions › 46029107 › python-pandas-lambda-function-to-change-all-the-values-of-a-column

Python Pandas lambda function to change all the values of a column? - Stack Overflow

Top answer

1 of 1

You can use str.split

data['TYPE'] = data['AIRCRAFT'].str.split().str[0]

You get

    AIRCRAFT    TYPE
0   B738 (C-GKWJ)   B738
1   A321 (C-FJNX)   A321

You can also use str.extract though split is ideal

 data['TYPE'] = data['AIRCRAFT'].str.extract('(\w+)\s\(', expand = False)

Stack Overflow

stackoverflow.com › questions › 12604909 › pandas-how-to-change-all-the-values-of-a-column

python - Pandas: how to change all the values of a column? - Stack Overflow

Top answer

1 of 3

178

As @DSM points out, you can do this more directly using the vectorised string methods:

df['Date'].str[-4:].astype(int)

Or using extract (assuming there is only one set of digits of length 4 somewhere in each string):

df['Date'].str.extract('(?P<year>\d{4})').astype(int)

An alternative slightly more flexible way, might be to use apply (or equivalently map) to do this:

df['Date'] = df['Date'].apply(lambda x: int(str(x)[-4:]))
             #  converts the last 4 characters of the string to an integer

The lambda function, is taking the input from the Date and converting it to a year.
You could (and perhaps should) write this more verbosely as:

def convert_to_year(date_in_some_format):
    date_as_string = str(date_in_some_format)  # cast to string
    year_as_string = date_in_some_format[-4:] # last four characters
    return int(year_as_string)

df['Date'] = df['Date'].apply(convert_to_year)

Perhaps 'Year' is a better name for this column...

2 of 3

You can do a column transformation by using apply

Define a clean function to remove the dollar and commas and convert your data to float.

def clean(x):
    x = x.replace("$", "").replace(",", "").replace(" ", "")
    return float(x)

Next, call it on your column like this.

data['Revenue'] = data['Revenue'].apply(clean)

Codecademy

codecademy.com › learn › decp-data-processing-pandas › modules › decp-modifying-data-frames-with-pandas › cheatsheet

Python Pandas for Data Engineers: Modifying DataFrames with Pandas Cheatsheet | Codecademy

# Apply this function to double every value in a specified column · df.column1 = df.column1.apply(double) # Lambda functions can also be supplied to `apply()` df.column2 = df.column2.apply(lambda x : 3*x) # Applying to a row requires it to be called on the entire DataFrame · df['newColumn'] = df.apply(lambda row: row['column1'] * 1.5 + row['column2'], axis=1 · ) Copy to clipboard · Copy to clipboard · Pandas DataFrames allow for the addition of columns after the DataFrame has already been created, by using the format df['newColumn'] and setting it equal to the new column’s value.

Stack Overflow

stackoverflow.com › questions › 70954320 › how-to-use-lambda-to-change-the-values-in-dataframe

python - How to use lambda to change the values in dataframe? - Stack Overflow

Top answer

1 of 2

You have to add the = operator. DataFrames are not mutable like lists, therefore you have to store the value in the column: Energy['Energy Supply'] = ....

Energy['Energy Supply'] = Energy['Energy Supply'].apply(lambda x: x*(10**6))
Energy.head()

2 of 2

You don't need to use apply, just use compound assignment operators:

Energy['Energy Supply'] *= 1e6

Stack Overflow

stackoverflow.com › questions › 34962104 › how-can-i-use-the-apply-function-for-a-single-column

python - How can I use the apply() function for a single column? - Stack Overflow

Top answer

1 of 8

711

Given a sample dataframe df as:

what you want is:

df['a'] = df['a'].apply(lambda x: x + 1)

that returns:

2 of 8

162

For a single column better to use map(), like this:

df = pd.DataFrame([{'a': 15, 'b': 15, 'c': 5}, {'a': 20, 'b': 10, 'c': 7}, {'a': 25, 'b': 30, 'c': 9}])

    a   b  c
0  15  15  5
1  20  10  7
2  25  30  9



df['a'] = df['a'].map(lambda a: a / 2.)

      a   b  c
0   7.5  15  5
1  10.0  10  7
2  12.5  30  9