polars attributeerror: 'dataframe' object has no attribute 'apply'

stackoverflow.com › questions › 78753820 › using-polars-with-python-and-being-thrown-the-following-exception-attributeerro

dataframe - Using Polars with Python and being thrown the following exception: AttributeError: 'Expr' object has no attribute 'apply' - Stack Overflow

1 of 1

apply was renamed to .map_elements() some time ago.

Previous versions printed a deprecation warning, but it was eventually removed after a grace period.

You're likely looking at the docs for an older version of Polars, but there is a "version switcher" on the docs site:

As for the actual task, you can also do it natively using .dt.to_string()

import datetime
import polars as pl

pl.select(
   pl.lit(str(datetime.datetime.now()))
     .str.to_datetime()
     .dt.to_string("%A")
)

shape: (1, 1)
┌─────────┐
│ literal │
│ ---     │
│ str     │
╞═════════╡
│ Tuesday │
└─────────┘

github.com › pola-rs › polars › issues › 24781

Polars.Exp.apply has no attribute apply · Issue #24781 · pola-rs/polars

October 6, 2025 - I have confirmed this bug exists on the latest version of Polars. df = pl.DataFrame( { "a": [1, 2, 3, 1], "b": ["a", "b", "c", "c"], } ) df.with_columns( pl.col("a").apply(lambda x: x * 2).alias("a_times_2"), ) **AttributeError: 'Expr' object has no attribute 'apply'** I wanted to use the function apply in my code, but I realized that I receive an error nonethelss .

Author pola-rs

Discussions

python - Using apply in polars - Stack Overflow

8 Using Polars with Python and being thrown the following exception: AttributeError: 'Expr' object has no attribute 'apply' More on stackoverflow.com

stackoverflow.com

ColumnTransformer.fit() fails on polars.DataFrame: AttributeError: 'DataFrame' object has no attribute 'size'

Describe the bug Fitting a sklearn.compose.ColumnTransformer with more than one transformer on a polars.DataFrame yields the error: AttributeError: 'DataFrame' object has no attribute '... More on github.com

github.com

September 11, 2025

AttributeError: 'DataFrame' object has no attribute 'get'

Couldn't load subscription status. Retry · There was an error while loading. Please reload this page More on github.com

github.com

June 6, 2023

'Expr' object has no attribute 'apply'

Description pl.col('fild1').apply( base64解密的代码 ) 会报错'Expr' object has no attribute 'apply'是为啥 Link No response More on github.com

github.com

December 23, 2024

github.com › pola-rs › polars › issues › 10744

Rename all `apply` functions to `map_*` · Issue #10744 · pola-rs/polars

August 26, 2023 - In #10678, Ritchie mentions the reason for moving away from the apply naming: Pandas apply will have opposite behavior of polars. We chose apply because people were so familiar with it in pandas. N...

Author pola-rs

JetBrains

youtrack.jetbrains.com › issue › PY-63683 › Polars-DataFrames-cant-be-viewed-as-DataFrame-in-PyCharm-Community

Polars DataFrames can't be viewed 'as ...

{{ (>_<) }} This version of your browser is not supported. Try upgrading to the latest stable version. Something went seriously wrong

docs.pola.rs › api › python › version › 0.18 › reference › expressions › api › polars.Expr.apply.html

polars.Expr.apply — Polars documentation

Apply a custom/user-defined function (UDF) in a GroupBy or Projection context · This method is much slower than the native expressions API. Only use it if you cannot implement your logic otherwise

stackoverflow.com › questions › 78813399 › using-apply-in-polars

python - Using apply in polars - Stack Overflow

1 of 2

pl.Expr.apply was deprecated in favour of pl.Expr.map_elements in Polars release 0.19.0. Recently, pl.Expr.apply was removed in the release of Polars 1.0.0.

You can adapt your code to the new version as follows.

Copydf.with_columns(
    pl.col("AH_PROC_REALIZADO")
    .map_elements(get_procedure_description, return_dtype=pl.String)
    .alias("proced_descr")
)

2 of 2

If you really want to apply python function then you can use map_elements(). However, using native polars expression is always preferrable.

In your case I'd suggest to look at replace() or replace_strict().

If you would want to just search by AH_PROC_REALIZADO column you could use simple replace_strict():

Copydf = pl.DataFrame({
    "AH_PROC_REALIZADO": ["30408", "410010065", "410010111", "XXXX"]
})

┌───────────────────┐
│ AH_PROC_REALIZADO │
│ ---               │
│ str               │
╞═══════════════════╡
│ 30408             │
│ 410010065         │
│ 410010111         │
│ XXXX              │
└───────────────────┘

df.with_columns(
    pl.col("AH_PROC_REALIZADO")
    .replace_strict(proceds, default=None)
    .alias("proced_descr")
)

┌───────────────────┬────────────────────────────────┐
│ AH_PROC_REALIZADO ┆ proced_descr                   │
│ ---               ┆ ---                            │
│ str               ┆ str                            │
╞═══════════════════╪════════════════════════════════╡
│ 30408             ┆ QUIMIOTERAPIA                  │
│ 410010065         ┆ MASTECTOMIA SIMPLES            │
│ 410010111         ┆ SETORECTOMIA / QUADRANTECTOMIA │
│ XXXX              ┆ null                           │
└───────────────────┴────────────────────────────────┘

The problem with your use case is that, as far as I understand, you want to search by prefix of the strings in AH_PROC_REALIZADO column. In that case you could probably adjust the solution to:

itertools.groupby() to transform proceds dictionary into dictionary of dictionaries where high level keys are length of the key.
replace_strict() to search for product description.
coalesce() to combine results into final column.

Copyfrom itertools import groupby

mappings = {k: dict(g) for k, g in groupby(proceds.items(), lambda x: len(x[0]))}

df = pl.DataFrame({
    "AH_PROC_REALIZADO": ["30408_____", "410010065_____", "410010111____", "XXXX"]
})

┌───────────────────┐
│ AH_PROC_REALIZADO │
│ ---               │
│ str               │
╞═══════════════════╡
│ 30408_____        │
│ 410010065_____    │
│ 410010111____     │
│ XXXX              │
└───────────────────┘

df.with_columns(
    pl.coalesce(
        pl.col("AH_PROC_REALIZADO").str.head(k).replace_strict(m, default=None) for k, m in mappings.items()
    )
    .alias("proced_descr")
)

┌───────────────────┬────────────────────────────────┐
│ AH_PROC_REALIZADO ┆ proced_descr                   │
│ ---               ┆ ---                            │
│ str               ┆ str                            │
╞═══════════════════╪════════════════════════════════╡
│ 30408_____        ┆ QUIMIOTERAPIA                  │
│ 410010065_____    ┆ MASTECTOMIA SIMPLES            │
│ 410010111____     ┆ SETORECTOMIA / QUADRANTECTOMIA │
│ XXXX              ┆ null                           │
└───────────────────┴────────────────────────────────┘

github.com › scikit-learn › scikit-learn › issues › 32155

ColumnTransformer.fit() fails on polars.DataFrame: AttributeError: 'DataFrame' object has no attribute 'size' · Issue #32155 · scikit-learn/scikit-learn

September 11, 2025 - Describe the bug Fitting a sklearn.compose.ColumnTransformer with more than one transformer on a polars.DataFrame yields the error: AttributeError: 'DataFrame' object has no attribute '...

Author scikit-learn

JetBrains

youtrack.jetbrains.com › issue › DS-5668 › Polars-tables-are-broken-in-data-vision-with-polars-0.16.13

Polars tables are broken in data vision with polars 0.16.13

{{ (>_<) }} This version of your browser is not supported. Try upgrading to the latest stable version. Something went seriously wrong

Find elsewhere

Google Bing Mojeek

github.com › mwaskom › seaborn › issues › 3379

AttributeError: 'DataFrame' object has no attribute 'get' · Issue #3379 · mwaskom/seaborn

June 6, 2023 - AttributeError: 'DataFrame' object has no attribute 'get'#3379 · Copy link · nick-youngblut · opened · on Jun 6, 2023 · Issue body actions · It appears that seaborn somewhat supports polars.DataFrame objects, such as: iris = pl.read_csv("data/iris.csv") p = sns.displot( data=iris, x="sepal_width", hue="species", col="species", height=3, aspect=1, alpha=1 ) ...but not fully: p = sns.catplot( data=iris, x="species", y="sepal_width", kind="box", height=3, aspect=1.3 ) The error: 3185 p = _CategoricalPlotter() 3186 p.require_numeric = plotter_class.require_numeric -> 3187 p.establish_variables(x_, y_, hue, data, orient, order, hue_order) 3188 if ( 3189 order is not None 3190 or (sharex and p.orient == "v") 3191 or (sharey and p.orient == "h") 3192 ): 3193 # Sync categorical axis between facets to have the same categories 3194 order = p.group_names ...

Author mwaskom

docs.pola.rs › api › python › version › 0.18 › reference › dataframe › api › polars.DataFrame.apply.html

polars.DataFrame.apply — Polars documentation

Apply a custom/user-defined function (UDF) over the rows of the DataFrame · This method is much slower than the native expressions API. Only use it if you cannot implement your logic otherwise

docs.pola.rs › api › python › version › 0.18 › reference › expressions › api › polars.Expr.map.html

polars.Expr.map — Polars documentation

Apply a custom python function to a Series or sequence of Series · The output of this custom function must be a Series. If you want to apply a custom function elementwise over single values, see apply(). A use case for map is when you want to transform an expression with a third-party library

github.com › pola-rs › polars › issues › 20410

'Expr' object has no attribute 'apply' · Issue #20410 · pola-rs/polars

December 23, 2024 - Description pl.col('fild1').apply( base64解密的代码 ) 会报错'Expr' object has no attribute 'apply'是为啥 Link No response

Author pola-rs

github.com › pola-rs › polars › issues › 6117

Rationalize with_column & with_columns · Issue #6117 · pola-rs/polars

January 8, 2023 - Problem description Initial discussion was on Discord, moving here so everyone can join. The DataFrame methods with_column and with_columns tend to be used quite often, as these are, together with select, the primary way to run expressio...

Author pola-rs

stackoverflow.com › questions › 38134643 › how-to-resolve-attributeerror-dataframe-object-has-no-attribute

python - How to resolve AttributeError: 'DataFrame' object has no attribute - Stack Overflow

1 of 7

Check your DataFrame with data.columns

It should print something like this

Index([u'regiment', u'company',  u'name',u'postTestScore'], dtype='object')

Check for hidden white spaces..Then you can rename with

data = data.rename(columns={'Number ': 'Number'})

2 of 7

I think the column name that contains "Number" is something like " Number" or "Number ". I'm assuming you might have a residual space in the column name. Please run print "<{}>".format(data.columns[1]) and see what you get. If it's something like < Number>, it can be fixed with:

data.columns = data.columns.str.strip()

See pandas.Series.str.strip

In general, AttributeError: 'DataFrame' object has no attribute '...', where ... is some column name, is caused because . notation has been used to reference a nonexistent column name or pandas method.

pandas methods are accessed with a .. pandas columns can also be accessed with a . (e.g. data.col) or with brackets (e.g. ['col'] or [['col1', 'col2']]).

data.columns = data.columns.str.strip() is a fast way to quickly remove leading and trailing spaces from all column names. Otherwise verify the column or attribute is correctly spelled.

github.com › pola-rs › polars › issues › 3481

Lazy join fails with `AttributeError: _ldf` · Issue #3481 · pola-rs/polars

May 23, 2022 - A join between two dataframes works in eager mode, but fails in lazy mode with an AttributeError: _ldf. import polars as pl df_left = pl.DataFrame({ "Id": [1, 2, 3, 4], "Names": ["A", "B", "C", "D"], }) df_right = pl.DataFrame({ "Id": [1, 3], "Tags": ["xxx", "yyy"] }) print(df_left.join(df_right, on="Id")) # Works print(df_left.lazy().join(df_right, on="Id")) # Fails!

Author pola-rs

github.com › pola-rs › polars › issues › 4740

set_with_columns_kwargs() does not exist, but recommended · Issue #4740 · pola-rs/polars

September 6, 2022 - Polars version checks I have checked that this issue has not already been reported. I have confirmed this bug exists on the latest version of polars. Issue Description .with_columns(**named_kwargs) throws an error RuntimeError: **kwargs ...

Author pola-rs

stackoverflow.com › questions › 77361799 › attributeerror-dataframe-object-has-no-attribute-group-by › 77361877

python - AttributeError: 'DataFrame' object has no attribute 'group_by' - Stack Overflow

1 of 1

On checking I found my polars version :

pl.__version__

0.17.3

https://pola-rs.github.io/polars/py-polars/html/reference/dataframe/api/polars.DataFrame.groupby.html

I need to do:

df.groupby("a").agg(pl.col("b").sum())  # there is no underscore in groupby

#output

shape: (3, 2)
a   b
str i64
"a" 2
"c" 3
"b" 5

and the document says :

Deprecated since version 0.19.0: This method has been renamed to DataFrame.group_by().

This is the new document for polars version 0.19

https://pola-rs.github.io/polars/py-polars/html/reference/dataframe/api/polars.DataFrame.group_by.html#polars-dataframe-group-by

github.com › pola-rs › polars › issues › 6080

Unreliable schema for datetime columns and error in .glimpse() · Issue #6080 · pola-rs/polars

January 6, 2023 - Polars does not seem to handle python datetime and pd.datetime objects reliably. ... import pandas as pd import numpy as np pd.DataFrame({'Date':['2023-01-01']}).astype({'Date':'datetime64[ns]'}).pipe(pl.from_pandas).glimpse()

Author pola-rs

reddit.com › r/learnpython › "'dataframe' object has no attribute" issue

r/learnpython on Reddit: "'DataFrame' object has no attribute" Issue

October 30, 2020 -

I am in university and am taking a special topics class regarding AI. I have zero knowledge about Python, how it works, or what anything means.

A project for the class involves manipulating Bayesian networks to predict how many and which individuals die upon the sinking of a ship. This is the code I am supposed to manipulate:

##EDIT VARIABLES TO THE VARIABLES OF INTEREST
train_var = train.loc[:,['Survived','Sex']]  
test_var = test.loc[:,['Sex']]  
BayesNet = BayesianModel([('Sex','Survived')])

I am supposed to add another variable, 'Pclass,' to the mix, paying attention to the order for causation. I have added that variable to every line of this code in every way imaginable and consistently get an error from this line:

predictions = pandas.DataFrame({'PassengerId': test.PassengerId,'Survived': hypothesis.Survived.tolist()})
predictions

For example, the error I get for this version of the code:

train_var = train.loc[:,['Survived','Pclass','Sex']]  
test_var = test.loc[:,['Pclass']]  
BayesNet = BayesianModel([('Sex','Pclass','Survived')])

is this:

AttributeError                            Traceback (most recent call last)
<ipython-input-98-16d9eb9451f7> in <module>
----> 1 predictions = pandas.DataFrame({'PassengerId': test.PassengerId,'Survived': hypothesis.Survived.tolist()})
      2 predictions

/opt/conda/lib/python3.7/site-packages/pandas/core/generic.py in __getattr__(self, name)
   5137             if self._info_axis._can_hold_identifiers_and_holds_name(name):
   5138                 return self[name]
-> 5139             return object.__getattribute__(self, name)
   5140 
   5141     def __setattr__(self, name: str, value) -> None:

AttributeError: 'DataFrame' object has no attribute 'Survived'

Honestly, I have no idea wtf any of this means. I have tried googling this issue and have come up with nothing.

Any help would be greatly appreciated. I know it's a lot.

1 of 2

Double check if there's a space in the column name. 'Survived ' vs 'Survived' It happens more often than you'd think especially with CSV data source.

2 of 2

It's an issue with how you're calling the data and if it's actually there.

train.loc[:,['Survived','Sex']]

tells me that there's a DataFrame (which is from pandas, hence the error) called train and this line is trying to access parts of that dataframe (it's just a type of an array). Specifically, it's trying to access columns named Survived and Sex.

Similarly, this line tells me there's another dataframe (df) known as test with a column named Sex and this is access that data.

test.loc[:,['Sex']]

The error code also informs me of some things

predictions = pandas.DataFrame({'PassengerId': test.PassengerId,'Survived': hypothesis.Survived.tolist()})

There's another df called predictions that's of dict type which is trying to access information from the another hypothesis df. The attribute it's tryin to access in the second key of the dict is

hypothesis.Survived.tolist()

which is a way of calling a column from that df. That is, when the predictions line is executed, it's trying to pull all the values from the Survived column of the hypothesis df.

The error is that the df doesn't actually have a column named Survived. So either there's missing data, or you're calling it wrong, or there's a missing reference.

Without knowing more about your code and your question, I can't really extrapolate much more.