dataframe' object has no attribute column

How to resolve AttributeError: 'DataFrame' object has no attribute

stackoverflow.com › questions › 38134643 › how-to-resolve-attributeerror-dataframe-object-has-no-attribute

Check your DataFrame with data.columns

It should print something like this

Index([u'regiment', u'company',  u'name',u'postTestScore'], dtype='object')

Check for hidden white spaces..Then you can rename with

data = data.rename(columns={'Number ': 'Number'})

Answer from Merlin on Stack Overflow

reddit.com › r/learnpython › "'dataframe' object has no attribute" issue

r/learnpython on Reddit: "'DataFrame' object has no attribute" Issue

September 15, 2019 -

I am in university and am taking a special topics class regarding AI. I have zero knowledge about Python, how it works, or what anything means.

A project for the class involves manipulating Bayesian networks to predict how many and which individuals die upon the sinking of a ship. This is the code I am supposed to manipulate:

##EDIT VARIABLES TO THE VARIABLES OF INTEREST
train_var = train.loc[:,['Survived','Sex']]  
test_var = test.loc[:,['Sex']]  
BayesNet = BayesianModel([('Sex','Survived')])

I am supposed to add another variable, 'Pclass,' to the mix, paying attention to the order for causation. I have added that variable to every line of this code in every way imaginable and consistently get an error from this line:

predictions = pandas.DataFrame({'PassengerId': test.PassengerId,'Survived': hypothesis.Survived.tolist()})
predictions

For example, the error I get for this version of the code:

train_var = train.loc[:,['Survived','Pclass','Sex']]  
test_var = test.loc[:,['Pclass']]  
BayesNet = BayesianModel([('Sex','Pclass','Survived')])

is this:

AttributeError                            Traceback (most recent call last)
<ipython-input-98-16d9eb9451f7> in <module>
----> 1 predictions = pandas.DataFrame({'PassengerId': test.PassengerId,'Survived': hypothesis.Survived.tolist()})
      2 predictions

/opt/conda/lib/python3.7/site-packages/pandas/core/generic.py in __getattr__(self, name)
   5137             if self._info_axis._can_hold_identifiers_and_holds_name(name):
   5138                 return self[name]
-> 5139             return object.__getattribute__(self, name)
   5140 
   5141     def __setattr__(self, name: str, value) -> None:

AttributeError: 'DataFrame' object has no attribute 'Survived'

Honestly, I have no idea wtf any of this means. I have tried googling this issue and have come up with nothing.

Any help would be greatly appreciated. I know it's a lot.

Top answer

1 of 2

Double check if there's a space in the column name. 'Survived ' vs 'Survived' It happens more often than you'd think especially with CSV data source.

2 of 2

It's an issue with how you're calling the data and if it's actually there.

train.loc[:,['Survived','Sex']]

tells me that there's a DataFrame (which is from pandas, hence the error) called train and this line is trying to access parts of that dataframe (it's just a type of an array). Specifically, it's trying to access columns named Survived and Sex.

Similarly, this line tells me there's another dataframe (df) known as test with a column named Sex and this is access that data.

test.loc[:,['Sex']]

The error code also informs me of some things

predictions = pandas.DataFrame({'PassengerId': test.PassengerId,'Survived': hypothesis.Survived.tolist()})

There's another df called predictions that's of dict type which is trying to access information from the another hypothesis df. The attribute it's tryin to access in the second key of the dict is

hypothesis.Survived.tolist()

which is a way of calling a column from that df. That is, when the predictions line is executed, it's trying to pull all the values from the Survived column of the hypothesis df.

The error is that the df doesn't actually have a column named Survived. So either there's missing data, or you're calling it wrong, or there's a missing reference.

Without knowing more about your code and your question, I can't really extrapolate much more.

Stack Overflow

stackoverflow.com › questions › 38134643 › how-to-resolve-attributeerror-dataframe-object-has-no-attribute

python - How to resolve AttributeError: 'DataFrame' object has no attribute - Stack Overflow

Top answer

1 of 7

Check your DataFrame with data.columns

It should print something like this

Index([u'regiment', u'company',  u'name',u'postTestScore'], dtype='object')

Check for hidden white spaces..Then you can rename with

data = data.rename(columns={'Number ': 'Number'})

2 of 7

I think the column name that contains "Number" is something like " Number" or "Number ". I'm assuming you might have a residual space in the column name. Please run print "<{}>".format(data.columns[1]) and see what you get. If it's something like < Number>, it can be fixed with:

data.columns = data.columns.str.strip()

See pandas.Series.str.strip

In general, AttributeError: 'DataFrame' object has no attribute '...', where ... is some column name, is caused because . notation has been used to reference a nonexistent column name or pandas method.

pandas methods are accessed with a .. pandas columns can also be accessed with a . (e.g. data.col) or with brackets (e.g. ['col'] or [['col1', 'col2']]).

data.columns = data.columns.str.strip() is a fast way to quickly remove leading and trailing spaces from all column names. Otherwise verify the column or attribute is correctly spelled.

Discussions

AttributeError: 'DataFrame' object has no attribute 'name'; Various stack overflow / github suggested fixes not working

AttributeError: 'DataFrame' object has no attribute 'name'; Various stack overflow / github suggested fixes not working#8624 ... I perform a pipeline of transformations on a Dask dataframe originating from dd.read_sql_table() from a view in an oracle DB. In one stage that follows many successful stages, I try to apply a simple class method to a column ... More on github.com

github.com

January 26, 2022

python - 'DataFrame' object has no attribute 'withColumn' - Stack Overflow

1 Pyspark, TypeError: 'Column' object is not callable · 4 Pyspark - withColumn is not working while calling on empty dataframe More on stackoverflow.com

stackoverflow.com

python - 'DataFrame' object has no attribute 'column' - Stack Overflow

I'm trying to retrieve the column names of a dataframe, all in lower case, but that's giving me the following Attribute Error: 'DataFrame' object has no attribute 'column'. import pandas as pd df ... More on stackoverflow.com

stackoverflow.com

python - AttributeError: 'DataFrame' object has no attribute - Stack Overflow

I keep getting different attribute errors when trying to run this file in ipython...beginner with pandas so maybe I'm missing something Code: from pandas import Series, DataFrame import pandas a... More on stackoverflow.com

stackoverflow.com

Stack Exchange

datascience.stackexchange.com › questions › 37435 › i-got-the-following-error-dataframe-object-has-no-attribute-data

python - I got the following error : 'DataFrame' object has no attribute 'data' - Data Science Stack Exchange

Top answer

1 of 5

"sklearn.datasets" is a scikit package, where it contains a method load_iris().

load_iris(), by default return an object which holds data, target and other members in it. In order to get actual values you have to read the data and target content itself.

Whereas 'iris.csv', holds feature and target together.

FYI: If you set return_X_y as True in load_iris(), then you will directly get features and target.

from sklearn import datasets
data,target = datasets.load_iris(return_X_y=True)

2 of 5

The Iris Dataset from Sklearn is in Sklearn's Bunch format:

print(type(iris))
print(iris.keys())

output:

<class 'sklearn.utils.Bunch'>
dict_keys(['data', 'target', 'target_names', 'DESCR', 'feature_names', 'filename'])

So, that's why you can access it as:

x=iris.data
y=iris.target

But when you read the CSV file as DataFrame as mentioned by you:

iris = pd.read_csv('iris.csv',header=None).iloc[:,2:4]
iris.head()

output is:

    2   3
0   petal_length    petal_width
1   1.4 0.2
2   1.4 0.2
3   1.3 0.2
4   1.5 0.2

Here the column names are '1' and '2'.

First of all you should read the CSV file as:

df = pd.read_csv('iris.csv')

you should not include header=None as your csv file includes the column names i.e. the headers.

So, now what you can do is something like this:

X = df.iloc[:, [2, 3]] # Will give you columns 2 and 3 i.e 'petal_length' and 'petal_width'
y = df.iloc[:, 4] # Label column i.e 'species'

or if you want to use the column names then:

X = df[['petal_length', 'petal_width']]
y = df.iloc['species']

Also, if you want to convert labels from string to numerical format use sklearn LabelEncoder

from sklearn import preprocessing
le = preprocessing.LabelEncoder()
y = le.fit_transform(y)

GitHub

github.com › dask › dask › issues › 8624

AttributeError: 'DataFrame' object has no attribute 'name'; Various stack overflow / github suggested fixes not working · Issue #8624 · dask/dask

January 26, 2022 - { File "[redacted]/pandas/core/generic.py", line 5487, in __getattr__ return object.__getattribute__(self, name) AttributeError: 'DataFrame' object has no attribute 'name'. Did you mean: 'rename'? } When I tried to inspect the dataframe object having this column on which the .apply() was not working, to see if there was a defect in this dataframe object as an artifact of upstream operations, I get this error (secondary issue):

Author david-thrower

Saturn Cloud

saturncloud.io › blog › solving-the-dataframe-object-has-no-attribute-name-error-in-pandas

Solving the 'DataFrame Object Has No Attribute 'name' Error in Pandas | Saturn Cloud Blog

November 3, 2023 - This is because a DataFrame as a whole does not have a 'name' attribute. In Pandas, the name attribute is associated with Series objects, not DataFrame objects. A Series object represents a single column or row in a DataFrame, and its name attribute ...

Stack Overflow

stackoverflow.com › questions › 56988316 › dataframe-object-has-no-attribute-withcolumn

python - 'DataFrame' object has no attribute 'withColumn' - Stack Overflow

Top answer

1 of 3

You mixed up pandas dataframe and Spark dataframe.

The issue is pandas df doesn't have spark function withColumn.

2 of 3

I figured it out. Thanks for the help.

def res(df):
    if df['data_type_x'] == df['data_type_y']:
        return 'no change'
    elif pd.isnull(df['data_type_x']):
        return 'new attribute'
    elif pd.isnull(df['data_type_y']):
        return 'deleted attribute'
    elif df['data_type_x'] != df['data_type_y'] and not pd.isnull(df['data_type_x']) and not pd.isnull(df['data_type_y']):
        return 'datatype change'

pd_merge['result'] = pd_merge.apply(res, axis = 1)

Databricks Community

community.databricks.com › t5 › data-engineering › attributeerror-dataframe-object-has-no-attribute-rename › td-p › 28109

Solved: AttributeError: 'DataFrame' object has no attribut... - Databricks Community - 28109

January 2, 2024 - https://stackoverflow.com/questions/38134643/data-frame-object-has-no-attribute ... If df_boston is a DataFrame, but you still face issues, try an alternative syntax: df_boston = df_boston.rename(columns={'zn': 'Zoning'}).

Brainly

brainly.com › computers and technology › high school › how to resolve attributeerror: 'dataframe' object has no attribute?

[FREE] How to resolve AttributeError: 'DataFrame' object has no attribute? - brainly.com

The AttributeError: 'DataFrame' object has no attribute is an error message that indicates that the code is trying to access an attribute or method that does not exist on the DataFrame object.

Find elsewhere

Google Bing Mojeek

Databricks Community

community.databricks.com › t5 › data-engineering › attributeerror-dataframe-object-has-no-attribute › td-p › 61132

AttributeError: 'DataFrame' object has no attribut... - Databricks Community - 61132

February 19, 2024 - Hello, I have some trouble deduplicating rows on the "id" column, with the method "dropDuplicatesWithinWatermark" in a pipeline. When I run this pipeline, I get the error message: "AttributeError: 'DataFrame' object has no attribute 'dropDuplicatesWithinWatermark'" Here is part of the code: @dl...

Stack Overflow

stackoverflow.com › questions › 75053578 › dataframe-object-has-no-attribute-column

python - 'DataFrame' object has no attribute 'column' - Stack Overflow

Top answer

1 of 1

IIUC, replace "column" by "columns" and use the str.lower accessor:

col_names = df.columns.str.lower().tolist()

Stack Overflow

stackoverflow.com › questions › 19392226 › attributeerror-dataframe-object-has-no-attribute

python - AttributeError: 'DataFrame' object has no attribute - Stack Overflow

Top answer

1 of 6

value_counts is a Series method rather than a DataFrame method (and you are trying to use it on a DataFrame, clean). You need to perform this on a specific column:

clean[column_name].value_counts()

It doesn't usually make sense to perform value_counts on a DataFrame, though I suppose you could apply it to every entry by flattening the underlying values array:

pd.value_counts(df.values.flatten())

2 of 6

To get all the counts for all the columns in a dataframe, it's just df.count()

Edureka Community

edureka.co › home › community › categories › python › python pandas error attributeerror dataframe ...

Python Pandas error AttributeError DataFrame object has no attribute rows | Edureka Community

March 28, 2019 - I am trying to print each entry of the dataframe separately. The dataframe is created by reading ... : 'DataFrame' object has no attribute 'rows'

Stack Overflow

stackoverflow.com › questions › 46169022 › dataframe-object-has-no-attribute-col-name

python - 'DataFrame' object has no attribute 'col_name' - Stack Overflow

Top answer

1 of 3

I think read_table have default separator tab, so is necessary define separator parameter:

x = pd.read_table('path to csv', sep=',')

Or use read_csv with default separator ,, so sep: can be omit.

x = pd.read_csv('path to csv')

2 of 3

Try to strip the potential whitespaces around the column name with this:

x.columns = [col.strip() for col in x.columns.tolist()]

Or as suggested in the documenation here and highlighted in @jezrael's answer:

x.columns = x.columns.str.strip()

Then, you will be able to access columns with x.col1..x.coln. Also be aware that column names are case sensitive.

Example:

>>> import pandas as pd 
>>> df = pd.DataFrame([[1,2],[3,4]], columns=[' col1', 'col2 '])
>>> df
    col1  col2 
0      1      2
1      3      4
>>> df.col1
Traceback (most recent call last):
..    return object.__getattribute__(self, name)
AttributeError: 'DataFrame' object has no attribute 'col1'
>>> df.col2 
Traceback (most recent call last):
...    return object.__getattribute__(self, name)
AttributeError: 'DataFrame' object has no attribute 'col2'
>>> df.columns = [col.strip() for col in df.columns.tolist()]
>>> df.col1
0    1
1    3
Name: col1, dtype: int64
>>> df.col2 
0    2
1    4
Name: col2, dtype: int64
>>>

reddit.com › r/learnpython › attributeerror: 'dataframe' object has no attribute 'data'

r/learnpython on Reddit: AttributeError: 'DataFrame' object has no attribute 'data'

September 29, 2021 -

wine = pd.read_csv("combined.csv", header=0).iloc[:-1]
df = pd.DataFrame(wine)
df
dataset = pd.DataFrame(df.data, columns =df.feature_names)
dataset['target']=df.target
dataset

ERROR:

<ipython-input-27-64122078da92> in <module>
----> 1 dataset = pd.DataFrame(df.data, columns =df.feature_names)
      2 dataset['target']=df.target
      3 dataset

D:\Anaconda\lib\site-packages\pandas\core\generic.py in __getattr__(self, name)
   5463             if self._info_axis._can_hold_identifiers_and_holds_name(name):
   5464                 return self[name]
-> 5465             return object.__getattribute__(self, name)
   5466 
   5467     def __setattr__(self, name: str, value) -> None:

AttributeError: 'DataFrame' object has no attribute 'data'

I'm trying to set up a target to proceed with my Multi Linear Regression Project, but I can't even do that. I've already downloaded the CSV file and have it uploaded on a Jupyter Notebook. What I'm I doing wrong?

Top answer

1 of 3

I would recommend you start off by reading the 10min pandas Quick Guide . This should get you on the right track (at the very least you will be able to load data into pandas). Afterwards, you should also be able to tell why your current code doesn't make much sense.

2 of 3

Is there a column in your file called data? you already have a df from the first line. Maybe you just want to rename the columns. df.rename(columns={})

Stack Overflow

stackoverflow.com › questions › 59804985 › attributeerror-dataframe-object-has-no-attribute-column

python - AttributeError: 'DataFrame' object has no attribute 'column' - Stack Overflow

Top answer

1 of 1

that is just because you wrote column instead of columns in the third rename:

mapping = {merged_df.columns[0]:'Index', merged_df.columns[1]: 'application_date', merged_df.column[2]:'Nifty50returns'}

Stack Overflow

stackoverflow.com › questions › 28163439 › attributeerror-dataframe-object-has-no-attribute-height › 28163504

python 2.7 - AttributeError: 'DataFrame' object has no attribute 'Height' - Stack Overflow

Top answer

1 of 3

I have run into a similar issue before when reading from csv. Assuming it is the same:

col_name =df.columns[0]
df=df.rename(columns = {col_name:'new_name'})

The error in my case was caused by (I think) by a byte order marker in the csv or some other non-printing character being added to the first column label. df.columns returns an array of the column names. df.columns[0] gets the first one. Try printing it and seeing if something is odd with the results.

2 of 3

PS On above answer by JAB - if there is clearly spaces in your column names use skipinitialspace=True in read_csv e.g.

df = pd.read_csv('/path../NavieBayes.csv',skipinitialspace=True)

GitHub

github.com › pandas-profiling › pandas-profiling › issues › 183

AttributeError: 'DataFrame' object has no attribute 'profile_report' · Issue #183 · ydataai/ydata-profiling

March 22, 2019 - AttributeError: 'DataFrame' object has no attribute 'profile_report'#183 · Copy link · Labels · bug 🐛Something isn't workingSomething isn't working · bdch1234 · opened · on Jun 22, 2019 · Issue body actions · Describe the bug Running the example in readme generates an error. To Reproduce Running: import numpy as np import pandas as pd import pandas_profiling df = pd.DataFrame( np.random.rand(100, 5), columns=['a', 'b', 'c', 'd', 'e'] ) df.profile_report() in a Jupyter notebook gives: --------------------------------------------------------------------------- AttributeError Traceback

Author bdch1234

Kaggle

kaggle.com › general › 108926

AttributeError: 'DataFrame' object has no attribute 'dtype' ...

Click here if you are not automatically redirected after 5 seconds.

Kaggle

kaggle.com › questions-and-answers › 266588

'DataFrame' object has no attribute 'target' | Kaggle

'DataFrame' object has no attribute 'target'

Easy Tweaks

easytweaks.com › fix-attributeerror-dataframe-object-pandas

Fix attributeerror 'dataframe' object has no attribute errors ...

January 9, 2023 - Verifying that you are not a robot