The type of your dataframe is pyspark.sql.DataFrame that doesn't have .to_json function. What you need is Pandas DataFrame object. You can use .toPandas() function (df1.toPandas().to_json...) to convert from PySpark's DataFrame to Pandas DataFrame, but it will work if the size of your data will fit into memory of the driver.

Answer from Alex Ott on Stack Overflow
Top answer
1 of 5
2

"sklearn.datasets" is a scikit package, where it contains a method load_iris().

load_iris(), by default return an object which holds data, target and other members in it. In order to get actual values you have to read the data and target content itself.

Whereas 'iris.csv', holds feature and target together.

FYI: If you set return_X_y as True in load_iris(), then you will directly get features and target.

from sklearn import datasets
data,target = datasets.load_iris(return_X_y=True)
2 of 5
1

The Iris Dataset from Sklearn is in Sklearn's Bunch format:

print(type(iris))
print(iris.keys())

output:

<class 'sklearn.utils.Bunch'>
dict_keys(['data', 'target', 'target_names', 'DESCR', 'feature_names', 'filename'])

So, that's why you can access it as:

x=iris.data
y=iris.target

But when you read the CSV file as DataFrame as mentioned by you:

iris = pd.read_csv('iris.csv',header=None).iloc[:,2:4]
iris.head()

output is:

    2   3
0   petal_length    petal_width
1   1.4 0.2
2   1.4 0.2
3   1.3 0.2
4   1.5 0.2

Here the column names are '1' and '2'.

First of all you should read the CSV file as:

df = pd.read_csv('iris.csv')

you should not include header=None as your csv file includes the column names i.e. the headers.

So, now what you can do is something like this:

X = df.iloc[:, [2, 3]] # Will give you columns 2 and 3 i.e 'petal_length' and 'petal_width'
y = df.iloc[:, 4] # Label column i.e 'species'

or if you want to use the column names then:

X = df[['petal_length', 'petal_width']]
y = df.iloc['species']

Also, if you want to convert labels from string to numerical format use sklearn LabelEncoder

from sklearn import preprocessing
le = preprocessing.LabelEncoder()
y = le.fit_transform(y)
Discussions

python - How to resolve AttributeError: 'DataFrame' object has no attribute - Stack Overflow
I know that this kind of question was asked before and I've checked all the answers and I have tried several times to find a solution but in vain. In fact I call a Dataframe using Pandas. I've upl... More on stackoverflow.com
🌐 stackoverflow.com
BUG: read_json -> 'DataFrame' object has no attribute 'dtype'
Came across this when creating #4376. Not sure what's going on as I was under the impression that DataFrame always has a dtype attribute. The error only seems to occur when the columns are not ... More on github.com
🌐 github.com
4
July 27, 2013
"'DataFrame' object has no attribute" Issue

Double check if there's a space in the column name. 'Survived ' vs 'Survived' It happens more often than you'd think especially with CSV data source.

More on reddit.com
🌐 r/learnpython
8
1
October 30, 2020
AttributeError: 'DataFrame' object has no attribute 'data'
I would recommend you start off by reading the 10min pandas Quick Guide . This should get you on the right track (at the very least you will be able to load data into pandas). Afterwards, you should also be able to tell why your current code doesn't make much sense. More on reddit.com
🌐 r/learnpython
4
0
September 29, 2021
🌐
GitHub
github.com › dask › dask › issues › 8624
AttributeError: 'DataFrame' object has no attribute 'name'; Various stack overflow / github suggested fixes not working · Issue #8624 · dask/dask
January 26, 2022 - The method being applied should return a Dask dataframe of 3 np.int64 columns: state_id, city_id, district_id. inspect.getmembers(my_dask_df) was expected to return a list of object names and values to introspect the dask dataframe object.
Author   david-thrower
🌐
GitHub
github.com › pandas-dev › pandas › issues › 4377
BUG: read_json -> 'DataFrame' object has no attribute 'dtype' · Issue #4377 · pandas-dev/pandas
July 27, 2013 - Came across this when creating #4376. Not sure what's going on as I was under the impression that DataFrame always has a dtype attribute. The error only seems to occur when the columns are not ...
Author   Komnomnomnom
🌐
Databricks Community
community.databricks.com › t5 › data-engineering › attributeerror-dataframe-object-has-no-attribute › td-p › 61132
AttributeError: 'DataFrame' object has no attribut... - Databricks Community - 61132
February 19, 2024 - Hello, I have some trouble deduplicating rows on the "id" column, with the method "dropDuplicatesWithinWatermark" in a pipeline. When I run this pipeline, I get the error message: "AttributeError: 'DataFrame' object has no attribute 'dropDuplicatesWithinWatermark'" Here is part of the code: @dl...
🌐
Reddit
reddit.com › r/learnpython › "'dataframe' object has no attribute" issue
r/learnpython on Reddit: "'DataFrame' object has no attribute" Issue
October 30, 2020 -

I am in university and am taking a special topics class regarding AI. I have zero knowledge about Python, how it works, or what anything means.

A project for the class involves manipulating Bayesian networks to predict how many and which individuals die upon the sinking of a ship. This is the code I am supposed to manipulate:

##EDIT VARIABLES TO THE VARIABLES OF INTEREST
train_var = train.loc[:,['Survived','Sex']]  
test_var = test.loc[:,['Sex']]  
BayesNet = BayesianModel([('Sex','Survived')])

I am supposed to add another variable, 'Pclass,' to the mix, paying attention to the order for causation. I have added that variable to every line of this code in every way imaginable and consistently get an error from this line:

predictions = pandas.DataFrame({'PassengerId': test.PassengerId,'Survived': hypothesis.Survived.tolist()})
predictions

For example, the error I get for this version of the code:

train_var = train.loc[:,['Survived','Pclass','Sex']]  
test_var = test.loc[:,['Pclass']]  
BayesNet = BayesianModel([('Sex','Pclass','Survived')])

is this:

AttributeError                            Traceback (most recent call last)
<ipython-input-98-16d9eb9451f7> in <module>
----> 1 predictions = pandas.DataFrame({'PassengerId': test.PassengerId,'Survived': hypothesis.Survived.tolist()})
      2 predictions

/opt/conda/lib/python3.7/site-packages/pandas/core/generic.py in __getattr__(self, name)
   5137             if self._info_axis._can_hold_identifiers_and_holds_name(name):
   5138                 return self[name]
-> 5139             return object.__getattribute__(self, name)
   5140 
   5141     def __setattr__(self, name: str, value) -> None:

AttributeError: 'DataFrame' object has no attribute 'Survived'

Honestly, I have no idea wtf any of this means. I have tried googling this issue and have come up with nothing.

Any help would be greatly appreciated. I know it's a lot.

🌐
Saturn Cloud
saturncloud.io › blog › solving-the-dataframe-object-has-no-attribute-name-error-in-pandas
Solving the 'DataFrame Object Has No Attribute 'name' Error in Pandas | Saturn Cloud Blog
November 2, 2023 - The 'DataFrame' object has no attribute 'name' error typically occurs when you try to access a DataFrame’s 'name' attribute, which doesn’t exist. import pandas as pd # Create a dataframe with 2 columns named 'A' and 'B' df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]}) print(df.name) Running this code will result in an AttributeError: 'DataFrame' object has no attribute 'name'.
Find elsewhere
🌐
Reddit
reddit.com › r/learnpython › attributeerror: 'dataframe' object has no attribute 'data'
r/learnpython on Reddit: AttributeError: 'DataFrame' object has no attribute 'data'
September 29, 2021 -
wine = pd.read_csv("combined.csv", header=0).iloc[:-1]
df = pd.DataFrame(wine)
df
dataset = pd.DataFrame(df.data, columns =df.feature_names)
dataset['target']=df.target
dataset

ERROR:

<ipython-input-27-64122078da92> in <module>
----> 1 dataset = pd.DataFrame(df.data, columns =df.feature_names)
      2 dataset['target']=df.target
      3 dataset

D:\Anaconda\lib\site-packages\pandas\core\generic.py in __getattr__(self, name)
   5463             if self._info_axis._can_hold_identifiers_and_holds_name(name):
   5464                 return self[name]
-> 5465             return object.__getattribute__(self, name)
   5466 
   5467     def __setattr__(self, name: str, value) -> None:

AttributeError: 'DataFrame' object has no attribute 'data'

I'm trying to set up a target to proceed with my Multi Linear Regression Project, but I can't even do that. I've already downloaded the CSV file and have it uploaded on a Jupyter Notebook. What I'm I doing wrong?

🌐
Edureka Community
edureka.co › home › community › categories › python › attributeerror dataframe object has no...
AttributeError DataFrame object has no attribute is impossible | Edureka Community
March 10, 2020 - from collections import Counter import re import numpy as np import pandas as pd from nltk. ... 'DataFrame' object has no attribute 'is_impossible'
🌐
Databricks Community
community.databricks.com › t5 › data-engineering › attributeerror-dataframe-object-has-no-attribute-rename › td-p › 28109
Solved: AttributeError: 'DataFrame' object has no attribut... - Databricks Community - 28109
January 2, 2024 - Hello, I am doing the Data Science and Machine Learning course. The Boston housing has unintuitive column names. I want to rename them, e.g. so 'zn' becomes 'Zoning'. When I run this command: df_bostonLegible = df_boston.rename({'zn':'Zoning'}, axis='columns') Then I get the error "AttributeError: '...
🌐
GitHub
github.com › pydata › pandas-datareader › issues › 991
MOEX: AttributeError: 'DataFrame' object has no attribute 'append' · Issue #991 · pydata/pandas-datareader
May 27, 2024 - AttributeError Traceback (most recent call last) ~\AppData\Local\Temp\ipykernel_18664\2461240623.py in ?() 1 import pandas_datareader as pdr 2 import pandas_montecarlo 3 ----> 4 df = pdr.get_data_moex('SBER', start='2024-05-26') 5 df.head() · AttributeError: 'DataFrame' object has no attribute ...
Published   May 27, 2024
Author   RJSDevel
🌐
Statology
statology.org › home › how to fix: module ‘pandas’ has no attribute ‘dataframe’
How to Fix: module 'pandas' has no attribute 'dataframe'
October 27, 2021 - This tutorial explains how to fix the following error in Python: module 'pandas' has no attribute 'dataframe'.
🌐
Python.org
discuss.python.org › python help
Dict cannot be saved as json no matter what - Python Help - Discussions on Python.org
November 15, 2021 - Hi I am trying to save a dict to json but no matter what I try I get either errors that Object of type DataFrame is not JSON serializable OR Dict object has no attribute to json I follow many tutorials on how to do that eg json = json.dumps(pdData) f = open("dict.json","w") # write json object to file f.write(json) # close file f.close() But still I get errors like the aboves.
🌐
Brainly
brainly.com › computers and technology › high school › how to resolve attributeerror: 'dataframe' object has no attribute?
[FREE] How to resolve AttributeError: 'DataFrame' object has no attribute? - brainly.com
The 'AttributeError: 'DataFrame' object has no attribute' suggests that a nonexistent attribute is being accessed on a DataFrame. Check the DataFrame's structure and ensure columns are correctly typed.
🌐
Quora
quora.com › How-do-you-fix-pandas-that-have-no-attribute-dataframe
How to fix pandas that have no attribute dataframe - Quora
Answer (1 of 2): There are three possibilities here : * The filename could be pandas.py * The filename could be whatever you have imported as, for example, pd. You could have imported using import pandas as pd * There is another file with the name pandas.py or pd.py in the current directory W...
🌐
GeeksforGeeks
geeksforgeeks.org › python › how-to-fix-module-pandas-has-no-attribute-dataframe
How to Fix: module ‘pandas’ has no attribute ‘dataframe’ - GeeksforGeeks
December 19, 2021 - To create dataframe we need to use DataFrame(). If we use dataframe it will throw an error because there is no dataframe attribute in pandas. The method is DataFrame(). We need to pass any dictionary as an argument.
🌐
Codecademy Forums
discuss.codecademy.com › data science
AttributeError: 'DataFrame' object has no attribute 'fit' - Data Science - Codecademy Forums
May 6, 2024 - I’m working on the final project for Machine Learning Engineering. I’m using jupyter notebook to fit the data. When I try to fit the data, I keep getting an error message saying: " AttributeError: ‘DataFrame’ object h…