pandas convert string to float multiple columns

Converting strings to floats in a DataFrame [duplicate]

stackoverflow.com › questions › 16729483 › converting-strings-to-floats-in-a-dataframe

NOTE: pd.convert_objects has now been deprecated. You should use pd.Series.astype(float) or pd.to_numeric as described in other answers.

This is available in 0.11. Forces conversion (or set's to nan) This will work even when astype will fail; its also series by series so it won't convert say a complete string column

CopyIn [10]: df = DataFrame(dict(A = Series(['1.0','1']), B = Series(['1.0','foo'])))

In [11]: df
Out[11]: 
     A    B
0  1.0  1.0
1    1  foo

In [12]: df.dtypes
Out[12]: 
A    object
B    object
dtype: object

In [13]: df.convert_objects(convert_numeric=True)
Out[13]: 
   A   B
0  1   1
1  1 NaN

In [14]: df.convert_objects(convert_numeric=True).dtypes
Out[14]: 
A    float64
B    float64
dtype: object

Answer from Jeff on Stack Overflow

Spark By {Examples}

sparkbyexamples.com › home › pandas › pandas convert column to float in dataframe

Pandas Convert Column to Float in DataFrame - Spark By {Examples}

October 14, 2024 - By using pandas DataFrame.astype() and pandas.to_numeric() methods you can convert a column from string/int type to float. In this article, I will explain

Stack Overflow

stackoverflow.com › questions › 16729483 › converting-strings-to-floats-in-a-dataframe

python - Converting strings to floats in a DataFrame - Stack Overflow

Top answer

1 of 7

83

NOTE: pd.convert_objects has now been deprecated. You should use pd.Series.astype(float) or pd.to_numeric as described in other answers.

This is available in 0.11. Forces conversion (or set's to nan) This will work even when astype will fail; its also series by series so it won't convert say a complete string column

CopyIn [10]: df = DataFrame(dict(A = Series(['1.0','1']), B = Series(['1.0','foo'])))

In [11]: df
Out[11]: 
     A    B
0  1.0  1.0
1    1  foo

In [12]: df.dtypes
Out[12]: 
A    object
B    object
dtype: object

In [13]: df.convert_objects(convert_numeric=True)
Out[13]: 
   A   B
0  1   1
1  1 NaN

In [14]: df.convert_objects(convert_numeric=True).dtypes
Out[14]: 
A    float64
B    float64
dtype: object

2 of 7

79

You can try df.column_name = df.column_name.astype(float). As for the NaN values, you need to specify how they should be converted, but you can use the .fillna method to do it.

Example:

CopyIn [12]: df
Out[12]: 
     a    b
0  0.1  0.2
1  NaN  0.3
2  0.4  0.5

In [13]: df.a.values
Out[13]: array(['0.1', nan, '0.4'], dtype=object)

In [14]: df.a = df.a.astype(float).fillna(0.0)

In [15]: df
Out[15]: 
     a    b
0  0.1  0.2
1  0.0  0.3
2  0.4  0.5

In [16]: df.a.values
Out[16]: array([ 0.1,  0. ,  0.4])

Discussions

python - pandas convert strings to float for multiple columns in dataframe - Stack Overflow

I'm new to pandas and trying to figure out how to convert multiple columns which are formatted as strings to float64's. Currently I'm doing the below, but it seems like apply() or applymap() shoul... More on stackoverflow.com

stackoverflow.com

How can I parse values to float in a pandas dataframe column that contains both floats and strings?

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.to_numeric.html df_distns['Parameter 3'] = pd.to_numeric(df_distns['Parameter 3'], errors='ignore') More on reddit.com

r/learnpython

7

0

June 22, 2021

python - pandas how to convert all the string value to float - Stack Overflow

I want to convert all the string value in Pandas DataFrame into float, and I can define a short function to do this, but it's not a Pythonic way to do that. My DataFrame looks like this: >>&... More on stackoverflow.com

stackoverflow.com

Pandas DataFrame tries to convert a string into a float, while adding it to a column

Well you're using np.nan which is a float. Interestingly though, it only raises the error after the first item has been added. >>> df = pd.DataFrame() >>> df['foo'] = [np.nan] * len(df) >>> df Empty DataFrame Columns: [foo] Index: [] Note the type is float64 >>> df.dtypes foo float64 dtype: object However, pandas converts it. >>> df.at['file_path', 'foo'] = 'file1' >>> df.dtypes foo object dtype: object Second time round, it raises the error - not sure why that is exactly. >>> df['bar'] = [np.nan] * len(df) >>> df.dtypes foo object bar float64 dtype: object >>> df.at['file_path', 'bar'] = 'file1' Traceback (most recent call last): You can use an empty string '' instead of np.nan - or you could initialize your dataframe with values. You may also want to check out pathlib from pathlib import Path file_paths = ... df = pd.DataFrame({ Path(p).stem: [p] for p in file_paths }) .stem from pathlib is the filename without the extension. More on reddit.com

r/learnpython

4

2

December 13, 2021

Videos

03:17

YouTube

How to Convert String Column to Float in Python While Handling ...

January 13, 2025

26