stackoverflow.com › questions › 15891038 › change-column-type-in-pandas

You have four main options for converting types in pandas:

to_numeric() - provides functionality to safely convert non-numeric types (e.g. strings) to a suitable numeric type. (See also to_datetime() and to_timedelta().)
astype() - convert (almost) any type to (almost) any other type (even if it's not necessarily sensible to do so). Also allows you to convert to categorial types (very useful).
infer_objects() - a utility method to convert object columns holding Python objects to a pandas type if possible.
convert_dtypes() - convert DataFrame columns to the "best possible" dtype that supports pd.NA (pandas' object to indicate a missing value).

Read on for more detailed explanations and usage of each of these methods.

1. `to_numeric()`

The best way to convert one or more columns of a DataFrame to numeric values is to use pandas.to_numeric().

This function will try to change non-numeric objects (such as strings) into integers or floating-point numbers as appropriate.

Basic usage

The input to to_numeric() is a Series or a single column of a DataFrame.

>>> s = pd.Series(["8", 6, "7.5", 3, "0.9"]) # mixed string and numeric values
>>> s
0      8
1      6
2    7.5
3      3
4    0.9
dtype: object

>>> pd.to_numeric(s) # convert everything to float values
0    8.0
1    6.0
2    7.5
3    3.0
4    0.9
dtype: float64

As you can see, a new Series is returned. Remember to assign this output to a variable or column name to continue using it:

# convert Series
my_series = pd.to_numeric(my_series)

# convert column "a" of a DataFrame
df["a"] = pd.to_numeric(df["a"])

You can also use it to convert multiple columns of a DataFrame via the apply() method:

# convert all columns of DataFrame
df = df.apply(pd.to_numeric) # convert all columns of DataFrame

# convert just columns "a" and "b"
df[["a", "b"]] = df[["a", "b"]].apply(pd.to_numeric)

As long as your values can all be converted, that's probably all you need.

Error handling

But what if some values can't be converted to a numeric type?

to_numeric() also takes an errors keyword argument that allows you to force non-numeric values to be NaN, or simply ignore columns containing these values.

Here's an example using a Series of strings s which has the object dtype:

>>> s = pd.Series(['1', '2', '4.7', 'pandas', '10'])
>>> s
0         1
1         2
2       4.7
3    pandas
4        10
dtype: object

The default behaviour is to raise if it can't convert a value. In this case, it can't cope with the string 'pandas':

>>> pd.to_numeric(s) # or pd.to_numeric(s, errors='raise')
ValueError: Unable to parse string

Rather than fail, we might want 'pandas' to be considered a missing/bad numeric value. We can coerce invalid values to NaN as follows using the errors keyword argument:

>>> pd.to_numeric(s, errors='coerce')
0     1.0
1     2.0
2     4.7
3     NaN
4    10.0
dtype: float64

The third option for errors is just to ignore the operation if an invalid value is encountered:

>>> pd.to_numeric(s, errors='ignore')
# the original Series is returned untouched

This last option is particularly useful for converting your entire DataFrame, but don't know which of our columns can be converted reliably to a numeric type. In that case, just write:

df.apply(pd.to_numeric, errors='ignore')

The function will be applied to each column of the DataFrame. Columns that can be converted to a numeric type will be converted, while columns that cannot (e.g. they contain non-digit strings or dates) will be left alone.

Downcasting

By default, conversion with to_numeric() will give you either an int64 or float64 dtype (or whatever integer width is native to your platform).

That's usually what you want, but what if you wanted to save some memory and use a more compact dtype, like float32, or int8?

to_numeric() gives you the option to downcast to either 'integer', 'signed', 'unsigned', 'float'. Here's an example for a simple series s of integer type:

>>> s = pd.Series([1, 2, -7])
>>> s
0    1
1    2
2   -7
dtype: int64

Downcasting to 'integer' uses the smallest possible integer that can hold the values:

>>> pd.to_numeric(s, downcast='integer')
0    1
1    2
2   -7
dtype: int8

Downcasting to 'float' similarly picks a smaller than normal floating type:

>>> pd.to_numeric(s, downcast='float')
0    1.0
1    2.0
2   -7.0
dtype: float32

2. `astype()`

The astype() method enables you to be explicit about the dtype you want your DataFrame or Series to have. It's very versatile in that you can try and go from one type to any other.

Basic usage

Just pick a type: you can use a NumPy dtype (e.g. np.int16), some Python types (e.g. bool), or pandas-specific types (like the categorical dtype).

Call the method on the object you want to convert and astype() will try and convert it for you:

# convert all DataFrame columns to the int64 dtype
df = df.astype(int)

# convert column "a" to int64 dtype and "b" to complex type
df = df.astype({"a": int, "b": complex})

# convert Series to float16 type
s = s.astype(np.float16)

# convert Series to Python strings
s = s.astype(str)

# convert Series to categorical type - see docs for more details
s = s.astype('category')

Notice I said "try" - if astype() does not know how to convert a value in the Series or DataFrame, it will raise an error. For example, if you have a NaN or inf value you'll get an error trying to convert it to an integer.

As of pandas 0.20.0, this error can be suppressed by passing errors='ignore'. Your original object will be returned untouched.

Be careful

astype() is powerful, but it will sometimes convert values "incorrectly". For example:

>>> s = pd.Series([1, 2, -7])
>>> s
0    1
1    2
2   -7
dtype: int64

These are small integers, so how about converting to an unsigned 8-bit type to save memory?

>>> s.astype(np.uint8)
0      1
1      2
2    249
dtype: uint8

The conversion worked, but the -7 was wrapped round to become 249 (i.e. 2⁸ - 7)!

Trying to downcast using pd.to_numeric(s, downcast='unsigned') instead could help prevent this error.

3. `infer_objects()`

Version 0.21.0 of pandas introduced the method infer_objects() for converting columns of a DataFrame that have an object datatype to a more specific type (soft conversions).

For example, here's a DataFrame with two columns of object type. One holds actual integers and the other holds strings representing integers:

>>> df = pd.DataFrame({'a': [7, 1, 5], 'b': ['3','2','1']}, dtype='object')
>>> df.dtypes
a    object
b    object
dtype: object

Using infer_objects(), you can change the type of column 'a' to int64:

>>> df = df.infer_objects()
>>> df.dtypes
a     int64
b    object
dtype: object

Column 'b' has been left alone since its values were strings, not integers. If you wanted to force both columns to an integer type, you could use df.astype(int) instead.

4. `convert_dtypes()`

Version 1.0 and above includes a method convert_dtypes() to convert Series and DataFrame columns to the best possible dtype that supports the pd.NA missing value.

Here "best possible" means the type most suited to hold the values. For example, this a pandas integer type, if all of the values are integers (or missing values): an object column of Python integer objects are converted to Int64, a column of NumPy int32 values, will become the pandas dtype Int32.

With our object DataFrame df, we get the following result:

>>> df.convert_dtypes().dtypes                                             
a     Int64
b    string
dtype: object

Since column 'a' held integer values, it was converted to the Int64 type (which is capable of holding missing values, unlike int64).

Column 'b' contained string objects, so was changed to pandas' string dtype.

By default, this method will infer the type from object values in each column. We can change this by passing infer_objects=False:

>>> df.convert_dtypes(infer_objects=False).dtypes                          
a    object
b    string
dtype: object

Now column 'a' remained an object column: pandas knows it can be described as an 'integer' column (internally it ran infer_dtype) but didn't infer exactly what dtype of integer it should have so did not convert it. Column 'b' was again converted to 'string' dtype as it was recognised as holding 'string' values.

Answer from Alex Riley on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 15891038 › change-column-type-in-pandas

python - Change column type in pandas - Stack Overflow

1. `to_numeric()`

The best way to convert one or more columns of a DataFrame to numeric values is to use pandas.to_numeric().

This function will try to change non-numeric objects (such as strings) into integers or floating-point numbers as appropriate.

Basic usage

The input to to_numeric() is a Series or a single column of a DataFrame.

>>> s = pd.Series(["8", 6, "7.5", 3, "0.9"]) # mixed string and numeric values
>>> s
0      8
1      6
2    7.5
3      3
4    0.9
dtype: object

>>> pd.to_numeric(s) # convert everything to float values
0    8.0
1    6.0
2    7.5
3    3.0
4    0.9
dtype: float64

As you can see, a new Series is returned. Remember to assign this output to a variable or column name to continue using it:

# convert Series
my_series = pd.to_numeric(my_series)

# convert column "a" of a DataFrame
df["a"] = pd.to_numeric(df["a"])

You can also use it to convert multiple columns of a DataFrame via the apply() method:

# convert all columns of DataFrame
df = df.apply(pd.to_numeric) # convert all columns of DataFrame

# convert just columns "a" and "b"
df[["a", "b"]] = df[["a", "b"]].apply(pd.to_numeric)

As long as your values can all be converted, that's probably all you need.

Error handling

But what if some values can't be converted to a numeric type?

to_numeric() also takes an errors keyword argument that allows you to force non-numeric values to be NaN, or simply ignore columns containing these values.

Here's an example using a Series of strings s which has the object dtype:

>>> s = pd.Series(['1', '2', '4.7', 'pandas', '10'])
>>> s
0         1
1         2
2       4.7
3    pandas
4        10
dtype: object

The default behaviour is to raise if it can't convert a value. In this case, it can't cope with the string 'pandas':

>>> pd.to_numeric(s) # or pd.to_numeric(s, errors='raise')
ValueError: Unable to parse string

Rather than fail, we might want 'pandas' to be considered a missing/bad numeric value. We can coerce invalid values to NaN as follows using the errors keyword argument:

>>> pd.to_numeric(s, errors='coerce')
0     1.0
1     2.0
2     4.7
3     NaN
4    10.0
dtype: float64

The third option for errors is just to ignore the operation if an invalid value is encountered:

>>> pd.to_numeric(s, errors='ignore')
# the original Series is returned untouched

This last option is particularly useful for converting your entire DataFrame, but don't know which of our columns can be converted reliably to a numeric type. In that case, just write:

df.apply(pd.to_numeric, errors='ignore')

Downcasting

By default, conversion with to_numeric() will give you either an int64 or float64 dtype (or whatever integer width is native to your platform).

That's usually what you want, but what if you wanted to save some memory and use a more compact dtype, like float32, or int8?

to_numeric() gives you the option to downcast to either 'integer', 'signed', 'unsigned', 'float'. Here's an example for a simple series s of integer type:

>>> s = pd.Series([1, 2, -7])
>>> s
0    1
1    2
2   -7
dtype: int64

Downcasting to 'integer' uses the smallest possible integer that can hold the values:

>>> pd.to_numeric(s, downcast='integer')
0    1
1    2
2   -7
dtype: int8

Downcasting to 'float' similarly picks a smaller than normal floating type:

>>> pd.to_numeric(s, downcast='float')
0    1.0
1    2.0
2   -7.0
dtype: float32

2. `astype()`

The astype() method enables you to be explicit about the dtype you want your DataFrame or Series to have. It's very versatile in that you can try and go from one type to any other.

Basic usage

Just pick a type: you can use a NumPy dtype (e.g. np.int16), some Python types (e.g. bool), or pandas-specific types (like the categorical dtype).

Call the method on the object you want to convert and astype() will try and convert it for you:

# convert all DataFrame columns to the int64 dtype
df = df.astype(int)

# convert column "a" to int64 dtype and "b" to complex type
df = df.astype({"a": int, "b": complex})

# convert Series to float16 type
s = s.astype(np.float16)

# convert Series to Python strings
s = s.astype(str)

# convert Series to categorical type - see docs for more details
s = s.astype('category')

As of pandas 0.20.0, this error can be suppressed by passing errors='ignore'. Your original object will be returned untouched.

Be careful

astype() is powerful, but it will sometimes convert values "incorrectly". For example:

>>> s = pd.Series([1, 2, -7])
>>> s
0    1
1    2
2   -7
dtype: int64

These are small integers, so how about converting to an unsigned 8-bit type to save memory?

>>> s.astype(np.uint8)
0      1
1      2
2    249
dtype: uint8

The conversion worked, but the -7 was wrapped round to become 249 (i.e. 2⁸ - 7)!

Trying to downcast using pd.to_numeric(s, downcast='unsigned') instead could help prevent this error.

3. `infer_objects()`

Version 0.21.0 of pandas introduced the method infer_objects() for converting columns of a DataFrame that have an object datatype to a more specific type (soft conversions).

For example, here's a DataFrame with two columns of object type. One holds actual integers and the other holds strings representing integers:

>>> df = pd.DataFrame({'a': [7, 1, 5], 'b': ['3','2','1']}, dtype='object')
>>> df.dtypes
a    object
b    object
dtype: object

Using infer_objects(), you can change the type of column 'a' to int64:

>>> df = df.infer_objects()
>>> df.dtypes
a     int64
b    object
dtype: object

Column 'b' has been left alone since its values were strings, not integers. If you wanted to force both columns to an integer type, you could use df.astype(int) instead.

4. `convert_dtypes()`

Version 1.0 and above includes a method convert_dtypes() to convert Series and DataFrame columns to the best possible dtype that supports the pd.NA missing value.

With our object DataFrame df, we get the following result:

>>> df.convert_dtypes().dtypes                                             
a     Int64
b    string
dtype: object

Since column 'a' held integer values, it was converted to the Int64 type (which is capable of holding missing values, unlike int64).

Column 'b' contained string objects, so was changed to pandas' string dtype.

By default, this method will infer the type from object values in each column. We can change this by passing infer_objects=False:

>>> df.convert_dtypes(infer_objects=False).dtypes                          
a    object
b    string
dtype: object

2 of 16

553

Use this:

a = [['a', '1.2', '4.2'], ['b', '70', '0.03'], ['x', '5', '0']]
df = pd.DataFrame(a, columns=['one', 'two', 'three'])
df

Out[16]:
  one  two three
0   a  1.2   4.2
1   b   70  0.03
2   x    5     0

df.dtypes

Out[17]:
one      object
two      object
three    object

df[['two', 'three']] = df[['two', 'three']].astype(float)

df.dtypes

Out[19]:
one       object
two      float64
three    float64

W3docs

w3docs.com › python

Change column type in pandas

In pandas, you can change the data type of a column using the astype() function. You can pass a dictionary to the astype() function where the keys are the column names and the values are the new data types.

Discussions

Pandas dataframe columns won't convert to float

could you try this , dp03_cleaned[columns] = dp03_cleaned[columns].apply(pd.to_numeric, errors='coerce') More on reddit.com

r/learnpython

May 14, 2022

python - Trying to change pandas column dtype from str to float - Data Science Stack Exchange

I am trying to convert a pandas column from a str to a float. Before I convert the strings to floats using astype(float), I need to remove characters from the string which cannot be converted into ... More on datascience.stackexchange.com

datascience.stackexchange.com

Converting object to a float

I mean, converting the string "15-20" to a float doesn't really make much sense.

What do you mean by "hasn't worked?" What are you trying to do with your data frame that requires you to change this column's data type?

Videos

youtube.com

Change Column Data Type From Float To Int Python Pandas ...

13:40

YouTube

CHANGE COLUMN DTYPE | How to change the datatype of a column in ...

September 23, 2020

03:04

YouTube

Pandas Trick. Convert Strings to Float in Pandas DataFrame (parsing ...

How do I change the data type of a pandas Series? - YouTube

Convert datatypes using Python Pandas - Float and String to integer ...

April 17, 2020

youtube.com

Change column with string of percent to float pandas dataframe

View all

Better Stack

betterstack.com › community › questions › change-column-type-in-pandas

Change Column Type in Pandas | Better Stack Community

June 19, 2024 - import pandas as pd # Sample DataFrame data = {'A': [1, 2, 3], 'B': [4.0, 5.0, 6.0]} df = pd.DataFrame(data) # Original DataFrame print("Original DataFrame:") print(df) # Change column type of column 'A' to float df['A'] = df['A'].astype(float) # Updated DataFrame print("\\nDataFrame with column 'A' converted to float:") print(df)

Spark By {Examples}

sparkbyexamples.com › home › pandas › pandas convert column to float in dataframe

Pandas Convert Column to Float in DataFrame - Spark By {Examples}

October 14, 2024 - By using pandas DataFrame.astype() and pandas.to_numeric() methods you can convert a column from string/int type to float. In this article, I will explain

GeeksforGeeks

geeksforgeeks.org › pandas › convert-pandas-dataframe-column-to-float

Convert Pandas Dataframe Column To Float - GeeksforGeeks

July 23, 2025 - As you observe the output the data type of the string column is changed from object to float after using to_numeric() function. We can handle Non-convertible values, Missing values, and NaN values by using errors='coerce' parameter in ...

Skytowner

skytowner.com › explore › converting_column_type_to_float_in_pandas_dataframe

Converting column type to float in Pandas DataFrame

To convert the column type to float in Pandas DataFrame, either use the Series' astype() method, or use Pandas' to_numeric() method.

Pandas

pandas.pydata.org › docs › reference › api › pandas.DataFrame.astype.html

pandas.DataFrame.astype — pandas 3.0.3 documentation

Use a str, numpy.dtype, pandas.ExtensionDtype or Python type to cast entire pandas object to the same type. Alternatively, use a mapping, e.g. {col: dtype, …}, where col is a column label and dtype is a numpy.dtype or Python type to cast one or more of the DataFrame’s columns to column-specific types. ... This keyword is now ignored; changing its value will have no impact on the method.

Find elsewhere

Google Bing Mojeek

datagy

datagy.io › home › pandas tutorials › data analysis in pandas › converting pandas dataframe column from object to float

Converting Pandas DataFrame Column from Object to Float • datagy

May 12, 2023 - We can see that the Price column’s data type is actually an object. Let’s now dive into how we can convert the column to a floating point value. To convert a Pandas column’s data type from object to float you can use the to_numeric function.

Saturn Cloud

saturncloud.io › blog › how-to-convert-a-column-in-pandas-dataframe-from-string-to-float

How to Convert a Column in Pandas DataFrame from String to Float | Saturn Cloud Blog

June 19, 2023 - If we have done everything correctly, ... DataFrame from a string to a float is a simple task that can be accomplished using the astype() method....

Sentry

sentry.io › sentry answers › python › change a column type in a dataframe in python pandas

Change a column type in a DataFrame in Python Pandas | Sentry

If we want to convert a column to a sensible numeric data type (integer or float), we should use the to_numeric function. If we want Pandas to decide which data types to use for each column, we should use the convert_dtypes method.

GoLinuxCloud

golinuxcloud.com › home › databases › convert pandas dataframe column to float (astype, to_numeric & practical examples)

Convert pandas DataFrame Column to Float (astype, to_numeric & Practical Examples) | GoLinuxCloud

March 9, 2026 - The most straightforward way to convert a pandas column to float is using the astype() method. This method explicitly casts the column data type to float.

Squash

squash.io › how-to-change-column-type-in-pandas

How to Change Column Type in Pandas - Squash Labs

October 14, 2023 - We then used the astype() method to change the data type of the Age column to float. Finally, we displayed the modified DataFrame. Another way to change the data type of a column in Pandas is by using the to_numeric() function.

GeeksforGeeks

geeksforgeeks.org › how-to-convert-integers-to-floats-in-pandas-dataframe

How to Convert Integers to Floats in Pandas DataFrame? | GeeksforGeeks

August 25, 2020 - In the above example, we change the data type of column 'Weight' from 'int64' to 'float64'. Example 2: Converting more than one column from int to float using DataFrame.astype() ... # importing pandas library import pandas as pd # Initializing the nested list with Data set player_list = [['M.S.Dhoni', 36, 75, 5428000, 176], ['A.B.D Villers', 38, 74, 3428000, 175], ['V.Kholi', 31, 70, 8428000, 172], ['S.Smith', 34, 80, 4428000, 180], ['C.Gayle', 40, 100, 4528000, 200], ['J.Root', 33, 72, 7028000, 170], ['K.Peterson', 42, 85, 2528000, 190]] # creating a pandas dataframe df = pd.DataFrame(player_list, columns=[ 'Name', 'Age', 'Weight', 'Salary', 'Strike_rate']) # lets find out the data type of 'Age' # and 'Strike_rate' columns print(df.dtypes)

Statology

statology.org › home › how to convert object to float in pandas (with examples)

How to Convert Object to Float in Pandas (With Examples)

May 11, 2022 - #convert points column from object to float df['points'] = df['points'].astype(float) #view updated DataFrame print(df) team points assists 0 A 18.0 5 1 B 22.2 7 2 C 19.1 7 3 D 14.0 9 4 E 14.0 12 5 F 11.5 9 6 G 20.0 9 7 H 28.0 4 #view updated data types print(df.dtypes) team object points float64 assists int64 dtype: object

reddit.com › r/learnpython › pandas dataframe columns won't convert to float

r/learnpython on Reddit: Pandas dataframe columns won't convert to float

May 14, 2022 -

I'm creating a pandas dataframe using US census data for a project I'm working on, and I'm having a problem converting the datatype of my columns to a numeric value. I've tried a couple different ways and keep running into the same issue.

dp03_emp = pd.read_csv('DP03_employment.csv',header=1).iloc[1:, :]

dp03_cleaned = dp03_emp[['Percent!!EMPLOYMENT STATUS!!Percent Unemployed',
                        'Estimate!!INCOME AND BENEFITS (IN 2011 INFLATION-ADJUSTED DOLLARS)!!Median household income (dollars)',
                        'Estimate!!INCOME AND BENEFITS (IN 2011 INFLATION-ADJUSTED DOLLARS)!!Mean household income (dollars)',
                        'Percent!!PERCENTAGE OF FAMILIES AND PEOPLE WHOSE INCOME IN THE PAST 12 MONTHS IS BELOW THE POVERTY LEVEL!!All people',
                        'Geographic Area Name',
                        'Survey Year']]

columns = ['Percent Unemployed','Median Household Income','Mean Household Income', 'Percent below poverty line']
dp03_cleaned[columns]=pd.to_numeric(columns,errors='coerce')

I know that using the 'coerce' argument will assign everything that is not recognized as a number either null or NaN, which is what happens to every value.

dp03_cleaned.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 20110 entries, 1 to 20110
Data columns (total 6 columns):
 #   Column                      Non-Null Count  Dtype  
---  ------                      --------------  -----  
 0   Percent Unemployed          0 non-null      float64
 1   Median Household Income     0 non-null      float64
 2   Mean Household Income       0 non-null      float64
 3   Percent below poverty line  0 non-null      float64
 4   Geographic Area Name        20109 non-null  object 
 5   Survey Year                 20109 non-null  float64
dtypes: float64(5), object(1)
memory usage: 942.8+ KB

I've also tried changing the datatype at the point of import in the read_csv dtype option, but it gives an error as well.

Sorry if this is a simple question, I've spent a fair bit of time trying to Google it and haven't had any luck. Thanks in advance for the help.

1. to_numeric()

Basic usage

Error handling

Downcasting

2. astype()

Basic usage

Be careful

3. infer_objects()

4. convert_dtypes()

1. to_numeric()

Basic usage

Error handling

Downcasting

2. astype()

Basic usage

Be careful

3. infer_objects()

4. convert_dtypes()

Videos

1. `to_numeric()`

2. `astype()`

3. `infer_objects()`

4. `convert_dtypes()`

1. `to_numeric()`

2. `astype()`

3. `infer_objects()`

4. `convert_dtypes()`