stackoverflow.com › questions › 15891038 › change-column-type-in-pandas

You have four main options for converting types in pandas:

to_numeric() - provides functionality to safely convert non-numeric types (e.g. strings) to a suitable numeric type. (See also to_datetime() and to_timedelta().)
astype() - convert (almost) any type to (almost) any other type (even if it's not necessarily sensible to do so). Also allows you to convert to categorial types (very useful).
infer_objects() - a utility method to convert object columns holding Python objects to a pandas type if possible.
convert_dtypes() - convert DataFrame columns to the "best possible" dtype that supports pd.NA (pandas' object to indicate a missing value).

Read on for more detailed explanations and usage of each of these methods.

1. `to_numeric()`

The best way to convert one or more columns of a DataFrame to numeric values is to use pandas.to_numeric().

This function will try to change non-numeric objects (such as strings) into integers or floating-point numbers as appropriate.

Basic usage

The input to to_numeric() is a Series or a single column of a DataFrame.

>>> s = pd.Series(["8", 6, "7.5", 3, "0.9"]) # mixed string and numeric values
>>> s
0      8
1      6
2    7.5
3      3
4    0.9
dtype: object

>>> pd.to_numeric(s) # convert everything to float values
0    8.0
1    6.0
2    7.5
3    3.0
4    0.9
dtype: float64

As you can see, a new Series is returned. Remember to assign this output to a variable or column name to continue using it:

# convert Series
my_series = pd.to_numeric(my_series)

# convert column "a" of a DataFrame
df["a"] = pd.to_numeric(df["a"])

You can also use it to convert multiple columns of a DataFrame via the apply() method:

# convert all columns of DataFrame
df = df.apply(pd.to_numeric) # convert all columns of DataFrame

# convert just columns "a" and "b"
df[["a", "b"]] = df[["a", "b"]].apply(pd.to_numeric)

As long as your values can all be converted, that's probably all you need.

Error handling

But what if some values can't be converted to a numeric type?

to_numeric() also takes an errors keyword argument that allows you to force non-numeric values to be NaN, or simply ignore columns containing these values.

Here's an example using a Series of strings s which has the object dtype:

>>> s = pd.Series(['1', '2', '4.7', 'pandas', '10'])
>>> s
0         1
1         2
2       4.7
3    pandas
4        10
dtype: object

The default behaviour is to raise if it can't convert a value. In this case, it can't cope with the string 'pandas':

>>> pd.to_numeric(s) # or pd.to_numeric(s, errors='raise')
ValueError: Unable to parse string

Rather than fail, we might want 'pandas' to be considered a missing/bad numeric value. We can coerce invalid values to NaN as follows using the errors keyword argument:

>>> pd.to_numeric(s, errors='coerce')
0     1.0
1     2.0
2     4.7
3     NaN
4    10.0
dtype: float64

The third option for errors is just to ignore the operation if an invalid value is encountered:

>>> pd.to_numeric(s, errors='ignore')
# the original Series is returned untouched

This last option is particularly useful for converting your entire DataFrame, but don't know which of our columns can be converted reliably to a numeric type. In that case, just write:

df.apply(pd.to_numeric, errors='ignore')

The function will be applied to each column of the DataFrame. Columns that can be converted to a numeric type will be converted, while columns that cannot (e.g. they contain non-digit strings or dates) will be left alone.

Downcasting

By default, conversion with to_numeric() will give you either an int64 or float64 dtype (or whatever integer width is native to your platform).

That's usually what you want, but what if you wanted to save some memory and use a more compact dtype, like float32, or int8?

to_numeric() gives you the option to downcast to either 'integer', 'signed', 'unsigned', 'float'. Here's an example for a simple series s of integer type:

>>> s = pd.Series([1, 2, -7])
>>> s
0    1
1    2
2   -7
dtype: int64

Downcasting to 'integer' uses the smallest possible integer that can hold the values:

>>> pd.to_numeric(s, downcast='integer')
0    1
1    2
2   -7
dtype: int8

Downcasting to 'float' similarly picks a smaller than normal floating type:

>>> pd.to_numeric(s, downcast='float')
0    1.0
1    2.0
2   -7.0
dtype: float32

2. `astype()`

The astype() method enables you to be explicit about the dtype you want your DataFrame or Series to have. It's very versatile in that you can try and go from one type to any other.

Basic usage

Just pick a type: you can use a NumPy dtype (e.g. np.int16), some Python types (e.g. bool), or pandas-specific types (like the categorical dtype).

Call the method on the object you want to convert and astype() will try and convert it for you:

# convert all DataFrame columns to the int64 dtype
df = df.astype(int)

# convert column "a" to int64 dtype and "b" to complex type
df = df.astype({"a": int, "b": complex})

# convert Series to float16 type
s = s.astype(np.float16)

# convert Series to Python strings
s = s.astype(str)

# convert Series to categorical type - see docs for more details
s = s.astype('category')

Notice I said "try" - if astype() does not know how to convert a value in the Series or DataFrame, it will raise an error. For example, if you have a NaN or inf value you'll get an error trying to convert it to an integer.

As of pandas 0.20.0, this error can be suppressed by passing errors='ignore'. Your original object will be returned untouched.

Be careful

astype() is powerful, but it will sometimes convert values "incorrectly". For example:

>>> s = pd.Series([1, 2, -7])
>>> s
0    1
1    2
2   -7
dtype: int64

These are small integers, so how about converting to an unsigned 8-bit type to save memory?

>>> s.astype(np.uint8)
0      1
1      2
2    249
dtype: uint8

The conversion worked, but the -7 was wrapped round to become 249 (i.e. 2⁸ - 7)!

Trying to downcast using pd.to_numeric(s, downcast='unsigned') instead could help prevent this error.

3. `infer_objects()`

Version 0.21.0 of pandas introduced the method infer_objects() for converting columns of a DataFrame that have an object datatype to a more specific type (soft conversions).

For example, here's a DataFrame with two columns of object type. One holds actual integers and the other holds strings representing integers:

>>> df = pd.DataFrame({'a': [7, 1, 5], 'b': ['3','2','1']}, dtype='object')
>>> df.dtypes
a    object
b    object
dtype: object

Using infer_objects(), you can change the type of column 'a' to int64:

>>> df = df.infer_objects()
>>> df.dtypes
a     int64
b    object
dtype: object

Column 'b' has been left alone since its values were strings, not integers. If you wanted to force both columns to an integer type, you could use df.astype(int) instead.

4. `convert_dtypes()`

Version 1.0 and above includes a method convert_dtypes() to convert Series and DataFrame columns to the best possible dtype that supports the pd.NA missing value.

Here "best possible" means the type most suited to hold the values. For example, this a pandas integer type, if all of the values are integers (or missing values): an object column of Python integer objects are converted to Int64, a column of NumPy int32 values, will become the pandas dtype Int32.

With our object DataFrame df, we get the following result:

>>> df.convert_dtypes().dtypes                                             
a     Int64
b    string
dtype: object

Since column 'a' held integer values, it was converted to the Int64 type (which is capable of holding missing values, unlike int64).

Column 'b' contained string objects, so was changed to pandas' string dtype.

By default, this method will infer the type from object values in each column. We can change this by passing infer_objects=False:

>>> df.convert_dtypes(infer_objects=False).dtypes                          
a    object
b    string
dtype: object

Now column 'a' remained an object column: pandas knows it can be described as an 'integer' column (internally it ran infer_dtype) but didn't infer exactly what dtype of integer it should have so did not convert it. Column 'b' was again converted to 'string' dtype as it was recognised as holding 'string' values.

Answer from Alex Riley on Stack Overflow

Pandas

pandas.pydata.org › docs › reference › api › pandas.DataFrame.astype.html

pandas.DataFrame.astype — pandas 3.0.2 documentation

Alternatively, use a mapping, e.g. {col: dtype, …}, where col is a column label and dtype is a numpy.dtype or Python type to cast one or more of the DataFrame’s columns to column-specific types. ... This keyword is now ignored; changing its value will have no impact on the method.

Spark By {Examples}

sparkbyexamples.com › home › pandas › pandas convert column to string type

Pandas Convert Column to String Type - Spark By {Examples}

July 3, 2025 - In this article, I will explain how to convert single column or multiple columns to string type in pandas DataFrame, here, I will demonstrate using

Videos

08:18

YouTube

How to Change Column Types in Pandas - YouTube

Update pandas data types // Change data types of multiple columns ...

Change a column Data Type in a dataframe in Pandas | Python Pandas ...

December 28, 2024

00:58

YouTube

Change Column Datatype in Pandas | Python Tutorial - YouTube

December 31, 2023

13:40

YouTube

CHANGE COLUMN DTYPE | How to change the datatype of a column in ...

September 23, 2020

View all

Pandas

pandas.pydata.org › docs › reference › frame.html

DataFrame — pandas 3.0.2 documentation

Flags refer to attributes of the pandas object. Properties of the dataset (like the date is was recorded, the URL it was accessed from, etc.) should be stored in DataFrame.attrs. DataFrame.attrs is a dictionary for storing global metadata for this DataFrame. ... DataFrame.attrs is considered experimental and may change without warning. DataFrame.plot is both a callable method and a namespace attribute for specific plotting methods of the form DataFrame.plot.<kind>. Sparse-dtype specific methods and attributes are provided under the DataFrame.sparse accessor.

Pandas

pandas.pydata.org › docs › user_guide › indexing.html

Indexing and selecting data — pandas 3.0.2 documentation

Destructuring tuple keys into row (and column) indexes occurs before callables are applied, so you cannot return a tuple from a callable to index both rows and columns. Getting values from an object with multi-axes selection uses the following notation (using .loc as an example, but the following applies to .iloc as well). Any of the axes accessors may be the null slice :. Axes left out of the specification are assumed to be :, e.g. p.loc['a'] is equivalent to p.loc['a', :]. In [1]: ser = pd.Series(range(5), index=list("abcde")) In [2]: ser.loc[["a", "c", "e"]] Out[2]: a 0 c 2 e 4 dtype: int64 In [3]: df = pd.DataFrame(np.arange(25).reshape(5, 5), index=list("abcde"), columns=list("abcde")) In [4]: df.loc[["a", "c", "e"], ["b", "d"]] Out[4]: b d a 1 3 c 11 13 e 21 23

Stack Overflow

stackoverflow.com › questions › 15891038 › change-column-type-in-pandas

python - Change column type in pandas - Stack Overflow

1. `to_numeric()`

The best way to convert one or more columns of a DataFrame to numeric values is to use pandas.to_numeric().

This function will try to change non-numeric objects (such as strings) into integers or floating-point numbers as appropriate.

Basic usage

The input to to_numeric() is a Series or a single column of a DataFrame.

>>> s = pd.Series(["8", 6, "7.5", 3, "0.9"]) # mixed string and numeric values
>>> s
0      8
1      6
2    7.5
3      3
4    0.9
dtype: object

>>> pd.to_numeric(s) # convert everything to float values
0    8.0
1    6.0
2    7.5
3    3.0
4    0.9
dtype: float64

As you can see, a new Series is returned. Remember to assign this output to a variable or column name to continue using it:

# convert Series
my_series = pd.to_numeric(my_series)

# convert column "a" of a DataFrame
df["a"] = pd.to_numeric(df["a"])

You can also use it to convert multiple columns of a DataFrame via the apply() method:

# convert all columns of DataFrame
df = df.apply(pd.to_numeric) # convert all columns of DataFrame

# convert just columns "a" and "b"
df[["a", "b"]] = df[["a", "b"]].apply(pd.to_numeric)

As long as your values can all be converted, that's probably all you need.

Error handling

But what if some values can't be converted to a numeric type?

to_numeric() also takes an errors keyword argument that allows you to force non-numeric values to be NaN, or simply ignore columns containing these values.

Here's an example using a Series of strings s which has the object dtype:

>>> s = pd.Series(['1', '2', '4.7', 'pandas', '10'])
>>> s
0         1
1         2
2       4.7
3    pandas
4        10
dtype: object

The default behaviour is to raise if it can't convert a value. In this case, it can't cope with the string 'pandas':

>>> pd.to_numeric(s) # or pd.to_numeric(s, errors='raise')
ValueError: Unable to parse string

Rather than fail, we might want 'pandas' to be considered a missing/bad numeric value. We can coerce invalid values to NaN as follows using the errors keyword argument:

>>> pd.to_numeric(s, errors='coerce')
0     1.0
1     2.0
2     4.7
3     NaN
4    10.0
dtype: float64

The third option for errors is just to ignore the operation if an invalid value is encountered:

>>> pd.to_numeric(s, errors='ignore')
# the original Series is returned untouched

This last option is particularly useful for converting your entire DataFrame, but don't know which of our columns can be converted reliably to a numeric type. In that case, just write:

df.apply(pd.to_numeric, errors='ignore')

Downcasting

By default, conversion with to_numeric() will give you either an int64 or float64 dtype (or whatever integer width is native to your platform).

That's usually what you want, but what if you wanted to save some memory and use a more compact dtype, like float32, or int8?

to_numeric() gives you the option to downcast to either 'integer', 'signed', 'unsigned', 'float'. Here's an example for a simple series s of integer type:

>>> s = pd.Series([1, 2, -7])
>>> s
0    1
1    2
2   -7
dtype: int64

Downcasting to 'integer' uses the smallest possible integer that can hold the values:

>>> pd.to_numeric(s, downcast='integer')
0    1
1    2
2   -7
dtype: int8

Downcasting to 'float' similarly picks a smaller than normal floating type:

>>> pd.to_numeric(s, downcast='float')
0    1.0
1    2.0
2   -7.0
dtype: float32

2. `astype()`

The astype() method enables you to be explicit about the dtype you want your DataFrame or Series to have. It's very versatile in that you can try and go from one type to any other.

Basic usage

Just pick a type: you can use a NumPy dtype (e.g. np.int16), some Python types (e.g. bool), or pandas-specific types (like the categorical dtype).

Call the method on the object you want to convert and astype() will try and convert it for you:

# convert all DataFrame columns to the int64 dtype
df = df.astype(int)

# convert column "a" to int64 dtype and "b" to complex type
df = df.astype({"a": int, "b": complex})

# convert Series to float16 type
s = s.astype(np.float16)

# convert Series to Python strings
s = s.astype(str)

# convert Series to categorical type - see docs for more details
s = s.astype('category')

As of pandas 0.20.0, this error can be suppressed by passing errors='ignore'. Your original object will be returned untouched.

Be careful

astype() is powerful, but it will sometimes convert values "incorrectly". For example:

>>> s = pd.Series([1, 2, -7])
>>> s
0    1
1    2
2   -7
dtype: int64

These are small integers, so how about converting to an unsigned 8-bit type to save memory?

>>> s.astype(np.uint8)
0      1
1      2
2    249
dtype: uint8

The conversion worked, but the -7 was wrapped round to become 249 (i.e. 2⁸ - 7)!

Trying to downcast using pd.to_numeric(s, downcast='unsigned') instead could help prevent this error.

3. `infer_objects()`

Version 0.21.0 of pandas introduced the method infer_objects() for converting columns of a DataFrame that have an object datatype to a more specific type (soft conversions).

For example, here's a DataFrame with two columns of object type. One holds actual integers and the other holds strings representing integers:

>>> df = pd.DataFrame({'a': [7, 1, 5], 'b': ['3','2','1']}, dtype='object')
>>> df.dtypes
a    object
b    object
dtype: object

Using infer_objects(), you can change the type of column 'a' to int64:

>>> df = df.infer_objects()
>>> df.dtypes
a     int64
b    object
dtype: object

Column 'b' has been left alone since its values were strings, not integers. If you wanted to force both columns to an integer type, you could use df.astype(int) instead.

4. `convert_dtypes()`

Version 1.0 and above includes a method convert_dtypes() to convert Series and DataFrame columns to the best possible dtype that supports the pd.NA missing value.

With our object DataFrame df, we get the following result:

>>> df.convert_dtypes().dtypes                                             
a     Int64
b    string
dtype: object

Since column 'a' held integer values, it was converted to the Int64 type (which is capable of holding missing values, unlike int64).

Column 'b' contained string objects, so was changed to pandas' string dtype.

By default, this method will infer the type from object values in each column. We can change this by passing infer_objects=False:

>>> df.convert_dtypes(infer_objects=False).dtypes                          
a    object
b    string
dtype: object

2 of 16

552

Use this:

a = [['a', '1.2', '4.2'], ['b', '70', '0.03'], ['x', '5', '0']]
df = pd.DataFrame(a, columns=['one', 'two', 'three'])
df

Out[16]:
  one  two three
0   a  1.2   4.2
1   b   70  0.03
2   x    5     0

df.dtypes

Out[17]:
one      object
two      object
three    object

df[['two', 'three']] = df[['two', 'three']].astype(float)

df.dtypes

Out[19]:
one       object
two      float64
three    float64

GeeksforGeeks

geeksforgeeks.org › pandas › convert-the-column-type-from-string-to-datetime-format-in-pandas-dataframe

Convert Column Type from String to Datetime Format in Pandas Dataframe - GeeksforGeeks

09:28

import pandas as pd df = pd.DataFrame({ 'Date': ['11/08/2011', '04/23/2008', '10/02/2019'], 'Event': ['Music', 'Poetry', 'Theatre'] }) print("Before Conversion:") print(df.dtypes) df['Date'] = df['Date'].astype('datetime64[ns]') print("\nAfter Conversion:") print(df.dtypes) ... Explanation: astype('datetime64[ns]') explicitly converts the “Date” column to datetime type. Change Data Type for one or more columns in Pandas Dataframe

Published November 1, 2025

GeeksforGeeks

geeksforgeeks.org › pandas › change-data-type-for-one-or-more-columns-in-pandas-dataframe

Change Data Type for one or more columns in Pandas Dataframe - GeeksforGeeks

July 11, 2025 - You can also change the data type of specific columns using a dictionary where the keys are column names and the values are the desired data types. ... import pandas as pd df = pd.DataFrame({ 'A': [1, 2, 3, 4, 5], 'B': ['a', 'b', 'c', 'd', 'e'], ...

Find elsewhere

Google Bing Mojeek

Cbseacademic

cbseacademic.nic.in › web_material › Curriculum26 › publication › srsec › AI_Teacher_HandbookXII.pdf pdf

ARTIFICIAL ARTIFICIAL ARTIFICIAL INTELLIGENCE INTELLIGENCE INTELLIGENCE

In NumPy, the number of dimensions ... the rank of the array ... Matplotlib, a popular plotting library in Python. Pandas provides powerful data manipulation and aggregation functionalities, making · it easy for us to perform complex analyses and generate insightful visualizations. This · capability is invaluable in AI and data-driven decision-making processes, allowing · businesses to gain actionable insights from their data. ... DataFRame.loc[] method can also be used to change the data values ...

Kaggle

kaggle.com › code › ysthehurricane › english-to-hindi-translator

English to Hindi Translator

Checking your browser before accessing www.kaggle.com · Click here if you are not automatically redirected after 5 seconds

Sentry

sentry.io › sentry answers › python › change a column type in a dataframe in python pandas

Change a column type in a DataFrame in Python Pandas | Sentry

import pandas as pd # Create and print DataFrame df = pd.DataFrame({ 'A': ['1', '2', '3'], 'B': ['4', '5', '6'], 'C': ['7', '8', '9'] }) print(df) # Print data types of each column in DataFrame print("\n") print(df.dtypes) # Change column A's values to floats df['A'] = df['A'].astype(float) # Change column B and C's values to integers df = df.astype({'B': int, 'C': int}) print("\nConverted:\n") # Print altered DataFrame print(df) # Print data types of each column in DataFrame print("\n") print(df.dtypes)

Pandas

pandas.pydata.org › docs › reference › api › pandas.get_dummies.html

pandas.get_dummies — pandas 3.0.2 documentation

Alternatively, prefix_sep can be a list with length equal to the number of columns, or a dictionary mapping column names to separators. ... If True, a NaN indicator column will be added even if no NaN values are present. If False, NA values are encoded as all zero. ... Column names in the DataFrame to be encoded. If columns is None then all the columns with object, string, or category dtype will be converted.

Wikipedia

en.wikipedia.org › wiki › R_(programming_language)

R (programming language) - Wikipedia

1 week ago - X Y Z A 2 6 12 B 20 30 42 > new_df$Z # Output the Z column [1] 12 42 > new_df$Z == new_df['Z'] && new_df[3] == new_df$Z # The dataframe column Z can be accessed using the syntax $Z, ['Z'], or [3], and the values are the same. [1] TRUE > attributes(new_df) # Print information about attributes of the new_df dataframe $names [1] "X" "Y" "Z" $row.names [1] "A" "B" $class [1] "data.frame" > attributes(new_df)$row.names <- c("one", "two") # Access and then change the row.names attribute; this can also be done using the rownames() function > new_df X Y Z one 2 6 12 two 20 30 42

History Packages Community Examples Version names Interfaces Implementations Commercial support Further reading

Pandas

pandas.pydata.org › docs › reference › api › pandas.read_excel.html

pandas.read_excel — pandas 3.0.2 documentation

Returns a subset of the columns according to behavior above. dtypeType name or dict of column -> type, default None

Python

docs.python.org › 3 › library › csv.html

csv — CSV File Reading and Writing

February 23, 2026 - Source code: Lib/csv.py The so-called CSV (Comma Separated Values) format is the most common import and export format for spreadsheets and databases. CSV format was used for many years prior to att...

Pandas

pandas.pydata.org › docs › reference › api › pandas.merge.html

pandas.merge — pandas 3.0.1 documentation

Since pandas 3.0, this method always returns a new object using a lazy copy mechanism that defers copies until necessary (Copy-on-Write). See the user guide on Copy-on-Write for more details. ... If True, adds a column to the output DataFrame called “_merge” with information on the source of each row. The column can be given a different name by providing a string argument.

Pandas

pandas.pydata.org › docs › reference › api › pandas.DataFrame.index.html

pandas.DataFrame.index — pandas 3.0.2 documentation

The column labels of the DataFrame. ... Convert the DataFrame to a NumPy array. ... >>> df = pd.DataFrame({'Name': ['Alice', 'Bob', 'Aritra'], ... 'Age': [25, 30, 35], ... 'Location': ['Seattle', 'New York', 'Kona']}, ... index=([10, 20, 30])) >>> df.index Index([10, 20, 30], dtype='int64')

Medium

medium.com › @filip.sekan › 3-ways-how-to-update-data-type-of-columns-in-pandas-97ddb5f32ae4

3 ways how to update data type of columns in Pandas | by Filip Sekan | Medium

February 3, 2023 - In this example, the data type of both the ‘age’ and ‘salary’ columns has been changed to ‘float64’. The pd.to_numeric() method is part of the Pandas library and is used to convert a column in a Pandas data frame from one data type ...

Favtutor

favtutor.com › articles › change-column-type-pandas

Change Column Type in Pandas (4 Methods with code)

December 6, 2023 - Original DataFrame: A B C 0 1 a 1.1 1 2 b 2.1 2 3 c 3.0 3 4 d 4.1 4 5 e 5.1 Type of original DataFrame: A object B object C object dtype: object Type of new DataFrame: A int64 B object C float64 dtype: object · The DataFrame.convert_dtypes() method ...

GeeksforGeeks

geeksforgeeks.org › pandas › convert-pandas-dataframe-column-to-float

Convert Pandas Dataframe Column To Float - GeeksforGeeks

July 23, 2025 - Data types before conversion: ... the data type of the string column is changed from object to float after using to_numeric() function....

scikit-learn

scikit-learn.org › stable › modules › generated › sklearn.tree.DecisionTreeClassifier.html

DecisionTreeClassifier — scikit-learn 1.8.0 documentation

Internally, it will be converted to dtype=np.float32 and if a sparse matrix is provided to a sparse csr_matrix. ... Allow to bypass several input checking. Don’t use this parameter unless you know what you’re doing. ... The predicted classes, or the predict values. ... Predict class log-probabilities of the input samples X.

1. to_numeric()

Basic usage

Error handling

Downcasting

2. astype()

Basic usage

Be careful

3. infer_objects()

4. convert_dtypes()

Videos

1. to_numeric()

Basic usage

Error handling

Downcasting

2. astype()

Basic usage

Be careful

3. infer_objects()

4. convert_dtypes()

1. `to_numeric()`

2. `astype()`

3. `infer_objects()`

4. `convert_dtypes()`

1. `to_numeric()`

2. `astype()`

3. `infer_objects()`

4. `convert_dtypes()`