pandas to_sql dtype example

stackoverflow.com › questions › 34383000 › pandas-to-sql-all-columns-as-nvarchar

To use dtype, pass a dictionary keyed to each data frame column with corresponding sqlalchemy types. Change keys to actual data frame column names:

import sqlalchemy
import pandas as pd
...

column_errors.to_sql('load_errors',push_conn, 
                      if_exists = 'append', 
                      index = False, 
                      dtype={'datefld': sqlalchemy.DateTime(), 
                             'intfld':  sqlalchemy.types.INTEGER(),
                             'strfld': sqlalchemy.types.NVARCHAR(length=255)
                             'floatfld': sqlalchemy.types.Float(precision=3, asdecimal=True)
                             'booleanfld': sqlalchemy.types.Boolean})

You may even be able to dynamically create this dtype dictionary given you do not know column names or types beforehand:

def sqlcol(dfparam):    
    
    dtypedict = {}
    for i,j in zip(dfparam.columns, dfparam.dtypes):
        if "object" in str(j):
            dtypedict.update({i: sqlalchemy.types.NVARCHAR(length=255)})
                                 
        if "datetime" in str(j):
            dtypedict.update({i: sqlalchemy.types.DateTime()})

        if "float" in str(j):
            dtypedict.update({i: sqlalchemy.types.Float(precision=3, asdecimal=True)})

        if "int" in str(j):
            dtypedict.update({i: sqlalchemy.types.INT()})

    return dtypedict

outputdict = sqlcol(df)    
column_errors.to_sql('load_errors', 
                     push_conn, 
                     if_exists = 'append', 
                     index = False, 
                     dtype = outputdict)

Answer from Parfait on Stack Overflow

Pandas

pandas.pydata.org › docs › reference › api › pandas.DataFrame.to_sql.html

pandas.DataFrame.to_sql — pandas 3.0.1 documentation

) ... stmt = stmt.on_duplicate_key_update(b=stmt.inserted.b, c=stmt.inserted.c) ... result = conn.execute(stmt) ... return result.rowcount >>> df_conflict.to_sql(name="conflict_table", con=conn, if_exists="append", # noqa: F821 ... method=insert_on_conflict_update) 2 · Specify the dtype (especially useful for integers with missing values). Notice that while pandas is forced to store the data as floating point, the database supports nullable integers.

Stack Overflow

stackoverflow.com › questions › 34383000 › pandas-to-sql-all-columns-as-nvarchar

python - pandas to_sql all columns as nvarchar - Stack Overflow

Top answer

1 of 4

To use dtype, pass a dictionary keyed to each data frame column with corresponding sqlalchemy types. Change keys to actual data frame column names:

import sqlalchemy
import pandas as pd
...

column_errors.to_sql('load_errors',push_conn, 
                      if_exists = 'append', 
                      index = False, 
                      dtype={'datefld': sqlalchemy.DateTime(), 
                             'intfld':  sqlalchemy.types.INTEGER(),
                             'strfld': sqlalchemy.types.NVARCHAR(length=255)
                             'floatfld': sqlalchemy.types.Float(precision=3, asdecimal=True)
                             'booleanfld': sqlalchemy.types.Boolean})

You may even be able to dynamically create this dtype dictionary given you do not know column names or types beforehand:

def sqlcol(dfparam):    
    
    dtypedict = {}
    for i,j in zip(dfparam.columns, dfparam.dtypes):
        if "object" in str(j):
            dtypedict.update({i: sqlalchemy.types.NVARCHAR(length=255)})
                                 
        if "datetime" in str(j):
            dtypedict.update({i: sqlalchemy.types.DateTime()})

        if "float" in str(j):
            dtypedict.update({i: sqlalchemy.types.Float(precision=3, asdecimal=True)})

        if "int" in str(j):
            dtypedict.update({i: sqlalchemy.types.INT()})

    return dtypedict

outputdict = sqlcol(df)    
column_errors.to_sql('load_errors', 
                     push_conn, 
                     if_exists = 'append', 
                     index = False, 
                     dtype = outputdict)

2 of 4

You can create this dict dynamically if you do not know the column names in advance:

from sqlalchemy.types import NVARCHAR
df.to_sql(...., dtype={col_name: NVARCHAR for col_name in df})

Note that you have to pass the sqlalchemy type object itself (or an instance to specify parameters like NVARCHAR(length=10)) and not a string as in your example.

HackerNoon

hackernoon.com › python-setting-data-types-when-using-to_sql

Python: Setting Data Types When Using 'to_sql' | HackerNoon

January 17, 2024 - The following is a basic code snippet to save a DataFrame to an Oracle database using SQLAlchemy and pandas.

Medium

medium.com › @erkansirin › worst-way-to-write-pandas-dataframe-to-database-445ec62025e0

Worst Way to Write Pandas Dataframe to Database | by Erkan Şirin | Medium

December 27, 2022 - By specifying the schema as we write. Now let’s create a schema. This schema is given as the dtype argument to the to_sql method and this argument type is a dictionary. In this dictionary, keys represent column names and values represent data types. So what about data types? mysql or pandas?

AskPython

askpython.com › home › pandas to_sql(): write records from a dataframe to a sql database

Pandas to_sql(): Write records from a DataFrame to a SQL Database - AskPython

January 31, 2023 - In this tutorial, you learned about the Pandas to_sql() function that enables you to write records from a data frame to a SQL database. You saw the syntax of the function and also a step-by-step example of its implementation.

Pandas

pandas.pydata.org › pandas-docs › version › 1.1 › reference › api › pandas.DataFrame.to_sql.html

pandas.DataFrame.to_sql — pandas 1.1.5 documentation

>>> from sqlalchemy.types import Integer >>> df.to_sql('integers', con=engine, index=False, ... dtype={"A": Integer()}) >>> engine.execute("SELECT * FROM integers").fetchall() [(1,), (None,), (2,)] pandas.DataFrame.to_records pandas.DataFrame.to_stata

GitHub

github.com › pandas-dev › pandas › issues › 35347

BUG/FEATURE: to_sql data types not reflecting types accurately · Issue #35347 · pandas-dev/pandas

July 19, 2020 - Bigger tables. Presumably longer insert times. Potentially conversion issues (I haven't tried feeding to_sql() a uint64 .... ) If my understanding is correct that Pandas is relying on SQLAlchemy to do the type conversions, I guess this could be seen as either a Pandas issue or an SQLAlchemy issue:

Author danieldjewell

Benchanczh

benchanczh.github.io › post › pandas_to_sql

Writing pandas data frames to database using SQLAlchemy · Benjamin's Blog

A function that maps pandas data types to SQL data types will save us a lot of work. from sqlalchemy.types import VARCHAR, Float, INTEGER def auto_dtype(df): # SQLAlchemy data types mapping data_types_mapping = {} for k, v in zip(df.columns, df.dtypes): if v == 'object': data_types_mapping[k] = VARCHAR(int(df[k].str.len().max())) elif v == 'float64': data_types_mapping[k] = Float() elif v == 'int64': data_types_mapping[k] = INTEGER() return data_types_mapping df.to_sql( name='your_table_name', con=engine, if_exist='replace', # can be 'append' or 'fail' chuncksize=your_chunk_size, index=False, dtype=auto_dtype(df) )

Vultr Docs

docs.vultr.com › python › third-party › pandas › DataFrame › to_sql

Python Pandas DataFrame to_sql() - Save Data to SQL Table | Vultr Docs

December 30, 2024 - df.to_sql('daily_sales', con=engine, if_exists='append', dtype={'Date': sqlalchemy.types.DATE, 'Product_ID': sqlalchemy.types.Integer(), 'Sold_Units': sqlalchemy.types.Integer()}) Explain Code · Specifying the column data types directly in the to_sql() function helps ensure compatibility with SQL data types, avoiding common pitfalls like type mismatches. The to_sql() function in pandas is an essential tool for developers and analysts dealing with data interplay between Python and SQL databases.

Find elsewhere

Google Bing Mojeek

CopyProgramming

copyprogramming.com › howto › pandas-to-sql-set-column-type

Python: Modifying column types when using Pandas to_sql

March 26, 2023 - This can be accomplished by utilizing the dtype argument, which allows for the specification of a column type (using sqlalchemy types) when inserting data into the database. The following code illustrates this: import pandas as pd from sqlalchemy import create_engine from sqlalchemy.types import Float # note this import to use sqlalchemy Float type engine = create_engine('postgresql://{}:{}@{}:5432/{}'.format(USER, DB_PW, HOST, DB)) df = pd.DataFrame({'String2Number': ['0,2', '', '0,0000001']}) # Replacing ',' to '.' df['String2Number'] = df['String2Number'].apply(lambda x: str(x).replace(',', '.')) # Set column type as SQLAlchemy Float df.to_sql( name='TABLE_NAME', con=engine, index=False, dtype={'String2Number': Float()} )

Spark Code Hub

sparkcodehub.com › pandas-dataframe-to-sql

Exporting Pandas DataFrame to SQL: A Comprehensive Guide

Learn how to export a Pandas DataFrame to SQL databases with this detailed guide Explore the tosql method configure data types handle special cases and build ETL pipelines with practical examples

CopyProgramming

copyprogramming.com › howto › python-pandas-to-sql-how-to-create-a-table-with-a-primary-key

Python: Creating a table with a primary key using Pandas to_sql in Python

June 1, 2023 - define function analoguous to pandas.io.SQLDatabase.to_sql() but with additional *kwargs argument which is passed to pandas.io.SQLTable object created inside it (i've just copied original to_sql() method and added *kwargs ): def to_sql_k(self, frame, name, if_exists='fail', index=True, index_label=None, schema=None, chunksize=None, dtype=None, **kwargs): if dtype is not None: from sqlalchemy.types import to_instance, TypeEngine for col, my_type in dtype.items(): if not isinstance(to_instance(my_type), TypeEngine): raise ValueError('The type of %s is not a SQLAlchemy ' 'type ' % col) table = pd.io.sql.SQLTable(name, self, frame=frame, index=index, if_exists=if_exists, index_label=index_label, schema=schema, dtype=dtype, **kwargs) table.create() table.insert(chunksize)

Spec-zone

spec-zone.ru › pandas~0.25 › reference › api › pandas.dataframe.to_sql

pandas 0.25 / DataFrame.to_sql() - Spec-Zone.ru

Для работы Spec-Zone.ru требуется JavaScript, включите в настройках вашего браузера

Tutorialink

python.tutorialink.com › pandas-to_sql-all-columns-as-nvarchar

pandas to_sql all columns as nvarchar – Python