convert xml to table in python

stackoverflow.com › questions › 45755513 › parse-xml-to-table-in-python

I've got the needed outcome using following script.

XML File:

<?xml version="1.0" encoding="UTF-8"?>
<base>
  <element1>element 1</element1>
  <element2>element 2</element2>
  <element3>
    <subElement3>subElement 3</subElement3>
  </element3>
</base>

Python code:

import pandas as pd
from lxml import etree

data = "C:/Path/test.xml"

tree = etree.parse(data)

lstKey = []
lstValue = []
for p in tree.iter() :
    lstKey.append(tree.getpath(p).replace("/",".")[1:])
    lstValue.append(p.text)

df = pd.DataFrame({'key' : lstKey, 'value' : lstValue})
df.sort_values('key')

Result:

Answer from Michal Hruška on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 45755513 › parse-xml-to-table-in-python

Parse XML to Table in Python - Stack Overflow

Top answer

1 of 1

I've got the needed outcome using following script.

XML File:

<?xml version="1.0" encoding="UTF-8"?>
<base>
  <element1>element 1</element1>
  <element2>element 2</element2>
  <element3>
    <subElement3>subElement 3</subElement3>
  </element3>
</base>

Python code:

import pandas as pd
from lxml import etree

data = "C:/Path/test.xml"

tree = etree.parse(data)

lstKey = []
lstValue = []
for p in tree.iter() :
    lstKey.append(tree.getpath(p).replace("/",".")[1:])
    lstValue.append(p.text)

df = pd.DataFrame({'key' : lstKey, 'value' : lstValue})
df.sort_values('key')

Result:

Medium

medium.com › @robertopreste › from-xml-to-pandas-dataframes-9292980b1c1c

From XML to Pandas dataframes. How to parse XML files to obtain proper… | by Roberto Preste | Medium

August 25, 2019 - We can think of this structure as a pandas DataFrame in which each student represents an observation, with its name attribute being the main identifier and the sub-elements being other features of the observation. A tabular representation of these data would be like this: So we want to find a way to convert XML-structured data to a more functional table. Given the structure of XML files, we can represent them as a tree, and this is the approach used by the xml.etree.ElementTree Python module.

Discussions

python - How to convert XML to table? - Stack Overflow

How can I do the same thing using Python? TIA. ... I think you're looking for the Pandas library. See this answer: https://stackoverflow.com/a/45815394/10358386 ... That's somewhat helpful, but I would like to load some items, but not all, into a data frame. I'm really looking for something that gives me a little more control over which elements I'm loading into a data frame. Thanks. ... I know excel works perfect to convert xml to table ... More on stackoverflow.com

stackoverflow.com

April 11, 2025

python - read xml file, convert it to table (dataframe) - Stack Overflow

this the first time I am dealing with xml file, so I am very lost. I would appreciate any help. All I want is to read the file and convert it to regular table (dataframe). I have file with this str... More on stackoverflow.com

stackoverflow.com

Convert xml to excel/csv

Please help me in converting XML file into excel/csv. Thank you in advance. More on discuss.python.org

discuss.python.org

October 15, 2022

Parse xml and store into mysql table in python - Code Review Stack Exchange

I have written the code that retrieves the data from XML file using xml.etree. Is there any best way to store the data into database by parsing an XML file. My code: from xml.etree import Elemen... More on codereview.stackexchange.com

codereview.stackexchange.com

April 26, 2024

Videos

14:51

YouTube

Python Tutorial : Read/Extract data from XML - YouTube

How to convert an XML file to python pandas dataframe - reading ...

April 20, 2019

04:40

YouTube

XML formatter: XML to CSV Python - YouTube

saturncloud.io › blog › converting-xml-to-python-dataframe-a-comprehensive-guide

Converting XML to Python DataFrame: A Guide | Saturn Cloud Blog

November 15, 2023 - Converting XML to a Python DataFrame can be a bit tricky, but with the right approach, it becomes a straightforward task. This guide has shown you how to parse an XML file, extract the necessary data, and convert it into a DataFrame using pandas.

Stack Overflow

stackoverflow.com › questions › 53541596 › how-to-convert-xml-to-table › 53543695

python - How to convert XML to table? - Stack Overflow

Top answer

1 of 1

Sould print this:

# importing csv module 
import csv 

# csv file name 
filename = "aapl.csv"

# initializing the titles and rows list 
fields = [] 
rows = [] 

# reading csv file 
with open(filename, 'r') as csvfile: 
    # creating a csv reader object 
    csvreader = csv.reader(csvfile) 

    # extracting field names through first row 
    fields = csvreader.next() 

    # extracting each data row one by one 
    for row in csvreader: 
        rows.append(row) 

    # get total number of rows 
    print("Total no. of rows: %d"%(csvreader.line_num)) 

# printing the field names 
print('Field names are:' + ', '.join(field for field in fields)) 

# printing first 5 rows 
print('\nFirst 5 rows are:\n') 
for row in rows[:5]: 
    # parsing each column of a row 
    for col in row: 
        print("%10s"%col), 
    print('\n')

Source: https://www.geeksforgeeks.org/working-csv-files-python/

Python Forum

python-forum.io › thread-23650.html

Convert XML into DB table

January 10, 2020 - Hi, I am new to Python and also new to this forum. Please forgive and correct me if you see any posting rules are violated in this post. I am using Python version 3.8 I have a big XML file which contains the information about patient, service prov...

GeeksforGeeks

geeksforgeeks.org › python › convert-xml-structure-to-dataframe-using-beautifulsoup-python

Convert XML structure to DataFrame using BeautifulSoup - Python - GeeksforGeeks

March 21, 2024 - Now we have extracted the data from the XML file using the BeautifulSoup into the DataFrame and it is stored as ‘df’. To see the DataFrame we use the print statement to print it. ... # Python program to convert xml # structure into dataframes using beautifulsoup # Import libraries from bs4 import BeautifulSoup import pandas as pd # Open XML file file = open("gfg.xml", 'r') # Read the contents of that file contents = file.read() soup = BeautifulSoup(contents, 'xml') # Extracting the data authors = soup.find_all('author') titles = soup.find_all('title') prices = soup.find_all('price') pubdat

Stack Overflow

stackoverflow.com › questions › 73377561 › read-xml-file-convert-it-to-table-dataframe

python - read xml file, convert it to table (dataframe) - Stack Overflow

Top answer

1 of 4

Given the two levels of nodes that cover the Coluna attributes, consider XSLT, the special-purpose language designed to transform or style original XML files. Python's lxml can run XSLT 1.0 scripts and being the default parse to pandas.read_xml can transform your raw XML into a flatter version to parse to DataFrame.

XSLT (save as .xsl file, a special .xml file)

<xsl:stylesheet version="1.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
                              xmlns:pace='http://www.ms.com/pace'>
    <xsl:output method="xml" omit-xml-declaration="no" indent="yes"/>
    <xsl:strip-space elements="*"/>

    <!-- REDESIGN XML TO ONLY RETURN AnaliseDiaria NODES -->
    <xsl:template match="/*">
     <xsl:copy>
       <xsl:apply-templates select="descendant::pace:AnaliseDiaria"/>
     </xsl:copy>
    </xsl:template>
    
    <!-- REDESIGN AnaliseDiaria NODES -->
    <xsl:template match="pace:AnaliseDiaria">
     <xsl:copy>
       <!-- BRING DOWN Produto ATTRIBUTES WITH CURRENT ATTRIBUTES -->
       <xsl:copy-of select="ancestor::pace:Produto/@*|@*"/>
     </xsl:copy>
    </xsl:template>
    
</xsl:stylesheet>

Online Demo

Python

analise_diaria_df = pd.read("input.xml", stylesheet="style.xsl")

analise_diaria_df 
#        Coluna1   Coluna2  Coluna3  ...    Coluna14  Coluna15   Coluna16
# 0    21-851611  CAMIO VO      NaN  ...         NaN       NaN        NaN
# 1   21-3667984    SCA4X2     -1.0  ...         NaN       NaN        NaN
# 2   21-3667994    SCA963     -1.0  ...         NaN       NaN        NaN
# 3   21-3676543    SCA713     -1.0  ...         NaN       NaN        NaN
# 4   21-3676601     SCA97     -1.0  ...         NaN       NaN        NaN
# 5   21-3814014    CAMIX2      NaN  ...         NaN       NaN        NaN
# 6   21-3814087     SCA56      NaN  ...         NaN       NaN        NaN
# 7   21-3814087     SCA56      NaN  ...  195.000,00       NF9  10203910A
# 8   21-3814087     SCA56      NaN  ...  195.090,00       NaN        NaN
# 9   21-3814087     SCA56      NaN  ...  195.270,00       NaN        NaN
# 10  21-3814087     SCA56      NaN  ...  195.482,60       NaN        NaN
# 11  21-3814087     SCA56      NaN  ...  195.627,80       NaN        NaN
# 12  21-3814087     SCA56      NaN  ...  204.529,82       NaN        NaN
# 13  21-3814087     SCA56      NaN  ...         NaN       NaN     158PES

2 of 4

Fortunately, in the case of your xml in the question, you can use the pandas read_xml() method, although you'll have to skirt around the namespaces issue:

import pandas as pd
pd.read_xml(file.xml,xpath='//*[local-name()="Linha"]//*[local-name()="Produto"]')

Output:

    Coluna1        Coluna2    Coluna3     Coluna4   Coluna5     {http://www.ms.com/pace}AnaliseDiaria
0   21-851611   CAMIO VO    NaN     NaN     NaN     NaN
1   21-3667984  SCA4X2  -1.0    NaN     NaN     NaN
2   21-3667994  SCA963  -1.0    NaN     NaN     NaN

etc. If you are not interested in one column or anothter, you can simply drop() it.

Find elsewhere

Google Bing Mojeek

Refitsmarthomes

refitsmarthomes.org › wp-content › uploads › 2017 › 06 › ConvertToTables.html

ConvertToTables

Foreign key fields are used to relate an object to its parent. The tables are saved in csv format. ... def create_tables(element,tables,FK_column,FK_value): """ A recursive function which travels through 'element' and its children. During the recursion, the 'tables' dictionary is populated with DataFrame tables - one for each element type """ name=element.tag.split('}')[1] if not name in tables.keys(): tables[name]=pd.DataFrame()#columns=attributes) d={} if not FK_column is None: d[FK_column]=FK_value for a in element.attrib.keys(): d[a]=element.get(a) tables[name]=tables[name].append(d,ignore_index=True) tables[name].index.name=name+'_PK' for child in element: create_tables(child,tables,name+'FK',tables[name].index.max()) tables={} create_tables(root,tables,None,None) for name in tables.keys(): tables[name].to_csv(name+'.csv')

CopyProgramming

copyprogramming.com › howto › parse-xml-to-table-in-python

Python: Python Code to Convert XML into a Table

August 2, 2023 - Converting XML into a Table: A Guide, Converting XML into Excel tables, Python Element Tree: Use XML to Generate Sub Tables in Excel Sheet

Python.org

discuss.python.org › python help

Convert xml to excel/csv - Python Help - Discussions on Python.org

October 15, 2022 - Please help me in converting XML file into excel/csv. Thank you in advance.

GitHub

github.com › knadh › xmlutils.py

GitHub - knadh/xmlutils.py: Python scripts for processing XML documents and converting to SQL, CSV, and JSON [UNMAINTAINED]

January 7, 2020 - Convert an XML document to an SQL file. xml2sql --input "samples/fruits.xml" --output "samples/fruits.sql" --tag "item" --table "myfruits"

Starred by 255 users

Forked by 141 users

Languages Python 100.0% | Python 100.0%

PyPI

pypi.org › project › sql-xml-table

sql-xml-table

January 6, 2026 - JavaScript is disabled in your browser · Please enable JavaScript to proceed · A required part of this site couldn’t load. This may be due to a browser extension, network issues, or browser settings. Please check your connection, disable any ad blockers, or try using a different browser

Sonra

sonra.io › home › xml › xml conversion using python in 2025

XML Conversion Using Python in 2026 - Sonra

January 14, 2025 - Basic usage includes importing Pandas and using the read_xml function to convert XML into a structured DataFrame. ... Pros: Integrates XML data seamlessly into data analysis workflows, easy conversion to tabular format.

Table Convert

tableconvert.com › home › convert xml to pandas dataframe online

Convert XML to Pandas DataFrame Online - Table Convert

November 30, 2018 - Generated code can be directly executed in Python environment for data analysis and processing. Extract tables from any website with one click. Convert to 30+ formats including Excel, CSV, JSON instantly - no copy-pasting required. Converting XML to PandasDataFrame?

Stack Exchange

codereview.stackexchange.com › questions › 193628 › parse-xml-and-store-into-mysql-table-in-python

Parse xml and store into mysql table in python - Code Review Stack Exchange

April 26, 2024 - Just convert the result of zip to a list · Here's a much shorter version of your code. If you add more tags, you just have one item to add in the tag list & in the query now: from xml.etree import ElementTree import mysql.connector dom = ElementTree.parse('profile.xml') # compose the argument list in one line, drop the big copied/pasted block args_list = ([t.text for t in dom.iter(tag)] for tag in ['TICKER','NAME','ADDRESS','PHONE','WEBSITE','SECTOR','INDUSTRY','FULL_TIME','BUS_SUMM']) # db connection code is unchanged db = mysql.connector.Connect(host = 'localhost', user = 'root', password =

GeeksforGeeks

geeksforgeeks.org › parsing-tables-and-xml-with-beautifulsoup

Parsing tables and XML with BeautifulSoup - GeeksforGeeks

January 12, 2024 - Scraping is a very essential skill that everybody should learn, It helps us to scrap data from a website or a file that can be used in another beautiful manner by the programmer. In this article, we will learn how to extract tables with beautiful soup and XML from a file. Here, we will scrap data using the Beautiful Soup Python Module.

GeeksforGeeks

geeksforgeeks.org › python › convert-xml-to-csv-in-python

Convert XML to CSV in Python - GeeksforGeeks

July 23, 2025 - We used ElementTree to parse and navigate through the XML structure. Data from each record was collected into a list of dictionaries. Finally, we used pandas to create a CSV file from that structured data. To learn about the pandas module in depth, refer to: Python Pandas Tutorial

Stack Overflow

stackoverflow.com › questions › 59635922 › how-to-convert-table-into-a-specific-xml-format-using-python

How to convert table into a specific XML format using Python? - Stack Overflow

Please take a look at the python standard library xml: https://docs.python.org/3/library/xml.etree.elementtree.html#xml.etree.ElementTree.ElementTree ... Sign up to request clarification or add additional context in comments.

Stack Overflow

stackoverflow.com › questions › 28259301 › how-to-convert-an-xml-file-to-nice-pandas-dataframe

python - How to convert an XML file to nice pandas dataframe? - Stack Overflow

Top answer

1 of 5

You can easily use xml (from the Python standard library) to convert to a pandas.DataFrame. Here's what I would do (when reading from a file replace xml_data with the name of your file or file object):

import pandas as pd
import xml.etree.ElementTree as ET
import io

def iter_docs(author):
    author_attr = author.attrib
    for doc in author.iter('document'):
        doc_dict = author_attr.copy()
        doc_dict.update(doc.attrib)
        doc_dict['data'] = doc.text
        yield doc_dict

xml_data = io.StringIO(u'''YOUR XML STRING HERE''')

etree = ET.parse(xml_data) #create an ElementTree object 
doc_df = pd.DataFrame(list(iter_docs(etree.getroot())))

If there are multiple authors in your original document or the root of your XML is not an author, then I would add the following generator:

def iter_author(etree):
    for author in etree.iter('author'):
        for row in iter_docs(author):
            yield row

and change doc_df = pd.DataFrame(list(iter_docs(etree.getroot()))) to doc_df = pd.DataFrame(list(iter_author(etree)))

Have a look at the ElementTree tutorial provided in the xml library documentation.

2 of 5

As of v1.3, you can simply use:

pandas.read_xml(path_or_file)

Python

docs.python.org › 3 › library › xml.etree.elementtree.html

xml.etree.ElementTree — The ElementTree XML API

January 29, 2026 - Source code: Lib/xml/etree/ElementTree.py The xml.etree.ElementTree module implements a simple and efficient API for parsing and creating XML data. Tutorial: This is a short tutorial for using xml....