How can I convert XML to CSV in Python without using libraries such as Etree or Xmltodict?

stackoverflow.com › questions › 76395980 › how-can-i-convert-xml-to-csv-in-python-without-using-libraries-such-as-etree-or

I would recommend pandasread_xml() and to_csv() function, 3-liner:

Compare the documentation: to_csv, read_xml

import pandas as pd

df = pd.read_xml('employee.xml')
df.to_csv('out.csv', index=False)

Output -> (CSV-file):

id,name,age,salary,division
303,varma,20,120000,3
304,Cyril,20,900000,3
305,Yojith,20,900000,3

Answer from Hermann12 on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 76395980 › how-can-i-convert-xml-to-csv-in-python-without-using-libraries-such-as-etree-or

How can I convert XML to CSV in Python without using libraries such as Etree or Xmltodict? - Stack Overflow

Top answer

1 of 2

I would recommend pandasread_xml() and to_csv() function, 3-liner:

Compare the documentation: to_csv, read_xml

import pandas as pd

df = pd.read_xml('employee.xml')
df.to_csv('out.csv', index=False)

Output -> (CSV-file):

id,name,age,salary,division
303,varma,20,120000,3
304,Cyril,20,900000,3
305,Yojith,20,900000,3

2 of 2

I recommend just using libraries because they're usually very optimised. I'll talk about that later. For now, here's a way that utilises the xml.dom.minidom module, which is a part of the Python standard library, so no additional libraries are required.

Edit: rewrote the last part using the standard CSV library instead of manually writing the file, as suggested by a comment. That makes for 2 Python built-in modules, not 1. The original code for the CSV writing will be at the end of the reply, if you're interested.

from xml.dom import minidom
from csv import DictWriter

# Step 1: Read and parse the XML file
# Write it as a string, or open the file and read it
xml_file = open('employees.xml', 'r')
xml_data = xml_file.read()

dom = minidom.parseString(xml_data)
employees = dom.getElementsByTagName('employee')

xml_file.close()

# Step 2: Extract the required information
data = []
for employee in employees:
    emp_data = {}
    for child in employee.childNodes:
        if child.nodeType == minidom.Node.ELEMENT_NODE:
            emp_data[child.tagName] = child.firstChild.data
    data.append(emp_data)

# Step 3: Write the extracted information to a CSV file
with open('output.csv', 'w', newline = '') as csv_file:
    fieldnames = ['id', 'name', 'age', 'salary', 'division']
    writer = DictWriter(csv_file, fieldnames = fieldnames)

    writer.writeheader()
    for emp_data in data:
        writer.writerow(emp_data)

Don't reinvent the wheel, just realign it.

— Anthony J. D'Angelo, I think

I recommend NOT using this code. You should really just use lxml. It's extremely simple and easy to use and can handle complex XML structures with nested elements and attributes. Let me know how everything goes!

Original CSV write code without CSV library

# Step 3: Write the extracted information to a CSV file
with open('output.csv', 'w') as f:
    f.write('id,name,age,salary,division\n')
    for emp_data in data:
        f.write(f"{emp_data['id']},{emp_data['name']},{emp_data['age']},{emp_data['salary']},{emp_data['division']}\n")

Stack Overflow

stackoverflow.com › questions › 49898661 › xml-to-csv-python

pandas - XML to CSV Python - Stack Overflow

Top answer

1 of 5

Using pandas and BeautifulSoup you can achieve your expected output easily:

#Code:

import pandas as pd
import itertools
from bs4 import BeautifulSoup as b
with open("file.xml", "r") as f: # opening xml file
    content = f.read()

soup = b(content, "lxml")
pkgeid =  [ values.text for values in soup.findAll("pkgeid")]
pkgname = [ values.text for values in soup.findAll("pkgname")]
time =  [ values.text for values in soup.findAll("time")]
oper =  [ values.text for values in soup.findAll("oper")]
# For python-3.x use `zip_longest` method
# For python-2.x use 'izip_longest method
data = [item for item in itertools.zip_longest(time, oper, pkgeid, pkgname)] 
df  = pd.DataFrame(data=data)
df.to_csv("sample.csv",index=False, header=None)

#output in `sample.csv` file will be as follows:
2015-09-16T04:13:20Z,Create_Product,10,BBCWRL
2015-09-16T04:13:20Z,Create_Product,18,CNNINT
2018-04-01T03:30:28Z,Deactivate_Dhct,,

2 of 5

Using Pandas, parsing all xml fields.

import xml.etree.ElementTree as ET
import pandas as pd

tree = ET.parse("file.xml")
root = tree.getroot()

get_range = lambda col: range(len(col))
l = [{r[i].tag:r[i].text for i in get_range(r)} for r in root]

df = pd.DataFrame.from_dict(l)
df.to_csv('file.csv')

Videos

12:42

YouTube

Convert XML to CSV in Python | Python Tutorial - YouTube

Convert XML to CSV in Python | Full Source Code | Complete Tutorial ...

October 14, 2021

08:51

YouTube

How To Convert XML to CSV In Python - YouTube

November 22, 2020

04:40

YouTube

XML formatter: XML to CSV Python - YouTube

Convert XML to CSV - YouTube

September 6, 2019

youtube.com

Convert XML to CSV the easy way

View all

Stack Overflow

stackoverflow.com › questions › 74772537 › how-to-convert-xml-file-to-csv-using-python-script

how to convert xml file to csv using python script - Stack Overflow

import xml.etree.ElementTree as ET import csv # PARSE XML xml = ET.parse("./error.xml") root = xml.getElementsByTagName() # CREATE CSV FILE csvfile = open("data.csv",'w',encoding='utf-8') csvfile_writer = csv.writer(csvfile) # ADD THE HEADER ...

Stack Overflow

stackoverflow.com › questions › 20714038 › xml-to-csv-in-python

XML to CSV in Python - Stack Overflow

Top answer

1 of 3

This is a namespaced XML document. Therefore you need to address the nodes using their respective namespaces.

The namespaces used in the document are defined at the top:

xmlns:tc2="http://www.garmin.com/xmlschemas/TrainingCenterDatabase/v2"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xmlns:tp1="http://www.garmin.com/xmlschemas/TrackPointExtension/v1"
xmlns="http://www.topografix.com/GPX/1/1"

So the first namespace is mapped to the short form tc2, and would be used in an element like <tc2:foobar/>. The last one, which doesn't have a short form after the xmlns, is called the default namespace, and it applies to all elements in the document that don't explicitely use a namespace - so it applies to your <trkpt /> elements as well.

Therefore you would need to write root.iter('{http://www.topografix.com/GPX/1/1}trkpt') to select these elements.

In order to also get time and elevation, you can use trkpt.find() to access these elements below the trkpt node, and then element.text to retrieve those elements' text content (as opposed to attributes like lat and lon). Also, because the time and ele elements also use the default namespace you'll have to use the {namespace}element syntax again to select those nodes.

So you could use something like this:

NS = 'http://www.topografix.com/GPX/1/1'
header = ('lat', 'lon', 'ele', 'time')

with open('output.csv', 'w') as f:
    writer = csv.writer(f)
    writer.writerow(header)
    root = lxml.etree.fromstring(x)
    for trkpt in root.iter('{%s}trkpt' % NS):
        lat = trkpt.get('lat')
        lon = trkpt.get('lon')
        ele = trkpt.find('{%s}ele' % NS).text
        time = trkpt.find('{%s}time' % NS).text

        row = lat, lon, ele, time
        writer.writerow(row)

For more information on XML namespaces, see the Namespaces section in the lxml tutorial and the Wikipedia article on XML Namespaces. Also see GPS eXchange Format for some details on the .gpx format.

2 of 3

Apologies for using already-made tools here, but this did the job with your data :

Convert XML to JSON : http://convertjson.com/xml-to-json.htm
Take that JSON and convert JSON to CSV : https://konklone.io/json/

It worked like a charm with your data.

ele,time,_lat,_lon
0.0000000,2013-12-03T21:08:56Z,45.4852855,-122.6347885
0.0000000,2013-12-03T21:09:00Z,45.4852961,-122.6347926
0.2000000,2013-12-03T21:09:01Z,45.4852982,-122.6347897

So for coding, I reckon XML > JSON > CSV may be a good approach. You many find the relevant scripts pointed to in those links.

Stack Overflow

stackoverflow.com › questions › 37028098 › convert-xml-to-csv-in-python

Convert xml to csv in Python - Stack Overflow

Top answer

1 of 2

Use `csv.DictWriter`, get values from `node.attrib` dictionary

Your elements named TrdCapRpt have attributes, if you have such node, its attribute node.attrib holds a dictionary with key/value for each attribute.

csv.DictWriter allows writing data taken from dictionary.

First some imports (I always use lxml as it is very fast and provides extra features):

from lxml import etree
import csv

Configure file names and fields to use in each record:

xml_fname = "data.xml"
csv_fname = "data.csv"

fields = [
    "RptID", "TrdTyp", "TrdSubTyp", "ExecID", "TrdDt", "BizDt", "MLegRptTyp",
    "MtchStat" "MsgEvtSrc", "TrdID", "LastQty", "LastPx", "TxnTm", "SettlCcy",
    "SettlDt", "PxSubTyp", "VenueTyp", "VenuTyp", "OfstInst"]

Read the XML:

xml = etree.parse(xml_fname)

Iterate over elements "TrdCapRpt", write attribute values to CSV file:

with open(csv_fname, "w") as f:

    writer = csv.DictWriter(f, fields, delimiter=";", extrasaction="ignore")
    writer.writeheader()
    for node in xml.iter("TrdCaptRpt"):
        writer.writerow(node.attrib)

If you prefer using stdlib xml.etree.ElementTree, you shall manage easily as you do now, because the node.attrib is present there too.

Reading from multiple element names

In your comments, you noted, that you want to export attributes from more element names. This is also possible. To do this, I will modify the example to use xpath (which will probably work only with lxml) and add extra column "elm_name" to track, from which element is the record created:

fields = [
    "elm_name",

    "RptID", "TrdTyp", "TrdSubTyp", "ExecID", "TrdDt", "BizDt", "MLegRptTyp",
    "MtchStat" "MsgEvtSrc", "TrdID", "LastQty", "LastPx", "TxnTm", "SettlCcy",
    "SettlDt", "PxSubTyp", "VenueTyp", "VenuTyp", "OfstInst",

    "Typ", "Amt", "Ccy"
]

xml = etree.parse(xml_fname)

with open(csv_fname, "w") as f:

    writer = csv.DictWriter(f, fields, delimiter=";", extrasaction="ignore")
    writer.writeheader()
    for node in xml.xpath("//*[self::TrdCaptRpt or self::PosRpt or self::Amt]"):
        atts = node.attrib
        atts["elm_name"] = node.tag
        writer.writerow(node.attrib)

The modifications are:

fields got extra "elm_name" field and fields from other elements (feel free to remove those you are not interested at).
iterate over elements using xml.xpath. The XPath expression is more complex so I am not sure, if stdlib ElementTree supports that.
before writing the record, I add name of the element into atts dictionary to provide name of the element.

Warning: the element Amt is nested inside PosRpt and this tree structure is not possible to support in CSV. The records are written, but do not hold information about where they come from (apart from following the record for parent element).

2 of 2

You should first push each line with all your tags into a list.

for node in tree.iter('TrdCaptRpt'):

    .....

    my_list.push([RptID, TrdTyp, TrdSubTyp, TrdDt, BizDt, 
                  MLegRptTyp, MtchStat, MsgEvtSrc, TrdID, 
                  LastQty, LastPx, TxnTm, SettlCcy, SettlDt, 
                  PxSubTyp, VenueTyp, VenuTyp, OfstInst])

Then write each line to file :

with open('/Users/anantsangar/Desktop/output.csv', 'w') as csvfile:
    spamwriter = csv.writer(csvfile, delimiter=' ', quotechar='|', quoting=csv.QUOTE_MINIMAL)
for row in my_list:
    spamwriter.writerow(row)

Stack Overflow

stackoverflow.com › questions › 31844713 › convert-xml-to-csv-file

python - Convert XML to CSV file - Stack Overflow

Top answer

1 of 1

Do not use the findall function, as it will look for att tags in the whole tree. Just iterate the tree in order from top to bottom and grab the relevant elements in them.

from xml.etree import ElementTree
tree = ElementTree.parse('input.xml')
root = tree.getroot()

for att in root:
    first = att.find('attval').text
    for subatt in att.find('children'):
        second = subatt.find('attval').text
        print('{},{}'.format(first, second))

Which gives:

$ python process.py 
Data,Studyval
Data,Site
Info,age
Info,gender

Stack Overflow

stackoverflow.com › questions › 69260885 › convert-xml-to-csv-using-python

convert xml to csv using python - Stack Overflow

Top answer

1 of 2

You probably don't need to go through ElementTree; you can feed the xml directly to pandas. If I understand you correctly, this should do it:

df = pd.read_xml(path_to_file,"//*[local-name()='MainVIP']")
df = df.iloc[:,:4]
df

Output from your xml above:

    Date    RegisteredDate  Type    TypeDescription
0   20210616    20210216    YMBA    TYPE OF ENQUIRY

2 of 2

-1

Without any external lib - the code below generates a csv file.
The idea is to collect the required elements data from MainVip and store it in list of dicts. Loop on the list and write the data into a file.

import xml.etree.ElementTree as ET

xml = ''' <soap:Envelope xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema">
    <soap:Body>
        <Level2 xmlns="https://xxxxxxxxxx/xxxxxxx">
            <Level3>
                <ResponseStatus>Success</ResponseStatus>
                <ErrorMessage/>
                <Message>20 alert(s) generated for this period</Message>
                <ProcessingTimeSecs>0.88217689999999993</ProcessingTimeSecs>
                <Something1>1</Something1>
                <Something2/>
                <Something3/>
                <Something4/>
                <VIP>
                    <MainVIP>
                        <Date>20210616</Date>
                        <RegisteredDate>20210216</RegisteredDate>
                        <Type>YMBA</Type>
                        <TypeDescription>TYPE OF ENQUIRY</TypeDescription>
                        <BusinessName>COMPANY NAME</BusinessName>
                        <ITNumber>987654321</ITNumber>
                        <RegistrationNumber>123456789</RegistrationNumber>
                        <SubscriberNumber>55889977</SubscriberNumber>
                        <SubscriberReference/>
                        <TicketNumber>1122336655</TicketNumber>
                        <SubscriberName>COMPANY NAME 2 </SubscriberName>
                        <CompletedDate>20210615</CompletedDate>
                    </MainVIP>
                </VIP>
                <Something5/>
                <Something6/>
                <Something7/>
                <Something8/>
                <Something9/>
                <PrincipalSomething10/>
                <PrincipalSomething11/>
                <PrincipalSomething12/>
                <PrincipalSomething13/>
                <Something14/>
                <Something15/>
                <Something16/>
                <Something17/>
                <Something18/>
                <PrincipalSomething19/>
                <PrincipalSomething20/>
            </Level3>
        </Level2>
    </soap:Body>
</soap:Envelope>'''

cols = ['Date', 'RegisteredDate', 'Type',
        'TypeDescription']
rows = []
NS = '{https://xxxxxxxxxx/xxxxxxx}'
root = ET.fromstring(xml)
for vip in root.findall(f'.//{NS}MainVIP'):
    rows.append({c: vip.find(NS+c).text for c in cols})
with open('out.csv','w') as f:
    f.write(','.join(cols) + '\n')
    for row in rows:
        f.write(','.join(row[c] for c in cols) + '\n')

out.csv

Date,RegisteredDate,Type,TypeDescription
20210616,20210216,YMBA,TYPE OF ENQUIRY

Stack Overflow

stackoverflow.com › questions › 59750648 › is-there-a-simple-way-to-convert-xml-format-to-csv-using-python

Is there a simple way to convert xml format to csv using python? - Stack Overflow

Top answer

1 of 1

I'd do this in a very explicit way rather than trying to hack xmltodict to fit your needs.

The only downside I see with this approach is a bit of repetition with the hardcoded headers and tags names.

Also, I don't know how regular you input XML is going to be. If it's possible that some of the tags will not be present then you will need to add some error handling (because node.find will return None, then .text will cause an AttributeError).

rows = []
for abc_node in tree.findall('abc'):
    rate_node = abc_node.find('Rate')
    fee_node = abc_node.find('fee')
    row = {'id': abc_node.find('id').text,
           'uniqueid': abc_node.find('uniqueid').text,
           'Name': abc_node.find('Name').text,
           'rate_mrp': rate_node.find('mrp').text,
           'rate_discount': rate_node.find('discount').text,
           'rate_discountmonths': rate_node.find('discountmonths').text,
           'fee_type': fee_node.find('type').text,
           'fee_minimumfee': fee_node.find('minimumfee').text,
           'fee_maxfee': fee_node.find('maxfee').text}
    rows.append(row)

with open('test.csv', 'w', encoding='utf8') as f:
    headers = ['id', 'uniqueid', 'Name', 'rate_mrp', 'rate_discount', 'rate_discountmonths',
               'fee_type', 'fee_minimumfee', 'fee_maxfee']
    dict_writer = csv.DictWriter(f, fieldnames=headers, lineterminator='\n')
    dict_writer.writeheader()
    dict_writer.writerows(rows)

Output

id,uniqueid,Name,rate_mrp,rate_discount,rate_discountmonths,fee_type,fee_minimumfee,fee_maxfee
23,23_0,,6.40000,10.00%,2,off,"£1,500.75",£10K
35,35_0,,7.90000,5.00%,5,offer,£1k,"£22,000"

If you want | as delimiter just add delimiter='|' to csv.DictWriter(f, fieldnames=headers, lineterminator='\n')

then the output is

id|uniqueid|Name|rate_mrp|rate_discount|rate_discountmonths|fee_type|fee_minimumfee|fee_maxfee
23|23_0||6.40000|10.00%|2|off|£1,500.75|£10K
35|35_0||7.90000|5.00%|5|offer|£1k|£22,000

Find elsewhere

Google Bing Mojeek

Stack Overflow

stackoverflow.com › questions › 39576683 › convert-deeply-nested-xml-to-csv-in-python

Convert Deeply Nested XML to CSV in Python - Stack Overflow

Top answer

1 of 3

The lxml library is capable of very powerful XML parsing, and can be used to iterate over an XML tree to search for specific elements.

from lxml import etree

with open(r'path/to/xml', 'r') as xml:
    text = xml.read()
tree = lxml.etree.fromstring(text)
row = ['', '']
for item in tree.iter('hw', 'def'):
    if item.tag == 'hw':
       row[0] = item.text
    elif item.tag == 'def':
       row[1] = item.text

line = ','.join(row)

with open(r'path/to/csv', 'a') as csv:
     csv.write(line + '\n')

How you build the CSV file is largely based upon preference, but I have provided a trivial example above. If there are multiple <dps-data> tags, you could extract those elements first (which can be done with the same tree.iter method shown above), and then apply the above logic to each of them.

EDIT: I should point out that this particular implementation reads the entire XML file into memory. If you are working with a single 150mb file at a time, this should not be a problem, but it's just something to be aware of.

2 of 3

How about this:

from xml.dom import minidom

xmldoc = minidom.parse('your.xml')
hw_lst = xmldoc.getElementsByTagName('hw')
defu_lst = xmldoc.getElementsByTagName('def')

with open('your.csv', 'a') as out_file:
    for i in range(len(hw_lst)):
        out_file.write('{0}, {1}\n'.format(hw_lst[i].firstChild.data, defu_lst[i].firstChild.data))

Stack Overflow

stackoverflow.com › questions › 48214975 › convert-xml-text-in-csv-python

convert xml text in csv python - Stack Overflow

Top answer

1 of 1

ElementTree is a python XML parser ( https://docs.python.org/2/library/xml.etree.elementtree.html )

parse the XML literals in the CSV cells as strings, then iterate through the elements and resave them :

from xml.etree.ElementTree import XML

parsed = XML('''
<root>
  <group>
    <child id="a">This is child "a".</child>
    <child id="b">This is child "b".</child>
  </group>                                     // replace this with a variable that contains your XML string literals
  <group>
    <child id="c">This is child "c".</child>
  </group>
</root>
''')

print 'parsed =', parsed

for elem in parsed:
    print elem.tag
    if elem.text is not None and elem.text.strip():
        print '  text: "%s"' % elem.text
    if elem.tail is not None and elem.tail.strip():
        print '  tail: "%s"' % elem.tail
    for name, value in sorted(elem.attrib.items()):
        print '  %-4s = "%s"' % (name, value)
    print

source :https://pymotw.com/2/xml/etree/ElementTree/parse.html#parsing-strings

alternatively you can convert the XML cells directly :

http://blog.appliedinformaticsinc.com/how-to-parse-and-convert-xml-to-csv-using-python/

Stack Overflow

stackoverflow.com › questions › 71583176 › how-do-i-convert-a-large-xml-file-to-a-csv-file

python 3.x - How do I convert a large XML file to a CSV file? - Stack Overflow

Top answer

1 of 2

There are various technologies for streamed processing of XML. One of them is XSLT 3.0, where you would write

<xsl:mode streamable="yes"/>
<xsl:output method="text"/>
<xsl:template match="row">
  <xsl:value-of select="@Id, @UserId, @Name, @Class, @TagBased"
     separator=","/>
  <xsl:text>&#xa;</xsl:text>
</xsl:template>

2 of 2

I tried MySQL, imported the XML data set files into the database, then exported them to CSV format and processed 82.2GB files in just 3 hours.

Stack Overflow

stackoverflow.com › questions › 57243482 › convert-xml-to-csv-python

convert xml to csv python - Stack Overflow

try: import xml.etree.cElementTree as ET except ImportError: import xml.etree.ElementTree as ET import pandas as pd tree = ET.parse("file1.xml") root = tree.getroot() iter_root = root.iter() l = {} for elem in iter_root: l[str(elem.tag)] = str(elem.text) df = pd.DataFrame.from_dict(l,orient="index") df.to_csv('ABC.csv')

Stack Overflow

stackoverflow.com › questions › 52307209 › how-could-i-convert-a-simple-xml-to-csv-using-python

How could I convert a simple XML to CSV using Python? - Stack Overflow

Top answer

1 of 1

use xml.etree.ElementTree and csv modules:

import xml.etree.ElementTree as et
import csv

tree = et.parse('node.xml')
nodes = tree.getroot()
with open('node.csv', 'w') as ff:
    cols = ['id','x','y','type']
    nodewriter = csv.writer(ff)
    nodewriter.writerow(cols)
    for node in nodes:
        values = [ node.attrib[kk] for kk in cols]
        nodewriter.writerow(values)

Stack Overflow

stackoverflow.com › questions › 44016555 › python-converting-xml-to-csv

Python - Converting XML to CSV - Stack Overflow

Top answer

1 of 1

You can use findall function.

import xml.etree.ElementTree as ET
import csv

tree = ET.parse("/temp/test.xml")
root = tree.getroot()

f = open('/temp/test.csv', 'w')

csvwriter = csv.writer(f)

count = 0

head = ['Job Name','Task Name','Staff Name','Date','Minutes','Billable']

csvwriter.writerow(head)

for time in root.findall('Time'):
    row = []
    job_name = time.find('Job').find('Name').text
    row.append(job_name)
    task_name = time.find('Task').find('Name').text
    row.append(task_name)
    staff_name = time.find('Staff').find('Name').text
    row.append(staff_name)
    date = time.find('Date').text
    row.append(date)
    minutes = time.find('Minutes').text
    row.append(minutes)
    billable = time.find('Billable').text
    row.append(billable)
    csvwriter.writerow(row)
f.close()

Which gives:

Job Name,Task Name,Staff Name,Date,                Minutes,Billable
 My Job , My Task , My Name , 2017-05-19T00:00:00 , 480 , true

Stack Overflow

stackoverflow.com › questions › 38959337 › convert-xml-to-csv-with-python

Convert XML to CSV with python - Stack Overflow

Top answer

1 of 1

I'm not familiar with "Magento", but this program converts your XML file to a CSV file. The resulting CSV file has one column for Name and one column for each Value.

from xml.etree import ElementTree as ET
import csv

tree = ET.parse('x.xml')
root = tree.getroot()

columns = ['Name'] + [
    value.attrib.get('AttributeID').encode('utf-8')
    for value in tree.findall('.//Product//Value')]

with open('x.csv', 'w') as ofile:
    ofile = csv.DictWriter(ofile, set(columns))
    ofile.writeheader()
    for product in tree.findall('.//Product'):
        d = {value.attrib.get('AttributeID').encode('utf-8'):
             (value.text or '').encode('utf-8')
             for value in product.findall('.//Values/Value')}
        d['Name'] = product.findtext('Name')
        ofile.writerow(d)

Delft Stack

delftstack.com › home › howto › python › xml to csv python

How to Convert XML to CSV Using Python | Delft Stack

February 2, 2024 - Finally, we use the to_csv() method of the DataFrame object to write the data to a CSV file named ‘students.csv’. The index=False parameter ensures that the index is not written into the CSV file. xmltodict is a Python library that provides ...

GitHub

github.com › waheed0332 › xml2csv

GitHub - waheed0332/xml2csv: Python scripts for processing XML documents and converting to CSV. Also works on nested xml files. · GitHub

This script utilize power of multiprocessing to convert huge data in less time. Install required libraries using following command before running script. pip install -r requirements.txt · python xml2csv.py -f ./xml-samples/1.xml -csv out.csv

Starred by 23 users

Forked by 7 users

Languages Python

Stack Overflow

stackoverflow.com › questions › 68200272 › dynamic-xml-parsing-and-converting-to-csv-in-python

Dynamic xml parsing and converting to csv in Python - Stack Overflow

Top answer

1 of 2

Use the code provided by Nk03 to convert the XML you're loading to a python dictionary.

import xmltodict

d = xmltodict.parse("""
<D1>
    <RECORD>
            <ELEC>EL-13</ELEC>
            <VAL>10</VAL>
            <POWER>Max</POWER>
            <WIRING>2.3</WIRING>
            <ENABLED>Yes</ENABLED>
    </RECORD>       
    <RECORD>
            <ELEC>EL-14</ELEC>
            <VAL>30</VAL>
            <POWER>Max</POWER>
            <WIRING>1.1</WIRING>
            <ENABLED>Yes</ENABLED>
    </RECORD>       
</D1>
""")

From there, you can generate a list of keys to use as the column names for the DataFrame:

for key in parsed_dictionary.keys():
    cols.append(key)

2 of 2

Here’s one way:

import xmltodict

d = xmltodict.parse("""
<D1>
    <RECORD>
            <ELEC>EL-13</ELEC>
            <VAL>10</VAL>
            <POWER>Max</POWER>
            <WIRING>2.3</WIRING>
            <ENABLED>Yes</ENABLED>
    </RECORD>       
    <RECORD>
            <ELEC>EL-14</ELEC>
            <VAL>30</VAL>
            <POWER>Max</POWER>
            <WIRING>1.1</WIRING>
            <ENABLED>Yes</ENABLED>
    </RECORD>       
</D1>
""")

pd.DataFrame(d).iloc[:,0].explode().apply(pd.Series).reset_index(drop=True).to_csv('out.csv’)

# Alternative:
pd.json_normalize(d).stack().explode().apply(pd.Series)

Explanation ->

Convert the XML to dict.
load the result into a dataframe.
use explode to extract the values from the list of dict into multiple roes.
Apply pd.series to generate the required columns from the dict
Save the output to csv.

Updated Answer:

df1 = pd.json_normalize(d).stack().explode().apply(pd.Series)
pd.concat([df1.pop('DATA').apply(pd.Series), df1], 1)

GeeksforGeeks

geeksforgeeks.org › python › convert-xml-to-csv-in-python

Convert XML to CSV in Python - GeeksforGeeks

July 23, 2025 - Export the DataFrame to a CSV file. To download the XML data used in the examples, click here. Python Program to Convert XML to CSV.

Stack Overflow

stackoverflow.com › questions › 26081880 › how-to-convert-xml-file-of-stack-overflow-dump-to-csv-file

python 3.x - How to convert xml file of stack overflow dump to csv file - Stack Overflow

October 16, 2016 - I have written a PySpark function to parse the .xml in .csv. XmltoCsv_StackExchange is the github repo. Used it to convert 1 GB of xml within 2-3 minutes on a minimal 2-core and 2 GB RAM Spark setup.

Original CSV write code without CSV library

Videos

Use csv.DictWriter, get values from node.attrib dictionary

Reading from multiple element names

Use `csv.DictWriter`, get values from `node.attrib` dictionary