python xml get specific element

How to find specific elements in XML using ElementTree

stackoverflow.com › questions › 16419673 › how-to-find-specific-elements-in-xml-using-elementtree

The namespace of an XML document is significant. ElementTree requires tags to be fully qualified to find the right element. Here's an example of three elements with the same tag in different namespaces:

data = '''\
<root xmlns="xyz" xmlns:name="abc">
  <object name="one" />
  <name:object name="two" />
  <object xmlns="def" name="three" />
</root>
'''

Here's the elements that ElementTree sees:

>>> from xml.etree import ElementTree as et
>>> tree = et.fromstring(data)
>>> print(tree.findall('.//*'))
>>> et.dump(tree)
[<Element '{xyz}object' at 0x0000000003B07BD8>,
 <Element '{abc}object' at 0x0000000003B07C28>,
 <Element '{def}object' at 0x0000000003B07C78>]

So you have it right. Given the default namespace definition of:

<entry xmlns='http://www.w3.org/2005/Atom' ...

To access the 'title' tag, which uses the default namespace:

media['title'] = e.findall('{http://www.w3.org/2005/Atom}title')

to access the 'media:group' tag, refer to the media namespace definition:

<entry ... xmlns:media='http://search.yahoo.com/mrss/' ...

And use:

e.findall('{http://search.yahoo.com/mrss/}group')

Note the different ways a namespace can be specified:

<root xmlns="xyz" xmlns:name="abc">   # default namespace and
                                      # 'abc' namespace with id 'name'.
  <object name="one" />               # Uses default namespace 'xyz'.
  <name:object name="two" />          # uses 'abc' namespace (specified by id).
  <object xmlns="def" name="three" /> # change the default namespace to 'def'.
</root>

To read a specific tag from a specific namespace:

>>> print(tree.find('{abc}object').attrib['name'])
'two'

Note the namespace IDs are just shortcuts. Here's what happens when you dump the parsed XML tree. ElementTree doesn't bother to save the original namespace IDs and generates its own in the format ns#:

>>> et.dump(tree)
<ns0:root xmlns:ns0="xyz" xmlns:ns1="abc" xmlns:ns2="def">
  <ns0:object name="one" />
  <ns1:object name="two" />
  <ns2:object name="three" />
</ns0:root>

If you want specific shortcuts defined, use `register_namespace':

>>> et.register_namespace('','xyz') # default namespace
>>> et.register_namespace('name','abc')
>>> et.register_namespace('custom','def')
>>> et.dump(tree)
<root xmlns="xyz" xmlns:custom="def" xmlns:name="abc">
  <object name="one" />
  <name:object name="two" />
  <custom:object name="three" />
</root>

Answer from Mark Tolonen on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 16419673 › how-to-find-specific-elements-in-xml-using-elementtree

python - How to find specific elements in XML using ElementTree - Stack Overflow

Top answer

1 of 2

data = '''\
<root xmlns="xyz" xmlns:name="abc">
  <object name="one" />
  <name:object name="two" />
  <object xmlns="def" name="three" />
</root>
'''

Here's the elements that ElementTree sees:

>>> from xml.etree import ElementTree as et
>>> tree = et.fromstring(data)
>>> print(tree.findall('.//*'))
>>> et.dump(tree)
[<Element '{xyz}object' at 0x0000000003B07BD8>,
 <Element '{abc}object' at 0x0000000003B07C28>,
 <Element '{def}object' at 0x0000000003B07C78>]

So you have it right. Given the default namespace definition of:

<entry xmlns='http://www.w3.org/2005/Atom' ...

To access the 'title' tag, which uses the default namespace:

media['title'] = e.findall('{http://www.w3.org/2005/Atom}title')

to access the 'media:group' tag, refer to the media namespace definition:

<entry ... xmlns:media='http://search.yahoo.com/mrss/' ...

And use:

e.findall('{http://search.yahoo.com/mrss/}group')

Note the different ways a namespace can be specified:

<root xmlns="xyz" xmlns:name="abc">   # default namespace and
                                      # 'abc' namespace with id 'name'.
  <object name="one" />               # Uses default namespace 'xyz'.
  <name:object name="two" />          # uses 'abc' namespace (specified by id).
  <object xmlns="def" name="three" /> # change the default namespace to 'def'.
</root>

To read a specific tag from a specific namespace:

>>> print(tree.find('{abc}object').attrib['name'])
'two'

>>> et.dump(tree)
<ns0:root xmlns:ns0="xyz" xmlns:ns1="abc" xmlns:ns2="def">
  <ns0:object name="one" />
  <ns1:object name="two" />
  <ns2:object name="three" />
</ns0:root>

If you want specific shortcuts defined, use `register_namespace':

>>> et.register_namespace('','xyz') # default namespace
>>> et.register_namespace('name','abc')
>>> et.register_namespace('custom','def')
>>> et.dump(tree)
<root xmlns="xyz" xmlns:custom="def" xmlns:name="abc">
  <object name="one" />
  <name:object name="two" />
  <custom:object name="three" />
</root>

2 of 2

Actually I have tried the following way using xml.dom.minidom, Just in case it helps you anyway.

#!/usr/bin/python

from xml.dom.minidom import parseString
import re
import urllib

def get_video_id(video_url):
    return re.search(r'watch\?v=.*', video_url).group(0)[8:]

def get_video_feed(video_url):
    video_feed = "http://gdata.youtube.com/feeds/api/videos/" + get_video_id(video_url)
    print video_feed
    return urllib.urlopen(video_feed).read()

def get_media_info(video_url):
    content = get_video_feed(video_url)
    dom = parseString(content)
    media = {}

    media['title'] = dom.getElementsByTagName('title')[0].firstChild.nodeValue
    return media

def main():
    video_url = 'http://youtube.com/watch?v=q5sOLzEerwA' 

    print ( get_media_info(video_url) )

if __name__ == '__main__':
    main()

TutorialsPoint

tutorialspoint.com › How-to-get-specific-nodes-in-xml-file-in-Python

How to get specific nodes in xml file in Python?

In some XML files, elements contains additional information which stored as attributes. These attributes are used to identify and filter the elements we want and this can be performed by using the findall() method in Python. Following is the example, in which we get the book by its id from ...

Discussions

xml - Python getting element value for specific element - Stack Overflow

I am attempting to parse the below XML file but having difficulty getting a specific element value. I am trying to specify element 'Item_No_2' to get the related value 2222222222 ... More on stackoverflow.com

stackoverflow.com

August 23, 2016

Navigating and extracting XML in Python

Hi, I’m trying to find a way to extract certain elements from the note data in this xml document: F 1 4 3 1 half I now need the data from the tied element, and whilst extracting the data from pitch, I had no problem, as I simply itterated through pitch as follows: for pitch in myroot.ite... More on forum.freecodecamp.org

forum.freecodecamp.org

February 8, 2023

Xml - Find Element By tag using Python - Stack Overflow

I am trying extract some data from a bunch of xml files. Now, the issue is the structure of all the files is not exactly the same and thus, just iterating over the children and extracting the value... More on stackoverflow.com

stackoverflow.com

parse a specific element in a xml file using Python - Stack Overflow

It would be still better if you stripped down the XML even further, and embedded it as a triple-quoted string in your sample code, so we could just copy and paste your example into a Python interpreter to make it easier to debug. But good enough for a +1 from me. ... So, what's going on? You don't have any elements ... More on stackoverflow.com

stackoverflow.com

June 6, 2017

Videos

35:04

YouTube

Parsing XML files with Python (xml.etree.ElementTree) - YouTube

July 4, 2020

youtube.com

Parse XML Files with Python - Basics in 10 Minutes

youtube.com

XML & ElementTree || Python Tutorial || Learn Python Programming ...

June 3, 2021

youtube.com

How to Read XML File in Python (Step-by-Step for Beginners)

27:33

YouTube

Python XML Parser Tutorial | Read and Write XML in Python | Python ...

October 6, 2023

View all

Stack Overflow

stackoverflow.com › questions › 58622395 › get-element-value-from-xml-file-using-python

Get element value from xml file using Python - Stack Overflow

import xml.etree.ElementTree as ET tree = ET.parse('csr1kv_file.xml') root = tree.getroot() ET.register_namespace("","http://www.test.com/esc/esc") for subnet in root.iter('address'): print (subnet)

Stack Overflow

stackoverflow.com › questions › 39091827 › python-getting-element-value-for-specific-element

xml - Python getting element value for specific element - Stack Overflow

Top answer

1 of 1

There is no Item_No_1 at the elements that are found by doc.findall('md/mi/ovalue/').

I think what you may try to do is get both lists

Copyitems = [e.text for e in doc.findall('md/mi/it')]
values = [e.text for e in doc.findall('md/mi/ovalue/v')]

Then find the index of the string 'Item_No_1' from items, and then index into values with that number.

Alternatively, zip the two lists together and check when you find one element.

Copyfor item,value in zip(doc.findall('md/mi/it'), doc.findall('md/mi/ovalue/v')):
    if item.text == 'Item_No_1':
        print(value.text)

There might be a better way, but those are the first ways that come to mind

freeCodeCamp

forum.freecodecamp.org › programming › python

Navigating and extracting XML in Python - Python - The freeCodeCamp Forum

February 8, 2023 - Hi, I’m trying to find a way to extract certain elements from the note data in this xml document: F 1 4 3 1 half I now need the data from the tied element, and whilst extracting the data from pitch, I had no problem, as I simply itterated through pitch as follows: for pitch in myroot.ite...

Rdegges

rdegges.com › 2013 › quickly-extract-xml-data-with-python

Randall Degges - Quickly Extract XML Data with Python

Load our XML document into memory, and construct an XML ElementTree object. We then use the find method, passing in an XPath selector, which allows us to specify what element we’re trying to extract.

Python

docs.python.org › 3 › library › xml.etree.elementtree.html

xml.etree.ElementTree — The ElementTree XML API

Their values are usually strings but may be any application-specific object. If the element is created from an XML file, the text attribute holds either the text between the element’s start tag and its first child or end tag, or None, and the tail attribute holds either the text between the element’s end tag and the next tag, or None.

Find elsewhere

Google Bing Mojeek

Stack Overflow

stackoverflow.com › questions › 38309395 › xml-find-element-by-tag-using-python

Xml - Find Element By tag using Python - Stack Overflow

Top answer

1 of 1

Yes, in the package xml.etree you can find the built-in function related to XML. (also available for python2)

The one specifically you are looking for is findall.

For example:

import xml.etree.ElementTree as ET
tree = ET.fromstring(some_xml_data)
all_name_elements = tree.findall('.//name')

With:

In [1]: some_xml_data = "<help><person><name>dean</name></person></help>"

I get the following:

In [10]: tree.findall(".//name")
Out[10]: [<Element 'name' at 0x7ff921edd390>]

AskPython

askpython.com › home › python xml parser

Python XML Parser - AskPython

July 7, 2022 - The syntax is similar to our xml module, so we’re still getting the attribute names using value = tag['attribute_name'] and text = tag.text. Exactly the same as before! ... We’ve now parsed this using bs4 too! If your source XML file is badly formatted, this method is the way to go since BeautifulSoup has different rules for handling such files. Hopefully, you’ve now got a good grasp on how to build a Python XML parser easily.

Stack Overflow

stackoverflow.com › questions › 25230674 › parse-a-specific-element-in-a-xml-file-using-python

parse a specific element in a xml file using Python - Stack Overflow

Top answer

1 of 1

The first thing you should test is what happens if you simplify your XPath:

>>> print(root.find(".//field"))
None

So, what's going on? You don't have any elements of type field. You've got an explicit namespace, which means you have elements of type '{http://www.librarything.com/}field'. You can see this pretty easily:

>>> print(root.getchildren())
[<Element '{http://www.librarything.com/}item' at 0x1047580e8>]
>>> print(root.find(".//{http://www.librarything.com/}field"))
<Element '{http://www.librarything.com/}field' at 0x1047582c8>
>>> print(root.find(".//{http://www.librarything.com/}field[@type='5']"))
<Element '{http://www.librarything.com/}field' at 0x104758688>

If you want to know more, there are multiple questions on this site about how ETree deals with namespaces (from a quick search, 1 and 2 look relevant), and detailed information in the documentation; trying to explain it all in yet another answer would just lead to an inferior answer to the existing ones.

Zyte

zyte.com › home › resources › learn › a practical guide to python xml parsing

A Practical Guide to Python XML Parsing

May 15, 2025 - While basic parsing and modification techniques cover most use cases, more complex scenarios may require advanced techniques like XPath queries, handling XML namespaces, and mapping XML data to custom Python objects. XPath is a powerful query language for selecting nodes in an XML document based on various criteria. Here’s how to use XPath to find specific elements in an XML document:

Python.org

discuss.python.org › python help

Extract information from an xml file - Python Help - Discussions on Python.org

June 12, 2023 - I have an xml file looks like this <report> #root element <bindingsite id="1" has_interactions="True"> <interactions> #child elements <hydrophobic_interactions/> #the sub-child elements I want to get name …

reddit.com › r/learnprogramming › [python] elementtree - how to select certain attribute from one selected root.

r/learnprogramming on Reddit: [Python] Elementtree - how to select certain attribute from one selected root.

November 25, 2013 -

I am trying to understand how to use XML with python, so I've created structure to test: http://pastebin.com/x0cvRA8V and I can't grasp how to get lets say value of mindmg for 'Mace' object.

I've started to read documentation, but I can't reach how to list/get those attributes. I could list that mace, and short spear got name, but I cannot list that it has mindmg and maxdmg. Also how to select root by criterium.

Top answer

1 of 1

The simplest route would probably be to use the XPath support . Example: import xml.etree.ElementTree as ET root = ET.fromstring(xml_text) print(root.find("item[@name='Mace']/attribute[@mindmg]").get('mindmg')) The XPath query "item[@name='Mace']/attribute[@mindmg]" means find an attribute element that has a mindmg attribute, and which is a child of an item element that has a name attribute with value 'Mace'. That will return an Element object, which has a get method for retrieving the attribute value. Side note, it's very strange to have elements named attribute. That's technically fine, but usually when you say attribute you mean attribute, i.e.

Stack Overflow

stackoverflow.com › questions › 9797274 › find-xml-element-based-on-its-attribute-and-change-its-value

python - find xml element based on its attribute and change its value - Stack Overflow

Top answer

1 of 4

You can access the attribute value as this:

from elementtree.ElementTree import XML, SubElement, Element, tostring

text = """
<root>
    <phoneNumbers>
        <number topic="sys/phoneNumber/1" update="none" />
        <number topic="sys/phoneNumber/2" update="none" />
        <number topic="sys/phoneNumber/3" update="none" />
    </phoneNumbers>

    <gfenSMSnumbers>
        <number topic="sys2/SMSnumber/1" update="none" />
        <number topic="sys2/SMSnumber/2" update="none" />
    </gfenSMSnumbers>
</root>
"""

elem = XML(text)
for node in elem.find('phoneNumbers'):
    print node.attrib['topic']
    # Create sub elements
    if node.attrib['topic']=="sys/phoneNumber/1":
        tag = SubElement(node,'TagName')
        tag.attrib['attr'] = 'AttribValue'

print tostring(elem)

forget to say, if your ElementTree version is greater than 1.3, you can use XPath:

elem.find('.//number[@topic="sys/phoneNumber/1"]')

http://effbot.org/zone/element-xpath.htm

or you can use this simple one:

for node in elem.findall('.//number'):
    if node.attrib['topic']=="sys/phoneNumber/1":
        tag = SubElement(node,'TagName')
        tag.attrib['attr'] = 'AttribValue'

2 of 4

For me this Elementtree snipped of code worked to find element by attribute:

import xml.etree.ElementTree as ET
tree = ET.parse('file.xml')
root = tree.getroot()


topic=root.find(".//*[@topic='sys/phoneNumber/1']").text

reddit.com › r/learnpython › find element whose attribute contains a value in xml.etree?

r/learnpython on Reddit: Find element whose attribute contains a value in xml.etree?

May 23, 2020 -

How would I go about finding all elements whose attribute 'id' contain a certain value?

e.g., the id's values vary and can be e.g. 'foo 1' 'foo 23' 'foo 9'

How do I go about getting all the elements whose id contains the word foo disregarding whichever number it is followed by?

Top answer

1 of 2

You could do it using XPath functions contains() (or starts-with() if it must be at the start) xml.etree has "basic XPath support" and doesn't support these functions. https://docs.python.org/3/library/xml.etree.elementtree.html#supported-xpath-syntax It supports [@attrib] syntax though which allows you to filter out elements that contain a specific attribute. >>> print(open('id.xml').read(), end='') I'm note 1 I'm note 2 I'm note 3 >>> import xml.etree.ElementTree as ET >>> doc = ET.parse('id.xml') >>> doc.findall('.//note[@id]') [, , ] So .//note[@id] means find all tags that have an id attribute (regardless of its value) Attributes are accessed from the .attrib dictionary. >>> for tag in doc.findall('.//note[@id]'): ... tag.attrib ... {'id': 'foo 1'} {'id': 'bar 1'} {'id': 'foo 2'} You could then use Python to grab the ones you want e.g. using in or startswith on the id value. Alternatively you could install lxml and use lxml.etree which supports all the XPath functions needed.

2 of 2

In the meanwhile I found a work-around, but I'd still be interested to know... if anyone does have the answer.

Stack Overflow

stackoverflow.com › questions › 24612582 › how-to-get-value-from-xml-tag-in-python

How to get value from XML Tag in Python? - Stack Overflow

Top answer

1 of 4

The previous posters have the right of it. The etree documentation can be found here:

https://docs.python.org/2/library/xml.etree.elementtree.html#module-xml.etree.ElementTree

And can help you out. Here's a code sample that might do the trick (partially taken from the above link):

Copyimport xml.etree.ElementTree as ET
tree = ET.parse('your_file.xml')
root = tree.getroot()

for group in root.findall('group'):
  title = group.find('title')
  titlephrase = title.find('phrase').text
  for doc in group.findall('document'):
    refid = doc.get('refid')

Or if you want the ID stored in the group tag, you'd use id = group.get('id') instead of searching for all the refids.

2 of 4

Did you have a look at Python's XML etree parser? There are plenty of examples on the web.

DataCamp

datacamp.com › tutorial › python-xml-elementtree

Python XML Tutorial: Element Tree Parse & Read | DataCamp

December 10, 2024 - Parse and read XML data with Element Tree Python package. Learn how to use xml.etree.elementtree and explore your data through XML today!

Kite

kite.com › python › answers › how-to-get-elements-by-tag-in-an-xml-file-in-python

Kite is saying farewell - Code Faster with Kite

November 20, 2022 - P.S. Most of our code has been open sourced on Github here. It includes our data-driven Python type inference engine, Python public-package analyzer, desktop software, editor integrations, Github crawler and analyzer, and much more.

Stack Overflow

stackoverflow.com › questions › 11351183 › how-to-get-xml-tag-value-in-python

How to get XML tag value in Python - Stack Overflow

Top answer

1 of 2

With lxml:

import lxml.etree
# xmlstr is your xml in a string
root = lxml.etree.fromstring(xmlstr)
textelem = root.find('result/field/value/text')
print textelem.text

Edit: But I imagine there could be more than one result...

import lxml.etree
# xmlstr is your xml in a string
root = lxml.etree.fromstring(xmlstr)
results = root.findall('result')
textnumbers = [r.find('field/value/text').text for r in results]

2 of 2

BeautifulSoup is the most simple way to parse XML as far as I know...

And assume that you have read the introduction, then just simply use:

soup = BeautifulSoup('your_XML_string')
print soup.find('text').string