how to compare two xml files having same data in different lines?

unix.stackexchange.com › questions › 64188 › how-to-compare-two-xml-files-having-same-data-in-different-lines

You can achieve what you want with the help of a small Python script (you'll need Python installed, as well as the lxml toolkit).

tagsort.py:

#!/usr/bin/python

import sys
from lxml import etree

filename, tag = sys.argv[1:]

doc = etree.parse(filename, etree.XMLParser(remove_blank_text=True))
root = doc.getroot()
root[:] = sorted(root, key=lambda el: el.findtext(tag))
print etree.tostring(doc, pretty_print=True)

This script sorts the first-level elements under the XML document root by the content of a second-level element, sending the result to stdout. It's called like this:

$ python tagsort.py filename tag

Once you've got that, you can use process substitution to get a diff based on its output (I've added one element and changed another in your example files to show a non-empty result):

$ diff <(python tagsort.py file1 Id) <(python tagsort.py file2 Id)
4a5
>     <AddedTag>Something</AddedTag>
17c18
<     <Role>X</Role>
---
>     <Role>S</Role>

Answer from user27282 on Stack Exchange

SemanticDiff

semanticdiff.com › online-diff › xml

SemanticDiff - Compare XML Online

Use our free online XML diff to perform a semantic comparison between two XML documents. Get a glimpse of what SemanticDiff can do. ... If the XML data is difficult to read (e.g. contains no line breaks), click the Prettify buttons

Stack Exchange

unix.stackexchange.com › questions › 64188 › how-to-compare-two-xml-files-having-same-data-in-different-lines

bash - how to compare two xml files having same data in different lines? - Unix & Linux Stack Exchange

Top answer

1 of 4

You can achieve what you want with the help of a small Python script (you'll need Python installed, as well as the lxml toolkit).

tagsort.py:

#!/usr/bin/python

import sys
from lxml import etree

filename, tag = sys.argv[1:]

doc = etree.parse(filename, etree.XMLParser(remove_blank_text=True))
root = doc.getroot()
root[:] = sorted(root, key=lambda el: el.findtext(tag))
print etree.tostring(doc, pretty_print=True)

This script sorts the first-level elements under the XML document root by the content of a second-level element, sending the result to stdout. It's called like this:

$ python tagsort.py filename tag

Once you've got that, you can use process substitution to get a diff based on its output (I've added one element and changed another in your example files to show a non-empty result):

$ diff <(python tagsort.py file1 Id) <(python tagsort.py file2 Id)
4a5
>     <AddedTag>Something</AddedTag>
17c18
<     <Role>X</Role>
---
>     <Role>S</Role>

2 of 4

I had a similar problem and I eventually found: https://superuser.com/questions/79920/how-can-i-diff-two-xml-files

That post suggests doing a canonical xml sort then doing a diff. The following should work for you if you are on linux, mac, or if you have windows something like cygwin installed:

$ xmllint --c14n File1.xml > 1.xml
$ xmllint --c14n File2.xml > 2.xml
$ diff 1.xml 2.xml

Discussions

diff - How to compare XML files - Stack Overflow

I have two XML files (XSD) which are generated by some tool. The tool doesn't preserve the order of elements so although the content is equal comparing it as text will result as the files are diffe... More on stackoverflow.com

stackoverflow.com

XML files comparison / diff?

Beyond compare software is perfect for this More on reddit.com

r/sysadmin

November 25, 2020

Best way to compare 2 XML documents in Java - Stack Overflow

I'm trying to write an automated test of an application that basically translates a custom message format into an XML message and sends it out the other end. I've got a good set of input/output me... More on stackoverflow.com

stackoverflow.com

Comparing xml files

xmldiff https://pypi.org/project/xmldiff/ If you need a good visual diff, you can download a free oXygen trial maybe. That's the best visual tool I've used. More on reddit.com

r/xml

April 15, 2024

Baeldung

baeldung.com › home › files › how to compare two files with the same content but on different lines

How to Compare Two Files With the Same Content but on Different Lines | Baeldung on Linux

December 12, 2023 - For the XML files, we used xsltproc with XSLT templates to sort the XML attributes, elements, and their child elements. Before comparing the sorted data with diff, we used xmllint to remove the white spaces and adjust the indentations.

Stack Overflow

stackoverflow.com › questions › 12176239 › how-to-compare-xml-files

diff - How to compare XML files - Stack Overflow

Top answer

1 of 2

I had a similar problem and I eventually found: http://superuser.com/questions/79920/how-can-i-diff-two-xml-files

That post suggests doing a canonical XML sort then doing a diff. The following should work for you if you are on Linux, Mac, or if you have Windows with something like Cygwin installed:

$ xmllint --c14n FileA.xml > 1.xml
$ xmllint --c14n FileB.xml > 2.xml
$ diff 1.xml 2.xml

2 of 2

For what it's worth, I have created a java tool (or kotlin actually) for effecient and configurable canonicalization of xml files.

It will always:

Sort nodes and attributes by name.
Remove namespaces (yes - it could - hypothetically - be a problem).
Prettyprint the result.

In addition you can tell it to:

Remove a given list of node names - maybe you do not want to know that the value of a piece of metadata - say <RequestReceivedTimestamp> has changed.
Sort a given list of collections in the context of the parent - maybe you do not care that the order of <Contact> entries in <ListOfFavourites> has changed.

It uses XSLT and does all the above efficiently using chaining.

Limitations

It does support sorting nested lists - sorting innermost lists before outer. But it cannot reliably sort arbitrary levels of recursively nested lists.

If you have such needs you can - after having used this tool - compare the sorted byte arrays of the results. they will be equal if only list sorting issues remain.

Where to get it

You can get it here: XMLNormalize

Super User

superuser.com › questions › 79920 › how-can-i-diff-two-xml-files

linux - How can I diff two XML files? - Super User

Top answer

1 of 10

132

One approach would be to first turn both XML files into Canonical XML, and compare the results using diff. For example, xmllint can be used to canonicalize XML.

$ xmllint --c14n one.xml > 1.xml
$ xmllint --c14n two.xml > 2.xml
$ diff 1.xml 2.xml

Or as a one-liner.

$ diff <(xmllint --c14n one.xml) <(xmllint --c14n two.xml)

2 of 10

Jukka's answer did not work for me, but it did point to Canonical XML. Neither --c14n nor --c14n11 sorted the attributes, but i did find the --exc-c14n switch did sort the attributes. --exc-c14n is not listed in the man page, but described on the command line as "W3C exclusive canonical format".

$ xmllint --exc-c14n one.xml > 1.xml
$ xmllint --exc-c14n two.xml > 2.xml
$ diff 1.xml 2.xml

$ xmllint | grep c14
    --c14n : save in W3C canonical format v1.0 (with comments)
    --c14n11 : save in W3C canonical format v1.1 (with comments)
    --exc-c14n : save in W3C exclusive canonical format (with comments)

$ rpm -qf /usr/bin/xmllint
libxml2-2.7.6-14.el6.x86_64
libxml2-2.7.6-14.el6.i686

$ cat /etc/system-release
CentOS release 6.5 (Final)

Warning --exc-c14n strips out the xml header whereas the --c14n prepends the xml header if not there.

reddit.com › r/sysadmin › xml files comparison / diff?

r/sysadmin on Reddit: XML files comparison / diff?

November 25, 2020 -

I've got a few XML files that are a couple thousand lines long I need to compare and see which values in file A are missing from file B.

Does anyone have an efficient way to diff these? They're not formatted the same at all, so normal text diff programs won't work. I just need to compare which "<value203>...</value203>" aren't in the one file.

Top answer

1 of 7

Beyond compare software is perfect for this

2 of 7

On the phone and in bed so if I'm wrong don't shoot me but if it's valid XML doesn't Powershell have some XML parsing cmdlets? I'd try to import each file into a variable then export (possibly like this - psuedo code don't quote me) and once they are the "same" then use win merge or powershell with get-content and compare-object $file1=import-xml c:\myxml\xml1.xml $file2=import-xml c:\myxml\xml2.xml $file1 | export-xml c:\myxml\newxml1.xml $file2 | export-xml c:\myxml\newxml2.xml Winmerge the two files Or 3. compare-object -referenceobject (get-content c:\myxml\newxml1.xml) -differenceobject (get-content c:\myxml\newxml2.xml) (the compare-object cmdlet can include the switch -includeequal if you want to see the differences AND the equalities)

ExtendsClass

extendsclass.com › xml-diff.html

XML diff - Compare xml online

Copy and paste, drag and drop a XML file or directly type in the editors above, and then click on "Compare" button they will be compared if the two XML are valids. You can also click on "load XML from URL" button to load your XML data from a URL (Must be https).

TextCompare

textcompare.org › xml

Online Xml Compare Tool

Find difference between 2 text files. Just input or paste original and modified text and click Compare button. Fast, Private & Unlimited.

Find elsewhere

Google Bing Mojeek

Server Fault

serverfault.com › questions › 430671 › utility-to-logically-compare-two-xml-files

Utility to LOGICALLY compare two xml files? - Server Fault

Top answer

1 of 5

Two approaches that I use are (a) to canonicalize both XML files and then compare their serializations, and (b) to use the XPath 2.0 deep-equal() function. Both approaches are OK for telling you whether the files are the same, but not very good at telling you where they differ.

A commercial tool that specializes in this problem is DeltaXML.

If you have things that you consider equivalent, but which aren't equivalent at the XML level - for example, elements in a different order - then you may have to be prepared to do a transformation to normalize the documents before comparison.

2 of 5

Good answer here:

Question: How can I diff two XML files? | Super User

Answer: How can I diff two XML files? | Super User

$ xmllint --format --exc-c14n one.xml > 1.xml
$ xmllint --format --exc-c14n two.xml > 2.xml
$ diff 1.xml 2.xml

Apologies for any failure to adhere to serverfault conventions ... I'm sure someone will let me know and I will amend appropriately.

Altova

altova.com › xmlspy-xml-editor › compare-xml

Compare XML Files | Altova

It's easy to compare XML files with the XML diff tools in XMLSpy and DiffDog. Whether you need to compare two XML files or merge XML from three documents, these tools make it easy with XML-aware diff functionality.

Xlcompare

xlcompare.com › compare-xml-files.html

Compare XML Files Online. Free. No Upload. Get the Semantic XML diffs.

October 3, 2009 - In our development process we are using XML trees in most cases. In XML Compare you can choose the best option for you. If text representation is more suitable for the data, you have – you can use it in the XML Compare.

CoreFiling

corefiling.com › home › open source tools › xml comparison tool

Free Online XML Comparison Tool

June 20, 2025 - CoreFiling XML Differences uses XML Pretty Printer and diff to display differences between XML files in an easy-to-read format.

Microsoft Community

techcommunity.microsoft.com › microsoft community hub › communities › products › powershell › windows powershell

How to compare two xml files and display the difference side by side. | Microsoft Community Hub

January 31, 2022 - $baseServer="C:\Store\PS\referrencexml.xml"$Server2Compare="C:\Store\PS\differencexml.xml"$boutput = Compare-Object -ReferenceObject (Get-Content -Path...

Oxygen XML

oxygenxml.com › doc › ug-oxygen › topics › file-comparison.html

Compare Files Tool

If you are comparing XML documents using the XML Fast or XML Accurate algorithms, you can enter an XPath 2.0 expression in the Ignore nodes by XPath text field to ignore certain nodes from the comparison. The resulting comparison will show you differences between the two files. The line numbers on each side and colored marks on the right-side vertical stripe help you to quickly identify the locations of the differences.

Minifier

minifier.org › xml-compare

XML Compare - Online XML Diff Checker

Green (Insert/Change): Data or content that has been added or differs in the second file. Precisely compares XML files without lowering execution. Processes complex XML structures and determines deviations in tags, attributes, and values. Easily visualize and compare two XML files in parallel.

Blogger

javarevisited.blogspot.com › 2017 › 04 › how-to-compare-two-xml-files-in-java.html

How to compare two XML files in Java - XMLUnit Example

Now let's remove all newline characters from target.xml and convert it as one line XML file for comparison.

Online Text Compare

onlinetextcompare.com › xml

Compare XML files online

This tool lets you compare the differences between two XML queries.

Stack Overflow

stackoverflow.com › questions › 141993 › best-way-to-compare-2-xml-documents-in-java

Best way to compare 2 XML documents in Java - Stack Overflow

Top answer

1 of 16

218

Sounds like a job for XMLUnit

http://www.xmlunit.org/
https://github.com/xmlunit

Example:

public class SomeTest extends XMLTestCase {
  @Test
  public void test() {
    String xml1 = ...
    String xml2 = ...

    XMLUnit.setIgnoreWhitespace(true); // ignore whitespace differences

    // can also compare xml Documents, InputSources, Readers, Diffs
    assertXMLEqual(xml1, xml2);  // assertXMLEquals comes from XMLTestCase
  }
}

2 of 16

The following will check if the documents are equal using standard JDK libraries.

DocumentBuilderFactory dbf = DocumentBuilderFactory.newInstance();
dbf.setNamespaceAware(true);
dbf.setCoalescing(true);
dbf.setIgnoringElementContentWhitespace(true);
dbf.setIgnoringComments(true);
DocumentBuilder db = dbf.newDocumentBuilder();

Document doc1 = db.parse(new File("file1.xml"));
doc1.normalizeDocument();

Document doc2 = db.parse(new File("file2.xml"));
doc2.normalizeDocument();

Assert.assertTrue(doc1.isEqualNode(doc2));

normalize() is there to make sure there are no cycles (there technically wouldn't be any)

The above code will require the white spaces to be the same within the elements though, because it preserves and evaluates it. The standard XML parser that comes with Java does not allow you to set a feature to provide a canonical version or understand xml:space if that is going to be a problem then you may need a replacement XML parser such as xerces or use JDOM.

Blogger

javarevisited.blogspot.com › 2017 › 04 › how-to-compare-two-xml-files-in-java.html

Javarevisited: How to compare two XML files in Java - XMLUnit Example

Now let's remove all newline characters from target.xml and convert it as one line XML file for comparison.

DevArt

blog.devart.com › home › how to › xml structure comparison explained

XML Structure Comparison Explained - Devart Blog

September 25, 2018 - In the example of .xaml files comparison provided above, XML element attributes were located in separate lines. This is what gave us the possibility to compare every attribute individually. Had the attributes been placed in a single line, they would have been compared as simple character sequences. The comparison algorithm tries to correlate XML elements on the basis of coincident attributes and child elements. But all attributes have the same importance during comparison.