Tree 2 tree correction between xml documents
xmldiff [ options ] original_file modified_file
xmldiff [ options ] -r original_directoty modified_directory
xmldiff is a utility for extracting differences between two xml files. It returns a set of primitives to apply on source tree to obtain the destination tree.
The implementation is based on Change detection in hierarchically structured information, by S. Chawathe, A. Rajaraman, H. Garcia-Molina and J. Widom, Stanford University, 1996
display this help message and exit.
display version number and exit
input files are HTML instead of XML
when comparing directories, recursively compare any subdirectories found.
display output following the Xupdate xml specification (see http://www.xmldb.org/xupdate/xupdate-wd.html#N19b1de).
specify the encoding to use for output. Default is UTF-8
do not normalize spaces and new lines in text and comment nodes.
do not process comment nodes
include all external general (text) entities.
include all external parameter entities, including the external DTD subset.
display an execution profile (run slower with this option), profile saved to file (binarie form).
0 means no difference were found.
A positive number means some differences were found.
A negative number means an error occured.
xmldiff uses an algorithm with a (too) high algorithmical complexity, which makes it unsuitable to process large XML documents. If your document has more than about 100 nodes, you should probably look for an alternative solution.
Any suggestion to this issue is welcome by Logilab. Please tell us if you are aware of a simpler algorithm to solve this problem.
python, pyxml
http://www.logilab.org/project/xmldiff/
http://www.xmldb.org/xupdate/xupdate-wd.html#N19b1de
Sylvain Thénault <[email protected]>
This manpage was written by Alexandre Fayolle <[email protected]>.