Filter to put all characters() in one event
my $h = SomeHandler->new; my $f = XML::Filter::BufferText->new( Handler => $h ); my $p = SomeParser->new( Handler => $f ); $p->parse;
This is a very simple filter. One common cause of grief (and programmer error) is that \s-1XML\s0 parsers aren't required to provide character events in one chunk. They can, but are not forced to, and most don't. This filter does the trivial but oft-repeated task of putting all characters into a single event.
Note that this won't help you cases such as:
<foo> blah <!-- comment --> phubar </foo>
In the above case, given the interleaving comment, there will be two \*(C`character()\*(C' events. This may be worked around in the future if there is demand for it.
An interesting way to use this filter, instead of telling users to use it, is to return it from your handler's constructor, already configured and all. That'll make the buffering totally transparent to them (\*(C`XML::SAX::Writer\*(C' does that).
Robin Berjon, [email protected]
Copyright (c) 2001-2002 Robin Berjon. All rights reserved. This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
XML::SAX::*, XML::Generator::*, XML::Handler::*, XML::Filter::*