Token.pm "start tag" class.
use HTML::TokeParser::Simple; my $p = HTML::TokeParser::Simple->new( $somefile ); while ( my $token = $p->get_token ) { # This prints all text in an HTML doc (i.e., it strips the HTML) next unless $token->is_text; print $token->as_is; }
This class does most of the heavy lifting for \*(C`HTML::TokeParser::Simple\*(C'. See the \*(C`HTML::TokeParser::Simple\*(C' docs for details.
as_is
delete_attr
get_attr
get_attrseq
get_tag
get_token0
is_start_tag
is_tag
return_attr
return_attrseq
return_tag
return_text
rewrite_tag
set_attr