SYNOPSIS

 use HTML::TokeParser::Simple;
 my $p = HTML::TokeParser::Simple->new( $somefile );

 while ( my $token = $p->get_token ) {
     # This prints all text in an HTML doc (i.e., it strips the HTML)
     next unless $token->is_text;
     print $token->as_is;
 }

DESCRIPTION

This class does most of the heavy lifting for \*(C`HTML::TokeParser::Simple\*(C'. See the \*(C`HTML::TokeParser::Simple\*(C' docs for details.

OVERRIDDEN METHODS

  • as_is

  • delete_attr

  • get_attr

  • get_attrseq

  • get_tag

  • get_token0

  • is_start_tag

  • is_tag

  • return_attr

  • return_attrseq

  • return_tag

  • return_text

  • rewrite_tag

  • set_attr