Token.pm text class.
use HTML::TokeParser::Simple; my $p = HTML::TokeParser::Simple->new( $somefile ); while ( my $token = $p->get_token ) { # This prints all text in an HTML doc (i.e., it strips the HTML) next unless $token->is_text; print $token->as_is; }
This class represents \*(L"text\*(R" tokens. See the \*(C`HTML::TokeParser::Simple\*(C' documentation for details.
as_is
is_text