Utility to load autopagerize siteinfo stuff
use HTML::AutoPagerize; my $autopager = HTML::AutoPagerize->new; $autopager->add_site( url => 'http://.+.tumblr.com/', nextLink => '//div[@id="content" or @id="container"]/div[last()]/a[last()]', pageElement => '//div[@id="content" or @id="container"]/div[@class!="footer" or @class!="navigation"]', ); my $uri = 'http://otsune.tumblr.com/'; my $html = LWP::Simple::get($uri); my $res = $autopager->handle($uri, $html); if ($res) { my $next_link = $res->{next_link}; # URI object my $content = $res->{page_element}; # XML::XPathEngine::NodeSet object. may be empty }
HTML::AutoPagerize is an utility module to load \s-1SITEINFO\s0 defined in AutoPagerize. AutoPagerize is an userscript to automatically figure out the \*(L"next link\*(R" of the current page, then fetch the content and insert the content by extracting the \*(L"page element\*(R".
Tatsuhiko Miyagawa <[email protected]>
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.
WWW::Mechanize::AutoPager, <http://swdyh.infogami.com/autopagerize>