Find schemeless uris in arbitrary text.
require URI::Find::Schemeless; my $finder = URI::Find::Schemeless->new(\&callback); The rest is the same as URI::Find.
URI::Find finds absolute URIs in plain text with some weak heuristics for finding schemeless URIs. This subclass is for finding things which might be URIs in free text. Things like \*(L"www.foo.com\*(R" and \*(L"lifes.a.bitch.if.you.aint.got.net\*(R".
The heuristics are such that it hopefully finds a minimum of false positives, but there's no easy way for it know if \*(L"\s-1COMMAND\s0.COM\*(R" refers to a web site or a file.
top_level_domain_re
my $tld_re = $self->top_level_domain_re;
Returns the regex for matching top level \s-1DNS\s0 domains. The regex shouldn't be anchored, it shouldn't do any capturing matches, and it should make itself ignore case.
Original code by Roderick Schertler <[email protected]>, adapted by Michael G Schwern <[email protected]>.
Currently maintained by Roderick Schertler <[email protected]>.
L<URI::Find>