slackbuilds_ponce/perl/perl-www-robotrules
Robby Workman 143991a46e Entire Repo: Remove APPROVED field from .info files
This field used to make sense in our pre-git days, but
the Signed-Off-By: line serves the same purpose (and
even more) now, so APPROVED has been rejected.  ;-)

Signed-off-by: Robby Workman <rworkman@slackbuilds.org>
2012-08-14 23:22:50 -05:00
..
perl-www-robotrules.info Entire Repo: Remove APPROVED field from .info files 2012-08-14 23:22:50 -05:00
perl-www-robotrules.SlackBuild
README
slack-desc Misc slack-desc clean-ups. 2012-05-19 15:59:44 -05:00

This module parses /robots.txt files as specified in "A Standard for
Robot Exclusion", at <http://www.robotstxt.org/wc/norobots.html>
Webmasters can use the /robots.txt file to forbid conforming robots
from accessing parts of their web site.
The parsed files are kept in a WWW::RobotRules object, and this
object provides methods to check if access to a given URL is
prohibited. The same WWW::RobotRules object can be used for one
or more parsed /robots.txt files on any number of hosts.

This requires perl-uri-escape.