align="right" alt="Screenshot" hspace="10" vspace="10">
webcheck is a Web site checking tool for Web
masters. It crawls a given Web site and generates
a number of reports. The whole system is
pluggable, allowing extra reports and checks to be
added easily. It supports retrieving Web sites
over HTTP, file, and FTP protocols and produces
reports on site structure, broken links, old Web
pages, overviews of external links, and more. The
links that webcheck considers external are
configurable through regular expressions, and
webcheck honors robots.txt.
License: GNU General Public License (GPL)
Changes:
This release changes the HTML parser to
BeautifulSoup (when available). This parser is
much more error-tolerant than the old HTMLParser
based solution but is also slightly slower. Some
small output improvements were made as well as
some internal improvements to better support
Unicode content. Parsing of robots.txt files was
re-enabled and an –ignore-robots option was
added.


























