BlogNerds

Tech News, Gadgets, and Trends




Filed under freshmeat.net announcements (Global)

align="right" alt="Screenshot" hspace="10" vspace="10">
webcheck is a Web site checking tool for Web
masters. It crawls a given Web site and generates
a number of reports. The whole system is
pluggable, allowing extra reports and checks to be
added easily. It supports retrieving Web sites
over HTTP, file, and FTP protocols and produces
reports on site structure, broken links, old Web
pages, overviews of external links, and more. The
links that webcheck considers external are
configurable through regular expressions, and
webcheck honors robots.txt.


License: GNU General Public License (GPL)


Changes:
This release changes the HTML parser to
BeautifulSoup (when available). This parser is
much more error-tolerant than the old HTMLParser
based solution but is also slightly slower. Some
small output improvements were made as well as
some internal improvements to better support
Unicode content. Parsing of robots.txt files was
re-enabled and an –ignore-robots option was
added.

Posted by freshmeat.net announcements (Global) on Sunday, May 13th, 2007


You can follow any responses to this entry through the magic of "RSS 2.0" and leave a trackback from your own site.

Post A Comment

Acer Aspire One NetbookAcer Aspire One NetbookAcer Aspire One NetbookAcer Aspire One NetbookAcer Aspire One NetbookAcer Aspire One NetbookAcer Aspire One NetbookAcer Aspire One NetbookAcer Aspire One NetbookAcer Aspire One Netbook