cpython/Tools/webchecker
Guido van Rossum af310c1d00 Restructured Checker class to get rid of 'ext' table.
Links are now either in 'todo' or 'done', and ext links
are hadled more like local links except that no further
links are gathered (and sometimes they aren't checked,
e.g. for mailto and news URLs).  The -x option reverses
its meaning: it disables checking of ext links (they are
moved to 'done' without checking).  A new 'errors' table
collects pages with bad links as we go -- redundant,
but useful for the GUI version which needs to report
this as we go.  Some new methods, including reset().
New checkpoint format.

Adapted the GUI to the changes in the Checker class.
Added Quit and "Start over" buttons, and a checkbox
to disable checking external links.  The details
window now also shows bad links emanating from the
selected page.  Miscellaneous small chages.
1997-02-02 23:30:32 +00:00
..
README Basic README file 1997-01-30 03:24:00 +00:00
mimetypes.py mime types guesser 1997-01-30 02:44:20 +00:00
robotparser.py Skip Montanaro's robots.txt parser. 1997-01-30 03:18:23 +00:00
tktools.py Check in another copy of tktools.py... 1997-01-31 18:58:53 +00:00
wcgui.py Restructured Checker class to get rid of 'ext' table. 1997-02-02 23:30:32 +00:00
webchecker.py Restructured Checker class to get rid of 'ext' table. 1997-02-02 23:30:32 +00:00

README

Webchecker
----------

This is a simple web tree checker, useful to find bad links in a web
tree.  It currently checks links pointing within the same subweb for
validity.  The main program is "webchecker.py".  See its doc string
(or invoke it with the option "-?") for more defails.

The module robotparser.py was written by Skip Montanaro; the rest is
original work.

Jan 29, 1997.

--Guido van Rossum (home page: http://www.python.org/~guido/)