af310c1d00
Links are now either in 'todo' or 'done', and ext links are hadled more like local links except that no further links are gathered (and sometimes they aren't checked, e.g. for mailto and news URLs). The -x option reverses its meaning: it disables checking of ext links (they are moved to 'done' without checking). A new 'errors' table collects pages with bad links as we go -- redundant, but useful for the GUI version which needs to report this as we go. Some new methods, including reset(). New checkpoint format. Adapted the GUI to the changes in the Checker class. Added Quit and "Start over" buttons, and a checkbox to disable checking external links. The details window now also shows bad links emanating from the selected page. Miscellaneous small chages. |
||
---|---|---|
.. | ||
README | ||
mimetypes.py | ||
robotparser.py | ||
tktools.py | ||
wcgui.py | ||
webchecker.py |
README
Webchecker ---------- This is a simple web tree checker, useful to find bad links in a web tree. It currently checks links pointing within the same subweb for validity. The main program is "webchecker.py". See its doc string (or invoke it with the option "-?") for more defails. The module robotparser.py was written by Skip Montanaro; the rest is original work. Jan 29, 1997. --Guido van Rossum (home page: http://www.python.org/~guido/)