Commit Graph

225 Commits

Author SHA1 Message Date
Guido van Rossum 1ade44cb0c Open the output files with 'wb', not 'w'. 1997-05-15 18:25:29 +00:00
Guido van Rossum 29f6533c7f Scroll to top of info window when done. 1997-05-09 03:19:29 +00:00
Guido van Rossum 89efda363f Avoid the fancy handler for error 401 (request authentication). 1997-05-07 15:00:56 +00:00
Guido van Rossum af310c1d00 Restructured Checker class to get rid of 'ext' table.
Links are now either in 'todo' or 'done', and ext links
are hadled more like local links except that no further
links are gathered (and sometimes they aren't checked,
e.g. for mailto and news URLs).  The -x option reverses
its meaning: it disables checking of ext links (they are
moved to 'done' without checking).  A new 'errors' table
collects pages with bad links as we go -- redundant,
but useful for the GUI version which needs to report
this as we go.  Some new methods, including reset().
New checkpoint format.

Adapted the GUI to the changes in the Checker class.
Added Quit and "Start over" buttons, and a checkbox
to disable checking external links.  The details
window now also shows bad links emanating from the
selected page.  Miscellaneous small chages.
1997-02-02 23:30:32 +00:00
Guido van Rossum 4f6ecdaacf Add root URL entry box, separate start/stop/step buttons.
If the users selects an item in 'To check', start checking there.
1997-02-01 05:17:29 +00:00
Guido van Rossum 6133ec656e Process <img> and <frame> tags. Don't bother skipping second href. 1997-02-01 05:16:08 +00:00
Guido van Rossum de99d310cc Check in another copy of tktools.py... 1997-01-31 18:58:53 +00:00
Guido van Rossum 06981c328d Tk interface to webchecker. Not fully featured yet, but usable. 1997-01-31 18:58:12 +00:00
Guido van Rossum 0b0b5f0279 Spin off checking of external page in a subroutine.
Increase MAXPAGE to 150K.
Add back printing of __doc__ for usage message.
1997-01-31 18:57:23 +00:00
Guido van Rossum e5605ba3c2 Many misc changes.
- Faster HTML parser derivede from SGMLparser (Fred Gansevles).

- All manipulations of todo, done, ext, bad are done via methods, so a
derived class can override.  Also moved the 'done' marking to
dopage(), so run() is much simpler.

- Added a method status() which returns a string containing the
summary counts; added a "total" count.

- Drop the guessing of the file type before opening the document -- we
still need to check those links for validity!

- Added a subroutine to close a connection which first slurps up the
remaining data when it's an ftp URL -- apparently closing an ftp
connection without reading till the end makes it hang.

- Added -n option to skip running (only useful with -R).

- The Checker object now has an instance variable which is set to 1
when it is changed.  This is not pickled.
1997-01-31 14:43:15 +00:00
Guido van Rossum c59a5d449f Set proper User-agent header (Python-webchecker/<version>).
When -x is combined with -q, still do the checking, but don't print
the error in this phase -- they are reported by report_errors().
1997-01-30 06:04:00 +00:00
Guido van Rossum 2739cd74b3 Some refinements of the external-link checking code: insert the errors
in the 'bad' dictionary (sanitize them so they are picklable; the
sanitation code is now a subroutine); don't check mailto: URLs; omit
colon in Error message.
1997-01-30 04:26:57 +00:00
Guido van Rossum de66268588 Added -x option to check external links. Slooooow! 1997-01-30 03:58:21 +00:00
Guido van Rossum 325a64f207 Catch I/O errors when parsing robots.txt file.
Add version number, printed at startup in non-quited mode.
1997-01-30 03:30:20 +00:00
Guido van Rossum df47bafa1c Basic README file 1997-01-30 03:24:00 +00:00
Guido van Rossum 3edbb35023 Added robots.txt support, using Skip Montanaro's parser.
Fixed occasional inclusion of unpicklable objects (Message in errors).
Changed indent of a few messages.
1997-01-30 03:19:41 +00:00
Guido van Rossum bbf8c2fafd Skip Montanaro's robots.txt parser. 1997-01-30 03:18:23 +00:00
Guido van Rossum 272b37d686 web tree checker 1997-01-30 02:44:48 +00:00
Guido van Rossum d7e4705d8f mime types guesser 1997-01-30 02:44:20 +00:00
Guido van Rossum 04f2b453bb Version 1.1. Fix memory leak and expensive comparison with None. 1997-01-18 02:20:37 +00:00
Guido van Rossum f06ee5fa07 /usr/local/bin/python -> /usr/bin/env python 1996-11-27 19:52:01 +00:00
Guido van Rossum 499181a833 Changed so that it replaces /usr/local/bin/python with /usr/bin/env python. 1996-11-27 19:43:49 +00:00
Guido van Rossum d5c58c34db Changed logic so it now replaces anything that has #! and python in
the first line, replacing the entire line.
1996-11-27 19:43:01 +00:00
Guido van Rossum fc0588241c Script to fix cprt notice 1996-11-27 19:41:55 +00:00
Barry Warsaw 3ffc503682 Removed the USA'isms that aren't really true 1996-11-20 18:43:05 +00:00
Barry Warsaw 7081a69b24 /usr/bin/env is slightly more portable 1996-11-20 15:19:10 +00:00
Barry Warsaw 00bf8efe0b Added a few meta info 1996-11-20 15:17:50 +00:00
Barry Warsaw 2a6e68f693 Output original address 1996-11-19 17:59:07 +00:00
Barry Warsaw de0e8f763b Updated country codes as per ISO3166 10-Feb-1994 1996-11-18 23:22:34 +00:00
Barry Warsaw cb2b672015 implementation complete. need to update country codes 1996-11-18 22:57:43 +00:00
Barry Warsaw 3a7212c86a Added `ru' --> Russia 1996-11-18 22:03:36 +00:00
Barry Warsaw f6d53448e5 A rewrite for better Python-ish style 1996-11-18 21:26:56 +00:00
Guido van Rossum e41d00bb6a Fixed a bug: files that no longer exist remotely would never be
removed from .mirrorinfo.  Now they are (even if -r is not specified
-- the files are not removed, just their .mirrorinfo entry).

Added a feature: the -s pattern option is also used to skip local
files when removing (i.e. -r won't remove local files matching the -s
patterns).
1996-11-14 18:24:47 +00:00
Fred Drake abcb38250b (texi2html.py): Minor improvement to HTML 3.0 output for code examples. 1996-10-25 22:13:59 +00:00
Guido van Rossum d266eb460e New permission notice, includes CNRI. 1996-10-25 14:44:06 +00:00
Fred Drake 22f7f9e671 (texi2html.py): Added CLASS=Navigation attribute to DIV around top/bottom
navigation links for HTML 3 version.

	Forced a blank line above the footnotes separator for HTML 2; at
	least one page did not get this spaced correctly.
1996-10-11 16:54:00 +00:00
Fred Drake 0282726643 (texi2html.py): Tightened the generated HTML slightly. Added -p option to
print section titles even when the debugging output is not enabled.
	Added -3 option to generate HTML 3.0 constructs where meaningful.

	Removed repititive garbage generation:  the old version added simple
	descriptive comments after every datadesc/funcdesc/*desc entry:

	function(args) -- function of module xxxx
	    Description....

	These comments are no longer generated:

	function(args)
	    Description....
1996-10-09 19:05:12 +00:00
Fred Drake a39a25e5ec (texi2html.py): Use "if __name__ == '__main__':" test. Adjust indentation
to 4 spaces per level (no longer 8).

(Makefile):  Use .pyc versions of partparse.py and texi2html.py to generate
	converted documentation formats.  This reduces the startup costs;
	probably doesn't affect anyone but me in reality, but helps when
	working on the docs.
1996-09-13 14:44:34 +00:00
Guido van Rossum be5e34204e Note obsolescence 1996-09-11 23:34:32 +00:00
Guido van Rossum dd6798c691 Added NumVersion definition. 1996-08-26 14:38:40 +00:00
Guido van Rossum c30461dd38 Insert filename in IOError message. 1996-08-26 14:37:47 +00:00
Guido van Rossum 96c4dd95cf OK, now it uses sys.[exec_]prefix and supports
a -o option to specify an output directory.  *Much* more convenient.
Removed a whole lot of hullabaloo from the README file, too.
1996-08-26 05:14:20 +00:00
Guido van Rossum 6498cad34e Get rid of it totally 1996-08-26 04:10:33 +00:00
Guido van Rossum f34bf1c07b Added warning that this is unsupported 1996-08-26 03:50:10 +00:00
Guido van Rossum aa233f18b2 Changes for 1.4 install targets 1996-08-26 03:49:49 +00:00
Guido van Rossum 6100d914e8 Allow whitespace before '#' in preprocessor directives. 1996-08-22 23:12:23 +00:00
Guido van Rossum 29901ff528 Some improvements by Fred Drake. 1996-08-09 21:46:34 +00:00
Guido van Rossum 8e7a54f525 More changes by Sjoerd & Jack 1996-07-21 02:50:30 +00:00
Guido van Rossum e47d5f9c52 Addition by Jack to test for missing type support 1996-07-21 02:49:55 +00:00
Guido van Rossum d4cc04c672 Add test that script name ends in .py.
Cosmetic changes to usage message (refer to "make install" now).
1996-06-17 17:49:13 +00:00
Guido van Rossum f75f80eb48 Get struct _frozen defn from Python.h; new variable names. 1996-06-17 17:48:30 +00:00
Guido van Rossum 814842f395 Removed most of the README text since it is all about the Mac specific
examples, which no longer live here.
1996-05-24 18:43:47 +00:00
Guido van Rossum d544d0126d Moved customization to bgenlocations.py module. 1996-04-12 16:39:00 +00:00
Guido van Rossum 9a2c546e75 Create files as temp file and move to real location only when complete.
Added some try-except statements around listdir and mkdir operations.
1996-04-09 02:51:23 +00:00
Guido van Rossum a0e2422615 A few missing casts (Richard Neitzel).
Don't append Unix paths on a Mac (Jack Jansen).
1996-03-07 16:16:54 +00:00
Guido van Rossum d2966cb95a don't die on unreadable files etc. 1996-02-05 18:49:00 +00:00
Jack Jansen 7d1eba9783 Added extra module argument varstuff, to declare extra module-level vars 1996-01-05 18:05:45 +00:00
Jack Jansen 425e9eb6cb - Added color window/pixmap support
- Added support for obtaining pixmap data
- Added OptResObj_* routines for optional handles
1995-12-12 15:02:03 +00:00
Jack Jansen 1e4ce733fc Added Sint8 and Uint8 types 1995-12-09 14:02:54 +00:00
Jack Jansen 0a54ae24c3 Added cm and qt demos 1995-12-09 14:02:10 +00:00
Jack Jansen 135fb71481 BitMap object support 1995-11-16 22:47:22 +00:00
Jack Jansen 7830ab8f49 Added definitions for GrafPort object and Fixed type. 1995-11-15 15:17:23 +00:00
Jack Jansen b5cf5dad43 Changed file signature to CW7 project mgr 1995-11-14 10:40:45 +00:00
Jack Jansen e48aa966c1 Fixed bug in cts error message handling. 1995-11-10 14:56:16 +00:00
Jack Jansen 81299f10f7 Added providence mailer support 1995-10-30 10:23:10 +00:00
Jack Jansen 4ec940ad04 Added (limited) compuserve error parsing 1995-10-30 10:10:19 +00:00
Jack Jansen f39ca24ebc Removed unneeded reference to tkinter 1995-10-23 14:34:52 +00:00
Jack Jansen 945b7b8eaf comment fix 1995-10-23 13:52:52 +00:00
Sjoerd Mullender 8968624574 Script to read and parse error messages from mailer daemons.
Written by Jack, some minor changes by Sjoerd.
1995-10-19 09:35:31 +00:00
Jack Jansen 349a988ba7 Added third argument to call() 1995-10-12 13:45:25 +00:00
Jack Jansen d40f3c60ca New universal headers 1995-10-09 23:12:22 +00:00
Sjoerd Mullender edb21c4edc Fixed typo: == -> =. 1995-08-29 14:22:16 +00:00
Guido van Rossum 06f4289d3f many fixes by anonymous contributor 1995-08-28 03:01:00 +00:00
Jack Jansen b95ea18a4c Added ResObj_OptNew declaration 1995-08-17 14:32:23 +00:00
Jack Jansen ec38010808 Added basic mac documentation. 1995-08-14 11:55:07 +00:00
Guido van Rossum 150316ee2e added getpath.c; added -P exec_prefix; added explanatory note 1995-08-08 14:21:07 +00:00
Guido van Rossum e830e55606 added ftp mirror script 1995-06-20 19:31:37 +00:00
Guido van Rossum 98590187d5 removed all references to modsupport.h 1995-06-20 18:45:39 +00:00
Jack Jansen 8d2033f08b Two new methods 1995-06-20 12:42:39 +00:00
Jack Jansen 5214a29774 Fixed sentence about modulator 1995-06-20 12:32:43 +00:00
Jack Jansen 138497d65b Updated 1995-06-20 12:32:15 +00:00
Jack Jansen 9a69112f85 Jim Fulton's change to support doc strings 1995-06-20 12:26:03 +00:00
Jack Jansen 80adc227d2 Added missing setselftype method to ManualGenerator 1995-06-18 20:08:28 +00:00
Barry Warsaw 6a9df7a86c updated comments 1995-06-15 15:56:34 +00:00
Barry Warsaw 268a39fb84 converted to python 1995-06-15 15:54:16 +00:00
Jack Jansen 7d0bc8343f Ported to Universal Header 2.0.1f (i.e. CW6) 1995-06-09 20:56:31 +00:00
Jack Jansen ae8a68f40c Trying again to check it in. Changes to:
- Use Universal Headers as input
- Change addpack calls to not depend on Guido's disklayout
- Checkge testprograms to use new names for some functions
1995-06-06 12:55:40 +00:00
Jack Jansen ebed75116f Templates converted to new naming conventions (thanks to Chak Tan) 1995-05-16 13:47:03 +00:00
Jack Jansen 52e02998ef Made the example slightly more complete 1995-05-16 13:43:09 +00:00
Guido van Rossum 953dc1a079 added transferSize() method 1995-05-03 17:36:54 +00:00
Guido van Rossum a12bbff857 mods by Gregor Schmid -- very nice! 1995-05-03 14:17:36 +00:00
Guido van Rossum bf6bdb0100 removed some harsh words and added some clarifications 1995-04-05 10:59:20 +00:00
Guido van Rossum e956386283 Jaap V's version 1995-04-05 10:58:52 +00:00
Guido van Rossum 6d67b607b2 backup Makefile too 1995-04-05 10:58:39 +00:00
Guido van Rossum 189f8fb8f7 added "pystone" benchmark 1995-03-30 09:42:43 +00:00
Guido van Rossum a4007eb816 added listing of argument types encountered 1995-03-23 10:41:10 +00:00
Guido van Rossum 26a9d37e5c new, from ../Demo/www 1995-03-15 11:26:05 +00:00
Guido van Rossum 227a4232e6 the usual 1995-03-10 14:42:57 +00:00
Guido van Rossum ed3c791050 moved lots of scripts from Demo/scripts to Tools/scripts 1995-03-02 15:56:30 +00:00
Guido van Rossum fd218ca865 moved Demo/modulator to Tools/modulator 1995-03-02 15:55:28 +00:00