Commit Graph

306 Commits

Author SHA1 Message Date
Guido van Rossum d57548023f A variant on webchecker that creates a mirror copy of a remote site. 1997-10-06 18:54:25 +00:00
Guido van Rossum 2237b73baf Several changes:
- Change the code that looks for robots.txt to always look in /, even
if the "root" path is somewhere deep down below.

- Add link processing in <AREA> tags.

- Change safeclose() to avoid crashing when the file has no geturl()
method.
1997-10-06 18:54:01 +00:00
Guido van Rossum 03be7f525e Changes by jack so the generated modules export their types
to Python.
1997-09-22 16:13:19 +00:00
Guido van Rossum b81b5c72db Moved pystone to Lib/test. Use import test.pystone to run it. 1997-09-16 18:57:42 +00:00
Guido van Rossum d2ba3690ce Skip symlinks; don't bother keeping a list of files that is never used. 1997-09-16 18:57:14 +00:00
Guido van Rossum a11b041537 Added faqwiz and webchecker. 1997-08-28 02:41:33 +00:00
Guido van Rossum 8c5fa91253 Added section about multiple FAQs. 1997-08-28 02:38:54 +00:00
Guido van Rossum f1ead1a63c New installation instructions show how to maintain multiple FAQs.
Removed bootstrap script from end of faqwiz.py module.
Added instructions to bootstrap script, too.
Version bumped to 0.8.
Added <html>...</html> feature suggested by Skip Montanaro.
Added leading text for Roulette, default to 'Hit Reload ...'.
Fix typo in default SRCDIR.
1997-08-28 02:38:01 +00:00
Guido van Rossum 8a2d216047 Removed faqmain.py -- it was an earlier implementation and no
longer relevant.
1997-08-27 22:31:18 +00:00
Guido van Rossum 92dfa4ffe1 Use correct keyword parameter when calling copy with answer option! 1997-08-18 20:56:10 +00:00
Guido van Rossum 92c90329d1 Recognize https:// as URL as well. 1997-08-18 13:59:41 +00:00
Guido van Rossum 939e4c1efb Oops, there was a bug in the output formatting - the last printed
values would overwrite the next value.
1997-08-14 23:25:20 +00:00
Guido van Rossum 3485bb6f44 Added logmerge.py 1997-08-14 22:04:26 +00:00
Guido van Rossum 6f0cf7ee20 Consolidate a bunch of CVS or RCS logs read from stdin. 1997-08-14 22:04:00 +00:00
Guido van Rossum 91c7c933cc New version (changes are so minor that I don't bother to explain them). 1997-08-14 20:20:42 +00:00
Guido van Rossum fee9faa044 Of course, this file should be empty except for an informative comment. 1997-08-14 20:19:27 +00:00
Guido van Rossum 193a11e389 Add a separate SRC directory and insert it in front of the path. 1997-08-14 20:18:18 +00:00
Guido van Rossum 1dccdc221c Add missing semicolon to '&gt;'. 1997-08-14 20:17:20 +00:00
Guido van Rossum 4d3376a80f Added cvsfiles.py and treesync.py 1997-08-14 20:16:32 +00:00
Guido van Rossum a554107802 Source tree synchronization tool (for Windowes, mostly) 1997-08-14 20:15:20 +00:00
Guido van Rossum 30e53c0c39 Print a list of files under CVS. 1997-08-14 20:14:54 +00:00
Guido van Rossum 9189bdabd5 support C++ comments 1997-08-14 20:14:29 +00:00
Guido van Rossum 5526e394a2 set LOOPS to 10000 1997-08-14 20:14:12 +00:00
Guido van Rossum 77b300895c Add XXX warning about fishy line 1997-08-14 20:13:46 +00:00
Guido van Rossum 7ba3de44a2 Added Windows NT instructions. 1997-08-14 02:12:04 +00:00
Guido van Rossum 58a594829c Changes for building under windows. 1997-08-14 01:45:33 +00:00
Guido van Rossum 0b4b8a21ce Changed indents to 4 *spaces*; added Emacs variables to avoid
inserting tabs.  All this (and the previous patch) in preparation for
porting to NT.
1997-08-10 16:56:48 +00:00
Guido van Rossum 9a6e855a27 Support using -p/-P to point to the source/build directory instead of
the install directory.  Added -h option to print the full usage
message; by default, only two lines are now printed for errors.
1997-08-10 16:47:17 +00:00
Guido van Rossum b3afce5763 Updated for new style build/install; much simpler! 1997-07-19 21:58:30 +00:00
Guido van Rossum 4dddff8ddf Upped version to 0.6. 1997-07-13 03:36:53 +00:00
Guido van Rossum 9b1718c0e7 New PythonPowered logo image. 1997-07-13 03:32:28 +00:00
Guido van Rossum 9115eb522e Added a variant of the epilogue that has the PythonPowered logo. 1997-06-03 22:04:10 +00:00
Guido van Rossum 5bf4d00887 Move the second import of faqcust to the very end. 1997-06-03 22:03:22 +00:00
Guido van Rossum dafce6db7b Expanded the setup instructions. 1997-06-02 23:10:06 +00:00
Guido van Rossum 525d52f8a4 Move the roulette() functionality out of the FaqDir class into the
FaqWizard class.
1997-06-02 22:52:37 +00:00
Guido van Rossum 4f074044be Version upped. 1997-06-02 21:40:18 +00:00
Guido van Rossum 178f58a17e Stupid typos -- ImporError. 1997-06-02 21:39:15 +00:00
Guido van Rossum c22eb01bfe Bugfix: last_changed would always print current time. 1997-06-02 15:51:51 +00:00
Guido van Rossum d7918fb758 Added hint about permissions. 1997-05-30 12:01:24 +00:00
Guido van Rossum 72a342fd4f Protect roulette against empty FAQ. 1997-05-30 11:58:21 +00:00
Guido van Rossum 00494bfa15 Inclide the imports and chdir in the try/except. 1997-05-30 11:50:36 +00:00
Guido van Rossum 68bdad1015 Tiny script to play with it on a Mac. 1997-05-28 16:09:02 +00:00
Guido van Rossum d812c07e0e Set version to 0.4; get rid of '(' in column 1 in triple-queted string. 1997-05-26 20:15:44 +00:00
Guido van Rossum 132e1892e1 Initial revision 1997-05-26 20:15:09 +00:00
Guido van Rossum 7a241071d4 Added marks for very recently (24 hours) and recently (7 days)
modified files.  Added and used global now, entry.getmtime(), and
entry.emit_marks().
1997-05-26 19:46:56 +00:00
Guido van Rossum fd67f738dd Added marks for very recently (24 hours) and recently (7 days)
modified files.  Throw things around a bit.
1997-05-26 19:46:18 +00:00
Guido van Rossum 8bc49c8ad1 Support for more general diffing and retrieving any old revision.
Support for index formatting with local refs.
1997-05-26 19:10:37 +00:00
Guido van Rossum 8cde0b47b8 Added keyword searching. 1997-05-26 16:35:46 +00:00
Guido van Rossum d993695b0f Added keyword searching radio buttons. 1997-05-26 16:35:27 +00:00
Guido van Rossum 030144d9f6 Don't need <BR> at end of <LI>. 1997-05-26 16:02:56 +00:00
Guido van Rossum 1f04772347 Bugfix -- should pass headers=0 when formatting query results. 1997-05-26 16:02:00 +00:00
Guido van Rossum 21c4b5f66a Add optional section titles to format_all.
Allow for missing section titles in format_all and format_index.
1997-05-26 06:28:40 +00:00
Guido van Rossum b1d8a259e3 Oops, checked in the wrong SHORTNAME value. 1997-05-26 06:14:02 +00:00
Guido van Rossum 2305231acd Added a local customization feature: load some parameters from faqcust. 1997-05-26 06:12:50 +00:00
Guido van Rossum 46a160f6de Initial revision 1997-05-26 06:11:35 +00:00
Guido van Rossum 72fdef07d3 Initial revision 1997-05-26 05:43:37 +00:00
Guido van Rossum ea31ea2859 I'm happy with this. 1997-05-26 05:43:29 +00:00
Guido van Rossum 1677e5b5dd Initial revision 1997-05-26 00:07:18 +00:00
Guido van Rossum efe640c00f Final touch -- bottom link points to FAQ Wizard home. 1997-05-23 23:07:44 +00:00
Guido van Rossum eae7d0ecf3 Forgot to remove debug stmt. 1997-05-23 22:46:16 +00:00
Guido van Rossum 9c6cedaa08 The official name of the whole thing is now "Python FAQ Wizard";
and it's in beta testing now.

Renamed the cookie id to Python-FAQ-Wizard and fixed some bugs
related to cookie processing.
1997-05-23 22:44:01 +00:00
Guido van Rossum a78a3c30dd Implemented a simple Password scheme.
Added a Help link to the FAQ Edit Wizard.
1997-05-23 22:29:24 +00:00
Guido van Rossum f8f0fb7069 Added last mtime on whole faq.
Added more elaborate explanation of version conflict.
Set fake header on commit so show() shows the new mtime.
Reorder name/email fields to be after the log message.
1997-05-23 21:21:43 +00:00
Guido van Rossum 1d57981015 Nits.
Use TARGET=rlog instead of _blank for rlog output.
Add missing mailto: to author's email address in Last changed link.
Forget about the domainname in the cookie; the default is just fine.
1997-05-23 19:18:35 +00:00
Guido van Rossum 1dcc24404b Reorg TO DO list.
Run rlog commands as pipe so we can cgi.escape(output).
Added self.escape(), like cgi.escape() but also escapes " -- for use
as double-quoted attribute values inside HTML tags.
Fix complains from Mark Lutz: HTTP_HOST is not always set.
1997-05-23 18:53:06 +00:00
Guido van Rossum a0e9a6d332 Small nits for Grail; don't forget to cgi.translate & and <. 1997-05-23 18:13:58 +00:00
Guido van Rossum d1c1ec8a77 - Suppress the visible text in the epilogue when "edit=no" is specified.
- Add translation of *foobar* into <I>foobar</I> (but not inside <PRE>).
- Optimization for the translation: only translate when @ or / or * seen.
- Add some layout to front page lay-out.
1997-05-23 17:45:04 +00:00
Guido van Rossum 4888c7ece5 Added section titles (for now, hardcoded in the file). 1997-05-23 15:55:19 +00:00
Guido van Rossum 5527db5b76 Added code to recognize http/ftp URLs and email addresses, and add <A>
tags for them.
1997-05-23 04:44:30 +00:00
Guido van Rossum c6447527aa Minor change: add prologue() calls to do_info and do_rlog; in
do_recent, don't display entries without a last-changed-date.
1997-05-23 00:50:01 +00:00
Guido van Rossum af5be958e3 Added menu entry on deleting FAQ entries.
Added a generic error handler function.
Added cookie support to preserve author's name and email.
1997-05-22 16:57:50 +00:00
Guido van Rossum 64099e955d HTML formatting nits.
Added "Reload this entry" link to edit complete screen.
1997-05-22 15:49:23 +00:00
Guido van Rossum ed531fd9df Mostly small changes.
- Fixed broken HTML in edit form that swallowed <BR> tags
- Removed "another one" link in roulette, replaced with "use Reload" message
- show() now has edit=1 by default; changed all calls
- Display last-changed date, author, email when present and edit=1
- Added back pointers at bottom
- Added a copy of the bootstrap script at the end of the file
1997-05-22 15:21:57 +00:00
Guido van Rossum 74427e51d7 Separate prologue/epilogue functions.
Get the current version on edit and reject commits based on an old
version.
1997-05-21 23:43:39 +00:00
Guido van Rossum 4c45422b13 Oops, left an 'n = n+1' in but n was undefined. 1997-05-21 22:42:23 +00:00
Guido van Rossum f701bf1098 Add "Log info" to view rcs log.
Speed up __getattr__ by storing the value as self.<key>.
1997-05-21 22:25:56 +00:00
Guido van Rossum d7bfa80c63 Lots of new features:
- got rid of the separate search page
- added an index (lists the questions with links to the answers)
- add a mechanism to add new entries
- add a way to list most recently edited entries first
1997-05-21 21:31:39 +00:00
Guido van Rossum 3c3354c0e7 Crude but functional, except for adding new entries. 1997-05-21 16:52:18 +00:00
Guido van Rossum adb3a9db57 Initial revision 1997-05-21 07:24:50 +00:00
Guido van Rossum 650b3aad55 Unlink before rename (for NT). 1997-05-19 15:20:49 +00:00
Guido van Rossum e6fcd33179 Correct comment; -c is off by default. 1997-05-15 21:51:16 +00:00
Guido van Rossum 1ade44cb0c Open the output files with 'wb', not 'w'. 1997-05-15 18:25:29 +00:00
Guido van Rossum 29f6533c7f Scroll to top of info window when done. 1997-05-09 03:19:29 +00:00
Guido van Rossum 89efda363f Avoid the fancy handler for error 401 (request authentication). 1997-05-07 15:00:56 +00:00
Guido van Rossum af310c1d00 Restructured Checker class to get rid of 'ext' table.
Links are now either in 'todo' or 'done', and ext links
are hadled more like local links except that no further
links are gathered (and sometimes they aren't checked,
e.g. for mailto and news URLs).  The -x option reverses
its meaning: it disables checking of ext links (they are
moved to 'done' without checking).  A new 'errors' table
collects pages with bad links as we go -- redundant,
but useful for the GUI version which needs to report
this as we go.  Some new methods, including reset().
New checkpoint format.

Adapted the GUI to the changes in the Checker class.
Added Quit and "Start over" buttons, and a checkbox
to disable checking external links.  The details
window now also shows bad links emanating from the
selected page.  Miscellaneous small chages.
1997-02-02 23:30:32 +00:00
Guido van Rossum 4f6ecdaacf Add root URL entry box, separate start/stop/step buttons.
If the users selects an item in 'To check', start checking there.
1997-02-01 05:17:29 +00:00
Guido van Rossum 6133ec656e Process <img> and <frame> tags. Don't bother skipping second href. 1997-02-01 05:16:08 +00:00
Guido van Rossum de99d310cc Check in another copy of tktools.py... 1997-01-31 18:58:53 +00:00
Guido van Rossum 06981c328d Tk interface to webchecker. Not fully featured yet, but usable. 1997-01-31 18:58:12 +00:00
Guido van Rossum 0b0b5f0279 Spin off checking of external page in a subroutine.
Increase MAXPAGE to 150K.
Add back printing of __doc__ for usage message.
1997-01-31 18:57:23 +00:00
Guido van Rossum e5605ba3c2 Many misc changes.
- Faster HTML parser derivede from SGMLparser (Fred Gansevles).

- All manipulations of todo, done, ext, bad are done via methods, so a
derived class can override.  Also moved the 'done' marking to
dopage(), so run() is much simpler.

- Added a method status() which returns a string containing the
summary counts; added a "total" count.

- Drop the guessing of the file type before opening the document -- we
still need to check those links for validity!

- Added a subroutine to close a connection which first slurps up the
remaining data when it's an ftp URL -- apparently closing an ftp
connection without reading till the end makes it hang.

- Added -n option to skip running (only useful with -R).

- The Checker object now has an instance variable which is set to 1
when it is changed.  This is not pickled.
1997-01-31 14:43:15 +00:00
Guido van Rossum c59a5d449f Set proper User-agent header (Python-webchecker/<version>).
When -x is combined with -q, still do the checking, but don't print
the error in this phase -- they are reported by report_errors().
1997-01-30 06:04:00 +00:00
Guido van Rossum 2739cd74b3 Some refinements of the external-link checking code: insert the errors
in the 'bad' dictionary (sanitize them so they are picklable; the
sanitation code is now a subroutine); don't check mailto: URLs; omit
colon in Error message.
1997-01-30 04:26:57 +00:00
Guido van Rossum de66268588 Added -x option to check external links. Slooooow! 1997-01-30 03:58:21 +00:00
Guido van Rossum 325a64f207 Catch I/O errors when parsing robots.txt file.
Add version number, printed at startup in non-quited mode.
1997-01-30 03:30:20 +00:00
Guido van Rossum df47bafa1c Basic README file 1997-01-30 03:24:00 +00:00
Guido van Rossum 3edbb35023 Added robots.txt support, using Skip Montanaro's parser.
Fixed occasional inclusion of unpicklable objects (Message in errors).
Changed indent of a few messages.
1997-01-30 03:19:41 +00:00
Guido van Rossum bbf8c2fafd Skip Montanaro's robots.txt parser. 1997-01-30 03:18:23 +00:00
Guido van Rossum 272b37d686 web tree checker 1997-01-30 02:44:48 +00:00
Guido van Rossum d7e4705d8f mime types guesser 1997-01-30 02:44:20 +00:00