Commit Graph

7 Commits

Author SHA1 Message Date
R. David Murray bb7b753cfc Add missing versionchanged, correct 'throw' wording to 'raise'. 2010-12-03 04:26:18 +00:00
R. David Murray b579dba119 #1486713: Add a tolerant mode to HTMLParser.
The motivation for adding this option is that the the functionality it
provides used to be provided by sgmllib in Python2, and was used by,
for example, BeautifulSoup.  Without this option, the Python3 version
of BeautifulSoup and the many programs that use it are crippled.

The original patch was by 'kxroberto'.  I modified it heavily but kept his
heuristics and test.  I also added additional heuristics to fix #975556,
#1046092, and part of #6191.  This patch should be completely backward
compatible:  the behavior with the default strict=True is unchanged.
2010-12-03 04:06:39 +00:00
Georg Brandl 7cb1319688 Terminology fix: exceptions are raised, except in generator.throw(). 2010-08-03 12:06:29 +00:00
Georg Brandl 46aa5c5ba1 #3874: document HTMLParser.unknown_decl(). 2010-07-29 13:38:37 +00:00
Ezio Melotti 2fad00c198 Updated the last example as requested in #6350 2009-06-27 22:58:15 +00:00
Georg Brandl 877b10add4 Remove the htmllib and sgmllib modules as per PEP 3108. 2008-06-01 21:25:55 +00:00
Georg Brandl 9087b7f83b Merged revisions 63438 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk

........
  r63438 | georg.brandl | 2008-05-17 23:54:03 +0200 (Sat, 17 May 2008) | 3 lines

  Rename html.parser file, and split html.entities from htmllib
  to ease removal of the latter in Py3k.
........
2008-05-18 07:53:01 +00:00