Commit Graph

34 Commits

Author SHA1 Message Date
Ezio Melotti 36b7361fe7 HTMLParser is now able to handle slashes in the start tag. 2012-02-21 09:22:16 +02:00
Ezio Melotti 65d36dab4d #13987: HTMLParser is now able to handle malformed start tags. 2012-02-15 13:19:10 +02:00
Ezio Melotti d2307cb48a #13987: HTMLParser is now able to handle EOFs in the middle of a construct. 2012-02-15 12:44:23 +02:00
Ezio Melotti 369cbd744e Fix an index, add more tests, avoid raising errors for unknown declarations, and clean up comments. 2012-02-13 20:36:55 +02:00
Ezio Melotti f117443cb8 #13993: HTMLParser is now able to handle broken end tags. 2012-02-13 16:28:54 +02:00
Ezio Melotti 4b92cc3f79 #13960: HTMLParser is now able to handle broken comments. 2012-02-13 16:10:44 +02:00
Ezio Melotti 6b7003a18c #13576: add tests about the handling of (possibly broken) condcoms. 2011-12-19 07:28:08 +02:00
Ezio Melotti 00dc60beee #13358: HTMLParser now calls handle_data only once for each CDATA. 2011-11-18 18:00:40 +02:00
Ezio Melotti 0f1571ce7f #1745761, #755670, #13357, #12629, #1200313: improve attribute handling in HTMLParser. 2011-11-14 18:04:05 +02:00
Ezio Melotti 74592919d6 Group tests about attributes in a separate class. 2011-11-08 02:07:18 +02:00
Ezio Melotti 7e82b276dd #670664: Fix HTMLParser to correctly handle the content of ``<script>...</script>`` and ``<style>...</style>``. 2011-11-01 14:09:56 +02:00
Ezio Melotti 9f1ffb2ae9 #7311: fix HTMLParser to accept non-ASCII attribute values. 2011-04-05 20:40:52 +03:00
Senthil Kumaran 3f60f09eb2 Fix Issue10759 - HTMLParser.unescape() to handle malform charrefs. 2010-12-28 16:05:07 +00:00
Victor Stinner 554a3b82e4 Issue #6662: Fix parsing of malformatted charref (&#bad;) 2010-05-24 21:33:24 +00:00
Benjamin Peterson b3be23ad27 remove the svn:executable property from files that don't have shebang lines 2010-03-05 03:20:06 +00:00
Georg Brandl bcdafa44f2 Remove html package and fix test_htmlparser. 2008-05-20 07:58:42 +00:00
Benjamin Peterson d21dabc890 change some imports in tests so they will not be skipped in 3.0 2008-05-18 23:07:07 +00:00
Christian Heimes c5f05e45cf Patch #2167 from calvin: Remove unused imports 2008-02-23 17:40:11 +00:00
Martin v. Löwis ab8a6bba25 Patch #912410: Replace HTML entity references for attribute values
in HTMLParser.
2007-03-06 14:43:00 +00:00
Georg Brandl d09def36d5 Bug #1442874: handle "<!>", the empty SGML comment 2006-03-09 13:27:14 +00:00
Fred Drake 75d9a62fe0 add tests that make sure buffer boundaries are handled properly for SGML comments
(see SF patch #901369)
2004-09-08 22:57:01 +00:00
Tim Peters 27f883687b Whitespace normalization. 2004-07-08 04:22:35 +00:00
Andrew M. Kuchling b7d8ce0275 [Bug #921657] Allow '@' in unquoted HTML attributes. Not strictly legal according to the HTML REC, but HTMLParser is already a pretty loose parser. Reported by Bernd Zimmermann. 2004-06-05 15:31:45 +00:00
Fred Drake fafd56f439 Add test that demonstrates SGML-style handling of processing
instructions.
2003-04-17 22:19:26 +00:00
Fred Drake 0834d77bc4 Accept commas in unquoted attribute values.
This closes SF patch #669683.
2003-03-14 16:21:57 +00:00
Barry Warsaw 04f357cffe Get rid of relative imports in all unittests. Now anything that
imports e.g. test_support must do so using an absolute package name
such as "import test.test_support" or "from test import test_support".

This also updates the README in Lib/test, and gets rid of the
duplicate data dirctory in Lib/test/data (replaced by
Lib/email/test/data).

Now Tim and Jack can have at it. :)
2002-07-23 19:04:11 +00:00
Fred Drake 073148c4ef Add a test that makes sure unclosed entity references are handled consitently. 2001-12-03 16:44:09 +00:00
Fred Drake e822049efc Adapt to use the test_main() approach. 2001-09-24 20:19:08 +00:00
Fred Drake 7cf613dc77 HTMLParser is allowed to be more strict than sgmllib, so let's not
change their basic behavior:  When parsing something that cannot possibly
be valid in either HTML or XHTML, raise an exception.
2001-09-04 16:26:03 +00:00
Fred Drake c20a698932 Enhanced the test for DOCTYPE declarations, added a test for dealing with
broken declaration-like things.
2001-09-04 15:13:04 +00:00
Fred Drake 029acfb922 Deal more appropriately with bare ampersands and pointy brackets; this
module has to deal with "class" HTML-as-deployed as well as XHTML, so we
cannot be as strict as XHTML allows.

This closes SF bug #453059, but uses a different fix than suggested in
the bug comments.
2001-08-20 21:24:19 +00:00
Fred Drake 84bb9d8dc4 Fix stupid bug: when migrating these tests from the Zope repository, the
names of the test methods were not changed from the Zope-standard "check"
names to the Python-standard "test_" names, so the tests were not actually
being run.

Added test of hexadecimal character references as a regression check for
SF bug #445196.
2001-08-03 19:53:01 +00:00
Fred Drake 50bc19fce5 Remove a duplicate test. 2001-07-16 18:50:29 +00:00
Fred Drake bd3090d4d6 Added test suite for the new HTMLParser module, originally from the
TAL/PageTemplate package for Zope.  This only needed a little boilerplate
change; the tests themselves are unchanged.
2001-05-18 15:32:59 +00:00