Ezio Melotti
b814745226
#19480 : HTMLParser now accepts all valid start-tag names as defined by the HTML5 standard.
2013-11-07 18:31:36 +02:00
Ezio Melotti
c45868ec69
#14538 : HTMLParser can now parse correctly start tags that contain a bare /.
2012-04-18 19:08:41 -06:00
Ezio Melotti
36b7361fe7
HTMLParser is now able to handle slashes in the start tag.
2012-02-21 09:22:16 +02:00
Ezio Melotti
65d36dab4d
#13987 : HTMLParser is now able to handle malformed start tags.
2012-02-15 13:19:10 +02:00
Ezio Melotti
d2307cb48a
#13987 : HTMLParser is now able to handle EOFs in the middle of a construct.
2012-02-15 12:44:23 +02:00
Ezio Melotti
369cbd744e
Fix an index, add more tests, avoid raising errors for unknown declarations, and clean up comments.
2012-02-13 20:36:55 +02:00
Ezio Melotti
f117443cb8
#13993 : HTMLParser is now able to handle broken end tags.
2012-02-13 16:28:54 +02:00
Ezio Melotti
4b92cc3f79
#13960 : HTMLParser is now able to handle broken comments.
2012-02-13 16:10:44 +02:00
Ezio Melotti
6b7003a18c
#13576 : add tests about the handling of (possibly broken) condcoms.
2011-12-19 07:28:08 +02:00
Ezio Melotti
00dc60beee
#13358 : HTMLParser now calls handle_data only once for each CDATA.
2011-11-18 18:00:40 +02:00
Ezio Melotti
0f1571ce7f
#1745761 , #755670 , #13357 , #12629 , #1200313 : improve attribute handling in HTMLParser.
2011-11-14 18:04:05 +02:00
Ezio Melotti
74592919d6
Group tests about attributes in a separate class.
2011-11-08 02:07:18 +02:00
Ezio Melotti
7e82b276dd
#670664 : Fix HTMLParser to correctly handle the content of ``<script>...</script>`` and ``<style>...</style>``.
2011-11-01 14:09:56 +02:00
Ezio Melotti
9f1ffb2ae9
#7311 : fix HTMLParser to accept non-ASCII attribute values.
2011-04-05 20:40:52 +03:00
Senthil Kumaran
3f60f09eb2
Fix Issue10759 - HTMLParser.unescape() to handle malform charrefs.
2010-12-28 16:05:07 +00:00
Victor Stinner
554a3b82e4
Issue #6662 : Fix parsing of malformatted charref (&#bad;)
2010-05-24 21:33:24 +00:00
Benjamin Peterson
b3be23ad27
remove the svn:executable property from files that don't have shebang lines
2010-03-05 03:20:06 +00:00
Georg Brandl
bcdafa44f2
Remove html package and fix test_htmlparser.
2008-05-20 07:58:42 +00:00
Benjamin Peterson
d21dabc890
change some imports in tests so they will not be skipped in 3.0
2008-05-18 23:07:07 +00:00
Christian Heimes
c5f05e45cf
Patch #2167 from calvin: Remove unused imports
2008-02-23 17:40:11 +00:00
Martin v. Löwis
ab8a6bba25
Patch #912410 : Replace HTML entity references for attribute values
...
in HTMLParser.
2007-03-06 14:43:00 +00:00
Georg Brandl
d09def36d5
Bug #1442874 : handle "<!>", the empty SGML comment
2006-03-09 13:27:14 +00:00
Fred Drake
75d9a62fe0
add tests that make sure buffer boundaries are handled properly for SGML comments
...
(see SF patch #901369 )
2004-09-08 22:57:01 +00:00
Tim Peters
27f883687b
Whitespace normalization.
2004-07-08 04:22:35 +00:00
Andrew M. Kuchling
b7d8ce0275
[Bug #921657 ] Allow '@' in unquoted HTML attributes. Not strictly legal according to the HTML REC, but HTMLParser is already a pretty loose parser. Reported by Bernd Zimmermann.
2004-06-05 15:31:45 +00:00
Fred Drake
fafd56f439
Add test that demonstrates SGML-style handling of processing
...
instructions.
2003-04-17 22:19:26 +00:00
Fred Drake
0834d77bc4
Accept commas in unquoted attribute values.
...
This closes SF patch #669683 .
2003-03-14 16:21:57 +00:00
Barry Warsaw
04f357cffe
Get rid of relative imports in all unittests. Now anything that
...
imports e.g. test_support must do so using an absolute package name
such as "import test.test_support" or "from test import test_support".
This also updates the README in Lib/test, and gets rid of the
duplicate data dirctory in Lib/test/data (replaced by
Lib/email/test/data).
Now Tim and Jack can have at it. :)
2002-07-23 19:04:11 +00:00
Fred Drake
073148c4ef
Add a test that makes sure unclosed entity references are handled consitently.
2001-12-03 16:44:09 +00:00
Fred Drake
e822049efc
Adapt to use the test_main() approach.
2001-09-24 20:19:08 +00:00
Fred Drake
7cf613dc77
HTMLParser is allowed to be more strict than sgmllib, so let's not
...
change their basic behavior: When parsing something that cannot possibly
be valid in either HTML or XHTML, raise an exception.
2001-09-04 16:26:03 +00:00
Fred Drake
c20a698932
Enhanced the test for DOCTYPE declarations, added a test for dealing with
...
broken declaration-like things.
2001-09-04 15:13:04 +00:00
Fred Drake
029acfb922
Deal more appropriately with bare ampersands and pointy brackets; this
...
module has to deal with "class" HTML-as-deployed as well as XHTML, so we
cannot be as strict as XHTML allows.
This closes SF bug #453059 , but uses a different fix than suggested in
the bug comments.
2001-08-20 21:24:19 +00:00
Fred Drake
84bb9d8dc4
Fix stupid bug: when migrating these tests from the Zope repository, the
...
names of the test methods were not changed from the Zope-standard "check"
names to the Python-standard "test_" names, so the tests were not actually
being run.
Added test of hexadecimal character references as a regression check for
SF bug #445196 .
2001-08-03 19:53:01 +00:00
Fred Drake
50bc19fce5
Remove a duplicate test.
2001-07-16 18:50:29 +00:00
Fred Drake
bd3090d4d6
Added test suite for the new HTMLParser module, originally from the
...
TAL/PageTemplate package for Zope. This only needed a little boilerplate
change; the tests themselves are unchanged.
2001-05-18 15:32:59 +00:00