cpython/Lib
R. David Murray b579dba119 #1486713: Add a tolerant mode to HTMLParser.
The motivation for adding this option is that the the functionality it
provides used to be provided by sgmllib in Python2, and was used by,
for example, BeautifulSoup.  Without this option, the Python3 version
of BeautifulSoup and the many programs that use it are crippled.

The original patch was by 'kxroberto'.  I modified it heavily but kept his
heuristics and test.  I also added additional heuristics to fix #975556,
#1046092, and part of #6191.  This patch should be completely backward
compatible:  the behavior with the default strict=True is unchanged.
2010-12-03 04:06:39 +00:00
..
concurrent Removes an inefficient spin loop in as_completed 2010-11-17 11:06:29 +00:00
ctypes #9424: Replace deprecated assert* methods in the Python test suite. 2010-11-20 19:04:17 +00:00
curses #8861: remove unused variable. 2010-08-02 19:44:48 +00:00
dbm #9397: remove mention of dbm.bsd which does not exist anymore. 2010-07-29 16:26:56 +00:00
distutils sys.abiflags is not defined on all platforms. 2010-11-25 03:46:44 +00:00
email #8989: add 'domain' keyword to make_msgid. 2010-12-02 21:47:19 +00:00
encodings #7475: add (un)transform method to bytes/bytearray and str, add back codecs that can be used with them from Python 2. 2010-12-02 18:06:51 +00:00
html #1486713: Add a tolerant mode to HTMLParser. 2010-12-03 04:06:39 +00:00
http Fix issue3709 - BaseHTTPRequestHandler will buffer the headers and write only on end_headers call. 2010-11-21 14:36:14 +00:00
idlelib Issue 9222 Fix filetypes for open dialog 2010-11-23 06:01:31 +00:00
importlib #9424: Replace deprecated assert* methods in the Python test suite. 2010-11-20 19:04:17 +00:00
json Issue #10572: Moved json tests to Lib/test/json_tests. 2010-11-30 03:03:30 +00:00
lib2to3 revert changes in inappropiate branch 2010-11-20 19:09:08 +00:00
logging Logging: added stack_info argument. 2010-11-14 21:33:04 +00:00
msilib remove pointless coding cookies 2010-08-30 14:05:50 +00:00
multiprocessing Issue #9244: multiprocessing.pool: Worker crashes if result can't be encoded 2010-11-09 20:55:52 +00:00
plat-aix4
plat-darwin
plat-freebsd4 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-freebsd5 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-freebsd6 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-freebsd7 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-freebsd8 Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-generic
plat-linux2
plat-netbsd1
plat-next3
plat-os2emx Merged revisions 78644 via svnmerge from 2010-03-04 02:11:41 +00:00
plat-sunos5 Merged revisions 78678,78680,78682 via svnmerge from 2010-03-05 03:33:11 +00:00
plat-unixware7
pydoc_data Update pydoc topics. 2010-11-13 06:36:58 +00:00
site-packages
sqlite3 #9424: Replace deprecated assert* methods in the Python test suite. 2010-11-20 19:04:17 +00:00
test #1486713: Add a tolerant mode to HTMLParser. 2010-12-03 04:06:39 +00:00
tkinter Issue #6878: Fixed return type of tkinter methods documented to return lists. 2010-11-22 19:40:51 +00:00
turtledemo Translated German message into English. 2010-11-01 18:42:01 +00:00
unittest Issue 7911: unittest.TestCase.longMessage defaults to True for improved failure messages by default 2010-12-03 00:53:09 +00:00
urllib Fix spelling of Jamie Zawinski's surname in urllib.parse docstring (issue 10606) 2010-12-02 16:41:00 +00:00
wsgiref Implement http://bugs.python.org/issue10155 using And Clover's patch, w/added 2010-11-03 22:39:01 +00:00
xml Fix Issue10205 - XML QName error when different tags have same QName. 2010-11-09 02:36:59 +00:00
xmlrpc Code Changes as per review comments by Antoine Pitrou. 2010-11-18 17:08:48 +00:00
__future__.py
__phello__.foo.py
_abcoll.py Issue #10565: Iterator ABC should require both __next__ and __iter__. 2010-11-29 03:56:12 +00:00
_compat_pickle.py
_dummy_thread.py Fix bootstrap issues when building without threads 2010-11-05 19:58:28 +00:00
_markupbase.py Take advantage of the frozenset constant optimization. 2010-09-05 23:15:06 +00:00
_pyio.py Issue #10180: Pickling file objects is now explicitly forbidden, since 2010-11-05 21:15:39 +00:00
_strptime.py PEP 8 conformance: class_ -> cls 2010-07-14 13:46:57 +00:00
_threading_local.py Issue #9707: Rewritten reference implementation of threading.local which 2010-09-07 22:06:17 +00:00
_weakrefset.py Issue #7105: Make WeakKeyDictionary and WeakValueDictionary robust against 2010-01-08 17:54:23 +00:00
abc.py add support for abstract class and static methods #5867 2010-08-17 00:52:52 +00:00
aifc.py
antigravity.py
argparse.py Fix bug 9340 - argparse parse_known_args didn't work with subparsers 2010-11-02 12:47:22 +00:00
ast.py Fix handling on negative numbers in ast.literal_eval(). 2010-10-08 00:47:45 +00:00
asynchat.py
asyncore.py Fix issue 6706: adds new handle_accepted() method to asyncore.dispatcher 2010-10-04 21:08:36 +00:00
base64.py #1466065: add validate option to base64.b64decode 2010-11-11 20:09:20 +00:00
bdb.py Use booleans where applicable. 2010-11-29 20:19:15 +00:00
binhex.py
bisect.py
cProfile.py #9428: fix running scripts from profile/cProfile with their own name and the right namespace. Same fix as for trace.py in #1690103. 2010-08-02 12:20:23 +00:00
calendar.py #10092: Properly reset locale in Locale*Calendar classes. The context manager was buggy because setlocale() returns the *new* locale, not the old. Also add a test for this. 2010-10-19 18:54:25 +00:00
cgi.py #2830: add html.escape() helper and move cgi.escape() uses in the standard library to it. It defaults to quote=True and also escapes single quotes, which makes casual use safer. The cgi.escape() interface is not touched, but emits a (silent) PendingDeprecationWarning. 2010-10-15 15:57:45 +00:00
cgitb.py
chunk.py
cmd.py Add docstring to cmd.Cmd.do_help() 2010-09-09 03:53:22 +00:00
code.py
codecs.py #7475: add (un)transform method to bytes/bytearray and str, add back codecs that can be used with them from Python 2. 2010-12-02 18:06:51 +00:00
codeop.py
collections.py Issue 9865: add __sizeof__ to OrderedDict. 2010-09-16 19:10:17 +00:00
colorsys.py
compileall.py Fix typos and style in compileall. 2010-11-20 21:53:02 +00:00
configparser.py configparser: read-only attributes to get the section name and parser from a SectionProxy instance 2010-11-21 13:56:42 +00:00
contextlib.py Issue 9110. Adding ContextDecorator to contextlib. This enables the creation of APIs that act as decorators as well as context managers. contextlib.contextmanager changed to use ContextDecorator. 2010-06-30 12:17:50 +00:00
copy.py Fix running the copy module from the command-line (however use{ful,less} it may be). 2010-09-04 17:46:44 +00:00
copyreg.py
csv.py #5975: add unix_dialect to csv module. 2010-10-27 07:27:06 +00:00
datetime.py Issue 9183: Intern UTC timezone. 2010-10-14 17:03:51 +00:00
decimal.py Add link to specification. 2010-11-30 20:32:59 +00:00
difflib.py Issue 2986: Add autojunk paramater to SequenceMatcher to turn off heuristic. Patch by Terry Reedy, Eli Bendersky, and Simon Cross 2010-11-25 06:12:34 +00:00
dis.py As per python-dev discussion with Eli, properly document and publish dis.show_code 2010-09-10 14:08:04 +00:00
doctest.py More proper closing of files 2010-10-14 22:11:44 +00:00
dummy_threading.py
filecmp.py
fileinput.py #1286: allow using fileinput.FileInput as context manager. 2010-07-31 20:08:15 +00:00
fnmatch.py Re-apply r83871. 2010-08-13 16:26:40 +00:00
formatter.py
fractions.py Make Fraction(-1).__hash__() return -2 rather than -1 (see issue 10356). 2010-11-13 10:27:38 +00:00
ftplib.py fix ftplib resource warnings 2010-10-31 19:58:07 +00:00
functools.py Issue 10593: Adopt Nick's suggestion for an lru_cache with maxsize=None. 2010-12-01 03:45:41 +00:00
genericpath.py Merged revisions 78018,78035-78040,78042-78043,78046,78048-78052,78054,78059,78075-78080 via svnmerge from 2010-03-14 10:23:39 +00:00
getopt.py remove pointless coding cookies 2010-08-30 14:05:50 +00:00
getpass.py
gettext.py Fix interaction of custom translation classes and caching (#9042) 2010-10-04 23:52:37 +00:00
glob.py
gzip.py #10465: fix broken delegation in __getattr__ of _PaddedFile. 2010-11-20 11:25:01 +00:00
hashlib.py hashlib has two new constant attributes: algorithms_guaranteed and 2010-09-06 08:30:23 +00:00
heapq.py Remove coding cookie from heapq.py. 2010-07-04 19:23:49 +00:00
hmac.py speed hack can be implemented more cleanly by directly calling __new__ 2010-08-21 02:45:15 +00:00
imaplib.py Issue #10429: IMAP.starttls() stored the capabilities as bytes objects, 2010-11-16 17:55:26 +00:00
imghdr.py
inspect.py Issue 10220: switch to using string constants rather than integers for inspect.getgeneratorstate() return values and make debugging friendly str() and repr() for generator states a requirement in the test suite 2010-11-30 06:36:04 +00:00
io.py
keyword.py Use a context manager for some file objects. 2010-09-03 19:52:03 +00:00
linecache.py Issue #10335: Add tokenize.open(), detect the file encoding using 2010-11-09 01:08:59 +00:00
locale.py Merged revisions 80512 via svnmerge from 2010-04-27 02:45:53 +00:00
macpath.py Issue #767645: Set os.path.supports_unicode_filenames to True on Mac OS X 2010-09-13 19:41:36 +00:00
macurl2path.py
mailbox.py Fix test_mailbox by supporting context manager protocol for get_file() returns. 2010-10-30 14:33:28 +00:00
mailcap.py Issue #8496: make mailcap.lookup() always return a list, rather than an iterator. 2010-04-22 13:30:10 +00:00
mimetypes.py Fix r85774 editor fail. Had a dangling try and incorrect indent. 2010-10-21 14:45:01 +00:00
modulefinder.py Silence ResourceWarnings in modulefinder by using file context managers. 2010-10-30 00:26:48 +00:00
netrc.py #10464: fix netrc handling of lines with embedded '#" characters. 2010-12-02 02:58:07 +00:00
nntplib.py Make `usenetrc` False by default (the old behaviour of having it True by 2010-11-09 18:58:42 +00:00
ntpath.py Issue #5117: Case normalization was needed on ntpath.relpath(). And 2010-10-18 12:13:18 +00:00
nturl2path.py
numbers.py
opcode.py #4617: Previously it was illegal to delete a name from the local 2010-09-10 21:39:53 +00:00
optparse.py
os.py Issue 9299 Add exist_ok parameter to os.makedirs to suppress 'File exists' exception. Patch by Ray Allen. 2010-12-02 07:05:56 +00:00
os2emxpath.py #9018: os.path.normcase() now raises a TypeError if the argument is not str or bytes. 2010-06-25 10:56:11 +00:00
pdb.py Use booleans where applicable. 2010-11-29 20:19:15 +00:00
pickle.py Issue #9410: Various optimizations to the pickle module, leading to 2010-09-09 18:33:21 +00:00
pickletools.py Syntax cleanup. 2010-07-28 16:39:41 +00:00
pipes.py Merged revisions 80030,80067,80069,80080-80081,80084,80432-80433,80465-80470,81059,81065-81067 via svnmerge from 2010-05-19 20:57:08 +00:00
pkgutil.py Have pkgutil properly close files. 2010-10-29 22:36:53 +00:00
platform.py Use a context manager for some file objects. 2010-09-03 19:52:03 +00:00
plistlib.py Merged revisions 80279 via svnmerge from 2010-04-20 21:00:34 +00:00
poplib.py fix issue #8807: adds a context parameter to POP3_SSL class. 2010-08-17 16:09:53 +00:00
posixpath.py Issue #5117: Case normalization was needed on ntpath.relpath(). And 2010-10-18 12:13:18 +00:00
pprint.py Have pprint() respect the order in an OrderedDict. 2010-09-09 12:31:00 +00:00
profile.py #9428: fix running scripts from profile/cProfile with their own name and the right namespace. Same fix as for trace.py in #1690103. 2010-08-02 12:20:23 +00:00
pstats.py Fix issue10377 - Output from pstats - it is just secs (i.e, wallclock time) and not CPU time. 2010-11-20 17:02:50 +00:00
pty.py
py_compile.py Issue #10335: Add tokenize.open(), detect the file encoding using 2010-11-09 01:08:59 +00:00
pyclbr.py
pydoc.py #9724: add nonlocal to pydoc topics. 2010-11-20 19:54:36 +00:00
queue.py Issue 10110: Let Queue.put recognize a full queue when the maxsize parameter has been reduced. 2010-10-31 17:57:52 +00:00
quopri.py convert shebang lines: python -> python3 2010-03-11 22:53:45 +00:00
random.py Improve variable name (don't shadow a builtin). 2010-09-08 19:27:59 +00:00
re.py Adopt more descriptive attribute names as suggested on python-dev. 2010-09-04 22:46:06 +00:00
reprlib.py Fix bootstrap issues when building without threads 2010-11-05 19:58:28 +00:00
rlcompleter.py
runpy.py account for the module __dict__ being cleared when its module is 2010-10-13 01:04:36 +00:00
sched.py
shelve.py Merged revisions 78141-78142 via svnmerge from 2010-02-11 02:42:19 +00:00
shlex.py Recode modules from latin-1 to utf-8 2010-10-27 18:52:48 +00:00
shutil.py Merged revisions 80837 via svnmerge from 2010-05-05 22:43:04 +00:00
site.py Issue #6612: Fix site and sysconfig to catch os.getcwd() error, eg. if the 2010-10-12 22:23:23 +00:00
smtpd.py Fix issue 6706: adds new handle_accepted() method to asyncore.dispatcher 2010-10-04 21:08:36 +00:00
smtplib.py #10321: Add support for sending binary DATA and Message objects to smtplib 2010-11-08 17:15:13 +00:00
sndhdr.py Issue #9243: Fix sndhdr module and add unit tests, contributed by James Lee. 2010-07-13 23:04:56 +00:00
socket.py Issue #10093: ResourceWarnings are now issued when files and sockets are 2010-10-29 10:38:18 +00:00
socketserver.py Merged revisions 80484 via svnmerge from 2010-04-25 22:01:43 +00:00
sre_compile.py Issue #8990: array.fromstring() and array.tostring() get renamed to 2010-09-01 20:29:34 +00:00
sre_constants.py
sre_parse.py #6509: fix re.sub to work properly when the pattern, the string, and the replacement were all bytes. Patch by Antoine Pitrou. 2010-03-06 15:24:08 +00:00
ssl.py Issue #5639: Add a *server_hostname* argument to `SSLContext.wrap_socket` 2010-10-22 18:19:07 +00:00
stat.py
string.py #9418: first step of moving private string methods to _string module. 2010-10-14 07:04:07 +00:00
stringprep.py
struct.py Issue #8973: Add __all__ to struct module, so that help(struct) correctly 2010-06-12 15:17:02 +00:00
subprocess.py Fix #10554. Added context manager support to Popen objects. 2010-12-03 02:46:02 +00:00
sunau.py Issue #10265: Close file objects explicitly in sunau. Patch by Brian Brazil. 2010-10-31 21:27:04 +00:00
symbol.py convert shebang lines: python -> python3 2010-03-11 22:53:45 +00:00
symtable.py
sysconfig.py sys.abiflags may not be defined on all platforms. 2010-11-25 01:34:47 +00:00
tabnanny.py Issue #10335: Add tokenize.open(), detect the file encoding using 2010-11-09 01:08:59 +00:00
tarfile.py Issue #10184: Touch directories only once when extracting a tarfile. 2010-11-01 21:39:13 +00:00
telnetlib.py #7761: fix telnetlib.interact failures on Windows. 2010-10-26 12:42:24 +00:00
tempfile.py Simplify code 2010-11-09 03:43:58 +00:00
textwrap.py
this.py
threading.py Issue 10260 2010-11-18 12:46:39 +00:00
timeit.py #4810: document "--" option separator in timeit help. 2010-08-01 08:52:32 +00:00
token.py Issue #10386: Added __all__ to token module; this simplifies importing 2010-11-11 14:07:41 +00:00
tokenize.py Issue #10386: Added __all__ to token module; this simplifies importing 2010-11-11 14:07:41 +00:00
trace.py Issue #10371: Deprecated undocumented functions in the trace module. 2010-11-20 18:21:07 +00:00
traceback.py
tty.py
turtle.py Issue #6878: Fixed return type of tkinter methods documented to return lists. 2010-11-22 19:40:51 +00:00
types.py
uu.py Issue #10266: uu.decode didn't close in_file explicitly when it was given 2010-10-31 16:04:14 +00:00
uuid.py Shift the pipe-using code into an else block, then close the pipe 2010-11-05 14:48:35 +00:00
warnings.py Add a new warning gategory, ResourceWarning, as discussed on python-dev. It is silent by default, 2010-10-24 15:11:22 +00:00
wave.py #10198: fix duplicate header when writeframes() is called with an empty string. 2010-10-30 08:29:28 +00:00
weakref.py Issue #7105: Make WeakKeyDictionary and WeakValueDictionary robust against 2010-01-08 17:54:23 +00:00
webbrowser.py Remove conditional import of 'ic', that module 2010-05-30 15:46:48 +00:00
wsgiref.egg-info
xdrlib.py
zipfile.py zipfile: remove remaining ResourceWarnings 2010-11-23 00:15:02 +00:00