cpython

Commit Graph

Author	SHA1	Message	Date
Tim Peters	ea405639bf	Reserved another gc_refs value for untracked objects. Every live gc object should now have a well-defined gc_refs value, with clear transitions among gc_refs states. As a result, none of the visit_XYZ traversal callbacks need to check IS_TRACKED() anymore, and those tests were removed. (They were already looking for objects with specific gc_refs states, and the gc_refs state of an untracked object can no longer match any other gc_refs state by accident.) Added more asserts. I expect that the gc_next == NULL indicator for an untracked object is now redundant and can also be removed, but I ran out of time for this.	2002-07-02 00:52:30 +00:00
Fred Drake	7c75bf2090	Bring this back into sync with PyXML revision 1.58.	2002-07-01 14:02:31 +00:00
Tim Peters	19b74c7868	OK, I couldn't stand it <0.5 wink>: removed all uncertainty about what's in gc_refs, even at the cost of putting back a test+branch in visit_decref. The good news: since gc_refs became utterly tame then, it became clear that another special value could be useful. The move_roots() and move_root_reachable() passes have now been replaced by a single move_unreachable() pass. Besides saving a pass over the generation, this has a better effect: most of the time everything turns out to be reachable, so we were breaking the generation list apart and moving it into into the reachable list, one element at a time. Now the reachable stuff stays in the generation list, and the unreachable stuff is moved instead. This isn't quite as good as it sounds, since sometimes we guess wrongly that a thing is unreachable, and have to move it back again. Still, overall, it yields a significant (but not dramatic) boost in collection speed.	2002-07-01 03:52:19 +00:00
Tim Peters	93cd83e4ae	visit_decref(): Two optimizations. 1. You're not supposed to call this with a NULL argument, although the docs could be clearer about that. The other visit_XYZ() functions don't bother to check. This doesn't either now, although it does assert non-NULL-ness now. 2. It doesn't matter whether the object is currently tracked, so don't bother checking that either (if it isn't currently tracked, it may have some nonsense value in gc_refs, but it doesn't hurt to decrement gibberish, and it's cheaper to do so than to make everyone test for trackedness). It would be nice to get rid of the other tests on IS_TRACKED. Perhaps trackedness should not be a matter of not being in any gc list, but should be a matter of being in a new "untracked" gc list. This list simply wouldn't be involved in the collection mechanism. A newly created object would be put in the untracked list. Tracking would simply unlink it and move it into the gen0 list. Untracking would do the reverse. No test+branch needed then. visit_move() may be vulnerable then, though, and I don't know how this would work with the trashcan.	2002-06-30 21:31:03 +00:00
Tim Peters	8839617cc9	SF bug #574132 : Major GC related performance regression "The regression" is actually due to that 2.2.1 had a bug that prevented the regression (which isn't a regression at all) from showing up. "The regression" is actually a glitch in cyclic gc that's been there forever. As the generation being collected is analyzed, objects that can't be collected (because, e.g., we find they're externally referenced, or are in an unreachable cycle but have a __del__ method) are moved out of the list of candidates. A tricksy scheme uses negative values of gc_refs to mark such objects as being moved. However, the exact negative value set at the start may become "more negative" over time for objects not in the generation being collected, and the scheme was checking for an exact match on the negative value originally assigned. As a result, objects in generations older than the one being collected could get scanned too, and yanked back into a younger generation. Doing so doesn't lead to an error, but doesn't do any good, and can burn an unbounded amount of time doing useless work. A test case is simple (thanks to Kevin Jacobs for finding it!): x = [] for i in xrange(200000): x.append((1,)) Without the patch, this ends up scanning all of x on every gen0 collection, scans all of x twice on every gen1 collection, and x gets yanked back into gen1 on every gen0 collection. With the patch, once x gets to gen2, it's never scanned again until another gen2 collection, and stays in gen2. Bugfix candidate, although the code has changed enough that I think I'll need to port it by hand. 2.2.1 also has a different bug that causes bound method objects not to get tracked at all (so the test case doesn't burn absurd amounts of time in 2.2.1, but should <wink>).	2002-06-30 17:56:40 +00:00
Martin v. Löwis	6238d2b024	Patch #569753 : Remove support for WIN16. Rename all occurrences of MS_WIN32 to MS_WINDOWS.	2002-06-30 15:26:10 +00:00
Martin v. Löwis	adfa7409f8	Bump required PyXML version to 0.6.5.	2002-06-30 15:08:22 +00:00
Martin v. Löwis	7d650ca83b	Implement the encoding argument for toxml and toprettyxml. Document toprettyxml.	2002-06-30 15:05:00 +00:00
Martin v. Löwis	2ebfd09e58	Merge from PyXML: [1.3] Added documentation of the namespace URI for elements with no namespace. [1.4] New property http://www.python.org/sax/properties/encoding. [1.5] Support optional string interning in pyexpat.	2002-06-30 07:38:50 +00:00
Martin v. Löwis	0e2d881406	Add xml namespace initially (PyXML 1.19).	2002-06-30 07:32:56 +00:00
Martin v. Löwis	d1b516c274	Fix spacing.	2002-06-30 07:27:30 +00:00
Martin v. Löwis	18476a3740	Merge changes from PyXML: [1.15] Added understanding of the feature_validation, feature_external_pes, and feature_string_interning features. Added support for the feature_external_ges feature. Added support for the property_xml_string property. [1.16] Made it recognize the namespace prefixes feature. [1.17] removed erroneous first line [1.19] Support optional string interning in pyexpat. [1.21] Restore compatibility with versions of Python that did not support weak references. These do not get the cyclic reference fix, but they will continue to work as they did before. [1.22] Activate entity processing unless standalone.	2002-06-30 07:21:24 +00:00
Martin v. Löwis	b4fcf4d102	Define PyDoc_STRVAR if it is not available (PyXML 1.54). Remove support for Python 1.5 (PyXML 1.55).	2002-06-30 06:40:55 +00:00
Martin v. Löwis	6b2cf0e5ea	Undo usage of PyOS_snprintf (rev. 1.51 of PyXML).	2002-06-30 06:03:35 +00:00
Raymond Hettinger	550fd5d799	Fixed bug 574978 shutil example out of sync with source code	2002-06-30 04:43:20 +00:00
Raymond Hettinger	8a9e8b6d0e	Fix bug 575221 referred to dictionary type instead of dict.	2002-06-30 04:32:38 +00:00
Raymond Hettinger	46ac8eb3c8	Code modernization. Replace v=s[i]; del s[i] with single lookup v=s.pop(i)	2002-06-30 03:39:14 +00:00
Fred Drake	78e057a32a	Clarify the version information for the unicode() built-in. Closes SF bug #575272.	2002-06-29 16:06:47 +00:00
Barry Warsaw	1969817486	Another test of long headers.	2002-06-29 15:23:39 +00:00
Barry Warsaw	9546e7972c	Oleg Broytmann's support for RFC 2231 encoded parameters, SF patch #549133 New test cases.	2002-06-29 05:58:45 +00:00
Barry Warsaw	12566a8826	Oleg Broytmann's support for RFC 2231 encoded parameters, SF patch #549133 Specifically, decode_rfc2231(), encode_rfc2231(): Functions to encode and decode RFC 2231 style parameters. decode_params(): Function to decode a list of parameters.	2002-06-29 05:58:04 +00:00
Barry Warsaw	908dc4bea8	Oleg Broytmann's support for RFC 2231 encoded parameters, SF patch #549133 Specifically, _formatparam(): Teach this about encoded `param' arguments, which are a 3-tuple of items (charset, language, value). language is ignored. _unquotevalue(): Handle both 3-tuple RFC 2231 values and unencoded values. _get_params_preserve(): Decode the parameters before returning them. get_params(), get_param(): Use _unquotevalue(). get_filename(), get_boundary(): Teach these about encoded (3-tuple) parameters.	2002-06-29 05:56:15 +00:00
Barry Warsaw	3fdc889e76	test_multilingual(): Test for Header.__unicode__().	2002-06-29 03:27:27 +00:00
Barry Warsaw	8e69bdac33	__unicode__(): Patch # 541263 by Mikhail Zabaluev, implementation modified by Barry.	2002-06-29 03:26:58 +00:00
Greg Ward	ae64f3adcd	Add documentation for new textwrap module.	2002-06-29 02:38:50 +00:00
Greg Ward	8b46c71d5b	Typo fix.	2002-06-29 01:23:45 +00:00
Jeremy Hylton	566fe9ef66	Track change of begin() to _begin().	2002-06-28 23:54:30 +00:00
Barry Warsaw	b6a9213930	Lots of new and updated tests to check for proper ascii header folding. Note that some of the Japanese tests have changed, but I don't really know if they are correct or not. :( Someone with Japanese and RFC 2047 expertise, please take a look!	2002-06-28 23:49:33 +00:00
Barry Warsaw	ba2577b7f1	_max_append(): When adding the string `s' to its own line, it should be lstrip'd so that old continuation whitespace is replaced by that specified in Header's continuation_ws parameter.	2002-06-28 23:48:23 +00:00
Barry Warsaw	766125080f	Teach this class about "highest-level syntactic breaks" but only for headers with no charset or 'us-ascii' charsets. Actually this is only partially true: we know about semicolons (but not true parameters) and we know about whitespace (but not technically folding whitespace). Still it should be good enough for all practical purposes. Other changes include: __init__(): Add a continuation_ws argument, which defaults to a single space. Set this to change the whitespace used for continuation lines when a header must be split. Also, changed the way header line lengths are calculated, so that they take into account continuation_ws (when tabs-expanded) and any provided header_name parameter. This should do much better on returning split headers for which the first and subsequent lines must fit into a specified width. guess_maxlinelen(): Removed. I don't think we need this method as part of the public API. encode_chunks() -> _encode_chunks(): I don't think we need this one as part of the public API either.	2002-06-28 23:46:53 +00:00
Barry Warsaw	062749ac57	_split_header(): The code here was terminally broken because it didn't know anything about RFC 2047 encoded headers. Fortunately we have a perfectly good header splitter in Header.encode(). So we just call that to give us a properly formatted and split header. Header.encode() didn't know about "highest-level syntactic breaks" but that's been fixed now too.	2002-06-28 23:41:42 +00:00
Jeremy Hylton	7c75c99a10	Simplify HTTPSConnection constructor. See discussion in SF bug 458463.	2002-06-28 23:38:14 +00:00
Jeremy Hylton	13f99d7097	Close SF patch 523944: importing modules with foreign newlines. Didn't use the patch, because universal newlines support made it easy. It might be worth fixing the actual problem in the 2.2 maintenance branch, in which case the patch is still needed.	2002-06-28 23:32:51 +00:00
Fred Drake	2a3d7db93e	Added character data buffering to pyexpat parser objects. Setting the buffer_text attribute to true causes the parser to collect character data, waiting as long as possible to report it to the Python callback. This can save an enormous number of callbacks from C to Python, which can be a substantial performance improvement. buffer_text defaults to false.	2002-06-28 22:56:48 +00:00
Jeremy Hylton	3e76d7f3b3	Add Bob Kline of HTTP 100 fame.	2002-06-28 22:39:56 +00:00
Jeremy Hylton	be4fcf1875	Fixes for two separate HTTP/1.1 bugs: 100 responses and HTTPS connections. The HTTPResponse class now handles 100 continue responses, instead of choking on them. It detects them internally in the _begin() method and ignores them. Based on a patch by Bob Kline. This closes SF bugs 498149 and 551273. The FakeSocket class (for SSL) is now usable with HTTP/1.1 connections. The old version of the code could not work with persistent connections, because the makefile() implementation read until EOF before returning. If the connection is persistent, the server sends a response and leaves the connection open. A client that reads until EOF will block until the server gives up on the connection -- more than a minute in my test case. The problem was fixed by implementing a reasonable makefile(). It reads data only when it is needed by the layers above it. It's implementation uses an internal buffer with a default size of 8192. Also, rename begin() method of HTTPResponse to _begin() because it should only be called by the HTTPConnection.	2002-06-28 22:38:01 +00:00
Fred Drake	71b63ff342	pyexpat code cleanup and minor refactorings: The handlers array on each parser now has the invariant that None will never be set as a handler; it will always be NULL or a Python-level value passed in for the specific handler. have_handler(): Return true if there is a Python handler for a particular event. get_handler_name(): Return a string object giving the name of a particular handler. This caches the string object so it doesn't need to be created more than once. get_parse_result(): Helper to allow the Parse() and ParseFile() methods to share the same logic for determining the return value or exception state. PyUnknownEncodingHandler(), PyModule_AddIntConstant(): Made these helpers static. (The later is only defined for older versions of Python.) pyxml_UpdatePairedHandlers(), pyxml_SetStartElementHandler(), pyxml_SetEndElementHandler(), pyxml_SetStartNamespaceDeclHandler(), pyxml_SetEndNamespaceDeclHandler(), pyxml_SetStartCdataSection(), pyxml_SetEndCdataSection(), pyxml_SetStartDoctypeDeclHandler(), pyxml_SetEndDoctypeDeclHandler(): Removed. These are no longer needed with Expat 1.95.x. handler_info: Use the setter functions provided by Expat 1.95.x instead of the pyxml_Set*Handler() functions which have been removed. Minor code formatting changes for consistency. Trailing whitespace removed.	2002-06-28 22:29:01 +00:00
Neil Schemenauer	c9051640f8	Fix small bug. The count of objects in all generations younger then the collected one should be zeroed.	2002-06-28 19:16:04 +00:00
Mark Hammond	0d3b2fe094	Patch 574531/Bug 574570 - allow freeze on windows to use the _winreg extension.	2002-06-28 01:13:02 +00:00
Jack Jansen	2bb598067a	The standard definition file is now called mwerks_shcarbon_plugin.h.	2002-06-27 22:10:19 +00:00
Jack Jansen	1892cf0312	Started on support for using standard setup.py to build at least the "standard" modules. Unfinished, but shouldn't harm anything.	2002-06-27 22:09:19 +00:00
Jack Jansen	73aa1fff85	More fixes for building MacPython extension modules. It now actually succeeds in building various modules.	2002-06-27 22:06:49 +00:00
Fredrik Lundh	3d9addd55a	merged with SLAB codebase (version 1.0.1)	2002-06-27 21:36:21 +00:00
Fredrik Lundh	4fb7027ec0	made the code match the comments (1.5.2 compatibility)	2002-06-27 20:08:25 +00:00
Fredrik Lundh	6f7c3431c8	Fix bug #570057 : Broken pre.subn() (and pre.sub()) This should be backported to the 2.2.X series (how do I do that?)	2002-06-27 19:59:27 +00:00
Fred Drake	1add023b88	Integrate the tests for name interning from PyXML (test_pyexpat.py revision 1.12 in PyXML).	2002-06-27 19:41:51 +00:00
Fred Drake	b91a36b230	Integrate the changes from PyXML's version of pyexpat.c revisions 1.47, 1.48, 1.49 (name interning support).	2002-06-27 19:40:48 +00:00
Fred Drake	d805fefff4	Added support for some of the more recently defined macros and environments.	2002-06-27 18:38:06 +00:00
Fred Drake	2c813818c8	Clean up some markup.	2002-06-27 18:30:34 +00:00
Neal Norwitz	dcd0500664	dis.dis() also supports modules	2002-06-26 22:32:47 +00:00

1 2 3 4 5 ...

23071 Commits All Branches Search

23071 Commits

All Branches