cpython

Commit Graph

Author	SHA1	Message	Date
Barry Warsaw	142865cae1	func_getattro(), func_setattro(): Implement the new semantics for setting and deleting a function's __dict__ attribute. Deleting it, or setting it to a non-dictionary result in a TypeError. Note that getting it the first time magically initializes it to an empty dict so that func.__dict__ will always appear to be a dictionary (never None). Closes SF bug #446645.	2001-08-14 18:23:58 +00:00
Jeremy Hylton	910d7d46dc	Remove much dead code from ceval.c The descr changes moved the dispatch for calling objects from call_object() in ceval.c to PyObject_Call() in abstract.c. call_object() and the many functions it used in ceval.c were no longer used, but were not removed. Rename meth_call() as PyCFunction_Call() so that it can be called by the CALL_FUNCTION opcode in ceval.c. Also, fix error message that referred to PyEval_EvalCodeEx() by its old name eval_code2(). (I'll probably refer to it by its old name, too.)	2001-08-12 21:52:24 +00:00
Guido van Rossum	8e24818cf4	Make dynamic types work as intended. Or at least more so. XXX There are still some loose ends: repr(), str(), hash() and comparisons don't inherit a default implementation from object. This must be resolved similarly to the way it's resolved for classic instances.	2001-08-12 05:17:56 +00:00
Guido van Rossum	8de8680d07	Temporary stop-gap fix for dynamic classes, so they pass the test. XXX This is not sufficient: if a dynamic class has no __repr__ method (for instance), but later one is added, that doesn't add a tp_repr slot, so repr() doesn't call the __repr__ method. To make this work, I'll have to add default implementations of several slots to 'object'. XXX Also, dynamic types currently only inherit slots from their dominant base.	2001-08-12 03:43:35 +00:00
Guido van Rossum	13d52f0b32	- Big changes to fix SF bug #442833 (a nasty multiple inheritance problem). inherit_slots() is split in two parts: inherit_special() which inherits the flags and a few very special members from the dominant base; inherit_slots() which inherits only regular slots, and is now called for each base in the MRO in turn. These are now both void functions since they don't have error returns. - Added object.__setitem__() back -- for the same reason as object.__new__(): a subclass of object should be able to call object.__new__(). - add_wrappers() was moved around to be closer to where it is used (it was defined together with add_methods() etc., but has nothing to do with these).	2001-08-10 21:24:08 +00:00
Guido van Rossum	05ac6de2d5	Add PyDict_Merge(a, b, override): PyDict_Merge(a, b, 1) is the same as PyDict_Update(a, b). PyDict_Merge(a, b, 0) does something similar but leaves existing items unchanged.	2001-08-10 20:28:28 +00:00
Guido van Rossum	d614f97733	Change PyType_Ready() to use the READY and READYING flags. This makes it possible to detect recursive calls early (as opposed to when the stack overflows :-).	2001-08-10 17:39:49 +00:00
Tim Peters	772747b3f1	SF patch #438013 Remove 2-byte Py_UCS2 assumptions Removed all instances of Py_UCS2 from the codebase, and so also (I hope) the last remaining reliance on the platform having an integral type with exactly 16 bits. PyUnicode_DecodeUTF16() and PyUnicode_EncodeUTF16() now read and write one byte at a time.	2001-08-09 22:21:55 +00:00
Guido van Rossum	29687cd211	Sigh. Strengthen the resriction of the previous checkin: tp_new is inherited unless both: (a) the base type is 'object', and (b) the subtype is not a "heap" type.	2001-08-09 19:43:37 +00:00
Guido van Rossum	c11e192d41	Thinking back to the 2.22 revision, I didn't like what I did there one bit. For one, this class: class C(object): def __new__(myclass, ...): ... would have no way to call the __new__ method of its base class, and the workaround (to create an intermediate base class whose __new__ you can call) is ugly. So, I've come up with a better solution that restores object.__new__, but still solves the original problem, which is that built-in and extension types shouldn't inherit object.__new__. The solution is simple: only "heap types" inherit tp_new. Simpler, less code, perfect!	2001-08-09 19:38:15 +00:00
Guido van Rossum	29206bc8a3	Apply anonymous SF patch #441229 . Previously, f.read() and f.readlines() checked for errors on their file object and possibly raised an IOError, but f.readline() didn't. This patch makes f.readline() behave like the others. Note that I've added a call to clearerr() since the other calls to ferror() include that too. I have no way to test this code. :-)	2001-08-09 18:14:59 +00:00
Guido van Rossum	dc91b99f23	Proper support for binary operators, including true division and floor division. The basic binary operators now all correctly call the __rxxx__ variant when they should. In type_new(), I now make the new type a new-style number unless it inherits from an old-style number that has numeric methods. By way of cosmetics, I've changed the signatures of the SLOT<i> macros to take actual function names and operator names as strings, rather than rely on C preprocessor symbol manipulations. This makes the calls slightly more verbose, but greatly helps simple searches through the file: you can now find out where "__radd__" is used or where the function slot_nb_power() is defined and where it is used.	2001-08-08 22:26:22 +00:00
Jack Jansen	8e938b4257	Removed extraneous semicolons that caused a gazzilion "empty declaration" warnings in the MetroWerks compiler.	2001-08-08 15:29:49 +00:00
Guido van Rossum	4668b000a1	Implement PEP 238 in its (almost) full glory. This introduces: - A new operator // that means floor division (the kind of division where 1/2 is 0). - The "future division" statement ("from __future__ import division) which changes the meaning of the / operator to implement "true division" (where 1/2 is 0.5). - New overloadable operators __truediv__ and __floordiv__. - New slots in the PyNumberMethods struct for true and floor division, new abstract APIs for them, new opcodes, and so on. I emphasize that without the future division statement, the semantics of / will remain unchanged until Python 3.0. Not yet implemented are warnings (default off) when / is used with int or long arguments. This has been on display since 7/31 as SF patch #443474. Flames to /dev/null.	2001-08-08 05:00:18 +00:00
Guido van Rossum	528b7eb0b0	- Rename PyType_InitDict() to PyType_Ready(). - Add an explicit call to PyType_Ready(&PyList_Type) to pythonrun.c (just for the heck of it, really -- we should either explicitly ready all types, or none).	2001-08-07 17:24:28 +00:00
Guido van Rossum	f040ede6e8	Cosmetics: - Add comment blocks explaining add_operators() and override_slots(). (This file could use some more explaining, but this is all I had breath for today. :) - Renamed the argument 'base' of add_wrappers() to 'wraps' because it's not a base class (which is what the 'base' identifier is used for elsewhere). Small nits: - Fix add_tp_new_wrapper() to avoid overwriting an existing __new__ descriptor in tp_defined. - In add_operators(), check the return value of add_tp_new_wrapper(). Functional change: - Remove the tp_new functionality from PyBaseObject_Type; this means you can no longer instantiate the 'object' type. It's only useful as a base class. - To make up for the above loss, add tp_new to dynamic types. This has to be done in a hackish way (after override_slots() has been called, with an explicit call to add_tp_new_wrapper() at the very end) because otherwise I ran into recursive calls of slot_tp_new(). Sigh.	2001-08-07 16:40:56 +00:00
Guido van Rossum	63e0a64562	Remove spurious "closed" attribute definition from the memberlist table. (reported as an aside in SF #446049).	2001-08-06 18:51:38 +00:00
Guido van Rossum	0d231eda52	A totally new way to do the __new__ wrapper. This should address the problem brought up in SF bug #444229.	2001-08-06 16:50:37 +00:00
Guido van Rossum	2b8d7bdd77	Fix SF #442791 (revisited): No __delitem__ wrapper was defined.	2001-08-02 15:31:58 +00:00
Tim Peters	5962cbf5ba	Fix the test_weakref.py failure. Introduced by resolving "a conflict" (which didn't actually exist!) incorrectly.	2001-08-02 04:45:20 +00:00
Tim Peters	6d6c1a35e0	Merge of descr-branch back into trunk.	2001-08-02 04:15:00 +00:00
Jeremy Hylton	3ce45389bd	Add _PyUnicode_AsDefaultEncodedString to unicodeobject.h. And remove all the extern decls in the middle of .c files. Apparently, it was excluded from the header file because it is intended for internal use by the interpreter. It's still intended for internal use and documented as such in the header file.	2001-07-30 22:34:24 +00:00
Tim Peters	7321ec437b	SF bug #444510 : int() should guarantee truncation. It's guaranteed now, assuming the platform modf() works correctly.	2001-07-26 20:02:17 +00:00
Marc-André Lemburg	80d1dd5f3b	Fix for bug #444493 : u'\U00010001' segfaults with current CVS on wide builds.	2001-07-25 16:05:59 +00:00
Marc-André Lemburg	6c6bfb7c70	Make the unicode-escape and the UTF-16 codecs handle surrogates correctly and thus roundtrip-safe. Some minor cleanups of the code. Added tests for the roundtrip-safety.	2001-07-20 17:39:11 +00:00
Guido van Rossum	0d42e0c54a	#ifdef out generation of \U escapes unless Py_UNICODE_WIDE. This #caused warnings with the VMS C compiler. (SF bug #442998, in part.) On a narrow system the current code should never be executed since ch will always be < 0x10000. Marc-Andre: you may end up fixing this a different way, since I believe you have plans to generate \U for surrogate pairs. I'll leave that to you.	2001-07-20 16:36:21 +00:00
Fred Drake	1bc8fab0e7	Kill more warnings from the SGI compiler. Part of SF patch #434992.	2001-07-19 21:49:38 +00:00
Tim Peters	0d5dd68692	Python.h: Don't attempt to redefine NDEBUG if it's already defined. Others: Remove redundant includes of assert.h.	2001-07-15 18:38:47 +00:00
Tim Peters	586b2e3f3d	long_format: Simplify the overly elaborate base-is-a-power-of-2 code.	2001-07-15 09:11:14 +00:00
Guido van Rossum	3c29602d74	_Py_GetObjects(): GCC suggests to add () around && within \|\| for some code only compiled in debug mode, and I dutifully comply.	2001-07-14 17:58:00 +00:00
Tim Peters	212e614f60	divrem1 & long_format: found a clean way to factor divrem1 so that long_format can reuse a scratch area for its repeated divisions (instead of malloc/free for every digit produced); speeds str(long)/repr(long).	2001-07-14 12:23:19 +00:00
Tim Peters	c8a6b9b6d6	long_format(): Simplify new code a bit.	2001-07-14 11:01:28 +00:00
Tim Peters	fad225f1ba	long_format(): Easy speedup for output bases that aren't a power of 2 (in particular, str(long) and repr(long) use base 10, and that gets a factor of 4 speedup). Another factor of 2 can be gotten by refactoring divrem1 to support in-place division, but that started getting messy so I'm leaving that out.	2001-07-13 02:59:26 +00:00
Neil Schemenauer	10c6692774	GC for method objects.	2001-07-12 13:27:35 +00:00
Neil Schemenauer	7eac9b72d4	GC for iterator objects.	2001-07-12 13:27:25 +00:00
Neil Schemenauer	19cd292bbc	GC for frame objects.	2001-07-12 13:27:11 +00:00
Guido van Rossum	0ec9abaa2b	On long to the negative long power, let float handle it instead of raising an error. This was one of the two issues that the VPython folks were particularly problematic for their students. (The other one was integer division...) This implements (my) SF patch #440487.	2001-07-12 11:21:17 +00:00
Guido van Rossum	b82fedc7d8	On int to the negative integral power, let float handle it instead of raising an error. This was one of the two issues that the VPython folks were particularly problematic for their students. (The other one was integer division...) This implements (my) SF patch #440487.	2001-07-12 11:19:45 +00:00
Thomas Wouters	efafcea280	Re-add 'advanced' xrange features, adding DeprecationWarnings as discussed on python-dev. The features will still vanish, however, just one release later.	2001-07-09 12:30:54 +00:00
Tim Peters	6ee4234802	SF bug #439104 : Tuple richcompares has code-typo. Symptom: (1, 2, 3) <= (1, 2) returned 1. This was already fixed in CVS for tuples, but an isomorphic error was in the list richcompare code.	2001-07-06 17:45:43 +00:00
Guido van Rossum	3f56166b1a	Rip out the fancy behaviors of xrange that nobody uses: repeat, slice, contains, tolist(), and the start/stop/step attributes. This includes removing the 4th ('repeat') argument to PyRange_New().	2001-07-05 13:27:48 +00:00
Fredrik Lundh	72b068566a	removed "register const" from scalar arguments to the unicode predicates	2001-06-27 22:08:26 +00:00
Fredrik Lundh	8f4558583f	use Py_UNICODE_WIDE instead of USE_UCS4_STORAGE and Py_UNICODE_SIZE tests.	2001-06-27 18:59:43 +00:00
Martin v. Löwis	ce9b5a55e1	Encode surrogates in UTF-8 even for a wide Py_UNICODE. Implement sys.maxunicode. Explicitly wrap around upper/lower computations for wide Py_UNICODE. When decoding large characters with UTF-8, represent expected test results using the \U notation.	2001-06-27 06:28:56 +00:00
Martin v. Löwis	ac93bc2501	When decoding UTF-16, don't assume that the buffer is in native endianness when checking surrogates.	2001-06-26 22:43:40 +00:00
Martin v. Löwis	0ba70cc3c8	Support using UCS-4 as the Py_UNICODE type: Add configure option --enable-unicode. Add config.h macros Py_USING_UNICODE, PY_UNICODE_TYPE, Py_UNICODE_SIZE, SIZEOF_WCHAR_T. Define Py_UCS2. Encode and decode large UTF-8 characters into single Py_UNICODE values for wide Unicode types; likewise for UTF-16. Remove test whether sizeof Py_UNICODE is two.	2001-06-26 22:22:37 +00:00
Fredrik Lundh	ee13dba1aa	more unicode tweaks: fix unicodectype for sizeof(Py_UNICODE) > sizeof(int)	2001-06-26 20:36:12 +00:00
Barry Warsaw	66a0d1d9b9	dict_update(): Generalize this method so {}.update() accepts any "mapping" object, specifically one that supports PyMapping_Keys() and PyObject_GetItem(). This allows you to say e.g. {}.update(UserDict()) We keep the special case for concrete dict objects, although that seems moderately questionable. OTOH, the code exists and works, so why change that? .update()'s docstring already claims that D.update(E) implies calling E.keys() so it's appropriate not to transform AttributeErrors in PyMapping_Keys() to TypeErrors. Patch eyeballed by Tim.	2001-06-26 20:08:32 +00:00
Fredrik Lundh	1294ad0c59	experimental UCS-4 support: added USE_UCS4_STORAGE define to unicodeobject.h, which forces sizeof(Py_UNICODE) == sizeof(Py_UCS4). (this may be good enough for platforms that doesn't have a 16-bit type. the UTF-16 codecs don't work, though)	2001-06-26 17:17:07 +00:00
Fredrik Lundh	45714e9ecb	experimental UCS-4 support: made compare a bit more robust, in case sizeof(Py_UNICODE) >= sizeof(long). also changed surrogate expansion to work if sizeof(Py_UNICODE) > 2.	2001-06-26 16:39:36 +00:00
Fredrik Lundh	3083163dc1	experimental UCS-4 support: don't assume that MS_WIN32 implies HAVE_USABLE_WCHAR_T	2001-06-26 15:11:00 +00:00
Tim Peters	8c96369513	PyFrameObject: rename f_stackbottom to f_stacktop, since it points to the next free valuestack slot, not to the base (in America, stacks push and pop at the top -- they mutate at the bottom in Australia <winK>). eval_frame(): assert that f_stacktop isn't NULL upon entry. frame_delloc(): avoid ordered pointer comparisons involving f_stacktop when f_stacktop is NULL.	2001-06-23 05:26:56 +00:00
Tim Peters	5ca576ed0a	Merging the gen-branch into the main line, at Guido's direction. Yay! Bugfix candidate in inspect.py: it was referencing "self" outside of a method.	2001-06-18 22:08:13 +00:00
Tim Peters	1dad6a86de	SF bug 434186: 0x80000000/2 != 0x80000000>>1 i_divmod: New and simpler algorithm. Old one returned gibberish on most boxes when the numerator was -sys.maxint-1. Oddly enough, it worked in the release (not debug) build on Windows, because the compiler optimized away some tricky sign manipulations that were incorrect in this case. Makes you wonder <wink> ... Bugfix candidate.	2001-06-18 19:21:11 +00:00
Tim Peters	70128a1ba6	PyLong_{As, From}VoidPtr: cleanup, replacing assumptions in comments with #if/#error constructs.	2001-06-16 08:48:40 +00:00
Tim Peters	c605784174	dict_repr: Reuse one of the int vars (minor code simplification).	2001-06-16 07:52:53 +00:00
Tim Peters	52e155e31b	Reformat decl of new _PyString_Join. Add NEWS blurb about repr() speedup.	2001-06-16 05:42:57 +00:00
Tim Peters	a7259597f1	SF bug 433228: repr(list) woes when len(list) big. Gave Python linear-time repr() implementations for dicts, lists, strings. This means, e.g., that repr(range(50000)) is no longer 50x slower than pprint.pprint() in 2.2 <wink>. I don't consider this a bugfix candidate, as it's a performance boost. Added _PyString_Join() to the internal string API. If we want that in the public API, fine, but then it requires runtime error checks instead of asserts.	2001-06-16 05:11:17 +00:00
Tim Peters	cf37dfc3db	Change IS_LITTLE_ENDIAN macro -- a little faster now.	2001-06-14 18:42:50 +00:00
Guido van Rossum	ad98db1d9e	Fix a mis-indentation in _PyUnicode_New() that caused me to stare at some code for longer than needed.	2001-06-14 17:52:02 +00:00
Tim Peters	ede0509111	_PyLong_AsByteArray: simplify the logic for dealing with the most- significant digits sign bits. Again no change in semantics.	2001-06-14 08:53:38 +00:00
Tim Peters	ce9de2f79a	PyLong_From{Unsigned,}Long: count the # of digits first, so no more space is allocated than needed (used to allocate 80 bytes of digit space no matter how small the long input). This also runs faster, at least on 32- bit boxes.	2001-06-14 04:56:19 +00:00
Tim Peters	f251d06a66	_PyLong_FromByteArray: changed decl of "carry" to match "thisbyte". No semantic change, but a bit clearer and may help a really stupid compiler avoid pointless runtime length conversions.	2001-06-13 21:09:15 +00:00
Tim Peters	05607ad4fd	_PyLong_AsByteArray: Don't do the "delicate overflow" check unless it's truly needed; usually saves a little time, but no change in semantics.	2001-06-13 21:01:27 +00:00
Tim Peters	898cf85c25	_PyLong_AsByteArray: added assert that the input is normalized. This is outside the function's control, but is crucial to correct operation.	2001-06-13 20:50:08 +00:00
Tim Peters	9cb0c38fff	PyLong_As{Unsigned,}LongLong: fiddled final result casting.	2001-06-13 20:45:17 +00:00
Tim Peters	d1a7da6c0d	longobject.c: Replaced PyLong_{As,From}{Unsigned,}LongLong guts with calls to _PyLong_{As,From}ByteArray. _testcapimodule.c: Added strong tests of PyLong_{As,From}{Unsigned,}LongLong. Fixes SF bug #432552 PyLong_AsLongLong() problems. Possible bugfix candidate, but the fix relies on code added to longobject to support the new q/Q structmodule format codes.	2001-06-13 00:35:57 +00:00
Tim Peters	8bc84b4391	_PyLong_{As,From}ByteArray: Minor code rearrangement aimed at improving clarity. Should have no effect visible to callers.	2001-06-12 19:17:03 +00:00
Marc-André Lemburg	8c2133da7b	Fix for bug #432384 : Recursion in PyString_AsEncodedString?	2001-06-12 13:14:10 +00:00
Tim Peters	7a3bfc3a47	Added q/Q standard (x-platform 8-byte ints) mode in struct module. This completes the q/Q project. longobject.c _PyLong_AsByteArray: The original code had a gross bug: the most-significant Python digit doesn't necessarily have SHIFT significant bits, and you really need to count how many copies of the sign bit it has else spurious overflow errors result. test_struct.py: This now does exhaustive std q/Q testing at, and on both sides of, all relevant power-of-2 boundaries, both positive and negative. NEWS: Added brief dict news while I was at it.	2001-06-12 01:22:22 +00:00
Tim Peters	2a9b367385	Two new private longobject API functions, _PyLong_FromByteArray _PyLong_AsByteArray Untested and probably buggy -- they compile OK, but nothing calls them yet. Will soon be called by the struct module, to implement x-platform 'q' and 'Q'. If other people have uses for them, we could move them into the public API. See longobject.h for usage details.	2001-06-11 21:23:58 +00:00
Jack Jansen	fcc54cab10	Added a missing cast to the hashfunc initializer.	2001-06-10 21:43:28 +00:00
Martin v. Löwis	0163d6d6ef	Patch #424475 : Speed-up tp_compare usage, by special-casing the common case of objects with equal types which support tp_compare. Give type objects a tp_compare function. Also add c<0 tests before a few PyErr_Occurred tests.	2001-06-09 07:34:05 +00:00
Marc-André Lemburg	8879a33613	Fixes [ #430986 ] Buglet in PyUnicode_FromUnicode.	2001-06-07 12:26:56 +00:00
Tim Peters	afb6ae8452	Store the mask instead of the size in dictobjects. The mask is more frequently used, and in particular this allows to drop the last remaining obvious time-waster in the crucial lookdict() and lookdict_string() functions. Other changes consist mostly of changing "i < ma_size" to "i <= ma_mask" everywhere.	2001-06-04 21:00:21 +00:00
Tim Peters	453163d842	lookdict: stop more insane core-dump mutating comparison cases. Should be possible to provoke unbounded recursion now, but leaving that to someone else to provoke and repair. Bugfix candidate -- although this is getting harder to backstitch, and the cases it's protecting against are mondo contrived.	2001-06-03 04:54:32 +00:00
Tim Peters	7b5d0afb1e	lookdict: Reduce obfuscating code duplication with a judicious goto. This code is likely to get even hairier to squash core dumps due to mutating comparisons, and it's hard enough to follow without that.	2001-06-03 04:14:43 +00:00
Tim Peters	19b77cfc4b	Finish the dict->string coredump fix. Need sleep. Bugfix candidate.	2001-06-02 08:27:39 +00:00
Tim Peters	23cf6be23c	Coredumpers from Michael Hudson, mutating dicts while printing or converting to string. Critical bugfix candidate -- if you take this seriously <wink>.	2001-06-02 08:02:56 +00:00
Tim Peters	f4b33f61fb	dict_popitem(): Repaired last-second 2.1 comment, which misidentified the true reason for allocating the tuple before checking the dict size.	2001-06-02 05:42:29 +00:00
Tim Peters	eb28ef209e	New collision resolution scheme: no polynomials, simpler, faster, less code, less memory. Tests have uncovered no drawbacks. Christian and Vladimir are the other two people who have burned many brain cells on the dict code in recent years, and they like the approach too, so I'm checking it in without further ado.	2001-06-02 05:27:19 +00:00
Jeremy Hylton	9cea41c195	fix bogus indentation	2001-05-29 17:13:15 +00:00
Thomas Wouters	0dcea5973d	_PyTuple_Resize: guard against PyTuple_New() returning NULL, using Tim's suggestion (modulo style).	2001-05-29 07:58:45 +00:00
Tim Peters	4324aa3572	Cruft cleanup: Removed the unused last_is_sticky argument from the internal _PyTuple_Resize().	2001-05-28 22:30:08 +00:00
Thomas Wouters	6a922372ad	_PyTuple_Resize: take into account the empty tuple. There can be only one. Instead of raising a SystemError, just create a new tuple of the desired size. This fixes (at least) SF bug #420343.	2001-05-28 13:11:02 +00:00
Tim Peters	15d4929ae4	Implement an old idea of Christian Tismer's: use polynomial division instead of multiplication to generate the probe sequence. The idea is recorded in Python-Dev for Dec 2000, but that version is prone to rare infinite loops. The value is in getting all the bits of the hash code to participate; and, e.g., this speeds up querying every key in a dict with keys [i << 16 for i in range(20000)] by a factor of 500. Should be equally valuable in any bad case where the high-order hash bits were getting ignored. Also wrote up some of the motivations behind Python's ever-more-subtle hash table strategy.	2001-05-27 07:39:22 +00:00
Tim Peters	1af03e98d9	Change list.extend() error msgs and NEWS to reflect that list.extend() now takes any iterable argument, not only sequences. NEEDS DOC CHANGES -- but I don't think we settled on a concise way to say this stuff.	2001-05-26 19:37:54 +00:00
Tim Peters	442914d265	Cruft cleanup: removed the #ifdef'ery in support of compiling to allow multi-argument list.append(1, 2, 3) (as opposed to .append((1,2,3))).	2001-05-26 05:50:03 +00:00
Tim Peters	65b8b84839	roundupsize() and friends: fiddle over-allocation strategy for list resizing. Accurate timings are impossible on my Win98SE box, but this is obviously faster even on this box for reasonable list.append() cases. I give credit for this not to the resizing strategy but to getting rid of integer multiplication and divsion (in favor of shifting) when computing the rounded-up size. For unreasonable list.append() cases, Win98SE now displays linear behavior for one-at-time appends up to a list with about 35 million elements. Then it dies with a MemoryError, due to fatally fragmented address space (there's plenty of VM available, but by this point Win9X has broken user space into many distinct heaps none of which has enough contiguous space left to resize the list, and for whatever reason Win9x isn't coalescing the dead heaps). Before the patch it got a MemoryError for the same reason, but once the list reached about 2 million elements. Haven't yet tried on Win2K but have high hopes extreme list.append() will be much better behaved now (NT & Win2K didn't fragment address space, but suffered obvious quadratic-time behavior before as lists got large). For other systems I'm relying on common sense: replacing integer * and / by << and >> can't plausibly hurt, the number of function calls hasn't changed, and the total operation count for reasonably small lists is about the same (while the operations are cheaper now).	2001-05-26 05:28:40 +00:00
Martin v. Löwis	cd35306a25	Patch #424335 : Implement string_richcompare, remove string_compare. Use new _PyString_Eq in lookdict_string.	2001-05-24 16:56:35 +00:00
Tim Peters	f8a548c23c	dictresize(): Rebuild small tables if there are any dummies, not just if they're entirely full. Not a question of correctness, but of temporarily misplaced common sense.	2001-05-24 16:26:40 +00:00
Tim Peters	0c6010be75	Jack Jansen hit a bug in the new dict code, reported on python-dev. dictresize() was too aggressive about never ever resizing small dicts. If a small dict is entirely full, it needs to rebuild it despite that it won't actually resize it, in order to purge old dummy entries thus creating at least one virgin slot (lookdict assumes at least one such exists). Also took the opportunity to add some high-level comments to dictresize.	2001-05-23 23:33:57 +00:00
Fred Drake	0c23231f6e	Remove unused variable.	2001-05-22 22:36:52 +00:00
Tim Peters	dea48ec581	SF patch #425242 : Patch which "inlines" small dictionaries. The idea is Marc-Andre Lemburg's, the implementation is Tim's. Add a new ma_smalltable member to dictobjects, an embedded vector of MINSIZE (8) dictentry structs. Short course is that this lets us avoid additional malloc(s) for dicts with no more than 5 entries. The changes are widespread but mostly small. Long course: WRT speed, all scalar operations (getitem, setitem, delitem) on non-empty dicts benefit from no longer needing NULL-pointer checks (ma_table is never NULL anymore). Bulk operations (copy, update, resize, clearing slots during dealloc) benefit in some cases from now looping on the ma_fill count rather than on ma_size, but that was an unexpected benefit: the original reason to loop on ma_fill was to let bulk operations on empty dicts end quickly (since the NULL-pointer checks went away, empty dicts aren't special-cased any more). Special considerations: For dicts that remain empty, this change is a lose on two counts: the dict object contains 8 new dictentry slots now that weren't needed before, and dict object creation also spends time memset'ing these doomed-to-be-unsused slots to NULLs. For dicts with one or two entries that never get larger than 2, it's a mix: a malloc()/free() pair is no longer needed, and the 2-entry case gets to use 8 slots (instead of 4) thus decreasing the chance of collision. Against that, dict object creation spends time memset'ing 4 slots that aren't strictly needed in this case. For dicts with 3 through 5 entries that never get larger than 5, it's a pure win: the dict is created with all the space they need, and they never need to resize. Before they suffered two malloc()/free() calls, plus 1 dict resize, to get enough space. In addition, the 8-slot table they ended with consumed more memory overall, because of the hidden overhead due to the additional malloc. For dicts with 6 or more entries, the ma_smalltable member is wasted space, but then these are large(r) dicts so 8 slots more or less doesn't make much difference. They still benefit all the time from removing ubiquitous dynamic null-pointer checks, and get a small benefit (but relatively smaller the larger the dict) from not having to do two mallocs, two frees, and a resize on the way to getting their sixth entry. All in all it appears a small but definite general win, with larger benefits in specific cases. It's especially nice that it allowed to get rid of several branches, gotos and labels, and overall made the code smaller.	2001-05-22 20:40:22 +00:00
Guido van Rossum	5b021848ac	file_getiter(): make iter(file) be equivalent to file.xreadlines(). This should be faster. This means: (1) "for line in file:" won't work if the xreadlines module can't be imported. (2) The body of "for line in file:" shouldn't use the file directly; the effects (e.g. of file.readline(), file.seek() or even file.tell()) would be undefined because of the buffering that goes on in the xreadlines module.	2001-05-22 16:48:37 +00:00
Guido van Rossum	0ba9e3ac27	init_name_op(): add (void) to the argument list to make it a valid prototype, for gcc -Wstrict-prototypes.	2001-05-22 02:33:08 +00:00
Marc-André Lemburg	489b56e044	This patch changes the behaviour of the UTF-16 codec family. Only the UTF-16 codec will now interpret and remove a leading BOM mark. Sub- sequent BOM characters are no longer interpreted and removed. UTF-16-LE and -BE pass through all BOM mark characters. These changes should get the UTF-16 codec more in line with what the Unicode FAQ recommends w/r to BOM marks.	2001-05-21 20:30:15 +00:00
Tim Peters	91a364df17	Bugfix candidate. Two exceedingly unlikely errors in dictresize(): 1. The loop for finding the new size had an off-by-one error at the end (could over-index the polys[] vector). 2. The polys[] vector ended with a 0, apparently intended as a sentinel value but never used as such; i.e., it was never checked, so 0 could have been used as a polynomial. Neither bug could trigger unless a dict grew to 2**30 slots; since that would consume at least 12GB of memory just to hold the dict pointers, I'm betting it's not the cause of the bug Fred's tracking down <wink>.	2001-05-19 07:04:38 +00:00
Tim Peters	1928314ef4	Speed dictresize by collapsing its two passes into one; the reason given in the comments for using two passes was bogus, as the only object that can get decref'ed due to the copy is the dummy key, and decref'ing dummy can't have side effects (for one thing, dummy is immortal! for another, it's a string object, not a potentially dangerous user-defined object).	2001-05-17 22:25:34 +00:00
Tim Peters	d7ed3bf552	Speed tuple comparisons in two ways: 1. Omit the early-out EQ/NE "lengths different?" test. Was unable to find any real code where it triggered, but it always costs. The same is not true of list richcmps, where different-size lists appeared to get compared about half the time. 2. Because tuples are immutable, there's no need to refetch the lengths of both tuples from memory again on each loop trip. BUG ALERT: The tuple (and list) richcmp algorithm is arguably wrong, because it won't believe there's any difference unless Py_EQ returns false for some corresponding elements: >>> class C: ... def __lt__(x, y): return 1 ... __eq__ = __lt__ ... >>> C() < C() 1 >>> (C(),) < (C(),) 0 >>> That doesn't make sense -- provided you believe the defn. of C makes sense.	2001-05-15 20:12:59 +00:00
Marc-André Lemburg	2d9204199f	This patch changes the way the string .encode() method works slightly and introduces a new method .decode(). The major change is that strg.encode() will no longer try to convert Unicode returns from the codec into a string, but instead pass along the Unicode object as-is. The same is now true for all other codec return types. The underlying C APIs were changed accordingly. Note that even though this does have the potential of breaking existing code, the chances are low since conversion from Unicode previously took place using the default encoding which is normally set to ASCII rendering this auto-conversion mechanism useless for most Unicode encodings. The good news is that you can now use .encode() and .decode() with much greater ease and that the door was opened for better accessibility of the builtin codecs. As demonstration of the new feature, the patch includes a few new codecs which allow string to string encoding and decoding (rot13, hex, zip, uu, base64). Written by Marc-Andre Lemburg. Copyright assigned to the PSF.	2001-05-15 12:00:02 +00:00
Tim Peters	342c65e19a	Aggressive reordering of dict comparisons. In case of collision, it stands to reason that me_key is much more likely to match the key we're looking for than to match dummy, and if the key is absent me_key is much more likely to be NULL than dummy: most dicts don't even have a dummy entry. Running instrumented dict code over the test suite and some apps confirmed that matching dummy was 200-300x less frequent than matching key in practice. So this reorders the tests to try the common case first. It can lose if a large dict with many collisions is mostly deleted, not resized, and then frequently searched, but that's hardly a case we should be favoring.	2001-05-13 06:43:53 +00:00
Tim Peters	2f228e75e4	Get rid of the superstitious "~" in dict hashing's "i = (~hash) & mask". The comment following used to say: /* We use ~hash instead of hash, as degenerate hash functions, such as for ints <sigh>, can have lots of leading zeros. It's not really a performance risk, but better safe than sorry. 12-Dec-00 tim: so ~hash produces lots of leading ones instead -- what's the gain? / That is, there was never a good reason for doing it. And to the contrary, as explained on Python-Dev last December, it tended to make the sum* (i + incr) & mask (which is the first table index examined in case of collison) the same "too often" across distinct hashes. Changing to the simpler "i = hash & mask" reduced the number of string-dict collisions (== # number of times we go around the lookup for-loop) from about 6 million to 5 million during a full run of the test suite (these are approximate because the test suite does some random stuff from run to run). The number of collisions in non-string dicts also decreased, but not as dramatically. Note that this may, for a given dict, change the order (wrt previous releases) of entries exposed by .keys(), .values() and .items(). A number of std tests suffered bogus failures as a result. For dicts keyed by small ints, or (less so) by characters, the order is much more likely to be in increasing order of key now; e.g., >>> d = {} >>> for i in range(10): ... d[i] = i ... >>> d {0: 0, 1: 1, 2: 2, 3: 3, 4: 4, 5: 5, 6: 6, 7: 7, 8: 8, 9: 9} >>> Unfortunately. people may latch on to that in small examples and draw a bogus conclusion. test_support.py Moved test_extcall's sortdict() into test_support, made it stronger, and imported sortdict into other std tests that needed it. test_unicode.py Excluced cp875 from the "roundtrip over range(128)" test, because cp875 doesn't have a well-defined inverse for unicode("?", "cp875"). See Python-Dev for excruciating details. Cookie.py Chaged various output functions to sort dicts before building strings from them. test_extcall Fiddled the expected-result file. This remains sensitive to native dict ordering, because, e.g., if there are multiple errors in a keyword-arg dict (and test_extcall sets up many cases like that), the specific error Python complains about first depends on native dict ordering.	2001-05-13 00:19:31 +00:00
Tim Peters	16cabc0a3d	Repair "module has no attribute xxx" error msg; bug introduced when switching from tp_getattr to tp_getattro.	2001-05-12 20:24:22 +00:00
Tim Peters	d85e102337	Variant of patch #423262 : Change module attribute get & set Allow module getattr and setattr to exploit string interning, via the previously null module object tp_getattro and tp_setattro slots. Yields a very nice speedup for things like random.random and os.path etc.	2001-05-11 21:51:48 +00:00
Jeremy Hylton	1b0feb4ada	Variant of SF patch 423181 For rich comparisons, use instance_getattr2() when possible to avoid the expense of setting an AttributeError. Also intern the name_op[] table and use the interned strings rather than creating a new string and interning it each time through.	2001-05-11 14:48:41 +00:00
Tim Peters	5acbfcc164	Cosmetic: code under "else" clause was missing indent.	2001-05-11 03:36:45 +00:00
Tim Peters	4fa58bfac2	Restore dicts' tp_compare slot, and change dict_richcompare to say it doesn't know how to do LE, LT, GE, GT. dict_richcompare can't do the latter any faster than dict_compare can. More importantly, for cmp(dict1, dict2), Python first tries rich compares with EQ, LT, and GT one at a time, even if the tp_compare slot is defined, and dict_richcompare called dict_compare for the latter two because it couldn't do them itself. The result was a lot of wasted calls to dict_compare. Now dict_richcompare gives up at once the times Python calls it with LT and GT from try_rich_to_3way_compare(), and dict_compare is called only once (when Python gets around to trying the tp_compare slot). Continued mystery: despite that this cut the number of calls to dict_compare approximately in half in test_mutants.py, the latter still runs amazingly slowly. Running under the debugger doesn't show excessive activity in the dict comparison code anymore, so I'm guessing the culprit is somewhere else -- but where? Perhaps in the element (key/value) comparison code? We clearly spend a lot of time figuring out how to compare things.	2001-05-10 21:45:19 +00:00
Tim Peters	3918fb2549	Repair typo in comment.	2001-05-10 18:58:31 +00:00
Tim Peters	95bf9390a4	SF bug #422121 Insecurities in dict comparison. Fixed a half dozen ways in which general dict comparison could crash Python (even cause Win98SE to reboot) in the presence of kay and/or value comparison routines that mutate the dict during dict comparison. Bugfix candidate.	2001-05-10 08:32:44 +00:00
Tim Peters	9c012af3c3	Heh. I need a break. After this: stropmodule & stringobject were more out of synch than I realized, and I managed to break replace's "count" argument when it was 0. All is well again. Maybe. Bugfix candidate.	2001-05-10 00:32:57 +00:00
Tim Peters	4cd44ef4bf	Fudge. stropmodule and stringobject both had copies of the buggy mymemXXX stuff, and they were already out of synch. Fix the remaining bugs in both and get them back in synch. Bugfix release candidate.	2001-05-10 00:05:33 +00:00
Tim Peters	1a97d5f098	SF patch #416247 2.1c1 stringobject: unused vrbl cleanup. Thanks to Mark Favas.	2001-05-09 20:06:00 +00:00
Tim Peters	4862ab7bf4	Sheesh -- repair the dodge around "cast isn't an lvalue" complaints to restore correct semantics.	2001-05-09 08:43:21 +00:00
Tim Peters	9e897f41db	Mark Favas reported that gcc caught me using casts as lvalues. Dodge it.	2001-05-09 07:37:07 +00:00
Tim Peters	b4bbcd76ea	Ack! Restore the COUNT_ALLOCS one_strings code.	2001-05-09 00:31:40 +00:00
Tim Peters	cf5ad5d6f6	My change to string_item() left an extra reference to each 1-character interned string created by "string"[i]. Since they're immortal anyway, this was hard to notice, but it was still wrong <wink>.	2001-05-09 00:24:55 +00:00
Tim Peters	5b4d477568	Intern 1-character strings as soon as they're created. As-is, they aren't interned when created, so the cached versions generally aren't ever interned. With the patch, the Py_INCREF(t); *p = t; Py_DECREF(s); return; indirection block in PyString_InternInPlace() is never executed during a full run of the test suite, but was executed very many times before. So I'm trading more work when creating one-character strings for doing less work later. Note that the "more work" here can happen at most 256 times per program run, so it's trivial. The same reasoning accounts for the patch's simplification of string_item (the new version can call PyString_FromStringAndSize() no more than 256 times per run, so there's no point to inlining that stuff -- if we were serious about saving time here, we'd pre-initialize the characters vector so that no runtime testing at all was needed!).	2001-05-08 22:33:50 +00:00
Tim Peters	72f98e9b83	SF bug #422177 : Results from .pyc differs from .py Store floats and doubles to full precision in marshal. Test that floats read from .pyc/.pyo closely match those read from .py. Declare PyFloat_AsString() in floatobject header file. Add new PyFloat_AsReprString() API function. Document the functions declared in floatobject.h.	2001-05-08 15:19:57 +00:00
Tim Peters	e63415ead8	SF patch #421922 : Implement rich comparison for dicts. d1 == d2 and d1 != d2 now work even if the keys and values in d1 and d2 don't support comparisons other than ==, and testing dicts for equality is faster now (especially when inequality obtains).	2001-05-08 04:38:29 +00:00
Jeremy Hylton	4c889011db	SF patch 419176 from MvL; fixed bug 418977 Two errors in dict_to_map() helper used by PyFrame_LocalsToFast().	2001-05-08 04:08:59 +00:00
Jeremy Hylton	d37292bb8d	Remove unused variable	2001-05-08 04:00:45 +00:00
Tim Peters	6d60b2e762	SF bug #422108 - Error in rich comparisons. 2.1.1 bugfix candidate too. Fix a bad (albeit unlikely) return value in try_rich_to_3way_compare(). Also document do_cmp()'s return values.	2001-05-07 20:53:51 +00:00
Tim Peters	cb8d368b82	Reimplement PySequence_Contains() and instance_contains(), so they work safely together and don't duplicate logic (the common logic was factored out into new private API function _PySequence_IterContains()). Visible change: some_complex_number in some_instance no longer blows up if some_instance has __getitem__ but neither __contains__ nor __iter__. test_iter changed to ensure that remains true.	2001-05-05 21:05:01 +00:00
Tim Peters	75f8e35ef4	Generalize PySequence_Count() (operator.countOf) to work with iterators.	2001-05-05 11:33:43 +00:00
Tim Peters	de9725f135	Make 'x in y' and 'x not in y' (PySequence_Contains) play nice w/ iterators. NEEDS DOC CHANGES A few more AttributeErrors turned into TypeErrors, but in test_contains this time. The full story for instance objects is pretty much unexplainable, because instance_contains() tries its own flavor of iteration-based containment testing first, and PySequence_Contains doesn't get a chance at it unless instance_contains() blows up. A consequence is that some_complex_number in some_instance dies with a TypeError unless some_instance.__class__ defines __iter__ but does not define __getitem__.	2001-05-05 10:06:17 +00:00
Tim Peters	2cfe368283	Make unicode.join() work nice with iterators. This also required a change to string.join(), so that when the latter figures out in midstream that it really needs unicode.join() instead, unicode.join() can actually get all the sequence elements (i.e., there's no guarantee that the sequence passed to string.join() can be iterated over again by unicode.join(), so string.join() must not pass on the original sequence object anymore).	2001-05-05 05:36:48 +00:00
Tim Peters	12d0a6c78a	Fix a tiny and unlikely memory leak. Was there before too, and actually several of these turned up and got fixed during the iteration crusade.	2001-05-05 04:10:25 +00:00
Tim Peters	6912d4ddf0	Generalize tuple() to work nicely with iterators. NEEDS DOC CHANGES. This one surprised me! While I expected tuple() to be a no-brainer, turns out it's actually dripping with consequences: 1. It will allow the popular PySequence_Fast() to work with any iterable object (code for that not yet checked in, but should be trivial). 2. It caused two std tests to fail. This because some places used PyTuple_Sequence() (the C spelling of tuple()) as an indirect way to test whether something is a sequence. But tuple() code only looked for the existence of sq->item to determine that, and e.g. an instance passed that test whether or not it supported the other operations tuple() needed (e.g., __len__). So some things the tests expected to fail with an AttributeError now fail with a TypeError instead. This looks like an improvement to me; e.g., test_coercion used to produce 559 TypeErrors and 2 AttributeErrors, and now they're all TypeErrors. The error details are more informative too, because the places calling this were looking for TypeErrors in order to replace the generic tuple() "not a sequence" msg with their own more specific text, and AttributeErrors snuck by that.	2001-05-05 03:56:37 +00:00
Tim Peters	f4848dac41	Make PyIter_Next() a little smarter (wrt its knowledge of iterator internals) so clients can be a lot dumber (wrt their knowledge).	2001-05-05 00:14:56 +00:00
Fred Drake	6aebded915	The weakref support in PyObject_InitVar() as well; this should have come out at the same time as it did from PyObject_Init() .	2001-05-03 20:04:33 +00:00
Fred Drake	ba40ec42c8	Remove unnecessary intialization for the case of weakly-referencable objects; the code necessary to accomplish this is simpler and faster if confined to the object implementations, so we only do this there. This causes no behaviorial changes beyond a (very slight) speedup.	2001-05-03 19:44:50 +00:00
Fred Drake	4dcb85b817	Since Py_TPFLAGS_HAVE_WEAKREFS is set in Py_TPFLAGS_DEFAULT, it does not need to be specified in the type structures independently. The flag exists only for binary compatibility. This is a "source cleanliness" issue and introduces no behavioral changes.	2001-05-03 16:04:13 +00:00
Guido van Rossum	b1f35bffe5	Mchael Hudson pointed out that the code for detecting changes in dictionary size was comparing ma_size, the hash table size, which is always a power of two, rather than ma_used, wich changes on each insertion or deletion. Fixed this.	2001-05-02 15:13:44 +00:00
Marc-André Lemburg	542fe56cb9	Fix for bug #417030 : "print '%*s' fails for unicode string"	2001-05-02 14:21:53 +00:00
Tim Peters	6ad22c41c2	Plug a memory leak in list(), when appending to the result list.	2001-05-02 07:12:39 +00:00
Tim Peters	f553f89d45	Generalize list(seq) to work with iterators. This also generalizes list() to no longer insist that len(seq) be defined. NEEDS DOC CHANGES. This is meant to be a model for how other functions of this ilk (max, filter, etc) can be generalized similarly. Feel encouraged to grab your favorite and convert it! Note some cute consequences: list(file) == file.readlines() == list(file.xreadlines()) list(dict) == dict.keys() list(dict.iteritems()) = dict.items() list(xrange(i, j, k)) == range(i, j, k)	2001-05-01 20:45:31 +00:00
Guido van Rossum	47668928e6	Discard a misleading comment about iter_iternext().	2001-05-01 17:01:25 +00:00
Guido van Rossum	4f288ab7d6	Printing objects to a real file still wasn't done right: if the object's type didn't define tp_print, there were still cases where the full "print uses str() which falls back to repr()" semantics weren't honored. This resulted in >>> print None <None object at 0x80bd674> >>> print type(u'') <type object at 0x80c0a80> Fixed this by always using the appropriate PyObject_Repr() or PyObject_Str() call, rather than trying to emulate what they would do. Also simplified PyObject_Str() to always fall back on PyObject_Repr() when tp_str is not defined (rather than making an extra check for instances with a __str__ method). And got rid of the special case for strings.	2001-05-01 16:53:37 +00:00
Guido van Rossum	189f1df301	Add a proper implementation for the tp_str slot (returning self, of course), so I can get rid of the special case for strings in PyObject_Str().	2001-05-01 16:51:53 +00:00
Guido van Rossum	09e563abb4	Add experimental iterkeys(), itervalues(), iteritems() to dict objects. Tests show that iteritems() is 5-10% faster than iterating over the dict and extracting the value with dict[key].	2001-05-01 12:10:21 +00:00
Guido van Rossum	82c690f11a	Well darnit! The innocuous fix I made to PyObject_Print() caused printing of instances not to look for __str__(). Fix this.	2001-04-30 14:39:18 +00:00
Tim Peters	b3d8d1f76c	A different approach to the problem reported in Patch #419651: Metrowerks on Mac adds 0x itself C std says %#x and %#X conversion of 0 do not add the 0x/0X base marker. Metrowerks apparently does. Mark Favas reported the same bug under a Compaq compiler on Tru64 Unix, but no other libc broken in this respect is known (known to be OK under MSVC and gcc). So just try the damn thing at runtime and see what the platform does. Note that we've always had bugs here, but never knew it before because a relevant test case didn't exist before 2.1.	2001-04-28 05:38:26 +00:00
Guido van Rossum	3a80c4a29c	(Adding this to the trunk as well.) Fix a very old flaw in PyObject_Print(). Amazing! When an object type defines tp_str but not tp_repr, 'print x' to a real file object would not call the tp_str slot but rather print a default style representation: <foo object at 0x....>. This even though 'print x' to a file-like-object would correctly call the tp_str slot.	2001-04-27 21:35:01 +00:00
Marc-André Lemburg	8155e0e541	This patch originated from an idea by Martin v. Loewis who submitted a patch for sharing single character Unicode objects. Martin's patch had to be reworked in a number of ways to take Unicode resizing into consideration as well. Here's what the updated patch implements: * Single character Unicode strings in the Latin-1 range are shared (not only ASCII chars as in Martin's original patch). * The ASCII and Latin-1 codecs make use of this optimization, providing a noticable speedup for single character strings. Most Unicode methods can use the optimization as well (by virtue of using PyUnicode_FromUnicode()). * Some code cleanup was done (replacing memcpy with Py_UNICODE_COPY) * The PyUnicode_Resize() can now also handle the case of resizing unicode_empty which previously resulted in an error. * Modified the internal API _PyUnicode_Resize() and the public PyUnicode_Resize() API to handle references to shared objects correctly. The _PyUnicode_Resize() signature changed due to this. * Callers of PyUnicode_FromUnicode() may now only modify the Unicode object contents of the returned object in case they called the API with NULL as content template. Note that even though this patch passes the regression tests, there may still be subtle bugs in the sharing code.	2001-04-23 14:44:21 +00:00
Guido van Rossum	213c7a6aa5	Mondo changes to the iterator stuff, without changing how Python code sees it (test_iter.py is unchanged). - Added a tp_iternext slot, which calls the iterator's next() method; this is much faster for built-in iterators over built-in types such as lists and dicts, speeding up pybench's ForLoop with about 25% compared to Python 2.1. (Now there's a good argument for iterators. ;-) - Renamed the built-in sequence iterator SeqIter, affecting the C API functions for it. (This frees up the PyIter prefix for generic iterator operations.) - Added PyIter_Check(obj), which checks that obj's type has a tp_iternext slot and that the proper feature flag is set. - Added PyIter_Next(obj) which calls the tp_iternext slot. It has a somewhat complex return condition due to the need for speed: when it returns NULL, it may not have set an exception condition, meaning the iterator is exhausted; when the exception StopIteration is set (or a derived exception class), it means the same thing; any other exception means some other error occurred.	2001-04-23 14:08:49 +00:00
Guido van Rossum	65967259f2	Oops, forgot to merge this from the iter-branch to the trunk. This adds "for line in file" iteration, as promised.	2001-04-21 13:20:18 +00:00
Tim Peters	cf96de052f	SF but #417587 : compiler warnings compiling 2.1. Repaired some of the SGI compiler warnings Sjoerd Mullender reported.	2001-04-21 02:46:11 +00:00
Guido van Rossum	05311481d4	Adding iterobject.[ch], which were accidentally not added. Sorry\!	2001-04-20 21:06:46 +00:00
Guido van Rossum	59d1d2b434	Iterators phase 1. This comprises: new slot tp_iter in type object, plus new flag Py_TPFLAGS_HAVE_ITER new C API PyObject_GetIter(), calls tp_iter new builtin iter(), with two forms: iter(obj), and iter(function, sentinel) new internal object types iterobject and calliterobject new exception StopIteration new opcodes for "for" loops, GET_ITER and FOR_ITER (also supported by dis.py) new magic number for .pyc files new special method for instances: __iter__() returns an iterator iteration over dictionaries: "for x in dict" iterates over the keys iteration over files: "for x in file" iterates over lines TODO: documentation test suite decide whether to use a different way to spell iter(function, sentinal) decide whether "for key in dict" is a good idea use iterators in map/filter/reduce, min/max, and elsewhere (in/not in?) speed tuning (make next() a slot tp_next???)	2001-04-20 19:13:02 +00:00
Guido van Rossum	55ad67d74d	Oops. Removed dictiter_new decl that wasn't supposed to go in yet.	2001-04-20 16:52:06 +00:00
Guido van Rossum	0dbb4fba4c	Implement, test and document "key in dict" and "key not in dict". I know some people don't like this -- if it's really controversial, I'll take it out again. (If it's only Alex Martelli who doesn't like it, that doesn't count as "real controversial" though. :-) That's why this is a separate checkin from the iterators stuff I'm about to check in next.	2001-04-20 16:50:40 +00:00
Tim Peters	78fe5308b4	CVS patch 416248: 2.1c1 unicodeobject: unused vrbl cleanup, from Mark Favas.	2001-04-19 21:55:14 +00:00
Jeremy Hylton	b8a93215c2	Revert previous checkin, which caused test_unicodedata to fail.	2001-04-19 16:43:49 +00:00
Martin v. Löwis	da3dc5b892	Patch #416953 : Cache ASCII characters to speed up ASCII decoding.	2001-04-18 12:49:15 +00:00
Guido van Rossum	e04eaec5b6	Tim pointed out a remaining vulnerability in popitem(): the PyTuple_New() could conceivably clear the dict, so move the test for an empty dict after the tuple allocation. It means that we waste time allocating and deallocating a 2-tuple when the dict is empty, but who cares. It also means that when the dict is empty and there's no memory to allocate a 2-tuple, we raise MemoryError, not KeyError -- but that may actually a good idea: if there's no room for a lousy 2-tuple, what are the chances that there's room for a KeyError instance?	2001-04-16 00:02:32 +00:00
Guido van Rossum	a4dd011259	Tentative fix for a problem that Tim discovered at the last moment, and reported to python-dev: because we were calling dict_resize() in PyDict_Next(), and because GC's dict_traverse() uses PyDict_Next(), and because PyTuple_New() can cause GC, and because dict_items() calls PyTuple_New(), it was possible for dict_items() to have the dict resized right under its nose. The solution is convoluted, and touches several places: keys(), values(), items(), popitem(), PyDict_Next(), and PyDict_SetItem(). There are two parts to it. First, we no longer call dict_resize() in PyDict_Next(), which seems to solve the immediate problem. But then PyDict_SetItem() must have a different policy about when it calls dict_resize(), because we want to guarantee (e.g. for an algorithm that Jeremy uses in the compiler) that you can loop over a dict using PyDict_Next() and make changes to the dict as long as those changes are only value replacements for existing keys using PyDict_SetItem(). This is done by resizing after the insertion instead of before, and by remembering the size before we insert the item, and if the size is still the same, we don't bother to even check if we might need to resize. An additional detail is that if the dict starts out empty, we must still resize it before the insertion. That was the first part. :-) The second part is to make keys(), values(), items(), and popitem() safe against side effects on the dict caused by allocations, under the assumption that if the GC can cause arbitrary Python code to run, it can cause other threads to run, and it's not inconceivable that our dict could be resized -- it would be insane to write code that relies on this, but not all code is sane. Now, I have this nagging feeling that the loops in lookdict probably are blissfully assuming that doing a simple key comparison does not change the dict's size. This is not necessarily true (the keys could be class instances after all). But that's a battle for another day.	2001-04-15 22:16:26 +00:00
Guido van Rossum	6b356e70b5	Make one more private symbol static.	2001-04-14 17:55:41 +00:00
Guido van Rossum	f68d8e52e7	Make some private symbols static.	2001-04-14 17:55:09 +00:00
Tim Peters	fff5325078	Bug 415514 reported that e.g. "%#x" % 0 blew up, at heart because C sprintf supplies a base marker if and only if the value is not 0. I then fixed that, by tolerating C's inconsistency when it does %#x, and taking away that Python produced 0x0 when formatting 0L (the "long" flavor of 0) under %#x itself. But after talking with Guido, we agreed it would be better to supply 0x for the short int case too, despite that it's inconsistent with C, because C is inconsistent with itself and with Python's hex(0) (plus, while "%#x" % 0 didn't work before, "%#x" % 0L did, and returned "0x0"). Similarly for %#X conversion.	2001-04-12 18:38:48 +00:00
Tim Peters	711088d9b8	Fix for SF bug #415514 : "%#x" % 0 caused assertion failure/abort. http://sourceforge.net/tracker/index.php?func=detail&aid=415514&group_id=5470&atid=105470 For short ints, Python defers to the platform C library to figure out what %#x should do. The code asserted that the platform C returned a string beginning with "0x". However, that's not true when-- and only when --the value being formatted is 0. Changed the code to live with C's inconsistency here. In the meantime, the problem does not arise if you format a long 0 (0L) instead. However, that's because the code we wrote to do %#x conversions on longs produces a leading "0x" regardless of value. That's probably wrong too: we should drop leading "0x", for consistency with C, when (& only when) formatting 0L. So I changed the long formatting code to do that too.	2001-04-12 00:35:51 +00:00
Marc-André Lemburg	ae605341e3	Fixed ref count bug. Patch #411191 . Found by Walter Dörwald.	2001-03-25 19:16:13 +00:00
Fred Drake	db81e8ddf8	Add support for weak references to the function and method types.	2001-03-23 04:19:27 +00:00
Fred Drake	4e262a9631	A small change to the C API for weakly-referencable types: Such types must now initialize the extra field used by the weak-ref machinery to NULL themselves, to avoid having to require PyObject_INIT() to check if the type supports weak references and do it there. This causes less work to be done for all objects (the type object does not need to be consulted to check for the Py_TPFLAGS_HAVE_WEAKREFS bit).	2001-03-22 18:26:47 +00:00
Tim Peters	6783070ebf	Make PyDict_Next safe to use for loops that merely modify the values associated with existing dict keys. This is a variant of part of Michael Hudson's patch #409864 "lazy fix for Pings bizarre scoping crash".	2001-03-21 19:23:56 +00:00
Guido van Rossum	823649d544	Move the code implementing isinstance() and issubclass() to new C APIs, PyObject_IsInstance() and PyObject_IsSubclass() -- both returning an int, or -1 for errors.	2001-03-21 18:40:58 +00:00
Jeremy Hylton	220ae7c0bf	Fix PyFrame_FastToLocals() and counterpart to deal with cells and frees. Note there doesn't seem to be any way to test LocalsToFast(), because the instructions that trigger it are illegal in nested scopes with free variables. Fix allocation strategy for cells that are also formal parameters. Instead of emitting LOAD_FAST / STORE_DEREF pairs for each parameter, have the argument handling code in eval_code2() do the right thing. A side-effect of this change is that cell variables that are also arguments are listed at the front of co_cellvars in the order they appear in the argument list.	2001-03-21 16:43:47 +00:00
Guido van Rossum	a1351fbd88	SF patch #408326 by Robin Thomas: slice objects comparable, not hashable This patch changes the behavior of slice objects in the following manner: - Slice objects are now comparable with other slice objects as though they were logically tuples of (start,stop,step). The tuple is not created in the comparison function, but the comparison behavior is logically equivalent. - Slice objects are not hashable. With the above change to being comparable, slice objects now cannot be used as keys in dictionaries. [I've edited the patch for style. Note that this fixes the problem that dict[i:j] seemed to work but was meaningless. --GvR]	2001-03-20 12:41:34 +00:00
Tim Peters	0f33604e17	SF bug [ #409448 ] Complex division is braindead http://sourceforge.net/tracker/?func=detail&aid=409448&group_id=5470&atid=105470 Now less braindead. Also added test_complex.py, which doesn't test much, but fails without this patch.	2001-03-18 08:21:57 +00:00
Jeremy Hylton	30c9f3991c	Variety of small INC/DECREF patches that fix reported memory leaks with free variables. Thanks to Martin v. Loewis for finding two of the problems. This fixes SF buf 405583. There is also a C API change: PyFrame_New() is reverting to its pre-2.1 signature. The change introduced by nested scopes was a mistake. XXX Is this okay between beta releases? cell_clear(), the GC helper, must decref its reference to break cycles. frame_dealloc() must dealloc all cell vars and free vars in addition to locals. eval_code2() setup code must INCREF cells it copies out of the closure. The STORE_DEREF opcode implementation must DECREF the object it passes to PyCell_Set().	2001-03-13 01:58:22 +00:00
Tim Peters	b2336529ef	Identifiers matching _[A-Z_]\w* are reserved for C implementations. May or may not be related to bug 407680 (obmalloc.c - looks like it's corrupted). This repairs the illegal vrbl names, but leaves a pile of illegal macro names (_THIS_xxx, _SYSTEM_xxx, _SET_HOOKS, _FETCH_HOOKS).	2001-03-11 18:36:13 +00:00
Tim Peters	7069512bd0	When 1.6 boosted the # of digits produced by repr(float), repr(complex) apparently forgot to play along. Make complex act like float.	2001-03-11 08:37:29 +00:00
Martin v. Löwis	01c6526c0e	Avoid giving prototypes on Solaris.	2001-03-06 12:14:54 +00:00
Martin v. Löwis	2b6727bd8a	Use Py_CHARMASK for ctype macros. Fixes bug #232787 .	2001-03-06 12:12:02 +00:00
Guido van Rossum	4f53da07bf	Two improvements to large file support: - In _portable_ftell(), try fgetpos() before ftello() and ftell64(). I ran into a situation on a 64-bit capable Linux where the C library's ftello() and ftell64() returned negative numbers despite fpos_t and off_t both being 64-bit types; fgetpos() did the right thing. - Define a new typedef, Py_off_t, which is either fpos_t or off_t, depending on which one is 64 bits. This removes the need for a lot of #ifdefs later on. (XXX Should this be moved to pyport.h? That file currently seems oblivious to large fille support, so for now I'll leave it here where it's needed.)	2001-03-01 18:26:53 +00:00
Jeremy Hylton	a52e8fe49a	Visit the closure during traversal and XDECREF it on during deallocation.	2001-03-01 06:06:37 +00:00
Jeremy Hylton	3f571d6497	Fix SF buf 404774 submitted by Gregory H. Ball A user program could delete a function's func_closure, which would cause it to crash when called.	2001-02-28 02:42:56 +00:00
Neil Schemenauer	a35c688055	Add Vladimir Marangozov's object allocator. It is disabled by default. This closes SF patch #401229.	2001-02-27 04:45:05 +00:00
Fred Drake	b60654bc15	The return value from PyObject_ClearWeakRefs() is no longer meaningful, so make it void.	2001-02-26 18:56:37 +00:00
Barry Warsaw	4f9b13bac8	instancemethod_setattro(): Raise TypeError if an attempt is made to set a function attribute on a method (either bound or unbound). This reverts to Python 2.0 behavior that no attributes of the method are writable, but provides a more informative error message.	2001-02-26 18:09:15 +00:00
Barry Warsaw	a903ad9855	_Py_ReleaseInternedStrings(): Private API function to decref and release the interned string dictionary. This is useful for memory use debugging because it eliminates a huge source of noise from the reports. Only defined when INTERN_STRINGS is defined.	2001-02-23 16:40:48 +00:00
Barry Warsaw	eefb107a48	_PyObject_Dump(): If argument is NULL, print "NULL" instead of crashing.	2001-02-22 22:39:18 +00:00
Guido van Rossum	2da0ea82ba	In try_3way_to_rich_compare(), swap the call to default_3way_compare() and the test for errors, so that an error in the default compare doesn't go undetected. This fixes SF Bug #132933 (submitted by effbot) -- list.sort doesn't detect comparision errors.	2001-02-22 22:18:04 +00:00
Fredrik Lundh	ccc7473fc8	reorganized PyUnicode_DecodeUnicodeEscape a bit (in order to make it less likely that bug #132817 ever appears again)	2001-02-18 22:13:49 +00:00
Guido van Rossum	b86c549c7c	Fix core dump whenever PyList_Reverse() was called. This fixes SF bug #132008, reported by Warren J. Hack. The copyright for this patch (and this patch only) belongs to CNRI, as part of the (yet to be issued) 1.6.1 release. This is now checked into the HEAD branch. Tim will check in a test case to check for this specific bug, and an assertion in PyArgs_ParseTuple() to catch similar bugs in the future.	2001-02-12 22:06:02 +00:00
Neil Schemenauer	693291ba23	Superseded by $(srcdir)/Makefile.pre.in.	2001-02-03 17:18:21 +00:00
Jeremy Hylton	2492a20579	SF patch 103543 from tg@freebsd.org: PyFPE_END_PROTECT() was called on undefined var	2001-02-01 23:53:05 +00:00
Fred Drake	41deb1efc2	PEP 205, Weak References -- initial checkin.	2001-02-01 05:27:45 +00:00
Guido van Rossum	3202c6fac8	Rename dubiously named local variable 'cmpfunc' -- this is also a typedef, and at least one compiler choked on this. (SF patch #103457, by bquinlan)	2001-01-29 23:50:25 +00:00
Jeremy Hylton	2b724da8d9	Remove f_closure slot of frameobject and use f_localsplus instead. This change eliminates an extra malloc/free when a frame with free variables is created. Any cell vars or free vars are stored in f_localsplus after the locals and before the stack. eval_code2() fills in the appropriate values after handling initialization of locals. To track the size the frame has an f_size member that tracks the total size of f_localsplus. It used to be implicitly f_nlocals + f_stacksize.	2001-01-29 22:51:52 +00:00
Jeremy Hylton	09ac89ae78	fix indentation glitch	2001-01-29 22:38:32 +00:00
Marc-André Lemburg	fde66e1bcc	Fixed .capitalize() method of Unicode objects to work like the corresponding string method. Added tests for this too. Patch written by Marc-Andre Lemburg. Copyright assigned to Guido van Rossum.	2001-01-29 11:14:16 +00:00
Moshe Zadka	497671e094	The one thing I love more then writing code is deleting code. * Removed func_hash and func_compare, so they can be treated as immutable content-less objects (address hash and comparison) * Added tests to that affect to test_funcattrs (also testing func_code is writable) * Reverse meaning of tests in test_opcodes which checked identical code gets identical functions	2001-01-29 06:21:17 +00:00
Fred Drake	5cc2c8c3c8	Re-factored PyInstance_New() into PyInstance_New() and PyInstance_NewRaw().	2001-01-28 03:53:08 +00:00
Jeremy Hylton	64949cb753	PEP 227 implementation The majority of the changes are in the compiler. The mainloop changes primarily to implement the new opcodes and to pass a function's closure to eval_code2(). Frames and functions got new slots to hold the closure. Include/compile.h Add co_freevars and co_cellvars slots to code objects. Update PyCode_New() to take freevars and cellvars as arguments Include/funcobject.h Add func_closure slot to function objects. Add GetClosure()/SetClosure() functions (and corresponding macros) for getting at the closure. Include/frameobject.h PyFrame_New() now takes a closure. Include/opcode.h Add four new opcodes: MAKE_CLOSURE, LOAD_CLOSURE, LOAD_DEREF, STORE_DEREF. Remove comment about old requirement for opcodes to fit in 7 bits. compile.c Implement changes to code objects for co_freevars and co_cellvars. Modify symbol table to use st_cur_name (string object for the name of the current scope) and st_cur_children (list of nested blocks). Also define st_nested, which might more properly be called st_cur_nested. Add several DEF_XXX flags to track def-use information for free variables. New or modified functions of note: com_make_closure(struct compiling , PyCodeObject ) Emit LOAD_CLOSURE opcodes as needed to pass cells for free variables into nested scope. com_addop_varname(struct compiling , int, char ) Emits opcodes for LOAD_DEREF and STORE_DEREF. get_ref_type(struct compiling , char name) Return NAME_CLOSURE if ref type is FREE or CELL symtable_load_symbols(struct compiling ) Decides what variables are cell or free based on def-use info. Can now raise SyntaxError if nested scopes are mixed with exec or from blah import . make_scope_info(PyObject , PyObject , int, int) Helper functions for symtable scope stack. symtable_update_free_vars(struct symtable ) After a code block has been analyzed, it must check each of its children for free variables that are not defined in the block. If a variable is free in a child and not defined in the parent, then it is defined by block the enclosing the current one or it is a global. This does the right logic. symtable_add_use() is now a macro for symtable_add_def() symtable_assign(struct symtable , node *) Use goto instead of for (;;) Fixed bug in symtable where name of keyword argument in function call was treated as assignment in the scope of the call site. Ex: def f(): g(a=2) # a was considered a local of f ceval.c eval_code2() now take one more argument, a closure. Implement LOAD_CLOSURE, LOAD_DEREF, STORE_DEREF, MAKE_CLOSURE> Also: When name error occurs for global variable, report that the name was global in the error mesage. Objects/frameobject.c Initialize f_closure to be a tuple containing space for cellvars and freevars. f_closure is NULL if neither are present. Objects/funcobject.c Add support for func_closure. Python/import.c Change the magic number. Python/marshal.c Track changes to code objects.	2001-01-25 20:06:59 +00:00
Jeremy Hylton	fbd849f201	PEP 227 implementation A cell contains a reference to a single PyObject. It could be implemented as a mutable, one-element sequence, but the separate type has less overhead.	2001-01-25 20:04:14 +00:00
Guido van Rossum	d1f06b9b2f	Check the Py_TPFLAGS_HAVE_RICHCOMPARE flag before using the tp_richcompare field! (Hopefully this will make Python 2.1 binary compatible with certain Zope extensions. :-)	2001-01-24 22:14:43 +00:00
Ka-Ping Yee	fa004ad36c	Show '\011', '\012', and '\015' as '\t', '\n', '\r' in strings. Switch from octal escapes to hex escapes for other nonprintable characters.	2001-01-24 17:19:08 +00:00
Fredrik Lundh	06d126803c	Move uchhash functionality into unicodedata (after the recent crop of changes, the files are small enough to do this). Also adds "name" and "lookup" functions to unicodedata.	2001-01-24 07:59:11 +00:00
Barry Warsaw	bbd89b66b1	PyObject_Dump() -> _PyObject_Dump() PyGC_Dump() -> _PyGC_Dump()	2001-01-24 04:18:13 +00:00
Barry Warsaw	903138f775	PyObject_Dump(): Use %p format to print the address of the pointer. PyGC_Dump(): Wrap this in a #ifdef WITH_CYCLE_GC.	2001-01-23 16:33:18 +00:00
Barry Warsaw	9bf16440f4	A few miscellaneous helpers. PyObject_Dump(): New function that is useful when debugging Python's C runtime. In something like gdb it can be a pain to get some useful information out of PyObject's. This function prints the str() of the object to stderr, along with the object's refcount and hex address. PyGC_Dump(): Similar to PyObject_Dump() but knows how to cast from the garbage collector prefix back to the PyObject structure. [See Misc/gdbinit for some useful gdb hooks] none_dealloc(): Rather than SEGV if we accidentally decref None out of existance, we assign None's and NotImplemented's destructor slot to this function, which just calls abort().	2001-01-23 16:24:35 +00:00
Guido van Rossum	0871e9315e	New special case in comparisons: None is smaller than any other object (unless the object's type overrides this comparison).	2001-01-22 19:28:09 +00:00
Guido van Rossum	8f9143da33	Once again, numeric-smelling objects compare smaller than non-numeric ones.	2001-01-22 15:59:32 +00:00
Fredrik Lundh	9e9bcda547	forgot to check in the new makeunicodedata.py script	2001-01-21 17:01:31 +00:00
Neil Schemenauer	d38855c35a	Remove a smelly export.	2001-01-21 16:25:18 +00:00
Fredrik Lundh	f60560626c	Better error message if ucnhash cannot be found (obscure attribute errors aren't that helpful), or doesn't contain what's expected from it. Also tweaked the test script so it compiles even if ucnhash is missing.	2001-01-20 11:15:25 +00:00
Barry Warsaw	b0e754d488	Tim chastens: Barry, that comment belongs in the code, not in the checkin msg. The code used to do this correctly (as you well know, since you & I went thru considerable pain to fix this the first time). However, because the reason for the convolution wasn't recorded in the code as a comment, somebody threw it all away the first time it got reworked. c-code-isn't-often-self-explanatory-ly y'rs - tim default_3way_compare(): Stick the checkin message from 2.110 in a comment.	2001-01-20 06:24:55 +00:00
Barry Warsaw	71ff8d5dc5	default_3way_compare(): When comparing the pointers, they must be cast to integer types (i.e. Py_uintptr_t, our spelling of C9X's uintptr_t). ANSI specifies that pointer compares other than == and != to non-related structures are undefined. This quiets an Insure portability warning.	2001-01-20 06:08:10 +00:00
Barry Warsaw	0395fdd3a9	Application and elaboration of patch #103305 to fix core dumps when del'ing func.func_dict. I took the opportunity to also clean up some other nits with the code, namely core dumps when del'ing func_defaults and KeyError instead of AttributeError when del'ing a non-existant function attribute. Specifically, func_memberlist: Move func_dict and __dict__ into here instead of special casing them in the setattro and getattro methods. I don't remember why I took them out of here before I first uploaded the PEP 232 patch. :/ func_getattro(): No need to special case __dict__/func_dict since their now in the func_memberlist and PyMember_Get() should Do The Right Thing (i.e. transforms NULL values into Py_None). func_setattro(): Document the intended behavior of del'ing or setting to None one of the special func_* attributes. I.e.: func_code - can only be set to a code object. It can't be del'd or set to None. func_defaults - can be del'd. Can only be set to None or a tuple. func_dict - can be del'd. Can only be set to None or a dictionary. Fix core dumps and incorrect exceptions as described above. Also, if we're del'ing an arbitrary function attribute but func_dict is NULL, don't create func_dict before discovering that we'll get an AttributeError anyway.	2001-01-19 19:53:29 +00:00
Fredrik Lundh	0fdb90cafe	refactored the unicodeobject/ucnhash interface, to hide the implementation details inside the ucnhash module. also cleaned up the unicode copyright blurb a little; Secret Labs' internal revision history isn't that interesting...	2001-01-19 09:45:02 +00:00
Tim Peters	19fe14e76a	Derivative of patch #102549 , "simpler, faster(!) implementation of string.join". Also fixes two long-standing bugs (present in 2.0): 1. .join() didn't check that the result size fit in an int. 2. string.join(s) when len(s)==1 returned s[0] regardless of s[0]'s type; e.g., "".join([3]) returned 3 (overly optimistic optimization). I resisted a keen temptation to make .join() apply str() automagically.	2001-01-19 03:03:47 +00:00
Guido van Rossum	65e8bd7fd5	Rich comparisons fallout: instance_hash() should check for both __cmp__ and __eq__ absent before deciding to do a quickie based on the object address. (Tim Peters discovered this.)	2001-01-18 23:46:31 +00:00
Guido van Rossum	41c3244875	Rich comparisons fallout: PyObject_Hash() should check for both tp_compare and tp_richcompare NULL before deciding to do a quickie based on the object address. (Tim Peters discovered this.)	2001-01-18 23:33:37 +00:00
Guido van Rossum	a3af41d564	Changes to recursive-object comparisons, having to do with a test case I found where rich comparison of unequal recursive objects gave unintuituve results. In a discussion with Tim, where we discovered that our intuition on when a<=b should be true was failing, we decided to outlaw ordering comparisons on recursive objects. (Once we have fixed our intuition and designed a matching algorithm that's practical and reasonable to implement, we can allow such orderings again.) - Refactored the recursive-object comparison framework; more is now done in the support routines so less needs to be done in the calling routines (even at the expense of slowing it down a bit -- this should normally never be invoked, it's mostly just there to avoid blowing up the interpreter). - Changed the framework so that the comparison operator used is also stored. (The dictionary now stores triples (v, w, op) instead of pairs (v, w).) - Changed the nesting limit to a more reasonable small 20; this only slows down comparisons of very deeply nested objects (unlikely to occur in practice), while speeding up comparisons of recursive objects (previously, this would first waste time and space on 500 nested comparisons before it would start detecting recursion). - Changed rich comparisons for recursive objects to raise a ValueError exception when recursion is detected for ordering oprators (<, <=, >, >=). Unrelated change: - Moved PyObject_Unicode() to just under PyObject_Str(), where it belongs. MAL's patch must've inserted in a random spot between two functions in the file -- between two helpers for rich comparison...	2001-01-18 22:07:06 +00:00
Tim Peters	60f42b50d8	Move distributed and duplicated config for stat() and fstat() into pyport.h.	2001-01-18 03:03:16 +00:00
Guido van Rossum	be4cbb1668	Use rich comparisons to fulfill an old wish: complex numbers now raise exceptions when compared using <, <=, > or >=. NOTE: This is a tentative change: this means that cmp() involving complex numbers will raise an exception when the numbers differ, and that in turn means that e.g. dictionaries and certain other compounds (e.g. UserLists) containing complex numbers can't be compared either. So we'll have to decide whether this is acceptable. The alpha test cycle is a good time to keep an eye on this!	2001-01-18 01:12:39 +00:00
Guido van Rossum	b932420cc7	Rich comparisons: - Use PyObject_RichCompareBool() when comparing keys; this makes the error handling cleaner. - There were two implementations for dictionary comparison, an old one (#ifdef'ed out) and a new one. Got rid of the old one, which was abandoned years ago. - In the characterize() function, part of dictionary comparison, use PyObject_RichCompareBool() to compare keys and values instead. But continue to use PyObject_Compare() for comparing the final (deciding) elements. - Align the comments in the type struct initializer. Note: I don't implement rich comparison for dictionaries -- there doesn't seem to be much to be gained. (The existing comparison already decides that shorter dicts are always smaller than longer dicts.)	2001-01-18 00:39:02 +00:00
Guido van Rossum	f77bc62e73	Same treatment as listobject.c: - tuplecontains(): call RichCompare(Py_EQ). - Get rid of tuplecompare(), in favor of new tuplerichcompare() (a clone of list_compare()). - Aligned the comments for large struct initializers.	2001-01-18 00:00:53 +00:00
Guido van Rossum	24f67d568c	Fix a leak in instance_coerce(). This was introduced by Neil's earlier coercion changes, not by rich comparisons. When a coercion function returns 1 (meaning it cannot do it), it should not INCREF the arguments. When no __coerce__() method was found, instance_coerce() originally returned 0, pretending it did it. Neil changed the return value to 1, more accurately reflecting that it didn't do anything, but forgot to take out the two INCREF calls.	2001-01-17 23:43:43 +00:00
Guido van Rossum	65e1cea6e3	Convert to rich comparisons: - sort's docompare() calls RichCompare(Py_LT). - list_contains(), list_index(), listcount(), listremove() call RichCompare(Py_EQ). - Get rid of list_compare(), in favor of new list_richcompare(). The latter does some nice shortcuts, like when == or != is requested, it first compares the lengths for trivial accept/reject. Then it goes over the items until it finds an index where the items differe; then it does more shortcut magic to minimize the number of additional comparisons. - Aligned the comments for large struct initializers.	2001-01-17 22:11:59 +00:00
Guido van Rossum	2ffbf6b112	Deal properly (?) with comparing recursive datastructures. - Use the compare nesting level and in-progress dictionary properly in PyObject_RichCompare(). - Change the in-progress code to use static variables instead of globals (both the nesting level and the key for the thread dict were globals but have no reason to be globals; the key can even be a function-static variable in get_inprogress_dict()). - Rewrote try_rich_to_3way_compare() to benefit from the similarity of the three cases, making it table-driven. - In try_rich_to_3way_compare(), test for EQ before LT and GT. This turns out essential when comparing recursive UserList instances; with the old code, these would recurse into rich comparison three times for each nesting level up to NESTING_LIMIT/2, making the total number of calls in the order of 3**(NESTING_LIMIT/2)! NOTE: I'm not 100% comfortable with this. It works for the standard test suite (which compares a few trivial recursive data structures only), but I'm not sure that the in-progress dictionary is used properly by the rich comparison code. Jeremy suggested that maybe the operation should be included in the dict. Currently I presume that objects in the dict are equal unless proven otherwise, and I set the outcome for the rich comparison accordingly: true for operators EQ, LE, GE, and false for the other three. But Jeremy seems to think that there may be counter-examples where this doesn't do the right thing.	2001-01-17 21:27:02 +00:00
Marc-André Lemburg	ad7c98e264	This patch adds a new builtin unistr() which behaves like str() except that it always returns Unicode objects. A new C API PyObject_Unicode() is also provided. This closes patch #101664. Written by Marc-Andre Lemburg. Copyright assigned to Guido van Rossum.	2001-01-17 17:09:53 +00:00
Guido van Rossum	f916e7aa62	Rich comparisons fall-out: - Get rid of float_cmp(). - Renamed Py_TPFLAGS_NEWSTYLENUMBER to Py_TPFLAGS_CHECKTYPES.	2001-01-17 15:33:42 +00:00
Guido van Rossum	6fd867b04d	Rich comparisons fall-out: - Get rid of long_cmp(). - Renamed Py_TPFLAGS_NEWSTYLENUMBER to Py_TPFLAGS_CHECKTYPES.	2001-01-17 15:33:18 +00:00
Guido van Rossum	3968e4c0f5	Rich comparisons fall-out: - Get rid of int_cmp(). - Renamed Py_TPFLAGS_NEWSTYLENUMBER to Py_TPFLAGS_CHECKTYPES.	2001-01-17 15:32:23 +00:00
Guido van Rossum	c31896960a	Rich comparisons fall-out: - Renamed Py_TPFLAGS_NEWSTYLENUMBER to Py_TPFLAGS_CHECKTYPES. - Use PyObject_RichCompareBool() in PySequence_Contains().	2001-01-17 15:29:42 +00:00
Guido van Rossum	8998b4f691	Rich comparisons. - Got rid of instance_cmp(); refactored instance_compare(). - Added instance_richcompare() which calls __lt__() etc. Some unrelated stuff mixed in: - Aligned comments in various large struct initializers. - Better test to avoid recursion if __coerce__ returns self as the first argument (this is an unrelated fix by Neil Schemenauer!). - Style nit: don't use Py_DECREF(Py_NotImplemented); use Py_DECREF(result) -- it just looks better. :-)	2001-01-17 15:28:20 +00:00
Guido van Rossum	e797ec1cb8	Rich comparisons. Refactored internal routine do_cmp() and added APIs PyObject_RichCompare() and PyObject_RichCompareBool(). XXX Note: the code that checks for deeply nested rich comparisons is bogus -- it assumes the two objects are always identical, rather than using the same logic as PyObject_Compare(). I'll fix that later.	2001-01-17 15:24:28 +00:00
Guido van Rossum	e54e0be3b6	Rationalizing the fallback code for portable fseek -- this is all much simpler if we use fgetpos and fsetpos, rather than trying to mess with platform-specific TELL64 alternatives. Of course, this hasn't been tested on a 64-bit platform, so I may have to withdraw this -- but I'm hopeful, and Trent Mick supports this patch!	2001-01-16 20:53:31 +00:00
Marc-André Lemburg	3a645e4dd4	Added checks to prevent PyUnicode_Count() from dumping core in case the parameters are out of bounds and fixes error handling for .count(), .startswith() and .endswith() for the case of mixed string/Unicode objects. This patch adds Python style index semantics to PyUnicode_Count() indices (including the special handling of negative indices). The patch is an extended version of patch #103249 submitted by Michael Hudson (mwh) on SF. It also includes new test cases.	2001-01-16 11:54:12 +00:00
Barry Warsaw	d6a9e84c81	Committing PEP 232, function attribute feature, approved by Guido. Closes SF patch #103123. funcobject.h: PyFunctionObject: add the func_dict slot. funcobject.c: PyFunction_New(): Initialize the func_dict slot to NULL. func_getattr(): Rename to func_getattro() and change the signature. It's more efficient to use attro methods and dig the C string out than it is to re-convert a C string to a PyString. Also, add support for getting the __dict__ (a.k.a. func_dict) attribute, and for getting an arbitrary function attribute. func_setattr(): Rename to func_setattro() and change the signature for the same reason. Also add support for setting __dict__ (a.k.a. func_dict) and any arbitrary function attribute. func_dealloc(): Be sure to DECREF the func_dict slot. func_traverse(): Be sure to traverse func_dict too. PyFunction_Type: make the necessary func_?etattro() changes. classobject.c: instancemethod_memberlist: Add __dict__ instancemethod_setattro(): New method to set arbitrary attributes on methods (really the underlying im_func). Raise TypeError when the instance is bound or when you're trying to set one of the reserved im_* attributes. instancemethod_getattr(): Renamed to instancemethod_getattro() since that's what it really is. Also, added support fo getting arbitrary attributes through the im_func. PyMethod_Type: Do the ?etattr{,o} dance.	2001-01-15 20:40:19 +00:00
Guido van Rossum	65e0b99b61	SF patch #103158 by Greg Ball: Don't do unsafe arithmetic in xrange object. This fixes potential overflows in xrange()'s internal calculations on 64-bit platforms. The fix is complicated because the sq_length slot function can only return an int; we want to support xrange(sys.maxint), which is a 64-bit quantity on most 64-bit platforms (except Win64). The solution is hacky but the best possible: when the range is that long, we can use it in a for loop but we can't ask for its length (nor can we actually iterate beyond 2**31-1, because the sq_item slot function has the same restrictions on its arguments. Fixing those restrictions is a project for another day...	2001-01-15 18:58:56 +00:00
Tim Peters	142297ac92	Speed getline_via_fgets(), by supplying two "fast paths", although one is faster than the other. Should be faster for Mark Favas's 254-character mail log lines, and is 3-4% quicker for my test case with much shorter lines (but they're typical of my text files, and I'm tired of optimizing for everyone else at my expense <wink> -- in fact, the only one who loses here is Guido ...).	2001-01-15 10:36:56 +00:00
Tim Peters	f29b64d243	Use the "MS" getline hack (fgets()) by default on non-get_unlocked platforms. See NEWS for details.	2001-01-15 06:33:19 +00:00
Guido van Rossum	e07d5cf966	Jeff Epler's patch adding an xreadlines() method. (It just imports the xreadlines module and lets it do its thing.)	2001-01-09 21:50:24 +00:00
Guido van Rossum	dcf5715db1	Tsk, tsk, tsk. Treat FreeBSD the same as the other BSDs when defining a fallback for TELL64. Fixes SF Bug #128119.	2001-01-09 02:00:11 +00:00
Neil Schemenauer	010b0cc218	Fix a silly bug in float_pow. Sorry Tim.	2001-01-08 06:29:50 +00:00
Tim Peters	1c73323d6f	A few reformats; no logic changes.	2001-01-08 04:02:07 +00:00
Guido van Rossum	8628206b95	Let's hope that three time's a charm... Tim discovered another "bug" in my get_line() code: while the comments said that n<0 was invalid, it was in fact still called with n<0 (when PyFile_GetLine() was called with n<0). In that case fortunately executed the same code as for n==0. Changed the comment to admit this fact, and changed Tim's MS speed hack code to use 'n <= 0' as the criteria for the speed hack.	2001-01-08 01:26:47 +00:00
Tim Peters	15b838521f	Fiddled ms_getline_hack after talking w/ Guido: made clearer that the code duplication is to let us get away without a realloc whenever possible; boosted the init buf size (the cutoff at which we can get away without a realloc) from 100 to 200 so that more files can enjoy this boost; and allowed other threads to run in all cases. The last two cost something, but not significantly: in my fat test case, less than a 1% slowdown total. Since my test case has a great many short lines, that's probably the worst slowdown, too. While the logic barely changed, there were lots of edits. This also gets rid of the reference to fp->_cnt, so the last platform assumption being made here is that fgets doesn't overwrite bytes capriciously (== beyond the terminating null byte it must write).	2001-01-08 00:53:12 +00:00
Tim Peters	86821b2563	MS Win32 .readline() speedup, as discussed on Python-Dev. This is a tricky variant that never needs to "search from the right". Also fixed unlikely memory leak in get_line, if string size overflows INTMAX. Also new std test test_bufio to make sure .readline() works.	2001-01-07 21:19:34 +00:00
Guido van Rossum	4ddf0a01f7	Tim noticed that I had botched get_line_raw(). Looking again, I realized that this behavior is already present in PyFile_GetLine(), which is the only place that needs it. A little refactoring of that function made get_line_raw() redundant.	2001-01-07 20:51:39 +00:00
Marc-André Lemburg	ec233e5803	This patch adds a new feature to the builtin charmap codec: The mapping dictionaries can now contain 1-n mappings, meaning that character ordinals may be mapped to strings or Unicode object, e.g. 0x0078 ('x') -> u"abc", causing the ordinal to be replaced by the complete string or Unicode object instead of just one character. Another feature introduced by the patch is that of mapping oridnals to the emtpy string. This allows removing characters. The patch is different from patch #103100 in that it does not cause a performance hit for the normal use case of 1-1 mappings. Written by Marc-Andre Lemburg, copyright assigned to Guido van Rossum.	2001-01-06 14:59:58 +00:00
Guido van Rossum	1187aa4d33	Restructured get_line() for clarity and speed. - The raw_input() functionality is moved to a separate function. - Drop GNU getline() in favor of getc_unlocked(), which exists on more platforms (and is even a tad faster on my system).	2001-01-05 14:43:05 +00:00
Neil Schemenauer	5ed85ec0c0	Changes for PEP 208. PyObject_Compare has been rewritten. Instances no longer get special treatment. The Py_NotImplemented type is here as well.	2001-01-04 01:48:10 +00:00
Neil Schemenauer	ba872e2534	Make long a new style number type. Sequence repeat is now done here now as well.	2001-01-04 01:46:03 +00:00
Neil Schemenauer	139e72ad1a	Make int a new style number type. Sequence repeat is now done here now as well.	2001-01-04 01:45:33 +00:00
Neil Schemenauer	32117e5c29	Make float a new style number type.	2001-01-04 01:44:34 +00:00
Neil Schemenauer	29bfc07183	Make instances a new style number type. See PEP 208 for details. Instance types no longer get special treatment from abstract.c so more number number methods have to be implemented.	2001-01-04 01:43:46 +00:00
Neil Schemenauer	5a1f015bee	Massive changes as per PEP 208. Read it for details.	2001-01-04 01:39:06 +00:00
Jeremy Hylton	1fb6088e86	dict_update has two boundary conditions: a.update(a) and a.update({}) Added test for second one.	2001-01-03 22:34:59 +00:00
Jeremy Hylton	db60bb5aad	fix leak	2001-01-03 22:32:16 +00:00
Marc-André Lemburg	a866df806d	This patch changes the default behaviour of the builtin charmap codec to not apply Latin-1 mappings for keys which are not found in the mapping dictionaries, but instead treat them as undefined mappings. The patch was originally written by Martin v. Loewis with some additional (cosmetic) changes and an updated test script by Marc-Andre Lemburg. The standard codecs were recreated from the most current files available at the Unicode.org site using the Tools/scripts/gencodec.py tool. This patch closes the bugs #116285 and #119960.	2001-01-03 21:29:14 +00:00
Neil Schemenauer	10e31cf82e	Add garbage collection for module objects. Closes patch #102939 and fixes bug #126345.	2001-01-02 15:58:27 +00:00
Fred Drake	e7e190ef97	Make the indentation consistently use tabs instead of using spaces just in one place.	2000-12-20 00:55:07 +00:00
Andrew M. Kuchling	f947ffe951	Patch #102940 : use only printable Unicode chars in reporting incorrect % characters; characters outside the printable range are replaced with '?'	2000-12-19 22:49:06 +00:00
Andrew M. Kuchling	932af110d3	Patch #102868 from cgw: fix memory leak when an EOF is encountered using GNU libc's getline()	2000-12-19 20:59:04 +00:00
Guido van Rossum	cda4f9a8dc	Fix off-by-one error in split_substring(). Fixes SF bug #122162 .	2000-12-19 02:23:19 +00:00
Andrew M. Kuchling	6ca8917758	[ Patch #102852 ] Make % error a bit more informative by indicates the index at which an unknown %-escape was found	2000-12-15 13:07:46 +00:00
Guido van Rossum	adf5410dc4	Test for NULL returned from PyObject_NEW().	2000-12-14 15:09:46 +00:00
Guido van Rossum	9e8f4ea0aa	Test for NULL returned from PyObject_NEW().	2000-12-14 14:59:53 +00:00
Tim Peters	f7f88b11e4	Add long-overdue docstrings to dict methods.	2000-12-13 23:18:45 +00:00
Tim Peters	0e76ab2ecc	Use METH_VARARGS instead of "1" in list method table.	2000-12-13 22:35:46 +00:00
Tim Peters	f1c7c884b3	Typo repair in comments. Fell for GregS's .popitem() poke.	2000-12-13 19:58:25 +00:00
Tim Peters	ea8f2bf9ca	Bring comments up to date (e.g., they still said the table had to be a prime size, which is in fact never true anymore ...).	2000-12-13 01:02:46 +00:00
Guido van Rossum	ba6ab84e73	Add popitem() -- SF patch #102733 .	2000-12-12 22:02:18 +00:00
Fred Drake	49312a52ec	Jeffrey D. Collins <tokeneater@users.sourceforge.net>: Fix type of the self parameter to some string object methods. This closes patch #102670.	2000-12-06 14:27:49 +00:00
Moshe Zadka	5725d1eb03	Backing out my changes. Improved version coming soon to a Source Forge near you!	2000-11-30 19:30:21 +00:00
Andrew M. Kuchling	1221e6df3d	Only use getline() when compiling using glibc	2000-11-30 18:27:50 +00:00
Moshe Zadka	1a62750eda	Added .first{item,value,key}() to dictionaries. Complete with docos and tests. OKed by Guido.	2000-11-30 12:31:03 +00:00
Tim Peters	a3a3a030af	Fox for SF bug #123859 : %[duxXo] long formats inconsistent.	2000-11-30 05:22:44 +00:00
Andrew M. Kuchling	4b2b445f28	Patch #102469 : Use glibc's getline() extension when reading unbounded lines	2000-11-29 02:53:22 +00:00
Guido van Rossum	d7aa0f245f	Update dependencies per /F.	2000-11-28 12:09:18 +00:00
Guido van Rossum	2ccda8a7c4	SF patch #102548 , fix for bug #121013 , by mwh@users.sourceforge.net. Fixes a typo that caused "".join(u"this is a test") to dump core.	2000-11-27 18:46:26 +00:00
Guido van Rossum	ecaa77798b	Added _HAVE_BSDI and __APPLE__ to the list of platforms that require a hack for TELL64()... Sounds like there's something else going on really. Does anybody have a clue I can buy?	2000-11-13 19:48:22 +00:00
Fred Drake	0b796fa5c5	Fixed support for containment test when a negative step is used; this really closes bug #121965. Added three attributes to the xrange object: start, stop, and step. These are the same as for the slice objects.	2000-11-08 19:42:43 +00:00
Fred Drake	a91e1650aa	In the containment test, get the boundary condition right. ">" was used where ">=" should have been. This closes bug #121965.	2000-11-08 18:37:05 +00:00
Fredrik Lundh	fad27aee11	Added 38,642 missing characters to the Unicode database (first-last ranges) -- but thanks to the 2.0 compression scheme, this doesn't add a single byte to the resulting binaries (!) Closes bug #117524	2000-11-03 20:24:15 +00:00
Fred Drake	661ea26b3d	Ka-Ping Yee <ping@lfw.org>: Changes to error messages to increase consistency & clarity. This (mostly) closes SourceForge patch #101839.	2000-10-24 19:57:45 +00:00
Marc-André Lemburg	53f3d4ac74	[ Bug #116174 ] using %% in cstrings sometimes fails with unicode paramsFix for the bug reported in Bug #116174 : "%% %s" % u"abc" failed due to the way string formatting delegated work to the Unicode formatting function.	2000-10-07 08:54:09 +00:00
Fred Drake	db810ac2b8	Donn Cave <donn@oz.net>: Fix large file support for BeOS. This closes SourceForge patch #101773. Refer to the patch discussion for information on possible alternate fixes.	2000-10-06 20:42:33 +00:00
Tim Peters	c54d19043a	SF bug 115831 and Ping's SF patch 101751, 0.0**-2.0 returns inf rather than raise ValueError. Checked in the patch as far as it went, but also changed all of ints, longs and floats to raise ZeroDivisionError instead when raising 0 to a negative number. This is what 754-inspired stds require, as the "true result" is an infinity obtained from finite operands, i.e. it's a singularity. Also changed float pow to not be so timid about using its square-and-multiply algorithm. Note that what math.pow does is unrelated to what builtin pow does, and will still vary by platform.	2000-10-06 00:36:09 +00:00
Neil Schemenauer	08b53e6a2a	Simplify _PyTuple_Resize by not using the tuple free list and dropping support for the last_is_sticky flag. A few hard to find bugs may be fixed by this patch since the old code was buggy.	2000-10-05 19:36:49 +00:00
Thomas Wouters	dc9100f57d	Fix for SF bug #115987 : PyInstance_HalfBinOp does not initialize the result-object-pointer that is passed in, when an exception occurs during coercion. The pointer has to be explicitly initialized in the caller to avoid putting trash on the Python stack.	2000-10-05 12:43:25 +00:00
Tim Peters	d57731f74b	Move LONG_BIT from intobject.c to pyport.h. #error if it's already been #define'd to an unreasonable value (several recent gcc systems have misdefined it, causing bogus overflows in integer multiplication). Nuke CHAR_BIT entirely.	2000-10-05 01:42:25 +00:00
Neil Schemenauer	e3550a65eb	- fix a GC bug caused by malloc() failing	2000-10-04 16:20:41 +00:00
Barry Warsaw	5b4c22806f	_PyUnicode_Fini(): Initialize the local freelist walking variable `u' after unicode_empty has been freed, otherwise it might not point to the real start of the unicode_freelist. Final closure for SF bug #110681, Jitterbug PR#398.	2000-10-03 20:45:26 +00:00
Guido van Rossum	4ae8ef84da	In _PyUnicode_Fini(), decref unicode_empty before tearng down the free list. Discovered by Barry, fix approved by MAL.	2000-10-03 18:09:04 +00:00
Fred Drake	d5fadf75e4	Rationalize use of limits.h, moving the inclusion to Python.h. Add definitions of INT_MAX and LONG_MAX to pyport.h. Remove includes of limits.h and conditional definitions of INT_MAX and LONG_MAX elsewhere. This closes SourceForge patch #101659 and bug #115323.	2000-09-26 05:46:01 +00:00
Fredrik Lundh	375732cd41	- don't set the titlecase flag for uppercase letters (sorry, tim)	2000-09-25 23:03:34 +00:00
Fredrik Lundh	9e7dd4c185	unicode database compression, step 3: - use unidb compression for the unicodectype module. smaller, faster, and slightly more portable...	2000-09-25 21:48:13 +00:00
Fredrik Lundh	69b58e2772	unicode database compression, step 3: - use unidb compression for the unicodectype module. smaller, faster, and slightly more portable... (note: this commit doesn't include the unicodectype.c file itself; I'm still waiting for the reviewers...)	2000-09-25 21:12:34 +00:00
Tim Peters	858346e484	Replace SIGFPE paranoia around strtod and atof. I don't believe these fncs are allowed to raise SIGFPE (see the C std), but OK by me if people using --with-fpectl want to pay for checking anyway.	2000-09-25 21:01:28 +00:00
Tim Peters	ef14d73b7a	Fix for SF bug 110624: float literals behave inconsistently. I fixed the specific complaint but left the (many) large issues untouched. See the (very long) bug report discussion for why: http://sourceforge.net/bugs/?func=detailbug&group_id=5470&bug_id=110624 Note that while I left the interface to the undocumented public API function PyFloat_FromString alone, its 2nd argument is useless. From a comment block in the code: RED_FLAG 22-Sep-2000 tim PyFloat_FromString's pend argument is braindead. Prior to this RED_FLAG, 1. If v was a regular string, pend was set to point to its terminating null byte. That's useless (the caller can find that without any help from this function!). 2. If v was a Unicode string, or an object convertible to a character buffer, pend was set to point into stack trash (the auto temp vector holding the character buffer). That was downright dangerous. Since we can't change the interface of a public API function, pend is still supported but now officially useless: if pend is not NULL, *pend is set to NULL.	2000-09-23 03:39:17 +00:00
Guido van Rossum	1a5e5830a7	Untested patch by Ty Sarna to make TELL64 work on older NetBSD systems. According to Justin Pettit, this also works on OpenBSD, so I've added that symbol as well.	2000-09-21 22:15:29 +00:00
Guido van Rossum	1e3c8ccb9b	As suggested by Toby Dickenson, setting ob_type to NULL in _Py_Dealloc(), is a bad idea (and always was!). So let's drop it.	2000-09-21 16:25:33 +00:00
Tim Peters	38fd5b6413	Derived from Martin's SF patch 110609: support unbounded ints in %d,i,u,x,X,o formats. Note a curious extension to the std C rules: x, X and o formatting can never produce a sign character in C, so the '+' and ' ' flags are meaningless for them. But unbounded ints can produce a sign character under these conversions (no fixed- width bitstring is wide enough to hold all negative values in 2's-comp form). So these flags become meaningful in Python when formatting a Python long which is too big to fit in a C long. This required shuffling around existing code, which hacked x and X conversions to death when both the '#' and '0' flags were specified: the hacks weren't strong enough to deal with the simultaneous possibility of the ' ' or '+' flags too, since signs were always meaningless before for x and X conversions. Isomorphic shuffling was required in unicodeobject.c. Also added dozens of non-trivial new unbounded-int test cases to test_format.py.	2000-09-21 05:43:11 +00:00
Marc-André Lemburg	d1ba443206	This patch adds a new Python C API called PyString_AsStringAndSize() which implements the automatic conversion from Unicode to a string object using the default encoding. The new API is then put to use to have eval() and exec accept Unicode objects as code parameter. This closes bugs #110924 and #113890. As side-effect, the traditional C APIs PyString_Size() and PyString_AsString() will also accept Unicode objects as parameters.	2000-09-19 21:04:18 +00:00
Marc-André Lemburg	e44e507b0e	PyObject_SetAttr() and PyObject_GetAttr() now also accept Unicode objects for the attribute name. Unicode objects are converted to a string using the default encoding before trying the lookup. Note that previously it was allowed to pass arbitrary objects as attribute name in case the tp_getattro/setattro slots were defined. This patch fixes this by applying an explicit string check first: all uses of these slots expect string objects and do not check for the type resulting in a core dump. The tp_getattro/setattro are still useful as optimization for lookups using interned string objects though. This patch fixes bug #113829.	2000-09-18 16:20:57 +00:00

... 4 5 6 7 8 ...

1301 Commits