cpython

Commit Graph

Author	SHA1	Message	Date
Tim Peters	6000464d08	Added new function k_lopsided_mul(), which is much more efficient than k_mul() when inputs have vastly different sizes, and a little more efficient when they're close to a factor of 2 out of whack. I consider this done now, although I'll set up some more correctness tests to run overnight.	2002-08-12 22:01:34 +00:00
Tim Peters	547607c4bf	k_mul(): Moved an assert down. In a debug build, interrupting a multiply via Ctrl+C could cause a NULL-pointer dereference due to the assert.	2002-08-12 19:43:49 +00:00
Tim Peters	70b041bbe7	k_mul(): Heh -- I checked in two fixes for the last problem. Only keep the good one <wink>. Also checked in a test-aid by mistake.	2002-08-12 19:38:01 +00:00
Tim Peters	d8b2173ef9	k_mul(): White-box testing turned up that (ah+al)(bh+bl) can, in rare cases, overflow the allocated result object by 1 bit. In such cases, it would have been brought back into range if we subtracted albl and ahbh from it first, but I don't want to do that because it hurts cache behavior. Instead we just ignore the excess bit when it appears -- in effect, this is forcing unsigned mod BASE*(asize + bsize) arithmetic in a case where that doesn't happen all by itself.	2002-08-12 19:30:26 +00:00
Guido van Rossum	3747a0f04c	Fix MSVC warnings.	2002-08-12 19:25:08 +00:00
Guido van Rossum	ad47da072a	Refactor how __dict__ and __weakref__ interact with __slots__. 1. You can now have __dict__ and/or __weakref__ in your __slots__ (before only __weakref__ was supported). This is treated differently than before: it merely sets a flag that the object should support the corresponding magic. 2. Dynamic types now always have descriptors __dict__ and __weakref__ thrust upon them. If the type in fact does not support one or the other, that descriptor's __get__ method will raise AttributeError. 3. (This is the reason for all this; it fixes SF bug 575229, reported by Cesar Douady.) Given this code: class A(object): __slots__ = [] class B(object): pass class C(A, B): __slots__ = [] the class object for C was broken; its size was less than that of B, and some descriptors on B could cause a segfault. C now correctly inherits __weakrefs__ and __dict__ from B, even though A is the "primary" base (C.__base__ is A). 4. Some code cleanup, and a few comments added.	2002-08-12 19:05:44 +00:00
Tim Peters	115c888b97	x_mul(): Made life easier for C optimizers in the "grade school" algorithm. MSVC 6 wasn't impressed <wink>. Something odd: the x_mul algorithm appears to get substantially worse than quadratic time as the inputs grow larger: bits in each input x_mul time k_mul time ------------------ ---------- ---------- 15360 0.01 0.00 30720 0.04 0.01 61440 0.16 0.04 122880 0.64 0.14 245760 2.56 0.40 491520 10.76 1.23 983040 71.28 3.69 1966080 459.31 11.07 That is, x_mul is perfectly quadratic-time until a little burp at 2.56->10.76, and after that goes to hell in a hurry. Under Karatsuba, doubling the input size "should take" 3 times longer instead of 4, and that remains the case throughout this range. I conclude that my "be nice to the cache" reworkings of k_mul() are paying.	2002-08-12 18:25:43 +00:00
Tim Peters	d64c1def7c	k_mul() and long_mul(): I'm confident that the Karatsuba algorithm is correct now, so added some final comments, did some cleanup, and enabled it for all long-int multiplies. The KARAT envar no longer matters, although I left some #if 0'ed code in there for my own use (temporary). k_mul() is still much slower than x_mul() if the inputs have very differenent sizes, and that still needs to be addressed.	2002-08-12 17:36:03 +00:00
Tim Peters	738eda742c	k_mul: Rearranged computation for better cache use. Ignored overflow (it's possible, but should be harmless -- this requires more thought, and allocating enough space in advance to prevent it requires exactly as much thought, to know exactly how much that is -- the end result certainly fits in the allocated space -- hmm, but that's really all the thought it needs! borrows/carries out of the high digits really are harmless).	2002-08-12 15:08:20 +00:00
Tim Peters	44121a6bc9	x_mul(): This failed to normalize its result. k_mul(): This didn't allocate enough result space when one input had more than twice as many bits as the other. This was partly hidden by that x_mul() didn't normalize its result. The Karatsuba recurrence is pretty much hosed if the inputs aren't roughly the same size. If one has at least twice as many bits as the other, we get a degenerate case where the "high half" of the smaller input is 0. Added a special case for that, for speed, but despite that it helped, this can still be much slower than the "grade school" method. It seems to take a really wild imbalance to trigger that; e.g., a 2*22-bit input times a 1000-bit input on my box runs about twice as slow under k_mul than under x_mul. This still needs to be addressed. I'm also not sure that allocating a->ob_size + b->ob_size digits is enough, given that this is computing k = (ah+al)(bh+bl) instead of k = (ah-al)*(bl-bh); i.e., it's certainly enough for the final result, but it's vaguely possible that adding in the "artificially" large k may overflow that temporarily. If so, an assert will trigger in the debug build, but we'll probably compute the right result anyway(!).	2002-08-12 06:17:58 +00:00
Tim Peters	877a212678	Introduced helper functions v_iadd and v_isub, for in-place digit-vector addition and subtraction. Reworked the tail end of k_mul() to use them. This saves oodles of one-shot longobject allocations (this is a triply- recursive routine, so saving one allocation in the body saves 3**n allocations at depth n; we actually save 2 allocations in the body).	2002-08-12 05:09:36 +00:00
Tim Peters	fc07e56844	k_mul(): Repaired another typo in another comment.	2002-08-12 02:54:10 +00:00
Tim Peters	18c15b9bbd	k_mul(): Repaired typo in comment.	2002-08-12 02:43:58 +00:00
Tim Peters	5af4e6c739	Cautious introduction of a patch that started from SF 560379: Karatsuba multiplication. Lots of things were changed from that. This needs a lot more testing, for correctness and speed, the latter especially when bit lengths are unbalanced. For now, the Karatsuba code gets invoked if and only if envar KARAT exists.	2002-08-12 02:31:19 +00:00
Tim Peters	da1a2212c8	int_lshift(): Simplified/sped overflow-checking.	2002-08-11 17:54:42 +00:00
Guido van Rossum	643d59cbd6	Use a better check for overflow from a<<b.	2002-08-11 14:04:13 +00:00
Marc-André Lemburg	cc8764ca9d	Add C API PyUnicode_FromOrdinal() which exposes unichr() at C level. u'%c' will now raise a ValueError in case the argument is an integer outside the valid range of Unicode code point ordinals. Closes SF bug #593581.	2002-08-11 12:23:04 +00:00
Guido van Rossum	078151da90	Implement stage B0 of PEP 237: add warnings for operations that currently return inconsistent results for ints and longs; in particular: hex/oct/%u/%o/%x/%X of negative short ints, and x<<n that either loses bits or changes sign. (No warnings for repr() of a long, though that will also change to lose the trailing 'L' eventually.) This introduces some warnings in the test suite; I'll take care of those later.	2002-08-11 04:24:12 +00:00
Tim Peters	3ddb856ed1	Fixed new typos, added a little info about ~sort versus "hint"s.	2002-08-10 07:04:01 +00:00
Guido van Rossum	40af889081	Disallow class assignment completely unless both old and new are heap types. This prevents nonsense like 2.__class__ = bool or True.__class__ = int.	2002-08-10 05:42:07 +00:00
Tim Peters	e05f65a0c6	1. Combined the base and length arrays into a single array of structs. This is friendlier for caches. 2. Cut MIN_GALLOP to 7, but added a per-sort min_gallop vrbl that adapts the "get into galloping mode" threshold higher when galloping isn't paying, and lower when it is. There's no known case where this hurts. It's (of course) neutral for /sort, \sort and =sort. It also happens to be neutral for !sort. It cuts a tiny # of compares in 3sort and +sort. For sort, it reduces the # of compares to better than what this used to do when MIN_GALLOP was hardcoded to 10 (it did about 0.1% more sort compares before, but given how close we are to the limit, this is "a lot"!). %sort used to do about 1.5% more compares, and ~sort about 3.6% more. Here are exact counts: i *sort 3sort +sort %sort ~sort !sort 15 449235 33019 33016 51328 188720 65534 before 448885 33016 33007 50426 182083 65534 after 0.08% 0.01% 0.03% 1.79% 3.65% 0.00% %ch from after 16 963714 65824 65809 103409 377634 131070 962991 65821 65808 101667 364341 131070 0.08% 0.00% 0.00% 1.71% 3.65% 0.00% 17 2059092 131413 131362 209130 755476 262142 2057533 131410 131361 206193 728871 262142 0.08% 0.00% 0.00% 1.42% 3.65% 0.00% 18 4380687 262440 262460 421998 1511174 524286 4377402 262437 262459 416347 1457945 524286 0.08% 0.00% 0.00% 1.36% 3.65% 0.00% 19 9285709 524581 524634 848590 3022584 1048574 9278734 524580 524633 837947 2916107 1048574 0.08% 0.00% 0.00% 1.27% 3.65% 0.00% 20 19621118 1048960 1048942 1715806 6045418 2097150 19606028 1048958 1048941 1694896 5832445 2097150 0.08% 0.00% 0.00% 1.23% 3.65% 0.00% 3. Added some key asserts I overlooked before. 4. Updated the doc file.	2002-08-10 05:21:15 +00:00
Tim Peters	b80595f44a	The samplesort-vs-mergesort #-of-comparisons comparisons were captured before %sort was introduced. Redid them (the numbers change, but the conclusions don't). Also did the samplesort counts with the released 2.2.1, as they're slightly different under the last CVS 2.3 samplesort (some higher, some lower -- CVS had been changed to stop doing the special-case business on recursive samplesort calls).	2002-08-10 03:04:33 +00:00
Fred Drake	f16c3dc81b	Add support for the iterator protocol to weakref proxy objects. Part of fixing SF bug #591704.	2002-08-09 18:34:16 +00:00
Guido van Rossum	f36921c4b0	Unicode replace() method with empty pattern argument should fail, like it does for 8-bit strings.	2002-08-09 15:36:48 +00:00
Neil Schemenauer	3bc3f28dbe	Only call sq_repeat if the object does not have a nb_multiply slot. One example of where this changes behavior is when a new-style instance defines '__mul__' and '__rmul__' and is multiplied by an int. Before the change the '__rmul__' method is never called, even if the int is the left operand.	2002-08-09 15:20:48 +00:00
Tim Peters	671764beb0	Repaired a braino in the description of bad minrun values.	2002-08-09 05:06:44 +00:00
Guido van Rossum	721f62e200	Major speedup for new-style class creation. Turns out there was some trampolining going on with the tp_new descriptor, where the inherited PyType_GenericNew was overwritten with the much slower slot_tp_new which would end up calling tp_new_wrapper which would eventually call PyType_GenericNew. Add a special case for this to update_one_slot(). XXX Hope there isn't a loophole in this. I'll buy the first person to point out a bug in the reasoning a beer. Backport candidate (but I won't do it).	2002-08-09 02:14:34 +00:00
Raymond Hettinger	48923c5533	Moved special case for tuples from iterobject.c to tupleobject.c. Makes the code in iterobject.c cleaner and speeds-up the general case by not checking for tuples everytime. SF Patch #592065.	2002-08-09 01:30:17 +00:00
Guido van Rossum	7bed213224	Significant speedup in new-style object creation: in slot_tp_new(), intern the string "__new__" so we can call PyObject_GetAttr() rather than PyObject_GetAttrString(). (Though it's a mystery why slot_tp_new is being called when a class doesn't define __new__. I'll look into that tomorrow.) 2.2 backport candidate (but I won't do it).	2002-08-08 21:57:53 +00:00
Guido van Rossum	febd61dc02	A modest speedup of object deallocation. call_finalizer() did rather a lot of work: it had to save and restore the current exception around a call to lookup_maybe(), because that could fail in rare cases, and most objects don't have a __del__ method, so the whole exercise was usually a waste of time. Changed this to cache the __del__ method in the type object just like all other special methods, in a new slot tp_del. So now subtype_dealloc() can test whether tp_del is NULL and skip the whole exercise if it is. The new slot doesn't need a new flag bit: subtype_dealloc() is only called if the type was dynamically allocated by type_new(), so it's guaranteed to have all current slots. Types defined in C cannot fill in tp_del with a function of their own, so there's no corresponding "wrapper". (That functionality is already available through tp_dealloc.)	2002-08-08 20:55:20 +00:00
Tim Peters	6c511e6d1c	Added info about highwater heap-memory use for the sortperf.py tests; + a couple of minor edits elsewhere.	2002-08-08 01:55:16 +00:00
Tim Peters	6063e2615f	PyList_Reverse(): This was leaking a reference to Py_None on every call. I believe I introduced this bug when I refactored the reversal code so that the mergesort could use it too. It's not a problem on the 2.2 branch.	2002-08-08 01:06:39 +00:00
Guido van Rossum	0906e07442	Fix a subtle bug in the trashcan code I added yesterday to subtype_dealloc(). When call_finalizer() failed, it would return without going through the trashcan end macro, thereby unbalancing the trashcan nesting level counter, and thereby defeating the test case (slottrash() in test_descr.py). This in turn meant that the assert in the GC_UNTRACK macro wasn't triggered by the slottrash() test despite a bug in the code: _PyTrash_destroy_chain() calls the dealloc routine with an object that's untracked, and the assert in the GC_UNTRACK macro would fail on this; but because of an earlier test that resurrects an object, causing call_finalizer() to fail and the trashcan nesting level to be unbalanced, so _PyTrash_destroy_chain() was never called. Calling the slottrash() test in isolation did trigger the assert, however. So the fix is twofold: (1) call the GC_UnTrack() function instead of the GC_UNTRACK macro, because the function is safe when the object is already untracked; (2) when call_finalizer() fails, jump to a label that exits through the trashcan end macro, keeping the trashcan nesting balanced.	2002-08-07 20:42:09 +00:00
Martin v. Löwis	3f19b10ca5	Replace abort with Py_FatalError.	2002-08-07 16:21:51 +00:00
Neal Norwitz	657d222700	Make more functions static	2002-08-06 22:12:52 +00:00
Neal Norwitz	d8b995f5e8	Make readahead functions static	2002-08-06 21:50:54 +00:00
Guido van Rossum	22b1387c51	Fix SF bug 574207 (chained __slots__ dealloc segfault). This is inspired by SF patch 581742 (by Jonathan Hogg, who also submitted the bug report, and two other suggested patches), but separates the non-GC case from the GC case to avoid testing for GC several times. Had to fix an assert() from call_finalizer() that asserted that the object wasn't untracked, because it's possible that the object isn't GC'ed!	2002-08-06 21:41:44 +00:00
Barry Warsaw	6a043f3fe8	PyUnicode_Contains(): The memcmp() call didn't take into account the width of Py_UNICODE. Good catch, MAL.	2002-08-06 19:03:17 +00:00
Barry Warsaw	817918cc3c	Committing patch #591250 which provides "str1 in str2" when str1 is a string of longer than 1 character.	2002-08-06 16:58:21 +00:00
Guido van Rossum	7a6e95948c	SF patch 580331 by Oren Tirosh: make file objects their own iterator. For a file f, iter(f) now returns f (unless f is closed), and f.next() is similar to f.readline() when EOF is not reached; however, f.next() uses a readahead buffer that messes up the file position, so mixing f.next() and f.readline() (or other methods) doesn't work right. Calling f.seek() drops the readahead buffer, but other operations don't. The real purpose of this change is to reduce the confusion between objects and their iterators. By making a file its own iterator, it's made clearer that using the iterator modifies the file object's state (in particular the current position). A nice side effect is that this speeds up "for line in f:" by not having to use the xreadlines module. The f.xreadlines() method is still supported for backwards compatibility, though it is the same as iter(f) now. (I made some cosmetic changes to Oren's code, and added a test for "file closed" to file_iternext() and file_iter().)	2002-08-06 15:55:28 +00:00
Raymond Hettinger	bc552ce1b8	SF 582071 clarified the .split() method's docstring to note that sep=None will trigger splitting on any whitespace.	2002-08-05 06:28:21 +00:00
Tim Peters	66860f6da4	Sped the usual case for sorting by calling PyObject_RichCompareBool directly when no comparison function is specified. This saves a layer of function call on every compare then. Measured speedups: i 2*i sort \sort /sort 3sort +sort %sort ~sort =sort !sort 15 32768 12.5% 0.0% 0.0% 100.0% 0.0% 50.0% 100.0% 100.0% -50.0% 16 65536 8.7% 0.0% 0.0% 0.0% 0.0% 0.0% 12.5% 0.0% 0.0% 17 131072 8.0% 25.0% 0.0% 25.0% 0.0% 14.3% 5.9% 0.0% 0.0% 18 262144 6.3% -10.0% 12.5% 11.1% 0.0% 6.3% 5.6% 12.5% 0.0% 19 524288 5.3% 5.9% 0.0% 5.6% 0.0% 5.9% 5.4% 0.0% 2.9% 20 1048576 5.3% 2.9% 2.9% 5.1% 2.8% 1.3% 5.9% 2.9% 4.2% The best indicators are those that take significant time (larger i), and where sort doesn't do very few compares (so sort and ~sort benefit most reliably). The large numbers are due to roundoff noise combined with platform variability; e.g., the 14.3% speedup for %sort at i=17 reflects a printed elapsed time of 0.18 seconds falling to 0.17, but a change in the last digit isn't really meaningful (indeed, if it really took 0.175 seconds, one electron having a lazy nanosecond could shift it to either value <wink>). Similarly the 25% at 3sort i=17 was a meaningless change from 0.05 to 0.04. However, almost all the "meaningless changes" were in the same direction, which is good. The before-and-after times for sort are clearest: before after 0.18 0.16 0.25 0.23 0.54 0.50 1.18 1.11 2.57 2.44 5.58 5.30	2002-08-04 17:47:26 +00:00
Tim Peters	6bdbc9e0b1	SF bug 590366: Small typo in listsort:ParseTuple The PyArg_ParseTuple() error string still said "msort". Changed to "sort".	2002-08-03 02:28:24 +00:00
Guido van Rossum	f4be427c46	Tim found that once test_longexp has run, test_sort takes very much longer to run than normal. A profiler run showed that this was due to PyFrame_New() taking up an unreasonable amount of time. A little thinking showed that this was due to the while loop clearing the space available for the stack. The solution is to only clear the local variables (and cells and free variables), not the space available for the stack, since anything beyond the stack top is considered to be garbage anyway. Also, use memset() instead of a while loop counting backwards. This should be a time savings for normal code too! (By a probably unmeasurable amount. :-)	2002-08-01 18:50:33 +00:00
Guido van Rossum	0dbab4c560	SF patch 588728 (Nathan Srebro). The __delete__ method wrapper for descriptors was not supported (I added a test, too.) 2.2 bugfix candidate.	2002-08-01 14:39:25 +00:00
Tim Peters	a64dc245ac	Replaced samplesort with a stable, adaptive mergesort.	2002-08-01 02:13:36 +00:00
Tim Peters	92f81f2e63	Checking in the doc file for "timsort". There's way too much here to stuff into code comments, and lots of it is going to be useful again (but hard to predict exactly which parts of it ...).	2002-08-01 00:59:42 +00:00
Neal Norwitz	cee5ca060b	SF patch #587889 , fix memory leak of tp_doc	2002-07-30 00:42:06 +00:00
Michael W. Hudson	56796f672f	Fix for [ 587875 ] crash on deleting extended slice The array code got simpler, always a good thing!	2002-07-29 14:35:04 +00:00
Mark Hammond	a290527376	Excise DL_IMPORT/EXPORT from object.h, and related files. This patch also adds 'extern' to PyAPI_DATA rather than at each declaration, as discussed with Tim and Guido.	2002-07-29 13:42:14 +00:00
Neal Norwitz	88fe4ff5a9	Fix the problem of not raising a TypeError exception when doing: '%g' % '1' '%d' % '1' Add a test for these conditions Fix the test so that if not exception is raise, this is a failure	2002-07-28 16:44:23 +00:00
Martin v. Löwis	673c0a2247	Patch #574867 : Correct list.extend docstring.	2002-07-28 16:35:57 +00:00
Neal Norwitz	7beeed5dfd	SF patch #577031 , remove PyArg_Parse() since it's deprecated	2002-07-28 15:19:47 +00:00
Martin v. Löwis	75d2d94e0f	Patch #554716 : Use __va_copy where available.	2002-07-28 10:23:27 +00:00
Skip Montanaro	35b37a5c11	tighten up the unicode object's docstring a tad	2002-07-26 16:22:46 +00:00
Jeremy Hylton	73a088e3fa	Don't be so hasty. If PyInt_AsLong() raises an error, don't set ValueError.	2002-07-25 16:43:29 +00:00
Jeremy Hylton	f20fcf9fed	Complain if __len__() returns < 0, just like classic classes. Fixes SF bug #575773. Bug fix candidate.	2002-07-25 16:06:15 +00:00
Michael W. Hudson	206d8f818f	Silly typo. Not sure how that got in.	2002-07-19 15:52:38 +00:00
Michael W. Hudson	f0d777c56b	A few days ago, Guido said (in the thread "[Python-Dev] Python version of PySlice_GetIndicesEx"): > OK. Michael, if you want to check in indices(), go ahead. Then I did what was needed, but didn't check it in. Here it is.	2002-07-19 15:47:06 +00:00
Tim Peters	330f9e9581	More sort cleanup: Moved the special cases from samplesortslice into listsort. If the former calls itself recursively, they're a waste of time, since it's called on a random permutation of a random subset of elements. OTOH, for exactly the same reason, they're an immeasurably small waste of time (the odds of finding exploitable order in a random permutation are ~= 0, so the special-case loops looking for order give up quickly). The point is more for conceptual clarity. Also changed some "assert comments" into real asserts; when this code was first written, Python.h didn't supply assert.h.	2002-07-19 07:05:44 +00:00
Tim Peters	0fe977c4a9	binarysort() cleanup: Documented the key invariants, explained why they imply this is a stable sort, and added some asserts.	2002-07-19 06:12:32 +00:00
Tim Peters	326b44871e	listreverse(): Don't call the new reverse_slice unless the list has something in it (else ob_item may be a NULL pointer).	2002-07-19 04:04:16 +00:00
Tim Peters	a8c974c157	Cleanup yielding a small speed boost: before rich comparisons were introduced, list.sort() was rewritten to use only the "< or not <?" distinction. After rich comparisons were introduced, docompare() was fiddled to translate a Py_LT Boolean result into the old "-1 for <, 0 for ==, 1 for >" flavor of outcome, and the sorting code was left alone. This left things more obscure than they should be, and turns out it also cost measurable cycles. So: The old CMPERROR novelty is gone. docompare() is renamed to islt(), and now has the same return conditinos as PyObject_RichCompareBool. The SETK macro is renamed to ISLT, and is even weirder than before (don't complain unless you want to maintain the sort code <wink>). Overall, this yields a 1-2% speedup in the usual (no explicit function passed to list.sort()) case when sorting arrays of floats (as sortperf.py does). The boost is higher for arrays of ints.	2002-07-19 03:30:57 +00:00
Tim Peters	3b01a1217f	Trimmed trailing whitespace.	2002-07-19 02:35:45 +00:00
Tim Peters	8e2e7ca330	Cleanup: Define one internal utility for reversing a list slice, and use that everywhere.	2002-07-19 02:33:08 +00:00
Jeremy Hylton	d1fedb6ab5	Remove extraneous semicolon. (Silences compiler warning for Compaq C++ 6.5 on Tru64.)	2002-07-18 18:49:52 +00:00
Jeremy Hylton	938ace69a0	staticforward bites the dust. The staticforward define was needed to support certain broken C compilers (notably SCO ODT 3.0, perhaps early AIX as well) botched the static keyword when it was used with a forward declaration of a static initialized structure. Standard C allows the forward declaration with static, and we've decided to stop catering to broken C compilers. (In fact, we expect that the compilers are all fixed eight years later.) I'm leaving staticforward and statichere defined in object.h as static. This is only for backwards compatibility with C extensions that might still use it. XXX I haven't updated the documentation.	2002-07-17 16:30:39 +00:00
Guido van Rossum	ca5ed5b875	Remove the next() method -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set (fortunately, because using the tp_iternext implementation for the the next() implementation is buggy). Also changed the allocation order in enum_next() so that the underlying iterator is only moved ahead when we have successfully allocated the result tuple and index.	2002-07-16 21:02:42 +00:00
Guido van Rossum	86d593e110	Remove the next() method -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set. Also removed the redundant (and expensive!) call to raise StopIteration from rangeiter_next().	2002-07-16 20:47:50 +00:00
Guido van Rossum	2147df748f	Make StopIteration a sink state. This is done by clearing out the di_dict field when the end of the list is reached. Also make the error ("dictionary changed size during iteration") a sticky state. Also remove the next() method -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set. That's a good thing, because the implementation given here was buggy (it never raised StopIteration).	2002-07-16 20:30:22 +00:00
Guido van Rossum	613bed3726	Make StopIteration a sink state. This is done by clearing out the object references (it_seq for seqiterobject, it_callable and it_sentinel for calliterobject) when the end of the list is reached. Also remove the next() methods -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set. That's a good thing, because the implementation given here was buggy (it never raised StopIteration).	2002-07-16 20:24:46 +00:00
Guido van Rossum	6b6272c857	Whitespace normalization.	2002-07-16 20:10:23 +00:00
Guido van Rossum	86103ae531	Make StopIteration a sink state. This is done by clearing out the it_seq field when the end of the list is reached. Also remove the next() method -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set. That's a good thing, because the implementation given here was buggy (it never raised StopIteration).	2002-07-16 20:07:32 +00:00
Jeremy Hylton	719841e2fb	The object returned by tp_new() may not have a tp_init. If the object is an ExtensionClass, for example, the slot is not even defined. So we must check that the type has the slot (implied by HAVE_CLASS) before calling tp_init().	2002-07-16 19:39:38 +00:00
Guido van Rossum	5086e49a6e	Make list_iter() really static.	2002-07-16 15:56:52 +00:00
Guido van Rossum	03013a0130	valid_identifier(): use an unsigned char* so that isalpha() will do the right thing even if char is unsigned.	2002-07-16 14:30:28 +00:00
Tim Peters	58cf361e35	docompare(): Another reasonable optimization from Jonathan Hogg for the explicit comparison function case: use PyObject_Call instead of PyEval_CallObject. Same thing in context, but gives a 2.4% overall speedup when sorting a list of ints via list.sort(__builtin__.cmp).	2002-07-15 05:16:13 +00:00
Tim Peters	7a1f91709b	WINDOWS_LEAN_AND_MEAN: There is no such symbol, although a very few MSDN sample programs use it, apparently in error. The correct name is WIN32_LEAN_AND_MEAN. After switching to the correct name, in two cases more was needed because the code actually relied on things that disappear when WIN32_LEAN_AND_MEAN is defined.	2002-07-14 22:14:19 +00:00
Guido van Rossum	b6d29b7856	Undef MIN and MAX before defining them, to avoid warnings on certain platforms.	2002-07-13 14:31:51 +00:00
Jeremy Hylton	a4b4c3bf05	Don't declare a function with staticforward. Just declare it static so that lame (BAD_STATIC_FORWARD) compilers don't see a mismatch between the prototype and the function.	2002-07-13 03:51:17 +00:00
Tim Peters	f2a0473350	docompare(): Use PyTuple_New instead of Py_BuildValue to build compare's arg tuple. This was suggested on c.l.py but afraid I can't find the msg again for proper attribution. For list.sort(cmp) where list is a list of random ints, and cmp is __builtin__.cmp, this yields an overall 50-60% speedup on my Win2K box. Of course this is a best case, because the overhead of calling cmp relative to the cost of actually comparing two ints is at an extreme. Nevertheless it's huge bang for the buck. An additionak 20-30% can be bought by making the arg tuple an immortal static (avoiding all but "the first" PyTuple_New), but that's tricky to make correct since docompare needs to be reentrant. So this picks the cherry and leaves the pits for Fred <wink>. Note that this makes no difference to the list.sort() case; an arg tuple gets built only if the user specifies an explicit sort function.	2002-07-11 21:46:16 +00:00
Jeremy Hylton	df3f793516	Extend function() to support an optional closure argument. Also, simplify some ref counting for other optional arguments.	2002-07-11 18:30:27 +00:00
Tim Peters	3459251d5a	object.h special-build macro minefield: renamed all the new lexical helper macros to something saner, and used them appropriately in other files too, to reduce #ifdef blocks. classobject.c, instance_dealloc(): One of my worst Python Memories is trying to fix this routine a few years ago when COUNT_ALLOCS was defined but Py_TRACE_REFS wasn't. The special-build code here is way too complicated. Now it's much simpler. Difference: in a Py_TRACE_REFS build, the instance is no longer in the doubly-linked list of live objects while its __del__ method is executing, and that may be visible via sys.getobjects() called from a __del__ method. Tough -- the object is presumed dead while its __del__ is executing anyway, and not calling _Py_NewReference() at the start allows enormous code simplification. typeobject.c, call_finalizer(): The special-build instance_dealloc() pain apparently spread to here too via cut-'n-paste, and this is much simpler now too. In addition, I didn't understand why this routine was calling _PyObject_GC_TRACK() after a resurrection, since there's no plausible way _PyObject_GC_UNTRACK() could have been called on the object by this point. I suspect it was left over from pasting the instance_delloc() code. Instead asserted that the object is still tracked. Caution: I suspect we don't have a test that actually exercises the subtype_dealloc() __del__-resurrected-me code.	2002-07-11 06:23:50 +00:00
Tim Peters	889f61dcfb	Documented PYMALLOC_DEBUG. This completes primary coverage of all the "special builds" I ever use. If you use others, document them here, or don't be surprised if I rip out the code for them <0.5 wink>.	2002-07-10 19:29:49 +00:00
Tim Peters	7c321a80f9	The Py_REF_DEBUG/COUNT_ALLOCS/Py_TRACE_REFS macro minefield: added more trivial lexical helper macros so that uses of these guys expand to nothing at all when they're not enabled. This should help sub- standard compilers that can't do a good job of optimizing away the previous "(void)0" expressions. Py_DECREF: There's only one definition of this now. Yay! That was that last one in the family defined multiple times in an #ifdef maze. Py_FatalError(): Changed the char* signature to const char*. _Py_NegativeRefcount(): New helper function for the Py_REF_DEBUG expansion of Py_DECREF. Calling an external function cuts down on the volume of generated code. The previous inline expansion of abort() didn't work as intended on Windows (the program often kept going, and the error msg scrolled off the screen unseen). _Py_NegativeRefcount calls Py_FatalError instead, which captures our best knowledge of how to abort effectively across platforms.	2002-07-09 02:57:01 +00:00
Tim Peters	c6a3ff634a	SF bug 578752: COUNT_ALLOCS vs heap types Repair segfaults and infinite loops in COUNT_ALLOCS builds in the presence of new-style (heap-allocated) classes/types. Bugfix candidate. I'll backport this to 2.2. It's irrelevant in 2.1.	2002-07-08 22:11:52 +00:00
Tim Peters	4be93d0e84	Rearranged and added comments to object.h, to clarify many things that have taken me "too long" to reverse-engineer over the years. Vastly reduced the nesting level and redundancy of #ifdef-ery. Took a light stab at repairing comments that are no longer true. sys_gettotalrefcount(): Changed to enable under Py_REF_DEBUG. It was enabled under Py_TRACE_REFS, which was much heavier than necessary. sys.gettotalrefcount() is now available in a Py_REF_DEBUG-only build.	2002-07-07 19:59:50 +00:00
Tim Peters	a6269a8ec5	Removed 3 unlikely #includes that were only needed for the non-gc flavor of the trashcan code.	2002-07-07 16:52:50 +00:00
Tim Peters	803526b9e2	Trashcan cleanup: Now that cyclic gc is always there, the trashcan mechanism is no longer evil: it no longer plays dangerous games with the type pointer or refcounts, and objects in extension modules can play along too without needing to edit the core first. Rewrote all the comments to explain this, and (I hope) give clear guidance to extension authors who do want to play along. Documented all the functions. Added more asserts (it may no longer be evil, but it's still dangerous <0.9 wink>). Rearranged the generated code to make it clearer, and to tolerate either the presence or absence of a semicolon after the macros. Rewrote _PyTrash_destroy_chain() to call tp_dealloc directly; it was doing a Py_DECREF again, and that has all sorts of obscure distorting effects in non-release builds (Py_DECREF was already called on the object!). Removed Christian's little "embedded change log" comments -- that's what checkin messages are for, and since it was impossible to correlate the comments with the code that changed, I found them merely distracting.	2002-07-07 05:13:56 +00:00
Tim Peters	943382c8e5	Removed WITH_CYCLE_GC #ifdef-ery. Holes: + I'm not sure what to do about configure.in. Left it alone. + Ditto pyexpat.c. Fred or Martin will know what to do.	2002-07-07 03:59:34 +00:00
Martin v. Löwis	6238d2b024	Patch #569753 : Remove support for WIN16. Rename all occurrences of MS_WIN32 to MS_WINDOWS.	2002-06-30 15:26:10 +00:00
Raymond Hettinger	5a04aec384	Fix SF bug 546434 -- buffer slice type inconsistent.	2002-06-25 00:25:30 +00:00
Raymond Hettinger	ab5dae35ca	Fix SF bug 572567: Memory leak in object comparison.	2002-06-24 13:08:16 +00:00
Jeremy Hylton	8b47dffc93	Fix for SF bug 571885 When resizing a tuple, zero out the memory starting at the end of the old tuple not at the beginning of the old tuple.	2002-06-20 23:13:17 +00:00
Raymond Hettinger	0ae0c07661	SF 569257 -- Name mangle double underscored variable names in __slots__.	2002-06-20 22:23:15 +00:00
Michael W. Hudson	9c14badc5f	Fix the bug described in http://mail.python.org/pipermail/python-dev/2002-June/025461.html with test cases. Also includes extended slice support for arrays, which I thought I'd already checked in but obviously not.	2002-06-19 15:44:15 +00:00
Guido van Rossum	63517577fd	Patch from SF bug 570483 (Tim Northover). In a fresh interpreter, type.mro(tuple) would segfault, because PyType_Ready() isn't called for tuple yet. To fix, call PyType_Ready(type) if type->tp_dict is NULL.	2002-06-18 16:44:57 +00:00
Michael W. Hudson	b1e8154013	About the new but unreferenced new_class, Guido sez: > Looks like an experiment by Oren Tirosh that didn't get nuked. I > think you can safely lose it. It's gone.	2002-06-18 12:38:06 +00:00
Guido van Rossum	bea18ccde6	SF patch 568629 by Oren Tirosh: types made callable. These built-in functions are replaced by their (now callable) type: slice() buffer() and these types can also be called (but have no built-in named function named after them) classobj (type name used to be "class") code function instance instancemethod (type name used to be "instance method") The module "new" has been replaced with a small backward compatibility placeholder in Python. A large portion of the patch simply removes the new module from various platform-specific build recipes. The following binary Mac project files still have references to it: Mac/Build/PythonCore.mcp Mac/Build/PythonStandSmall.mcp Mac/Build/PythonStandalone.mcp [I've tweaked the code layout and the doc strings here and there, and added a comment to types.py about StringTypes vs. basestring. --Guido]	2002-06-14 20:41:17 +00:00
Guido van Rossum	59e6c53920	Inexplicably, recurse_down_subclasses() was comparing the object gotten from a weak reference to NULL instead of to None. This caused the following assert() to fail (but only in 2.2 in the debug build -- I have to find a better test case). Will backport.	2002-06-14 02:27:07 +00:00
Neal Norwitz	2c2e827029	Missed one use of new PyDoc_STRVAR macro	2002-06-14 02:04:18 +00:00
Neal Norwitz	1f68fc7fa5	SF bug # 493951 string.{starts,ends}with vs slices Handle negative indices similar to slices.	2002-06-14 00:50:42 +00:00
Neal Norwitz	4178515035	SF # 533070 Silence AIX C Compiler Warnings Warning caused by using &func. & is not necessary.	2002-06-13 21:42:51 +00:00
Guido van Rossum	e7b8ecf196	Major cleanup operation: whenever there's a call that looks for an optional attribute, only clear the exception when the internal getattr operation raised AttributeError. Many places in this file already had that policy; but just as many didn't, and there didn't seem to be any rhyme or reason to it. Be consistently cautious. Question: should I backport this? On the one hand it's a bugfix. On the other hand it's a change in behavior. Certain forms of buggy or just weird code would work in the past but raise an exception under the new rules; e.g. if you define a __getattr__ method that raises a non-AttributeError exception.	2002-06-13 21:42:04 +00:00
Guido van Rossum	16b93b3d0e	Fix for SF bug 532646. This is a little simpler than what Neal suggested there, based upon a better analysis (__getattr__ is a red herring). Will backport to 2.2.	2002-06-13 21:32:51 +00:00
Neal Norwitz	51290d369d	SF # 561244 Micro optimizations Cleanup code a bit and return as early as possible.	2002-06-13 21:32:44 +00:00
Neal Norwitz	20e72130c4	Fix typo in exception message	2002-06-13 21:25:17 +00:00
Neal Norwitz	35fc7606f0	SF #561244 Micro optimizations Convert loops to memset()s.	2002-06-13 21:11:11 +00:00
Martin v. Löwis	14f8b4cfcb	Patch #568124 : Add doc string macros.	2002-06-13 20:33:02 +00:00
Guido van Rossum	09638c16d8	Hopefully this addresses the remaining issues of SF bugs 459235 and 473985. Through a subtle rearrangement of some members in the etype struct (!), mapping methods are now preferred over sequence methods, which is necessary to support str.__getitem__("hello", slice(4)) etc.	2002-06-13 19:17:46 +00:00
Guido van Rossum	65ce6de35c	Rearrange the #ifndef WITHOUT_COMPLEX so it can be picked up from pyconfig.h.	2002-06-13 17:07:07 +00:00
Michael W. Hudson	589dc93620	Fix for problem reported by Neal Norwitz. Tighten up calculation of slicelength. Include his test case.	2002-06-11 13:38:42 +00:00
Guido van Rossum	75a20b19ef	Fold remaining long lines.	2002-06-11 12:22:28 +00:00
Michael W. Hudson	5efaf7eac8	This is my nearly two year old patch [ 400998 ] experimental support for extended slicing on lists somewhat spruced up and better tested than it was when I wrote it. Includes docs & tests. The whatsnew section needs expanding, and arrays should support extended slices -- later.	2002-06-11 10:55:12 +00:00
Guido van Rossum	cab05807fc	Undo the last chunk of the previous patch, putting back a useful assert into PyType_Ready(): now that we're not clearing tp_dict, we can assert that it's non-NULL again.	2002-06-10 15:29:03 +00:00
Guido van Rossum	a386209754	In the recent python-dev thread "Bizarre new test failure", we discovered that subtype_traverse must traverse the type if it is a heap type, because otherwise some cycles involving a type and its instance would not be collected. Simplest example: while 1: class C(object): pass C.ref = C() This program grows without bounds before this fix. (It grows ever slower since it spends ever more time in the collector.) Simply adding the right visit() call to subtype_traverse() revealed other problems. With MvL's help we re-learned that type_clear() doesn't have to clear all references, only the ones that may not be cleared by other means. Careful analysis (see comments in the code) revealed that only tp_mro needs to be cleared. (The previous checkin to this file adds a test for tp_mro==NULL to _PyType_Lookup() that's essential to prevent crashes due to tp_mro being NULL when subtype_dealloc() tries to look for a __del__ method.) The same kind of analysis also revealed that subtype_clear() doesn't need to clear the instance dict. With this fix, a useful property of the collector is once again guaranteed: a single gc.collect() call will clear out all garbage. (It didn't always before, which put us on the track of this bug.) Will backport to 2.2.	2002-06-10 15:24:42 +00:00
Guido van Rossum	2309498595	Three's a charm: yet another fix for SF bug 551412. Thinking again about the test case, slot_nb_power gets called on behalf of its second argument, but with a non-None modulus it wouldn't check this, and believes it is called on behalf of its first argument. Fix this properly, and get rid of the code in _PyType_Lookup() that tries to call _PyType_Ready(). But do leave a check for a NULL tp_mro there, because this can still legitimately occur. I'll fix this in 2.2.x too.	2002-06-10 14:30:43 +00:00
Raymond Hettinger	478d47a168	Close SF bug 563740. complex() now finds __complex__() in new style classes. Made conversion failure error messages consistent between types. Added related unittests.	2002-06-06 15:45:38 +00:00
Neal Norwitz	56f46f8d8c	Pyrangeiter_Type && range_iter should be static	2002-06-06 14:58:21 +00:00
Raymond Hettinger	c4c453f5ae	Skip Montanaro's patch, SF 559833, exposing xrange type in builtins. Also, added more regression tests to cover the new type and test its conformity with range().	2002-06-05 23:12:45 +00:00
Raymond Hettinger	48165d40cb	SF 564601 adding rangeiterobject to make xrange() iterate like range().	2002-06-05 20:08:48 +00:00
Thomas Heller	aee2d5f975	Better isinstance error message. Closes SF patch # 560250. Bugfix candidate IMO.	2002-06-05 12:55:19 +00:00
Guido van Rossum	9923ffe2c0	Address SF bug 519621: slots weren't traversed by GC. While I was at it, I added a tp_clear handler and changed the tp_dealloc handler to use the clear_slots helper for the tp_clear handler. Also tightened the rules for slot names: they must now be proper identifiers (ignoring the dirty little fact that <ctype.h> is locale sensitive). Also set mp->flags = READONLY for the __weakref__ pseudo-slot. Most of this is a 2.2 bugfix candidate; I'll apply it there myself.	2002-06-04 19:52:53 +00:00
Raymond Hettinger	e53e7a2c7d	Inverted test for small speedup	2002-06-04 18:45:50 +00:00
Guido van Rossum	ed3e09f289	Add a docstring to the module type.	2002-06-04 06:02:35 +00:00
Guido van Rossum	c3a787e090	Surprising fix for SF bug 563060: module can be used as base class. Change the module constructor (module_init) to have the signature __init__(name:str, doc=None); this prevents the call from type_new() to succeed. While we're at it, prevent repeated calling of module_init for the same module from leaking the dict, changing the semantics so that __dict__ is only initialized if NULL. Also adding a unittest, test_module.py. This is an incompatibility with 2.2, if anybody was instantiating the module class before, their argument list was probably empty; so this can't be backported to 2.2.x.	2002-06-04 05:52:47 +00:00
Guido van Rossum	b65c65b301	Address the residual issue with the fix for SF 551412 in _PyType_Lookup(). Decided to clear the error condition in the unfortunate but unlikely case that PyType_Ready() fails. Will fix in 2.2.x too.	2002-06-03 19:52:41 +00:00
Tim Peters	93b2cc4e97	A bogus assert in the new listiter code prevented starting Python in a debug build. Repaired that, and rewrote other parts to reduce long-winded casting.	2002-06-01 05:22:55 +00:00
Raymond Hettinger	14bd6de0ec	SF 560736. Optimize list iteration by filling the tp_iter slot.	2002-05-31 21:40:38 +00:00
Neal Norwitz	3f8dae73c7	Fix typo	2002-05-31 20:23:33 +00:00
Guido van Rossum	a407300cd7	Implement the intention of SF patch 472523 (but coded differently). In the past, an object's tp_compare could return any value. In 2.2 the docs were tightened to require it to return -1, 0 or 1; and -1 for an error. We now issue a warning if the value is not in this range. When an exception is raised, we allow -1 or -2 as return value, since -2 will the recommended return value for errors in the future. (Eventually tp_compare will also be allowed to return +2, to indicate NotImplemented; but that can only be implemented once we know all extensions return a value in [-2...1]. Or perhaps it will require the type to set a flag bit.) I haven't decided yet whether to backport this to 2.2.x. The patch applies fine. But is it fair to start warning in 2.2.2 about code that worked flawlessly in 2.2.1?	2002-05-31 20:03:54 +00:00
Neal Norwitz	32a7e7f6b6	Change name from string to basestring	2002-05-31 19:58:02 +00:00
Marc-André Lemburg	4164439240	Fix a possible segfault. Found be Neal Norvitz.	2002-05-29 13:46:29 +00:00
Marc-André Lemburg	4da6fd63bc	Fix for bug [ 561796 ] string.find causes lazy error	2002-05-29 11:33:13 +00:00
Guido van Rossum	9fc8a29663	Fix for SF bug 551412. When _PyType_Lookup() is called on a type whose tp_mro hasn't been initialized, it would dump core. Fix this by checking for NULL and calling PyType_Ready(). Will fix this in 2.2.1 too.	2002-05-24 21:40:08 +00:00
Guido van Rossum	cacfc07d08	- A new type object, 'string', is added. This is a common base type for 'str' and 'unicode', and can be used instead of types.StringTypes, e.g. to test whether something is "a string": isinstance(x, string) is True for Unicode and 8-bit strings. This is an abstract base class and cannot be instantiated directly.	2002-05-24 19:01:59 +00:00
Guido van Rossum	9ee4b94f51	Add a safeguard against setting the class to something with a different free or alloc slot.	2002-05-24 18:47:47 +00:00
Tim Peters	bf9cb3ff1a	Use function instead of macro spellings for PyObject_memorystuff.	2002-05-23 15:51:20 +00:00
Neal Norwitz	d4e5be5340	Closes: #556025 seg fault when doing list(xrange(1e9)) A MemoryError is now raised when the list cannot be created. There is a test, but as the comment says, it really only works for 32 bit systems. I don't know how to improve the test for other systems (ie, 64 bit or systems where the data size != addressable size, e.g. 64 bit data, but 48 bit addressable memory)	2002-05-22 23:19:17 +00:00
Barry Warsaw	4be55b5cef	file_doc: Add some description of the U mode character, but only when WITH_UNIVERSAL_NEWLINES is enabled.	2002-05-22 20:37:53 +00:00
Raymond Hettinger	0ebac97058	Patch 549187. Improve string formatting error message.	2002-05-21 15:14:57 +00:00
Guido van Rossum	4402241450	Jim Fulton reported a segfault in dir(). A heavily proxied object returned a proxy for __class__ whose __bases__ was also a proxy. The merge_class_dict() helper for dir() assumed incorrectly that __bases__ would always be a tuple and used the in-line tuple API on the proxy. I will backport this to 2.2 as well.	2002-05-13 18:29:46 +00:00
Walter Dörwald	775c11f07a	Add #ifdef PY_USING_UNICODE sections, so that stringobject.c compiles again with --disable-unicode. Fixes SF bug http://www.python.org/sf/554912	2002-05-13 09:00:41 +00:00
Martin v. Löwis	e452659237	Patch #551410 : Implement tp_getiter.	2002-05-08 08:49:27 +00:00
Martin v. Löwis	01f94bda38	Patch #552433 : Special-case tuples. Avoid sub-type checking for lists. Avoid checks for negative indices and duplicate checks for support of the sequence protocol.	2002-05-08 08:44:21 +00:00
Tim Peters	4ce71f77c3	PyObject_Realloc(): If a small block is shrinking, bite the expense of copying it if at least 25% of the input block can be reclaimed.	2002-05-02 20:19:34 +00:00
Fred Drake	d9018323c0	Remove old deprecated features from the xrange object.	2002-05-02 19:56:55 +00:00
Fred Drake	edb51bb7e8	Fix attribute access for the xrange objects. The tp_getattr and tp_getattro handlers were both set, but were not compatible. This change uses only the tp_getattro handler with a more "modern" approach. This fixes SF bug #551285.	2002-05-02 16:05:27 +00:00
Skip Montanaro	71390a9a94	clarify message when raising TypeError to indicate that float() accepts strings or numbers	2002-05-02 13:03:22 +00:00
Tim Peters	29c0afcfec	Just added comments, and cleared some XXX questions, related to int memory management.	2002-04-28 16:57:34 +00:00
Tim Peters	449b5a8da1	_PyObject_DebugCheckAddress(): If the leading pad bytes are corrupt, display a msg warning that the count of bytes requested may be bogus, and that a segfault may happen next.	2002-04-28 06:14:45 +00:00
Tim Peters	5de9842b34	Repair widespread misuse of _PyString_Resize. Since it's clear people don't understand how this function works, also beefed up the docs. The most common usage error is of this form (often spread out across gotos): if (_PyString_Resize(&s, n) < 0) { Py_DECREF(s); s = NULL; goto outtahere; } The error is that if _PyString_Resize runs out of memory, it automatically decrefs the input string object s (which also deallocates it, since its refcount must be 1 upon entry), and sets s to NULL. So if the "if" branch ever triggers, it's an error to call Py_DECREF(s): s is already NULL! A correct way to write the above is the simpler (and intended) if (_PyString_Resize(&s, n) < 0) goto outtahere; Bugfix candidate.	2002-04-27 18:44:32 +00:00
Tim Peters	602f740bc2	SF patch 549375: Compromise PyUnicode_EncodeUTF8 This implements ideas from Marc-Andre, Martin, Guido and me on Python-Dev. "Short" Unicode strings are encoded into a "big enough" stack buffer, then exactly as much string space as they turn out to need is allocated at the end. This should have speed benefits akin to Martin's "measure once, allocate once" strategy, but without needing a distinct measuring pass. "Long" Unicode strings allocate as much heap space as they could possibly need (4 x # Unicode chars), and do a realloc at the end to return the untouched excess. Since the overallocation is likely to be substantial, this shouldn't burden the platform realloc with unusably small excess blocks. Also simplified uses of the PyString_xyz functions. Also added a release- build check that 4*size doesn't overflow a C int. Sooner or later, that's going to happen.	2002-04-27 18:03:26 +00:00
Guido van Rossum	7dab2426ca	- New builtin function enumerate(x), from PEP 279. Example: enumerate("abc") is an iterator returning (0,"a"), (1,"b"), (2,"c"). The argument can be an arbitrary iterable object.	2002-04-26 19:40:56 +00:00
Guido van Rossum	517c7d4fd3	PyNumber_CoerceEx: this took a shortcut (not doing anything) when the left and right type were of the same type and not classic instances. This shortcut is dangerous for proxy types, because it means that coerce(Proxy(1), Proxy(2.1)) leaves Proxy(1) unchanged rather than turning it into Proxy(1.0). In an ever-so-slight change of semantics, I now only take the shortcut when the left and right types are of the same type and don't have the CHECKTYPES feature. It so happens that classic instances have this flag, so the shortcut is still skipped in this case (i.e. nothing changes for classic instances). Proxies also have this flag set (otherwise implementing numeric operations on proxies would become nightmarish) and this means that the shortcut is also skipped there, as desired. It so happens that int, long and float also have this flag set; that means that e.g. coerce(1, 1) will now invoke int_coerce(). This is fine: int_coerce() can deal with this, and I'm not worried about the performance; int_coerce() is only invoked when the user explicitly calls coerce(), which should be rarer than rare.	2002-04-26 02:49:14 +00:00
Guido van Rossum	9364698101	Make sure that tp_free frees the int the same way as tp_dealloc would. This fixes the problem that Barry reported on python-dev: >>> 23000 .__class__ = bool crashes in the deallocator. This was because int inherited tp_free from object, which uses the default allocator. 2.2. Bugfix candidate.	2002-04-26 00:53:34 +00:00
Guido van Rossum	c95cc87447	Clean up the layout of the bool_as_number struct initializer.	2002-04-25 20:01:10 +00:00
Barry Warsaw	f16951cffe	abstract_get_bases(): Clarify exactly what the return values and states can be for this function, and ensure that only AttributeErrors are masked. Any other exception raised via the equivalent of getattr(cls, '__bases__') should be propagated up. abstract_issubclass(): If abstract_get_bases() returns NULL, we must call PyErr_Occurred() to see if an exception is being propagated, and return -1 or 0 as appropriate. This is the specific fix for a problem whereby if getattr(derived, '__bases__') raised an exception, an "undetected error" would occur (under a debug build). This nasty situation was uncovered when writing a security proxy extension type for the Zope3 project, where the security proxy raised a Forbidden exception on getattr of __bases__. PyObject_IsInstance(), PyObject_IsSubclass(): After both calls to abstract_get_bases(), where we're setting the TypeError if the return value is NULL, we must first check to see if an exception occurred, and /not/ mask an existing exception. Neil Schemenauer should double check that these changes don't break his ExtensionClass examples (there aren't any test cases for those examples and abstract_get_bases() was added by him in response to problems with ExtensionClass). Neil, please add test cases if possible! I belive this is a bug fix candidate for Python 2.2.2.	2002-04-23 22:45:44 +00:00
Jeremy Hylton	c4ad0bcbe5	Clarify return value of PyLong_AsLongLong(). The function is documented to return -1 on error. If res was < 0, it returned res. It wasn't clear that the invariant was res < 0 iff res == -1.	2002-04-23 20:01:20 +00:00
Tim Peters	030a5cebf4	unicode_memchr(): Squashed gratuitous int-vs-size_t mismatch (which gives a compiler wng under MSVC because of the resulting signed-vs- unsigned comparison).	2002-04-22 19:00:10 +00:00
Walter Dörwald	de02bcb265	Apply patch diff.txt from SF feature request http://www.python.org/sf/444708 This adds the optional argument for str.strip to unicode.strip too and makes it possible to call str.strip with a unicode argument and unicode.strip with a str argument.	2002-04-22 17:42:37 +00:00
Tim Peters	e1682a80fa	Py_UniversalNewlineFread(): small speed boost on non-Windows boxes.	2002-04-21 18:15:20 +00:00
Tim Peters	0eca65c4c5	PyUnicode_EncodeUTF8(): tightened the memory asserts a bit, and at least tried to catch some possible arithmetic overflows in the debug build.	2002-04-21 17:28:06 +00:00
Martin v. Löwis	2a7ff35a07	Back out 2.140.	2002-04-21 09:59:45 +00:00
Tim Peters	058b141ef7	Py_UniversalNewlineFread(): Many changes. + Continued looping until n bytes in the buffer have been filled, not just when n bytes have been read from the file. This repairs the bug that f.readlines() only sucked up the first 8192 bytes of the file on Windows when universal newlines was enabled and f was opened in U mode (see Python-Dev -- this was the ultimate cause of the test_inspect.py failure). + Changed prototye to take a char* buffer (void* doesn't make much sense). + Squashed size_t vs int mismatches (in particular, besides the unsigned vs signed distinction, size_t may be larger than int). + Gets out under all error conditions now (it's possible for fread() to suffer an error even if it returns a number larger than 0 -- any "short read" is an error or EOF condition). + Rearranged and simplified declarations.	2002-04-21 07:29:14 +00:00
Tim Peters	7e3d961fc1	PyUnicode_EncodeUTF8: squash compiler wng. The difference of two pointers is a signed type. Changing "allocated" to a signed int makes undetected overflow more likely, but there was no overflow detection before either.	2002-04-21 03:26:37 +00:00
Martin v. Löwis	a4eb14b7a4	Patch #495401 : Count number of required bytes for encoding UTF-8 before allocating the target buffer.	2002-04-20 13:44:01 +00:00
Jeremy Hylton	24ea8d3d9c	Fix SF bug #505315 : Make free and cell vars show up consistently in locals(). PyFrame_FastToLocals() and PyFrame_LocalsToFast() had a return if f_nlocals was 0. I think this was a holdover from the pre 2.1 days when regular locals were the only kind of local variables. The change makes it possible to use a free variable in eval or exec if it the variable is also used elsewhere in the same block, which is what the documentation says.	2002-04-20 04:46:55 +00:00
Tim Peters	08d821582f	_PyObject_DebugMallocStats(): Added some potentially expensive internal consistency checks, enabled only in a debug (Py_DEBUG) build. Note that this never gets called automatically unless PYMALLOC_DEBUG is #define'd too, and the envar PYTHONMALLOCSTATS exists.	2002-04-18 22:25:03 +00:00
Tim Peters	64d80c9f40	PyObject_Malloc: make a tiny bit faster for platforms where malloc(0) doesn't return NULL. PyObject_Realloc: better comment for why we don't call PyObject_Malloc(0).	2002-04-18 21:58:56 +00:00
Tim Peters	3e12071dbe	Remove some long-disabled debugging boilerplate.	2002-04-18 21:37:03 +00:00
Tim Peters	2b85897189	type_get_doc(): Squash compiler wng about incompatible ptr types.	2002-04-18 04:12:28 +00:00
Guido van Rossum	6ca7d41c1f	SF bug 542984. Change type_get_doc (the get function for __doc__) to look in tp_dict more often, and if it finds a descriptor in tp_dict, to call it (with a NULL instance). This means you can add a __doc__ descriptor to a new-style class that returns instance docs when called on an instance, and class docs when called on a class -- or the same docs in either case, but lazily computed. I'll also check this into the 2.2 maintenance branch.	2002-04-18 00:22:00 +00:00
Guido van Rossum	e8fc640349	SF bug 544647. PyNumber_InPlaceMultiply insisted on calling sq_inplace_repeat if it existed, even if nb_inplace_multiply also existed and the arguments weren't right for sq_inplace_repeat. Change this to only use sq_inplace_repeat if nb_inplace_multiply isn't defined. Bugfix candidate.	2002-04-16 16:44:51 +00:00
Guido van Rossum	7766091e04	Whitespace normalization and fold some long lines.	2002-04-16 16:32:50 +00:00
Walter Dörwald	0fe940c862	Return the orginal string only if it's a real str or unicode instance, otherwise make a copy.	2002-04-15 18:42:15 +00:00
Guido van Rossum	3aa3fc46c8	Remove 'const' from local variable declaration in string_zfill() -- it isn't constant, so why bother. Folded long lines. Whitespace normalization.	2002-04-15 13:48:52 +00:00
Walter Dörwald	068325ef92	Apply the second version of SF patch http://www.python.org/sf/536241 Add a method zfill to str, unicode and UserString and change Lib/string.py accordingly. This activates the zfill version in unicodeobject.c that was commented out and implements the same in stringobject.c. It also adds the test for unicode support in Lib/string.py back in and uses repr() instead() of str() (as it was before Lib/string.py 1.62)	2002-04-15 13:36:47 +00:00
Guido van Rossum	69cf3c7641	Deprecate % as well. The message for deprecation of //, % and divmod is the same in all three cases (mostly because // calls divmod :-).	2002-04-15 12:39:12 +00:00
Guido van Rossum	9ec4c78a54	SF bug #543387 . Complex numbers implement divmod() and //, neither of which makes one lick of sense. Unfortunately this is documented, so I'm adding a deprecation warning now, so we can delete this silliness, oh, around 2005 or so. Bugfix candidate (At least for 2.2.2, I think.)	2002-04-15 01:41:56 +00:00
Guido van Rossum	b6b8942f53	SF bug #541883 (Vincent Fiack). A stupid bug in object_set_class(): didn't check for value==NULL before checking its type. Bugfix candidate.	2002-04-15 01:03:30 +00:00
Tim Peters	077f27141f	SF bug 543840: complex(string) accepts strings with \0 complex_subtype_from_string(): this stopped parsing at the first 0 byte, as if that were the end of the input string. Bugfix candidate.	2002-04-14 22:04:03 +00:00
Jack Jansen	7b8c7546eb	Mass checkin of universal newline support. Highlights: import and friends will understand any of \r, \n and \r\n as end of line. Python file input will do the same if you use mode 'U'. Everything can be disabled by configuring with --without-universal-newlines. See PEP278 for details.	2002-04-14 20:12:41 +00:00
Guido van Rossum	5961042061	Fold long lines. (Walter, please take note! :-)	2002-04-13 14:06:36 +00:00
Tim Peters	0e871188e8	_PyObject_DebugDumpStats: renamed to _PyObject_DebugMallocStats. Added code to call this when PYMALLOC_DEBUG is enabled, and envar PYTHONMALLOCSTATS is set, whenever a new arena is obtained and once late in the Python shutdown process.	2002-04-13 08:29:14 +00:00
Tim Peters	b7ba743312	SF bug 543148: Memory leak with stackframes + inspect. Put a bound on the number of frameobjects that can live in the frameobject free_list. Am also backporting to 2.2. I don't intend to backport to 2.1 (too much work -- lots of cyclic structures leak there, and the GC API).	2002-04-13 05:21:47 +00:00
Guido van Rossum	018b0eb0f5	Partially implement SF feature request 444708. Add optional arg to string methods strip(), lstrip(), rstrip(). The optional arg specifies characters to delete. Also for UserString. Still to do: - Misc/NEWS - LaTeX docs (I did the docstrings though) - Unicode methods, and Unicode support in the string methods.	2002-04-13 00:56:08 +00:00
Tim Peters	8a8cdfd0f5	Small anal correctness tweaks: _PyObject_DebugMalloc: explicitly cast PyObject_Malloc's result to the target pointer type. _PyObject_DebugDumpStats: change decl of arena_alignment from unsigned int to unsigned long. This is for the 2.3 release only (it's new code).	2002-04-12 20:49:36 +00:00
Guido van Rossum	e027d9818f	Add Raymond Hettinger's d.pop(). See SF patch 539949.	2002-04-12 15:11:59 +00:00
Tim Peters	85cc1c4368	_PyObject_DebugRealloc(): rewritten to let the underlying realloc do most of the work. In particular, if the underlying realloc is able to grow the memory block in place, great (this routine used to do a fresh malloc + memcpy every time a block grew). BTW, I'm not so keen here on avoiding possible quadratic-time realloc patterns as I am on making the debug pymalloc more invisible (the more it uses memory "just like" the underlying allocator, the better the chance that a suspected memory corruption bug won't vanish when the debug malloc is turned on).	2002-04-12 08:52:50 +00:00
Tim Peters	f539c68ccd	_PyObject_DebugDumpAddress(): clarify an output message.	2002-04-12 07:43:07 +00:00
Tim Peters	f6fb501c57	PYMALLOC_{CLEAN, DEAD, FORBIDDEN}BYTE symbols: remove the PYMALLOC_ prefix. These symbols are private to the file, and the PYMALLOC_ gets in the way (overly long code lines, comments, and error messages).	2002-04-12 07:38:53 +00:00
Tim Peters	af3e8de580	First stab at rationalizing the PyMem_ API. Mixing PyObject_xyz with PyMem_{Del, DEL} doesn't work yet (compilation problems). pyport.h: _PyMem_EXTRA is gone. pmem.h: Repaired comments. PyMem_{Malloc, MALLOC} and PyMem_{Realloc, REALLOC} now make the same x-platform guarantees when asking for 0 bytes, and when passing a NULL pointer to the latter. object.c: PyMem_{Malloc, Realloc} just call their macro versions now, since the latter take care of the x-platform 0 and NULL stuff by themselves now. pypcre.c, grow_stack(): So sue me. On two lines, this called PyMem_RESIZE to grow a "const" area. It's not legit to realloc a const area, so the compiler warned given the new expansion of PyMem_RESIZE. It would have gotten the same warning before if it had used PyMem_Resize() instead; the older macro version, but not the function version, silently cast away the constness. IMO that was a wrong thing to do, and the docs say the macro versions of PyMem_xyz are deprecated anyway. If somebody else is resizing const areas with the macro spelling, they'll get a warning when they recompile now too.	2002-04-12 07:22:56 +00:00
Neil Schemenauer	d2560cd37c	Move PyObject_Malloc and PyObject_Free here from object.c. Remove PyMalloc_ prefix and use PyObject_ instead. I'm not sure about the debugging functions. Perhaps they should stay as PyMalloc_.	2002-04-12 03:10:20 +00:00
Neil Schemenauer	bdf0eedb68	Move PyObject_Malloc and PyObject_Free to obmalloc.c.	2002-04-12 03:08:42 +00:00
Neil Schemenauer	58aa861fa2	Remove PyMalloc_*.	2002-04-12 03:07:20 +00:00
Neil Schemenauer	09a2ae5882	Change signature of _PyObject_GC_Malloc to match PyObject_MALLOC. PyObject_Del and PyObject_GC_Del can now be used as a function designators.	2002-04-12 03:06:53 +00:00
Neil Schemenauer	626d774df6	PyObject_GC_Del can now be used as a function designator.	2002-04-12 03:05:52 +00:00
Neil Schemenauer	7465ad2fc9	Remove PyMalloc_New and PyMalloc_Del.	2002-04-12 03:05:37 +00:00
Neil Schemenauer	510492e985	Remove PyMalloc_New, _PyMalloc_MALLOC, and PyMalloc_Del.	2002-04-12 03:05:19 +00:00

... 2 3 4 5 6 ...

1803 Commits