cpython

Commit Graph

Author	SHA1	Message	Date
Raymond Hettinger	48923c5533	Moved special case for tuples from iterobject.c to tupleobject.c. Makes the code in iterobject.c cleaner and speeds-up the general case by not checking for tuples everytime. SF Patch #592065.	2002-08-09 01:30:17 +00:00
Guido van Rossum	7bed213224	Significant speedup in new-style object creation: in slot_tp_new(), intern the string "__new__" so we can call PyObject_GetAttr() rather than PyObject_GetAttrString(). (Though it's a mystery why slot_tp_new is being called when a class doesn't define __new__. I'll look into that tomorrow.) 2.2 backport candidate (but I won't do it).	2002-08-08 21:57:53 +00:00
Guido van Rossum	febd61dc02	A modest speedup of object deallocation. call_finalizer() did rather a lot of work: it had to save and restore the current exception around a call to lookup_maybe(), because that could fail in rare cases, and most objects don't have a __del__ method, so the whole exercise was usually a waste of time. Changed this to cache the __del__ method in the type object just like all other special methods, in a new slot tp_del. So now subtype_dealloc() can test whether tp_del is NULL and skip the whole exercise if it is. The new slot doesn't need a new flag bit: subtype_dealloc() is only called if the type was dynamically allocated by type_new(), so it's guaranteed to have all current slots. Types defined in C cannot fill in tp_del with a function of their own, so there's no corresponding "wrapper". (That functionality is already available through tp_dealloc.)	2002-08-08 20:55:20 +00:00
Tim Peters	6c511e6d1c	Added info about highwater heap-memory use for the sortperf.py tests; + a couple of minor edits elsewhere.	2002-08-08 01:55:16 +00:00
Tim Peters	6063e2615f	PyList_Reverse(): This was leaking a reference to Py_None on every call. I believe I introduced this bug when I refactored the reversal code so that the mergesort could use it too. It's not a problem on the 2.2 branch.	2002-08-08 01:06:39 +00:00
Guido van Rossum	0906e07442	Fix a subtle bug in the trashcan code I added yesterday to subtype_dealloc(). When call_finalizer() failed, it would return without going through the trashcan end macro, thereby unbalancing the trashcan nesting level counter, and thereby defeating the test case (slottrash() in test_descr.py). This in turn meant that the assert in the GC_UNTRACK macro wasn't triggered by the slottrash() test despite a bug in the code: _PyTrash_destroy_chain() calls the dealloc routine with an object that's untracked, and the assert in the GC_UNTRACK macro would fail on this; but because of an earlier test that resurrects an object, causing call_finalizer() to fail and the trashcan nesting level to be unbalanced, so _PyTrash_destroy_chain() was never called. Calling the slottrash() test in isolation did trigger the assert, however. So the fix is twofold: (1) call the GC_UnTrack() function instead of the GC_UNTRACK macro, because the function is safe when the object is already untracked; (2) when call_finalizer() fails, jump to a label that exits through the trashcan end macro, keeping the trashcan nesting balanced.	2002-08-07 20:42:09 +00:00
Martin v. Löwis	3f19b10ca5	Replace abort with Py_FatalError.	2002-08-07 16:21:51 +00:00
Neal Norwitz	657d222700	Make more functions static	2002-08-06 22:12:52 +00:00
Neal Norwitz	d8b995f5e8	Make readahead functions static	2002-08-06 21:50:54 +00:00
Guido van Rossum	22b1387c51	Fix SF bug 574207 (chained __slots__ dealloc segfault). This is inspired by SF patch 581742 (by Jonathan Hogg, who also submitted the bug report, and two other suggested patches), but separates the non-GC case from the GC case to avoid testing for GC several times. Had to fix an assert() from call_finalizer() that asserted that the object wasn't untracked, because it's possible that the object isn't GC'ed!	2002-08-06 21:41:44 +00:00
Barry Warsaw	6a043f3fe8	PyUnicode_Contains(): The memcmp() call didn't take into account the width of Py_UNICODE. Good catch, MAL.	2002-08-06 19:03:17 +00:00
Barry Warsaw	817918cc3c	Committing patch #591250 which provides "str1 in str2" when str1 is a string of longer than 1 character.	2002-08-06 16:58:21 +00:00
Guido van Rossum	7a6e95948c	SF patch 580331 by Oren Tirosh: make file objects their own iterator. For a file f, iter(f) now returns f (unless f is closed), and f.next() is similar to f.readline() when EOF is not reached; however, f.next() uses a readahead buffer that messes up the file position, so mixing f.next() and f.readline() (or other methods) doesn't work right. Calling f.seek() drops the readahead buffer, but other operations don't. The real purpose of this change is to reduce the confusion between objects and their iterators. By making a file its own iterator, it's made clearer that using the iterator modifies the file object's state (in particular the current position). A nice side effect is that this speeds up "for line in f:" by not having to use the xreadlines module. The f.xreadlines() method is still supported for backwards compatibility, though it is the same as iter(f) now. (I made some cosmetic changes to Oren's code, and added a test for "file closed" to file_iternext() and file_iter().)	2002-08-06 15:55:28 +00:00
Raymond Hettinger	bc552ce1b8	SF 582071 clarified the .split() method's docstring to note that sep=None will trigger splitting on any whitespace.	2002-08-05 06:28:21 +00:00
Tim Peters	66860f6da4	Sped the usual case for sorting by calling PyObject_RichCompareBool directly when no comparison function is specified. This saves a layer of function call on every compare then. Measured speedups: i 2*i sort \sort /sort 3sort +sort %sort ~sort =sort !sort 15 32768 12.5% 0.0% 0.0% 100.0% 0.0% 50.0% 100.0% 100.0% -50.0% 16 65536 8.7% 0.0% 0.0% 0.0% 0.0% 0.0% 12.5% 0.0% 0.0% 17 131072 8.0% 25.0% 0.0% 25.0% 0.0% 14.3% 5.9% 0.0% 0.0% 18 262144 6.3% -10.0% 12.5% 11.1% 0.0% 6.3% 5.6% 12.5% 0.0% 19 524288 5.3% 5.9% 0.0% 5.6% 0.0% 5.9% 5.4% 0.0% 2.9% 20 1048576 5.3% 2.9% 2.9% 5.1% 2.8% 1.3% 5.9% 2.9% 4.2% The best indicators are those that take significant time (larger i), and where sort doesn't do very few compares (so sort and ~sort benefit most reliably). The large numbers are due to roundoff noise combined with platform variability; e.g., the 14.3% speedup for %sort at i=17 reflects a printed elapsed time of 0.18 seconds falling to 0.17, but a change in the last digit isn't really meaningful (indeed, if it really took 0.175 seconds, one electron having a lazy nanosecond could shift it to either value <wink>). Similarly the 25% at 3sort i=17 was a meaningless change from 0.05 to 0.04. However, almost all the "meaningless changes" were in the same direction, which is good. The before-and-after times for sort are clearest: before after 0.18 0.16 0.25 0.23 0.54 0.50 1.18 1.11 2.57 2.44 5.58 5.30	2002-08-04 17:47:26 +00:00
Tim Peters	6bdbc9e0b1	SF bug 590366: Small typo in listsort:ParseTuple The PyArg_ParseTuple() error string still said "msort". Changed to "sort".	2002-08-03 02:28:24 +00:00
Guido van Rossum	f4be427c46	Tim found that once test_longexp has run, test_sort takes very much longer to run than normal. A profiler run showed that this was due to PyFrame_New() taking up an unreasonable amount of time. A little thinking showed that this was due to the while loop clearing the space available for the stack. The solution is to only clear the local variables (and cells and free variables), not the space available for the stack, since anything beyond the stack top is considered to be garbage anyway. Also, use memset() instead of a while loop counting backwards. This should be a time savings for normal code too! (By a probably unmeasurable amount. :-)	2002-08-01 18:50:33 +00:00
Guido van Rossum	0dbab4c560	SF patch 588728 (Nathan Srebro). The __delete__ method wrapper for descriptors was not supported (I added a test, too.) 2.2 bugfix candidate.	2002-08-01 14:39:25 +00:00
Tim Peters	a64dc245ac	Replaced samplesort with a stable, adaptive mergesort.	2002-08-01 02:13:36 +00:00
Tim Peters	92f81f2e63	Checking in the doc file for "timsort". There's way too much here to stuff into code comments, and lots of it is going to be useful again (but hard to predict exactly which parts of it ...).	2002-08-01 00:59:42 +00:00
Neal Norwitz	cee5ca060b	SF patch #587889 , fix memory leak of tp_doc	2002-07-30 00:42:06 +00:00
Michael W. Hudson	56796f672f	Fix for [ 587875 ] crash on deleting extended slice The array code got simpler, always a good thing!	2002-07-29 14:35:04 +00:00
Mark Hammond	a290527376	Excise DL_IMPORT/EXPORT from object.h, and related files. This patch also adds 'extern' to PyAPI_DATA rather than at each declaration, as discussed with Tim and Guido.	2002-07-29 13:42:14 +00:00
Neal Norwitz	88fe4ff5a9	Fix the problem of not raising a TypeError exception when doing: '%g' % '1' '%d' % '1' Add a test for these conditions Fix the test so that if not exception is raise, this is a failure	2002-07-28 16:44:23 +00:00
Martin v. Löwis	673c0a2247	Patch #574867 : Correct list.extend docstring.	2002-07-28 16:35:57 +00:00
Neal Norwitz	7beeed5dfd	SF patch #577031 , remove PyArg_Parse() since it's deprecated	2002-07-28 15:19:47 +00:00
Martin v. Löwis	75d2d94e0f	Patch #554716 : Use __va_copy where available.	2002-07-28 10:23:27 +00:00
Skip Montanaro	35b37a5c11	tighten up the unicode object's docstring a tad	2002-07-26 16:22:46 +00:00
Jeremy Hylton	73a088e3fa	Don't be so hasty. If PyInt_AsLong() raises an error, don't set ValueError.	2002-07-25 16:43:29 +00:00
Jeremy Hylton	f20fcf9fed	Complain if __len__() returns < 0, just like classic classes. Fixes SF bug #575773. Bug fix candidate.	2002-07-25 16:06:15 +00:00
Michael W. Hudson	206d8f818f	Silly typo. Not sure how that got in.	2002-07-19 15:52:38 +00:00
Michael W. Hudson	f0d777c56b	A few days ago, Guido said (in the thread "[Python-Dev] Python version of PySlice_GetIndicesEx"): > OK. Michael, if you want to check in indices(), go ahead. Then I did what was needed, but didn't check it in. Here it is.	2002-07-19 15:47:06 +00:00
Tim Peters	330f9e9581	More sort cleanup: Moved the special cases from samplesortslice into listsort. If the former calls itself recursively, they're a waste of time, since it's called on a random permutation of a random subset of elements. OTOH, for exactly the same reason, they're an immeasurably small waste of time (the odds of finding exploitable order in a random permutation are ~= 0, so the special-case loops looking for order give up quickly). The point is more for conceptual clarity. Also changed some "assert comments" into real asserts; when this code was first written, Python.h didn't supply assert.h.	2002-07-19 07:05:44 +00:00
Tim Peters	0fe977c4a9	binarysort() cleanup: Documented the key invariants, explained why they imply this is a stable sort, and added some asserts.	2002-07-19 06:12:32 +00:00
Tim Peters	326b44871e	listreverse(): Don't call the new reverse_slice unless the list has something in it (else ob_item may be a NULL pointer).	2002-07-19 04:04:16 +00:00
Tim Peters	a8c974c157	Cleanup yielding a small speed boost: before rich comparisons were introduced, list.sort() was rewritten to use only the "< or not <?" distinction. After rich comparisons were introduced, docompare() was fiddled to translate a Py_LT Boolean result into the old "-1 for <, 0 for ==, 1 for >" flavor of outcome, and the sorting code was left alone. This left things more obscure than they should be, and turns out it also cost measurable cycles. So: The old CMPERROR novelty is gone. docompare() is renamed to islt(), and now has the same return conditinos as PyObject_RichCompareBool. The SETK macro is renamed to ISLT, and is even weirder than before (don't complain unless you want to maintain the sort code <wink>). Overall, this yields a 1-2% speedup in the usual (no explicit function passed to list.sort()) case when sorting arrays of floats (as sortperf.py does). The boost is higher for arrays of ints.	2002-07-19 03:30:57 +00:00
Tim Peters	3b01a1217f	Trimmed trailing whitespace.	2002-07-19 02:35:45 +00:00
Tim Peters	8e2e7ca330	Cleanup: Define one internal utility for reversing a list slice, and use that everywhere.	2002-07-19 02:33:08 +00:00
Jeremy Hylton	d1fedb6ab5	Remove extraneous semicolon. (Silences compiler warning for Compaq C++ 6.5 on Tru64.)	2002-07-18 18:49:52 +00:00
Jeremy Hylton	938ace69a0	staticforward bites the dust. The staticforward define was needed to support certain broken C compilers (notably SCO ODT 3.0, perhaps early AIX as well) botched the static keyword when it was used with a forward declaration of a static initialized structure. Standard C allows the forward declaration with static, and we've decided to stop catering to broken C compilers. (In fact, we expect that the compilers are all fixed eight years later.) I'm leaving staticforward and statichere defined in object.h as static. This is only for backwards compatibility with C extensions that might still use it. XXX I haven't updated the documentation.	2002-07-17 16:30:39 +00:00
Guido van Rossum	ca5ed5b875	Remove the next() method -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set (fortunately, because using the tp_iternext implementation for the the next() implementation is buggy). Also changed the allocation order in enum_next() so that the underlying iterator is only moved ahead when we have successfully allocated the result tuple and index.	2002-07-16 21:02:42 +00:00
Guido van Rossum	86d593e110	Remove the next() method -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set. Also removed the redundant (and expensive!) call to raise StopIteration from rangeiter_next().	2002-07-16 20:47:50 +00:00
Guido van Rossum	2147df748f	Make StopIteration a sink state. This is done by clearing out the di_dict field when the end of the list is reached. Also make the error ("dictionary changed size during iteration") a sticky state. Also remove the next() method -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set. That's a good thing, because the implementation given here was buggy (it never raised StopIteration).	2002-07-16 20:30:22 +00:00
Guido van Rossum	613bed3726	Make StopIteration a sink state. This is done by clearing out the object references (it_seq for seqiterobject, it_callable and it_sentinel for calliterobject) when the end of the list is reached. Also remove the next() methods -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set. That's a good thing, because the implementation given here was buggy (it never raised StopIteration).	2002-07-16 20:24:46 +00:00
Guido van Rossum	6b6272c857	Whitespace normalization.	2002-07-16 20:10:23 +00:00
Guido van Rossum	86103ae531	Make StopIteration a sink state. This is done by clearing out the it_seq field when the end of the list is reached. Also remove the next() method -- one is supplied automatically by PyType_Ready() because the tp_iternext slot is set. That's a good thing, because the implementation given here was buggy (it never raised StopIteration).	2002-07-16 20:07:32 +00:00
Jeremy Hylton	719841e2fb	The object returned by tp_new() may not have a tp_init. If the object is an ExtensionClass, for example, the slot is not even defined. So we must check that the type has the slot (implied by HAVE_CLASS) before calling tp_init().	2002-07-16 19:39:38 +00:00
Guido van Rossum	5086e49a6e	Make list_iter() really static.	2002-07-16 15:56:52 +00:00
Guido van Rossum	03013a0130	valid_identifier(): use an unsigned char* so that isalpha() will do the right thing even if char is unsigned.	2002-07-16 14:30:28 +00:00
Tim Peters	58cf361e35	docompare(): Another reasonable optimization from Jonathan Hogg for the explicit comparison function case: use PyObject_Call instead of PyEval_CallObject. Same thing in context, but gives a 2.4% overall speedup when sorting a list of ints via list.sort(__builtin__.cmp).	2002-07-15 05:16:13 +00:00
Tim Peters	7a1f91709b	WINDOWS_LEAN_AND_MEAN: There is no such symbol, although a very few MSDN sample programs use it, apparently in error. The correct name is WIN32_LEAN_AND_MEAN. After switching to the correct name, in two cases more was needed because the code actually relied on things that disappear when WIN32_LEAN_AND_MEAN is defined.	2002-07-14 22:14:19 +00:00
Guido van Rossum	b6d29b7856	Undef MIN and MAX before defining them, to avoid warnings on certain platforms.	2002-07-13 14:31:51 +00:00
Jeremy Hylton	a4b4c3bf05	Don't declare a function with staticforward. Just declare it static so that lame (BAD_STATIC_FORWARD) compilers don't see a mismatch between the prototype and the function.	2002-07-13 03:51:17 +00:00
Tim Peters	f2a0473350	docompare(): Use PyTuple_New instead of Py_BuildValue to build compare's arg tuple. This was suggested on c.l.py but afraid I can't find the msg again for proper attribution. For list.sort(cmp) where list is a list of random ints, and cmp is __builtin__.cmp, this yields an overall 50-60% speedup on my Win2K box. Of course this is a best case, because the overhead of calling cmp relative to the cost of actually comparing two ints is at an extreme. Nevertheless it's huge bang for the buck. An additionak 20-30% can be bought by making the arg tuple an immortal static (avoiding all but "the first" PyTuple_New), but that's tricky to make correct since docompare needs to be reentrant. So this picks the cherry and leaves the pits for Fred <wink>. Note that this makes no difference to the list.sort() case; an arg tuple gets built only if the user specifies an explicit sort function.	2002-07-11 21:46:16 +00:00
Jeremy Hylton	df3f793516	Extend function() to support an optional closure argument. Also, simplify some ref counting for other optional arguments.	2002-07-11 18:30:27 +00:00
Tim Peters	3459251d5a	object.h special-build macro minefield: renamed all the new lexical helper macros to something saner, and used them appropriately in other files too, to reduce #ifdef blocks. classobject.c, instance_dealloc(): One of my worst Python Memories is trying to fix this routine a few years ago when COUNT_ALLOCS was defined but Py_TRACE_REFS wasn't. The special-build code here is way too complicated. Now it's much simpler. Difference: in a Py_TRACE_REFS build, the instance is no longer in the doubly-linked list of live objects while its __del__ method is executing, and that may be visible via sys.getobjects() called from a __del__ method. Tough -- the object is presumed dead while its __del__ is executing anyway, and not calling _Py_NewReference() at the start allows enormous code simplification. typeobject.c, call_finalizer(): The special-build instance_dealloc() pain apparently spread to here too via cut-'n-paste, and this is much simpler now too. In addition, I didn't understand why this routine was calling _PyObject_GC_TRACK() after a resurrection, since there's no plausible way _PyObject_GC_UNTRACK() could have been called on the object by this point. I suspect it was left over from pasting the instance_delloc() code. Instead asserted that the object is still tracked. Caution: I suspect we don't have a test that actually exercises the subtype_dealloc() __del__-resurrected-me code.	2002-07-11 06:23:50 +00:00
Tim Peters	889f61dcfb	Documented PYMALLOC_DEBUG. This completes primary coverage of all the "special builds" I ever use. If you use others, document them here, or don't be surprised if I rip out the code for them <0.5 wink>.	2002-07-10 19:29:49 +00:00
Tim Peters	7c321a80f9	The Py_REF_DEBUG/COUNT_ALLOCS/Py_TRACE_REFS macro minefield: added more trivial lexical helper macros so that uses of these guys expand to nothing at all when they're not enabled. This should help sub- standard compilers that can't do a good job of optimizing away the previous "(void)0" expressions. Py_DECREF: There's only one definition of this now. Yay! That was that last one in the family defined multiple times in an #ifdef maze. Py_FatalError(): Changed the char* signature to const char*. _Py_NegativeRefcount(): New helper function for the Py_REF_DEBUG expansion of Py_DECREF. Calling an external function cuts down on the volume of generated code. The previous inline expansion of abort() didn't work as intended on Windows (the program often kept going, and the error msg scrolled off the screen unseen). _Py_NegativeRefcount calls Py_FatalError instead, which captures our best knowledge of how to abort effectively across platforms.	2002-07-09 02:57:01 +00:00
Tim Peters	c6a3ff634a	SF bug 578752: COUNT_ALLOCS vs heap types Repair segfaults and infinite loops in COUNT_ALLOCS builds in the presence of new-style (heap-allocated) classes/types. Bugfix candidate. I'll backport this to 2.2. It's irrelevant in 2.1.	2002-07-08 22:11:52 +00:00
Tim Peters	4be93d0e84	Rearranged and added comments to object.h, to clarify many things that have taken me "too long" to reverse-engineer over the years. Vastly reduced the nesting level and redundancy of #ifdef-ery. Took a light stab at repairing comments that are no longer true. sys_gettotalrefcount(): Changed to enable under Py_REF_DEBUG. It was enabled under Py_TRACE_REFS, which was much heavier than necessary. sys.gettotalrefcount() is now available in a Py_REF_DEBUG-only build.	2002-07-07 19:59:50 +00:00
Tim Peters	a6269a8ec5	Removed 3 unlikely #includes that were only needed for the non-gc flavor of the trashcan code.	2002-07-07 16:52:50 +00:00
Tim Peters	803526b9e2	Trashcan cleanup: Now that cyclic gc is always there, the trashcan mechanism is no longer evil: it no longer plays dangerous games with the type pointer or refcounts, and objects in extension modules can play along too without needing to edit the core first. Rewrote all the comments to explain this, and (I hope) give clear guidance to extension authors who do want to play along. Documented all the functions. Added more asserts (it may no longer be evil, but it's still dangerous <0.9 wink>). Rearranged the generated code to make it clearer, and to tolerate either the presence or absence of a semicolon after the macros. Rewrote _PyTrash_destroy_chain() to call tp_dealloc directly; it was doing a Py_DECREF again, and that has all sorts of obscure distorting effects in non-release builds (Py_DECREF was already called on the object!). Removed Christian's little "embedded change log" comments -- that's what checkin messages are for, and since it was impossible to correlate the comments with the code that changed, I found them merely distracting.	2002-07-07 05:13:56 +00:00
Tim Peters	943382c8e5	Removed WITH_CYCLE_GC #ifdef-ery. Holes: + I'm not sure what to do about configure.in. Left it alone. + Ditto pyexpat.c. Fred or Martin will know what to do.	2002-07-07 03:59:34 +00:00
Martin v. Löwis	6238d2b024	Patch #569753 : Remove support for WIN16. Rename all occurrences of MS_WIN32 to MS_WINDOWS.	2002-06-30 15:26:10 +00:00
Raymond Hettinger	5a04aec384	Fix SF bug 546434 -- buffer slice type inconsistent.	2002-06-25 00:25:30 +00:00
Raymond Hettinger	ab5dae35ca	Fix SF bug 572567: Memory leak in object comparison.	2002-06-24 13:08:16 +00:00
Jeremy Hylton	8b47dffc93	Fix for SF bug 571885 When resizing a tuple, zero out the memory starting at the end of the old tuple not at the beginning of the old tuple.	2002-06-20 23:13:17 +00:00
Raymond Hettinger	0ae0c07661	SF 569257 -- Name mangle double underscored variable names in __slots__.	2002-06-20 22:23:15 +00:00
Michael W. Hudson	9c14badc5f	Fix the bug described in http://mail.python.org/pipermail/python-dev/2002-June/025461.html with test cases. Also includes extended slice support for arrays, which I thought I'd already checked in but obviously not.	2002-06-19 15:44:15 +00:00
Guido van Rossum	63517577fd	Patch from SF bug 570483 (Tim Northover). In a fresh interpreter, type.mro(tuple) would segfault, because PyType_Ready() isn't called for tuple yet. To fix, call PyType_Ready(type) if type->tp_dict is NULL.	2002-06-18 16:44:57 +00:00
Michael W. Hudson	b1e8154013	About the new but unreferenced new_class, Guido sez: > Looks like an experiment by Oren Tirosh that didn't get nuked. I > think you can safely lose it. It's gone.	2002-06-18 12:38:06 +00:00
Guido van Rossum	bea18ccde6	SF patch 568629 by Oren Tirosh: types made callable. These built-in functions are replaced by their (now callable) type: slice() buffer() and these types can also be called (but have no built-in named function named after them) classobj (type name used to be "class") code function instance instancemethod (type name used to be "instance method") The module "new" has been replaced with a small backward compatibility placeholder in Python. A large portion of the patch simply removes the new module from various platform-specific build recipes. The following binary Mac project files still have references to it: Mac/Build/PythonCore.mcp Mac/Build/PythonStandSmall.mcp Mac/Build/PythonStandalone.mcp [I've tweaked the code layout and the doc strings here and there, and added a comment to types.py about StringTypes vs. basestring. --Guido]	2002-06-14 20:41:17 +00:00
Guido van Rossum	59e6c53920	Inexplicably, recurse_down_subclasses() was comparing the object gotten from a weak reference to NULL instead of to None. This caused the following assert() to fail (but only in 2.2 in the debug build -- I have to find a better test case). Will backport.	2002-06-14 02:27:07 +00:00
Neal Norwitz	2c2e827029	Missed one use of new PyDoc_STRVAR macro	2002-06-14 02:04:18 +00:00
Neal Norwitz	1f68fc7fa5	SF bug # 493951 string.{starts,ends}with vs slices Handle negative indices similar to slices.	2002-06-14 00:50:42 +00:00
Neal Norwitz	4178515035	SF # 533070 Silence AIX C Compiler Warnings Warning caused by using &func. & is not necessary.	2002-06-13 21:42:51 +00:00
Guido van Rossum	e7b8ecf196	Major cleanup operation: whenever there's a call that looks for an optional attribute, only clear the exception when the internal getattr operation raised AttributeError. Many places in this file already had that policy; but just as many didn't, and there didn't seem to be any rhyme or reason to it. Be consistently cautious. Question: should I backport this? On the one hand it's a bugfix. On the other hand it's a change in behavior. Certain forms of buggy or just weird code would work in the past but raise an exception under the new rules; e.g. if you define a __getattr__ method that raises a non-AttributeError exception.	2002-06-13 21:42:04 +00:00
Guido van Rossum	16b93b3d0e	Fix for SF bug 532646. This is a little simpler than what Neal suggested there, based upon a better analysis (__getattr__ is a red herring). Will backport to 2.2.	2002-06-13 21:32:51 +00:00
Neal Norwitz	51290d369d	SF # 561244 Micro optimizations Cleanup code a bit and return as early as possible.	2002-06-13 21:32:44 +00:00
Neal Norwitz	20e72130c4	Fix typo in exception message	2002-06-13 21:25:17 +00:00
Neal Norwitz	35fc7606f0	SF #561244 Micro optimizations Convert loops to memset()s.	2002-06-13 21:11:11 +00:00
Martin v. Löwis	14f8b4cfcb	Patch #568124 : Add doc string macros.	2002-06-13 20:33:02 +00:00
Guido van Rossum	09638c16d8	Hopefully this addresses the remaining issues of SF bugs 459235 and 473985. Through a subtle rearrangement of some members in the etype struct (!), mapping methods are now preferred over sequence methods, which is necessary to support str.__getitem__("hello", slice(4)) etc.	2002-06-13 19:17:46 +00:00
Guido van Rossum	65ce6de35c	Rearrange the #ifndef WITHOUT_COMPLEX so it can be picked up from pyconfig.h.	2002-06-13 17:07:07 +00:00
Michael W. Hudson	589dc93620	Fix for problem reported by Neal Norwitz. Tighten up calculation of slicelength. Include his test case.	2002-06-11 13:38:42 +00:00
Guido van Rossum	75a20b19ef	Fold remaining long lines.	2002-06-11 12:22:28 +00:00
Michael W. Hudson	5efaf7eac8	This is my nearly two year old patch [ 400998 ] experimental support for extended slicing on lists somewhat spruced up and better tested than it was when I wrote it. Includes docs & tests. The whatsnew section needs expanding, and arrays should support extended slices -- later.	2002-06-11 10:55:12 +00:00
Guido van Rossum	cab05807fc	Undo the last chunk of the previous patch, putting back a useful assert into PyType_Ready(): now that we're not clearing tp_dict, we can assert that it's non-NULL again.	2002-06-10 15:29:03 +00:00
Guido van Rossum	a386209754	In the recent python-dev thread "Bizarre new test failure", we discovered that subtype_traverse must traverse the type if it is a heap type, because otherwise some cycles involving a type and its instance would not be collected. Simplest example: while 1: class C(object): pass C.ref = C() This program grows without bounds before this fix. (It grows ever slower since it spends ever more time in the collector.) Simply adding the right visit() call to subtype_traverse() revealed other problems. With MvL's help we re-learned that type_clear() doesn't have to clear all references, only the ones that may not be cleared by other means. Careful analysis (see comments in the code) revealed that only tp_mro needs to be cleared. (The previous checkin to this file adds a test for tp_mro==NULL to _PyType_Lookup() that's essential to prevent crashes due to tp_mro being NULL when subtype_dealloc() tries to look for a __del__ method.) The same kind of analysis also revealed that subtype_clear() doesn't need to clear the instance dict. With this fix, a useful property of the collector is once again guaranteed: a single gc.collect() call will clear out all garbage. (It didn't always before, which put us on the track of this bug.) Will backport to 2.2.	2002-06-10 15:24:42 +00:00
Guido van Rossum	2309498595	Three's a charm: yet another fix for SF bug 551412. Thinking again about the test case, slot_nb_power gets called on behalf of its second argument, but with a non-None modulus it wouldn't check this, and believes it is called on behalf of its first argument. Fix this properly, and get rid of the code in _PyType_Lookup() that tries to call _PyType_Ready(). But do leave a check for a NULL tp_mro there, because this can still legitimately occur. I'll fix this in 2.2.x too.	2002-06-10 14:30:43 +00:00
Raymond Hettinger	478d47a168	Close SF bug 563740. complex() now finds __complex__() in new style classes. Made conversion failure error messages consistent between types. Added related unittests.	2002-06-06 15:45:38 +00:00
Neal Norwitz	56f46f8d8c	Pyrangeiter_Type && range_iter should be static	2002-06-06 14:58:21 +00:00
Raymond Hettinger	c4c453f5ae	Skip Montanaro's patch, SF 559833, exposing xrange type in builtins. Also, added more regression tests to cover the new type and test its conformity with range().	2002-06-05 23:12:45 +00:00
Raymond Hettinger	48165d40cb	SF 564601 adding rangeiterobject to make xrange() iterate like range().	2002-06-05 20:08:48 +00:00
Thomas Heller	aee2d5f975	Better isinstance error message. Closes SF patch # 560250. Bugfix candidate IMO.	2002-06-05 12:55:19 +00:00
Guido van Rossum	9923ffe2c0	Address SF bug 519621: slots weren't traversed by GC. While I was at it, I added a tp_clear handler and changed the tp_dealloc handler to use the clear_slots helper for the tp_clear handler. Also tightened the rules for slot names: they must now be proper identifiers (ignoring the dirty little fact that <ctype.h> is locale sensitive). Also set mp->flags = READONLY for the __weakref__ pseudo-slot. Most of this is a 2.2 bugfix candidate; I'll apply it there myself.	2002-06-04 19:52:53 +00:00
Raymond Hettinger	e53e7a2c7d	Inverted test for small speedup	2002-06-04 18:45:50 +00:00
Guido van Rossum	ed3e09f289	Add a docstring to the module type.	2002-06-04 06:02:35 +00:00
Guido van Rossum	c3a787e090	Surprising fix for SF bug 563060: module can be used as base class. Change the module constructor (module_init) to have the signature __init__(name:str, doc=None); this prevents the call from type_new() to succeed. While we're at it, prevent repeated calling of module_init for the same module from leaking the dict, changing the semantics so that __dict__ is only initialized if NULL. Also adding a unittest, test_module.py. This is an incompatibility with 2.2, if anybody was instantiating the module class before, their argument list was probably empty; so this can't be backported to 2.2.x.	2002-06-04 05:52:47 +00:00
Guido van Rossum	b65c65b301	Address the residual issue with the fix for SF 551412 in _PyType_Lookup(). Decided to clear the error condition in the unfortunate but unlikely case that PyType_Ready() fails. Will fix in 2.2.x too.	2002-06-03 19:52:41 +00:00
Tim Peters	93b2cc4e97	A bogus assert in the new listiter code prevented starting Python in a debug build. Repaired that, and rewrote other parts to reduce long-winded casting.	2002-06-01 05:22:55 +00:00
Raymond Hettinger	14bd6de0ec	SF 560736. Optimize list iteration by filling the tp_iter slot.	2002-05-31 21:40:38 +00:00
Neal Norwitz	3f8dae73c7	Fix typo	2002-05-31 20:23:33 +00:00
Guido van Rossum	a407300cd7	Implement the intention of SF patch 472523 (but coded differently). In the past, an object's tp_compare could return any value. In 2.2 the docs were tightened to require it to return -1, 0 or 1; and -1 for an error. We now issue a warning if the value is not in this range. When an exception is raised, we allow -1 or -2 as return value, since -2 will the recommended return value for errors in the future. (Eventually tp_compare will also be allowed to return +2, to indicate NotImplemented; but that can only be implemented once we know all extensions return a value in [-2...1]. Or perhaps it will require the type to set a flag bit.) I haven't decided yet whether to backport this to 2.2.x. The patch applies fine. But is it fair to start warning in 2.2.2 about code that worked flawlessly in 2.2.1?	2002-05-31 20:03:54 +00:00
Neal Norwitz	32a7e7f6b6	Change name from string to basestring	2002-05-31 19:58:02 +00:00
Marc-André Lemburg	4164439240	Fix a possible segfault. Found be Neal Norvitz.	2002-05-29 13:46:29 +00:00
Marc-André Lemburg	4da6fd63bc	Fix for bug [ 561796 ] string.find causes lazy error	2002-05-29 11:33:13 +00:00
Guido van Rossum	9fc8a29663	Fix for SF bug 551412. When _PyType_Lookup() is called on a type whose tp_mro hasn't been initialized, it would dump core. Fix this by checking for NULL and calling PyType_Ready(). Will fix this in 2.2.1 too.	2002-05-24 21:40:08 +00:00
Guido van Rossum	cacfc07d08	- A new type object, 'string', is added. This is a common base type for 'str' and 'unicode', and can be used instead of types.StringTypes, e.g. to test whether something is "a string": isinstance(x, string) is True for Unicode and 8-bit strings. This is an abstract base class and cannot be instantiated directly.	2002-05-24 19:01:59 +00:00
Guido van Rossum	9ee4b94f51	Add a safeguard against setting the class to something with a different free or alloc slot.	2002-05-24 18:47:47 +00:00
Tim Peters	bf9cb3ff1a	Use function instead of macro spellings for PyObject_memorystuff.	2002-05-23 15:51:20 +00:00
Neal Norwitz	d4e5be5340	Closes: #556025 seg fault when doing list(xrange(1e9)) A MemoryError is now raised when the list cannot be created. There is a test, but as the comment says, it really only works for 32 bit systems. I don't know how to improve the test for other systems (ie, 64 bit or systems where the data size != addressable size, e.g. 64 bit data, but 48 bit addressable memory)	2002-05-22 23:19:17 +00:00
Barry Warsaw	4be55b5cef	file_doc: Add some description of the U mode character, but only when WITH_UNIVERSAL_NEWLINES is enabled.	2002-05-22 20:37:53 +00:00
Raymond Hettinger	0ebac97058	Patch 549187. Improve string formatting error message.	2002-05-21 15:14:57 +00:00
Guido van Rossum	4402241450	Jim Fulton reported a segfault in dir(). A heavily proxied object returned a proxy for __class__ whose __bases__ was also a proxy. The merge_class_dict() helper for dir() assumed incorrectly that __bases__ would always be a tuple and used the in-line tuple API on the proxy. I will backport this to 2.2 as well.	2002-05-13 18:29:46 +00:00
Walter Dörwald	775c11f07a	Add #ifdef PY_USING_UNICODE sections, so that stringobject.c compiles again with --disable-unicode. Fixes SF bug http://www.python.org/sf/554912	2002-05-13 09:00:41 +00:00
Martin v. Löwis	e452659237	Patch #551410 : Implement tp_getiter.	2002-05-08 08:49:27 +00:00
Martin v. Löwis	01f94bda38	Patch #552433 : Special-case tuples. Avoid sub-type checking for lists. Avoid checks for negative indices and duplicate checks for support of the sequence protocol.	2002-05-08 08:44:21 +00:00
Tim Peters	4ce71f77c3	PyObject_Realloc(): If a small block is shrinking, bite the expense of copying it if at least 25% of the input block can be reclaimed.	2002-05-02 20:19:34 +00:00
Fred Drake	d9018323c0	Remove old deprecated features from the xrange object.	2002-05-02 19:56:55 +00:00
Fred Drake	edb51bb7e8	Fix attribute access for the xrange objects. The tp_getattr and tp_getattro handlers were both set, but were not compatible. This change uses only the tp_getattro handler with a more "modern" approach. This fixes SF bug #551285.	2002-05-02 16:05:27 +00:00
Skip Montanaro	71390a9a94	clarify message when raising TypeError to indicate that float() accepts strings or numbers	2002-05-02 13:03:22 +00:00
Tim Peters	29c0afcfec	Just added comments, and cleared some XXX questions, related to int memory management.	2002-04-28 16:57:34 +00:00
Tim Peters	449b5a8da1	_PyObject_DebugCheckAddress(): If the leading pad bytes are corrupt, display a msg warning that the count of bytes requested may be bogus, and that a segfault may happen next.	2002-04-28 06:14:45 +00:00
Tim Peters	5de9842b34	Repair widespread misuse of _PyString_Resize. Since it's clear people don't understand how this function works, also beefed up the docs. The most common usage error is of this form (often spread out across gotos): if (_PyString_Resize(&s, n) < 0) { Py_DECREF(s); s = NULL; goto outtahere; } The error is that if _PyString_Resize runs out of memory, it automatically decrefs the input string object s (which also deallocates it, since its refcount must be 1 upon entry), and sets s to NULL. So if the "if" branch ever triggers, it's an error to call Py_DECREF(s): s is already NULL! A correct way to write the above is the simpler (and intended) if (_PyString_Resize(&s, n) < 0) goto outtahere; Bugfix candidate.	2002-04-27 18:44:32 +00:00
Tim Peters	602f740bc2	SF patch 549375: Compromise PyUnicode_EncodeUTF8 This implements ideas from Marc-Andre, Martin, Guido and me on Python-Dev. "Short" Unicode strings are encoded into a "big enough" stack buffer, then exactly as much string space as they turn out to need is allocated at the end. This should have speed benefits akin to Martin's "measure once, allocate once" strategy, but without needing a distinct measuring pass. "Long" Unicode strings allocate as much heap space as they could possibly need (4 x # Unicode chars), and do a realloc at the end to return the untouched excess. Since the overallocation is likely to be substantial, this shouldn't burden the platform realloc with unusably small excess blocks. Also simplified uses of the PyString_xyz functions. Also added a release- build check that 4*size doesn't overflow a C int. Sooner or later, that's going to happen.	2002-04-27 18:03:26 +00:00
Guido van Rossum	7dab2426ca	- New builtin function enumerate(x), from PEP 279. Example: enumerate("abc") is an iterator returning (0,"a"), (1,"b"), (2,"c"). The argument can be an arbitrary iterable object.	2002-04-26 19:40:56 +00:00
Guido van Rossum	517c7d4fd3	PyNumber_CoerceEx: this took a shortcut (not doing anything) when the left and right type were of the same type and not classic instances. This shortcut is dangerous for proxy types, because it means that coerce(Proxy(1), Proxy(2.1)) leaves Proxy(1) unchanged rather than turning it into Proxy(1.0). In an ever-so-slight change of semantics, I now only take the shortcut when the left and right types are of the same type and don't have the CHECKTYPES feature. It so happens that classic instances have this flag, so the shortcut is still skipped in this case (i.e. nothing changes for classic instances). Proxies also have this flag set (otherwise implementing numeric operations on proxies would become nightmarish) and this means that the shortcut is also skipped there, as desired. It so happens that int, long and float also have this flag set; that means that e.g. coerce(1, 1) will now invoke int_coerce(). This is fine: int_coerce() can deal with this, and I'm not worried about the performance; int_coerce() is only invoked when the user explicitly calls coerce(), which should be rarer than rare.	2002-04-26 02:49:14 +00:00
Guido van Rossum	9364698101	Make sure that tp_free frees the int the same way as tp_dealloc would. This fixes the problem that Barry reported on python-dev: >>> 23000 .__class__ = bool crashes in the deallocator. This was because int inherited tp_free from object, which uses the default allocator. 2.2. Bugfix candidate.	2002-04-26 00:53:34 +00:00
Guido van Rossum	c95cc87447	Clean up the layout of the bool_as_number struct initializer.	2002-04-25 20:01:10 +00:00
Barry Warsaw	f16951cffe	abstract_get_bases(): Clarify exactly what the return values and states can be for this function, and ensure that only AttributeErrors are masked. Any other exception raised via the equivalent of getattr(cls, '__bases__') should be propagated up. abstract_issubclass(): If abstract_get_bases() returns NULL, we must call PyErr_Occurred() to see if an exception is being propagated, and return -1 or 0 as appropriate. This is the specific fix for a problem whereby if getattr(derived, '__bases__') raised an exception, an "undetected error" would occur (under a debug build). This nasty situation was uncovered when writing a security proxy extension type for the Zope3 project, where the security proxy raised a Forbidden exception on getattr of __bases__. PyObject_IsInstance(), PyObject_IsSubclass(): After both calls to abstract_get_bases(), where we're setting the TypeError if the return value is NULL, we must first check to see if an exception occurred, and /not/ mask an existing exception. Neil Schemenauer should double check that these changes don't break his ExtensionClass examples (there aren't any test cases for those examples and abstract_get_bases() was added by him in response to problems with ExtensionClass). Neil, please add test cases if possible! I belive this is a bug fix candidate for Python 2.2.2.	2002-04-23 22:45:44 +00:00
Jeremy Hylton	c4ad0bcbe5	Clarify return value of PyLong_AsLongLong(). The function is documented to return -1 on error. If res was < 0, it returned res. It wasn't clear that the invariant was res < 0 iff res == -1.	2002-04-23 20:01:20 +00:00
Tim Peters	030a5cebf4	unicode_memchr(): Squashed gratuitous int-vs-size_t mismatch (which gives a compiler wng under MSVC because of the resulting signed-vs- unsigned comparison).	2002-04-22 19:00:10 +00:00
Walter Dörwald	de02bcb265	Apply patch diff.txt from SF feature request http://www.python.org/sf/444708 This adds the optional argument for str.strip to unicode.strip too and makes it possible to call str.strip with a unicode argument and unicode.strip with a str argument.	2002-04-22 17:42:37 +00:00
Tim Peters	e1682a80fa	Py_UniversalNewlineFread(): small speed boost on non-Windows boxes.	2002-04-21 18:15:20 +00:00
Tim Peters	0eca65c4c5	PyUnicode_EncodeUTF8(): tightened the memory asserts a bit, and at least tried to catch some possible arithmetic overflows in the debug build.	2002-04-21 17:28:06 +00:00
Martin v. Löwis	2a7ff35a07	Back out 2.140.	2002-04-21 09:59:45 +00:00
Tim Peters	058b141ef7	Py_UniversalNewlineFread(): Many changes. + Continued looping until n bytes in the buffer have been filled, not just when n bytes have been read from the file. This repairs the bug that f.readlines() only sucked up the first 8192 bytes of the file on Windows when universal newlines was enabled and f was opened in U mode (see Python-Dev -- this was the ultimate cause of the test_inspect.py failure). + Changed prototye to take a char* buffer (void* doesn't make much sense). + Squashed size_t vs int mismatches (in particular, besides the unsigned vs signed distinction, size_t may be larger than int). + Gets out under all error conditions now (it's possible for fread() to suffer an error even if it returns a number larger than 0 -- any "short read" is an error or EOF condition). + Rearranged and simplified declarations.	2002-04-21 07:29:14 +00:00
Tim Peters	7e3d961fc1	PyUnicode_EncodeUTF8: squash compiler wng. The difference of two pointers is a signed type. Changing "allocated" to a signed int makes undetected overflow more likely, but there was no overflow detection before either.	2002-04-21 03:26:37 +00:00
Martin v. Löwis	a4eb14b7a4	Patch #495401 : Count number of required bytes for encoding UTF-8 before allocating the target buffer.	2002-04-20 13:44:01 +00:00
Jeremy Hylton	24ea8d3d9c	Fix SF bug #505315 : Make free and cell vars show up consistently in locals(). PyFrame_FastToLocals() and PyFrame_LocalsToFast() had a return if f_nlocals was 0. I think this was a holdover from the pre 2.1 days when regular locals were the only kind of local variables. The change makes it possible to use a free variable in eval or exec if it the variable is also used elsewhere in the same block, which is what the documentation says.	2002-04-20 04:46:55 +00:00
Tim Peters	08d821582f	_PyObject_DebugMallocStats(): Added some potentially expensive internal consistency checks, enabled only in a debug (Py_DEBUG) build. Note that this never gets called automatically unless PYMALLOC_DEBUG is #define'd too, and the envar PYTHONMALLOCSTATS exists.	2002-04-18 22:25:03 +00:00
Tim Peters	64d80c9f40	PyObject_Malloc: make a tiny bit faster for platforms where malloc(0) doesn't return NULL. PyObject_Realloc: better comment for why we don't call PyObject_Malloc(0).	2002-04-18 21:58:56 +00:00
Tim Peters	3e12071dbe	Remove some long-disabled debugging boilerplate.	2002-04-18 21:37:03 +00:00
Tim Peters	2b85897189	type_get_doc(): Squash compiler wng about incompatible ptr types.	2002-04-18 04:12:28 +00:00
Guido van Rossum	6ca7d41c1f	SF bug 542984. Change type_get_doc (the get function for __doc__) to look in tp_dict more often, and if it finds a descriptor in tp_dict, to call it (with a NULL instance). This means you can add a __doc__ descriptor to a new-style class that returns instance docs when called on an instance, and class docs when called on a class -- or the same docs in either case, but lazily computed. I'll also check this into the 2.2 maintenance branch.	2002-04-18 00:22:00 +00:00
Guido van Rossum	e8fc640349	SF bug 544647. PyNumber_InPlaceMultiply insisted on calling sq_inplace_repeat if it existed, even if nb_inplace_multiply also existed and the arguments weren't right for sq_inplace_repeat. Change this to only use sq_inplace_repeat if nb_inplace_multiply isn't defined. Bugfix candidate.	2002-04-16 16:44:51 +00:00
Guido van Rossum	7766091e04	Whitespace normalization and fold some long lines.	2002-04-16 16:32:50 +00:00
Walter Dörwald	0fe940c862	Return the orginal string only if it's a real str or unicode instance, otherwise make a copy.	2002-04-15 18:42:15 +00:00
Guido van Rossum	3aa3fc46c8	Remove 'const' from local variable declaration in string_zfill() -- it isn't constant, so why bother. Folded long lines. Whitespace normalization.	2002-04-15 13:48:52 +00:00
Walter Dörwald	068325ef92	Apply the second version of SF patch http://www.python.org/sf/536241 Add a method zfill to str, unicode and UserString and change Lib/string.py accordingly. This activates the zfill version in unicodeobject.c that was commented out and implements the same in stringobject.c. It also adds the test for unicode support in Lib/string.py back in and uses repr() instead() of str() (as it was before Lib/string.py 1.62)	2002-04-15 13:36:47 +00:00
Guido van Rossum	69cf3c7641	Deprecate % as well. The message for deprecation of //, % and divmod is the same in all three cases (mostly because // calls divmod :-).	2002-04-15 12:39:12 +00:00
Guido van Rossum	9ec4c78a54	SF bug #543387 . Complex numbers implement divmod() and //, neither of which makes one lick of sense. Unfortunately this is documented, so I'm adding a deprecation warning now, so we can delete this silliness, oh, around 2005 or so. Bugfix candidate (At least for 2.2.2, I think.)	2002-04-15 01:41:56 +00:00
Guido van Rossum	b6b8942f53	SF bug #541883 (Vincent Fiack). A stupid bug in object_set_class(): didn't check for value==NULL before checking its type. Bugfix candidate.	2002-04-15 01:03:30 +00:00
Tim Peters	077f27141f	SF bug 543840: complex(string) accepts strings with \0 complex_subtype_from_string(): this stopped parsing at the first 0 byte, as if that were the end of the input string. Bugfix candidate.	2002-04-14 22:04:03 +00:00
Jack Jansen	7b8c7546eb	Mass checkin of universal newline support. Highlights: import and friends will understand any of \r, \n and \r\n as end of line. Python file input will do the same if you use mode 'U'. Everything can be disabled by configuring with --without-universal-newlines. See PEP278 for details.	2002-04-14 20:12:41 +00:00
Guido van Rossum	5961042061	Fold long lines. (Walter, please take note! :-)	2002-04-13 14:06:36 +00:00
Tim Peters	0e871188e8	_PyObject_DebugDumpStats: renamed to _PyObject_DebugMallocStats. Added code to call this when PYMALLOC_DEBUG is enabled, and envar PYTHONMALLOCSTATS is set, whenever a new arena is obtained and once late in the Python shutdown process.	2002-04-13 08:29:14 +00:00
Tim Peters	b7ba743312	SF bug 543148: Memory leak with stackframes + inspect. Put a bound on the number of frameobjects that can live in the frameobject free_list. Am also backporting to 2.2. I don't intend to backport to 2.1 (too much work -- lots of cyclic structures leak there, and the GC API).	2002-04-13 05:21:47 +00:00
Guido van Rossum	018b0eb0f5	Partially implement SF feature request 444708. Add optional arg to string methods strip(), lstrip(), rstrip(). The optional arg specifies characters to delete. Also for UserString. Still to do: - Misc/NEWS - LaTeX docs (I did the docstrings though) - Unicode methods, and Unicode support in the string methods.	2002-04-13 00:56:08 +00:00
Tim Peters	8a8cdfd0f5	Small anal correctness tweaks: _PyObject_DebugMalloc: explicitly cast PyObject_Malloc's result to the target pointer type. _PyObject_DebugDumpStats: change decl of arena_alignment from unsigned int to unsigned long. This is for the 2.3 release only (it's new code).	2002-04-12 20:49:36 +00:00
Guido van Rossum	e027d9818f	Add Raymond Hettinger's d.pop(). See SF patch 539949.	2002-04-12 15:11:59 +00:00
Tim Peters	85cc1c4368	_PyObject_DebugRealloc(): rewritten to let the underlying realloc do most of the work. In particular, if the underlying realloc is able to grow the memory block in place, great (this routine used to do a fresh malloc + memcpy every time a block grew). BTW, I'm not so keen here on avoiding possible quadratic-time realloc patterns as I am on making the debug pymalloc more invisible (the more it uses memory "just like" the underlying allocator, the better the chance that a suspected memory corruption bug won't vanish when the debug malloc is turned on).	2002-04-12 08:52:50 +00:00
Tim Peters	f539c68ccd	_PyObject_DebugDumpAddress(): clarify an output message.	2002-04-12 07:43:07 +00:00
Tim Peters	f6fb501c57	PYMALLOC_{CLEAN, DEAD, FORBIDDEN}BYTE symbols: remove the PYMALLOC_ prefix. These symbols are private to the file, and the PYMALLOC_ gets in the way (overly long code lines, comments, and error messages).	2002-04-12 07:38:53 +00:00
Tim Peters	af3e8de580	First stab at rationalizing the PyMem_ API. Mixing PyObject_xyz with PyMem_{Del, DEL} doesn't work yet (compilation problems). pyport.h: _PyMem_EXTRA is gone. pmem.h: Repaired comments. PyMem_{Malloc, MALLOC} and PyMem_{Realloc, REALLOC} now make the same x-platform guarantees when asking for 0 bytes, and when passing a NULL pointer to the latter. object.c: PyMem_{Malloc, Realloc} just call their macro versions now, since the latter take care of the x-platform 0 and NULL stuff by themselves now. pypcre.c, grow_stack(): So sue me. On two lines, this called PyMem_RESIZE to grow a "const" area. It's not legit to realloc a const area, so the compiler warned given the new expansion of PyMem_RESIZE. It would have gotten the same warning before if it had used PyMem_Resize() instead; the older macro version, but not the function version, silently cast away the constness. IMO that was a wrong thing to do, and the docs say the macro versions of PyMem_xyz are deprecated anyway. If somebody else is resizing const areas with the macro spelling, they'll get a warning when they recompile now too.	2002-04-12 07:22:56 +00:00
Neil Schemenauer	d2560cd37c	Move PyObject_Malloc and PyObject_Free here from object.c. Remove PyMalloc_ prefix and use PyObject_ instead. I'm not sure about the debugging functions. Perhaps they should stay as PyMalloc_.	2002-04-12 03:10:20 +00:00
Neil Schemenauer	bdf0eedb68	Move PyObject_Malloc and PyObject_Free to obmalloc.c.	2002-04-12 03:08:42 +00:00
Neil Schemenauer	58aa861fa2	Remove PyMalloc_*.	2002-04-12 03:07:20 +00:00
Neil Schemenauer	09a2ae5882	Change signature of _PyObject_GC_Malloc to match PyObject_MALLOC. PyObject_Del and PyObject_GC_Del can now be used as a function designators.	2002-04-12 03:06:53 +00:00
Neil Schemenauer	626d774df6	PyObject_GC_Del can now be used as a function designator.	2002-04-12 03:05:52 +00:00
Neil Schemenauer	7465ad2fc9	Remove PyMalloc_New and PyMalloc_Del.	2002-04-12 03:05:37 +00:00
Neil Schemenauer	510492e985	Remove PyMalloc_New, _PyMalloc_MALLOC, and PyMalloc_Del.	2002-04-12 03:05:19 +00:00
Neil Schemenauer	9acae5a0a6	Remove PyMalloc_New and PyMalloc_Del.	2002-04-12 02:44:55 +00:00
Neil Schemenauer	99b5d28467	PyObject_GC_Del can now be used as a function designator.	2002-04-12 02:44:22 +00:00
Neil Schemenauer	aa769ae468	PyObject_Del can now be used as a function designator.	2002-04-12 02:44:10 +00:00
Neil Schemenauer	6189b89cc5	PyObject_GC_Del and PyObject_Del can now be used as a function designators. Remove PyMalloc_New.	2002-04-12 02:43:00 +00:00
Tim Peters	52aefc8a7b	SF bug 542181: Realloc behavior The bug report pointed out a bogosity in the comment block explaining thread safety for arena management. Repaired that comment, repaired a couple others while I was at it, and added an assert. _PyMalloc_DebugRealloc: If this needed to get more memory, but couldn't, it erroneously freed the original memory. Repaired that. This is for 2.3 only (unless we decide to backport the new pymalloc).	2002-04-11 06:36:45 +00:00
Marc-André Lemburg	68e69338ae	Bug fix for UTF-8 encoding bug (buffer overrun) #541828 .	2002-04-10 20:36:13 +00:00
Marc-André Lemburg	ce0b664af2	Added test case for UTF-8 encoding bug #541828 .	2002-04-10 17:18:02 +00:00
Tim Peters	2ea9111cf1	SF bug 538827: Python open w/ MSVC6: bad error msgs. open_the_file: Some (not all) flavors of Windows set errno to EINVAL when passed a syntactically invalid filename. Python turned that into an incomprehensible complaint about the mode string. Fixed by special-casing MSVC.	2002-04-08 04:13:12 +00:00
Guido van Rossum	0986d8250f	- A type can now inherit its metatype from its base type. Previously, when PyType_Ready() was called, if ob_type was found to be NULL, it was always set to &PyType_Type; now it is set to base->ob_type, where base is tp_base, defaulting to &PyObject_Type. - PyType_Ready() accidentally did not inherit tp_is_gc; now it does. Bugfix candidate.	2002-04-08 01:38:42 +00:00
Guido van Rossum	7f7666ff43	isatty() should return a bool.	2002-04-07 06:28:00 +00:00
Tim Peters	49f26817eb	Minor improvements to the stats output dump, including adding commas to the big numbers.	2002-04-06 01:45:35 +00:00
Guido van Rossum	8ace1ab53a	- Changed new-style class instantiation so that when C's __new__ method returns something that's not a C instance, its __init__ is not called. [SF bug #537450]	2002-04-06 01:05:01 +00:00
Guido van Rossum	6b8ab74c8a	Don't inherit tp_new! This is a retraction of half of the previous checkin. And since that one was, this one is also a: Bugfix candidate.	2002-04-05 22:04:18 +00:00
Guido van Rossum	cc8fe0407a	Inherit tp_new and tp_is_gc. Bugfix candidate.	2002-04-05 17:10:16 +00:00
Tim Peters	ffdd22f1af	Repair an incomprehensible comment.	2002-04-05 06:24:54 +00:00
Tim Peters	16bcb6b1af	_PyMalloc_DebugDumpStats(): vastly improved the output, and it now accounts for every byte.	2002-04-05 05:45:31 +00:00
Tim Peters	e70ddf3a99	Widespread, but mostly in _PyMalloc_Malloc: optimize away all expensive runtime multiplications and divisions, via the scheme developed with Vladimir Marangozov on Python-Dev. The pool_header struct loses its capacity member, but gains nextoffset and maxnextoffset members; this still leaves it at 32 bytes on a 32-bit box (it has to be padded to a multiple of 8 bytes).	2002-04-05 04:32:29 +00:00
Guido van Rossum	c334df5727	A much revised version of SF patch 514662, by Naofumi Honda. This speeds up __getitem__ and __setitem__ in subclasses of built-in sequences. It's much revised because I took the opportunity to refactor the code somewhat (moving a large section of duplicated code to a helper function) and added comments to a series of functions.	2002-04-04 23:44:47 +00:00
Guido van Rossum	bfc2e5ee8e	Clarifying code rearrangement and comments by David Abrahams. I've got to admit that I haven't reviewed this carefully, but it looks okay from 30,000 views, and doesn't break anything. (SF patch 536407.)	2002-04-04 17:50:54 +00:00
Tim Peters	b7265dbe3e	_PyMalloc_Realloc(): removed a now-pointless cast.	2002-04-04 05:08:31 +00:00
Tim Peters	84c1b97467	_PyMalloc_{Malloc, Realloc}: Strive to meet the doc's promises about what these do given a 0 size argument. This is so that when pymalloc is enabled, we don't need to wrap pymalloc calls in goofy little routines special-casing 0. Note that it's virtually impossible to meet the doc's promise that malloc(0) will never return NULL; this makes a best effort, but not an insane effort. The code does promise that realloc(not-NULL, 0) will never return NULL (malloc(0) is much harder). _PyMalloc_Realloc: Changed to take over all requests for 0 bytes, and rearranged to be a little quicker in expected cases. All over the place: when resorting to the platform allocator, call free/malloc/realloc directly, without indirecting thru macros. This should avoid needing a nightmarish pile of #ifdef-ery if PYMALLOC_DEBUG is changed so that pymalloc takes over all Py(Mem, Object} memory operations (which would add useful debugging info to PyMem_xyz allocations too).	2002-04-04 04:44:32 +00:00
Guido van Rossum	645a22e007	As Neal pointed out, bool_print was an order of magnitude too complex.	2002-04-04 01:00:42 +00:00
Guido van Rossum	5f8203679d	Oops. Here are the new files. My apologies.	2002-04-03 23:01:45 +00:00
Guido van Rossum	77f6a65eb0	Add the 'bool' type and its values 'False' and 'True', as described in PEP 285. Everything described in the PEP is here, and there is even some documentation. I had to fix 12 unit tests; all but one of these were printing Boolean outcomes that changed from 0/1 to False/True. (The exception is test_unicode.py, which did a type(x) == type(y) style comparison. I could've fixed that with a single line using issubtype(x, type(y)), but instead chose to be explicit about those places where a bool is expected. Still to do: perhaps more documentation; change standard library modules to return False/True from predicates.	2002-04-03 22:41:51 +00:00
Fred Drake	cd874edaaa	Fix the names of the classmethod and staticmethod constructors as passed to PyArg_ParseTuple() as part of the format string.	2002-04-03 21:42:45 +00:00
Guido van Rossum	d464107ecf	Fold some long lines. Delete blank initial line.	2002-04-03 02:13:37 +00:00
Guido van Rossum	155db9aa22	SF patch 537536 by Phillip J. Eby, fix for SF bug 535444, super() broken w/ classmethods. Bugfix candidate.	2002-04-02 17:53:47 +00:00
Tim Peters	6169f09d64	Fixed errors in two comments.	2002-04-01 20:12:59 +00:00
Tim Peters	338e010b45	Restructured my pool-management overview in terms of the three possible pool states. I think it's much clearer now. Added a new long overdue block-management overview comment block. I believe the comments are in good shape now. Added two comments about possible small optimizations (one getting rid of runtime multiplications at the cost of a new pool_header member; the other getting rid of runtime divisions and the pool_header capacity member, at the cost of a static const vector of 32 uints).	2002-04-01 19:23:44 +00:00
Tim Peters	7ccfadf3a8	New PYMALLOC_DEBUG function void _PyMalloc_DebugDumpStats(void). This displays stats about the # of arenas, pools, blocks and bytes, to stderr, both used and reserved but unused. CAUTION: Because PYMALLOC_DEBUG is on, the debug malloc routine adds 16 bytes to each request. This makes each block appear two size classes higher than it would be if PYMALLOC_DEBUG weren't on. So far, playing with this confirms the obvious: there's a lot of activity in the "small dict" size class, but nothing in the core makes any use of the 8-byte or 16-byte classes.	2002-04-01 06:04:21 +00:00
Neal Norwitz	62f5a9d6c2	Convert file.readinto() to stop using METH_OLDARGS & PyArg_Parse. Add test for file.readinto().	2002-04-01 00:09:00 +00:00
Neal Norwitz	93c1e23667	Use METH_VARARGS rather than METH_OLDARGS implicitly (args are ignored)	2002-03-31 16:06:11 +00:00
Tim Peters	57b17ad6ae	Add one more assert that indirectly interlocking conditions are consistent with each other.	2002-03-31 02:59:48 +00:00
Tim Peters	4c5be0ce09	Fixed an error in a new assert.	2002-03-31 02:52:29 +00:00
Tim Peters	b1da050131	Fixed a typo in a new comment.	2002-03-31 02:51:40 +00:00
Tim Peters	2c95c99a64	_PyMalloc_Free(): As was already done for _PyMalloc_Malloc, rearranged the code so that the most frequent cases come first. Added comments. Found a hidden assumption that a pool contains room for at least two blocks, and added an assert to catch a violation if it ever happens in a place where that matters. Gave the normal "I allocated this block" case a longer basic block to work with before it has to do its first branch (via breaking apart an embedded assignment in an "if", and hoisting common code out of both branches).	2002-03-31 02:18:01 +00:00
Tim Peters	1e16db6d3b	Added a long-overdue comment block giving an overview of pool operations and terminology, plus explanation of some extreme obscurities.	2002-03-31 01:05:22 +00:00
Tim Peters	c2ce91af5f	It's once again thought safe to call the pymalloc free/realloc with an address obtained from system malloc/realloc without holding the GIL. When the vector of arena base addresses has to grow, the old vector is deliberately leaked. This makes "stale" x-thread references safe. arenas and narenas are also declared volatile, and changed in an order that prevents a thread from picking up a value of narenas too large for the value of arenas it sees. Added more asserts. Fixed an old inaccurate comment. Added a comment explaining why it's safe to call pymalloc free/realloc with an address obtained from system malloc/realloc even when arenas is still NULL (this is obscure, since the ADDRESS_IN_RANGE macro appears <wink> to index into arenas).	2002-03-30 21:36:04 +00:00
Tim Peters	7b85b4aa7f	new_arena(): In error cases, reset the number of available pools to 0. Else the pymalloc malloc will go insane the next time it's called.	2002-03-30 10:42:09 +00:00
Tim Peters	1d99af8d69	Changed the #-of-arenas counters to uints -- no need to be insane about this. But added an overflow check just in case there is. Got rid of the ushort macro. It wasn't used anymore (it was only used in the no-longer-exists off_t macro), and there's no plausible use for it.	2002-03-30 10:35:09 +00:00
Tim Peters	8deda70b16	Eliminate DONT_SHARE_SHORT_STRINGS.	2002-03-30 10:06:07 +00:00
Tim Peters	df4d1377ed	Turns out the off_t macro isn't used anymore, so got rid of it.	2002-03-30 07:07:24 +00:00
Tim Peters	3c83df2047	Now that we're no longer linking arenas together, there's no need to waste the first pool if malloc happens to return a pool-aligned address. This means the number of pools per arena can now vary by 1. Unfortunately, the code counted up from 0 to a presumed constant number of pools. So changed the increasing "watermark" counter to a decreasing "nfreepools" counter instead, and fiddled various stuff accordingly. This also allowed getting rid of two more macros. Also changed the code to align the first address to a pool boundary instead of a page boundary. These are two parallel sets of macro #defines that happen to be identical now, but the page macros are in theory more restrictive (bigger), and there's simply no reason I can see that it wasn't aligning to the less restrictive pool size all along (the code only relies on pool alignment). Hmm. The "page size" macros aren't used for anything except defining the pool size macros, and the comments claim the latter isn't necessary. So this has the feel of a layer of indirection that doesn't serve a purpose; should probably get rid of the page macros now.	2002-03-30 07:04:41 +00:00
Tim Peters	12300686ca	Retract the claim that this is always safe if PyMem_{Del, DEL, Free, FREE} are called without the GIL. It's incredibly unlikely to fail, but I can't make this bulletproof without either adding a lock for exclusion, or giving up on growing the arena base-address vector (it would be safe if this were a static array).	2002-03-30 06:20:23 +00:00
Tim Peters	d97a1c008c	Lots of changes: + A new scheme for determining whether an address belongs to a pymalloc arena. This should be 100% reliable. The poolp->pooladdr and poolp->magic members are gone. A new poolp->arenaindex member takes their place. Note that the pool header overhead doesn't actually shrink, though, since the header is padded to a multiple of 8 bytes. + _PyMalloc_Free and _PyMalloc_Realloc should now be safe to call for any legit address, whether obtained from a _PyMalloc function or from the system malloc/realloc. It should even be safe to call _PyMalloc_Free when not holding the GIL, provided that the passed-in address was obtained from system malloc/realloc. Since this is accomplished without any locks, you better believe the code is subtle. I hope it's sufficiently commented. + The above implies we don't need the new PyMalloc_{New, NewVar, Del} API anymore, and could switch back to PyObject_XXX without breaking existing code mixing PyObject_XXX with PyMem_{Del, DEL, Free, FREE}. Nothing is done here about that yet, and I'd like to see this new code exercised more first. + The small object threshhold is boosted to 256 (the max). We should play with that some more, but the old 64 was way too small for 2.3. + Getting a new arena is now done via new function new_arena(). + Removed some unused macros, and squashed out some macros that were used only once to define other macros. + Arenas are no longer linked together. A new vector of arena base addresses had to be created anyway to make address classification bulletproof. + A lot of the patch size is an illusion: given the way address classification works now, it was more convenient to switch the sense of the prime "if" tests in the realloc and free functions, so the "if" and "else" blocks got swapped. + Assorted minor code, comment and whitespace cleanup. Back to the Windows installer <wink>.	2002-03-30 06:09:22 +00:00
Tim Peters	1f7df3595a	Remove the CACHE_HASH and INTERN_STRINGS preprocessor symbols.	2002-03-29 03:29:08 +00:00
Neil Schemenauer	f589c059f4	If the GC is enabled then don't use the ob_type pointer to create a list of trash objects. Use the gc_prev pointer instead.	2002-03-29 03:05:54 +00:00
Neil Schemenauer	bd02b14255	Add missing "void" to function.	2002-03-28 21:05:38 +00:00
Guido van Rossum	ff413af605	This is Neil's fix for SF bug 535905 (Evil Trashcan and GC interaction). The fix makes it possible to call PyObject_GC_UnTrack() more than once on the same object, and then move the PyObject_GC_UnTrack() call to before the trashcan code is invoked. BUGFIX CANDIDATE!	2002-03-28 20:34:59 +00:00
Tim Peters	d1139e043c	PYMALLOC_DEBUG routines: The "check API family" gimmick was going nowhere fast, and just cluttered the code. Get rid of it for now. If a compelling case can be made for it, easy to restore it later.	2002-03-28 07:32:11 +00:00
Fred Drake	7bf9715a8b	Introduce two new flag bits that can be set in a PyMethodDef method descriptor, as used for the tp_methods slot of a type. These new flag bits are both optional, and mutually exclusive. Most methods will not use either. These flags are used to create special method types which exist in the same namespace as normal methods without having to use tedious construction code to insert the new special method objects in the type's tp_dict after PyType_Ready() has been called. If METH_CLASS is specified, the method will represent a class method like that returned by the classmethod() built-in. If METH_STATIC is specified, the method will represent a static method like that returned by the staticmethod() built-in. These flags may not be used in the PyMethodDef table for modules since these special method types are not meaningful in that case; a ValueError will be raised if these flags are found in that context.	2002-03-28 05:33:33 +00:00
Neil Schemenauer	38a8916134	Remove weakref free list. This has the side effect of fixing a memory management bug. Also, move some duplicated code into the new_weakref fucntion.	2002-03-27 15:18:21 +00:00
Walter Dörwald	e990c79fa8	Add missing methods iterkeys, itervalues and iteritems to dict-proxy objects. Add real docstrings to all methods.	2002-03-25 17:43:22 +00:00
Walter Dörwald	8c077227f2	Fix whitespace.	2002-03-25 11:16:18 +00:00
Tim Peters	e085017ab7	_PyMalloc_DebugRealloc(): simplify decl of "fresh". Assorted: bump the serial number via a trivial new bumpserialno() function. The point is to give a single place to set a breakpoint when waiting for a specific serial number.	2002-03-24 00:34:21 +00:00
Tim Peters	62c06ba6a9	Minor code cleanup -- no semantic changes.	2002-03-23 22:28:18 +00:00
Neil Schemenauer	3a204a7e48	Grow the string buffer at a mildly exponential rate for the getc version of get_line. This makes test_bufio finish in 1.7 seconds instead of 57 seconds on my machine (with Py_DEBUG defined). Also, rename the local variables n1 and n2 to used_v_size and total_v_size.	2002-03-23 19:41:34 +00:00
Tim Peters	ddea208be9	Give Python a debug-mode pymalloc, much as sketched on Python-Dev. When WITH_PYMALLOC is defined, define PYMALLOC_DEBUG to enable the debug allocator. This can be done independent of build type (release or debug). A debug build automatically defines PYMALLOC_DEBUG when pymalloc is enabled. It's a detected error to define PYMALLOC_DEBUG when pymalloc isn't enabled. Two debugging entry points defined only under PYMALLOC_DEBUG: + _PyMalloc_DebugCheckAddress(const void p) can be used (e.g., from gdb) to sanity-check a memory block obtained from pymalloc. It sprays info to stderr (see next) and dies via Py_FatalError if the block is detectably damaged. + _PyMalloc_DebugDumpAddress(const void p) can be used to spray info about a debug memory block to stderr. A tiny start at implementing "API family" checks isn't good for anything yet. _PyMalloc_DebugRealloc() has been optimized to do little when the new size is <= old size. However, if the new size is larger, it really can't call the underlying realloc() routine without either violating its contract, or knowing something non-trivial about how the underlying realloc() works. A memcpy is always done in this case. This was a disaster for (and only) one of the std tests: test_bufio creates single text file lines up to a million characters long. On Windows, fileobject.c's get_line() uses the horridly funky getline_via_fgets(), which keeps growing and growing a string object hoping to find a newline. It grew the string object 1000 bytes each time, so for a million-character string it took approximately forever (I gave up after a few minutes). So, also: fileobject.c, getline_via_fgets(): When a single line is outrageously long, grow the string object at a mildly exponential rate, instead of just 1000 bytes at a time. That's enough so that a debug-build test_bufio finishes in about 5 seconds on my Win98SE box. I'm curious to try this on Win2K, because it has very different memory behavior than Win9X, and test_bufio always took a factor of 10 longer to complete on Win2K. It could be that the endless reallocs were simply killing it on Win2K even in the release build.	2002-03-23 10:03:50 +00:00
Neil Schemenauer	ed19b88f0b	Check in (hopefully) corrected version of last change.	2002-03-23 02:06:50 +00:00
Tim Peters	ce7fb9b515	Just whitespace fiddling.	2002-03-23 00:28:57 +00:00
Tim Peters	1221c0a435	Build obmalloc.c directly instead of #include'ing from object.c. Also move all _PyMalloc_XXX entry points into obmalloc.c. The Windows build works fine. The Unix build is changed here (Makefile.pre.in), but not tested. No other platform's build process has been fiddled.	2002-03-23 00:20:15 +00:00
Neil Schemenauer	12a6d942d8	Undo last commit. It's causing the tests to file.	2002-03-22 23:50:30 +00:00
Neil Schemenauer	558ba52f10	Remove malloc hooks.	2002-03-22 23:20:15 +00:00
Neil Schemenauer	398b9f6d6d	Disallow open()ing of directories. Closes SF bug 487277.	2002-03-22 20:38:57 +00:00
Neil Schemenauer	dcc819a5c9	Use pymalloc if it's enabled.	2002-03-22 15:33:15 +00:00
Neil Schemenauer	a1a9c51a3e	Add pymalloc object memory management functions. These must be available even if pymalloc is disabled since extension modules might use them.	2002-03-22 15:28:30 +00:00
Tim Peters	bab22beda8	SF bug 533198: Complex power underflow raises exception. Konrad was too kind. Not only did it raise an exception, the specific exception it raised made no sense. These are old bugs in complex_pow() and friends: 1. Raising 0 to a negative power isn't a range error, it's a domain error, so changed c_pow() to set errno to EDOM in that case instead of ERANGE. 2. Changed complex_pow() to: A. Used the Py_ADJUST_ERANGE2 macro to try to clear errno of a spurious ERANGE error due to underflow in the libm pow() called by c_pow(). B. Produced different exceptions depending on the errno value: i) For errno==EDOM, raise ZeroDivisionError instead of ValueError. This is for consistency with the non-complex cases 0.0-2 and 0-2 and 0L**-2. ii) For errno==ERANGE, raise OverflowError. Bugfix candidate.	2002-03-22 02:48:46 +00:00
Martin v. Löwis	047c05ebc4	Do not insert characters for unicode-escape decoders if the error mode is "ignore". Fixes #529104.	2002-03-21 08:55:28 +00:00
Neil Schemenauer	25f3dc21b5	Drop the PyCore_* memory API.	2002-03-18 21:06:21 +00:00
Neil Schemenauer	d91eec9df3	Re-enable GC of method objects.	2002-03-18 20:44:53 +00:00
Neil Schemenauer	dbf409fbfc	Re-enable GC of iter objects.	2002-03-18 20:43:51 +00:00
Neil Schemenauer	11f5be8d88	Simpilify PyCore_* macros by assuming the function prototypes for malloc() and free() don't change.	2002-03-18 18:13:41 +00:00
Guido van Rossum	7e30548285	Fix for SF bug 528132 (Armin Rigo): classmethod().__get__() segfault The proper fix is not quite what was submitted; it's really better to take the class of the object passed rather than calling PyMethod_New with NULL pointer args, because that can then cause other core dumps later. I also added a testcase for the fix to classmethods() in test_descr.py. I've already applied this to the 2.2 branch.	2002-03-18 03:09:06 +00:00
Tim Peters	2400831773	SF patch 530070: pydoc regression, from Martin and Guido. Change the way __doc__ is handled, to avoid blowing up on non-string __doc__ values.	2002-03-17 18:56:20 +00:00
Martin v. Löwis	f6eebbb435	Patch #530105 : Allow file object may to be subtyped	2002-03-15 17:42:16 +00:00
Martin v. Löwis	0c160a08f2	Patch #517521 : Consider byte strings before Unicode strings in PyObject_Get/SetAttr.	2002-03-15 13:40:30 +00:00
Guido van Rossum	0628dcfe1f	"Fix" for SF bug #520644 : __slots__ are not pickled. As promised in my response to the bug report, I'm not really fixing it; in fact, one could argule over what the proper fix should do. Instead, I'm adding a little magic that raises TypeError if you try to pickle an instance of a class that has __slots__ but doesn't define or override __getstate__. This is done by adding a bozo __getstate__ that always raises TypeError.	2002-03-14 23:03:14 +00:00
Guido van Rossum	cd637aae56	Fix for SF bug #529050 - ModuleType.__new__ crash. There were several places that assumed the md_dict field was always set, but it needn't be. Fixed these to be more careful. I changed PyModule_GetDict() to initialize md_dict to a new dictionary if it's NULL. Bugfix candidate.	2002-03-12 20:37:02 +00:00
Tim Peters	8f01b680c8	Change Windows file.truncate() to (a) restore the original file position, and (b) stop trying to prevent file growth. Beef up the file.truncate() docs. Change test_largefile.py to stop assuming that f.truncate() moves the file pointer to the truncation point, and to verify instead that it leaves the file position alone. Remove the test for what happens when a specified size exceeds the original file size (it's ill-defined, according to the Single Unix Spec).	2002-03-12 03:04:44 +00:00
Tim Peters	fb05db2cae	file_truncate(): provide full "large file" support on Windows, by dropping MS's inadequate _chsize() function. This was inspired by SF patch 498109 ("fileobject truncate support for win32"), which I rejected. libstdtypes.tex: Someone who knows should update the availability blurb. For example, if it's available on Linux, it would be good to say so. test_largefile: Uncommented the file.truncate() tests, and reworked to do more. The old comment about "permission errors" in the truncation tests under Windows was almost certainly due to that the file wasn't open for write access at this point, so of course MS wouldn't let you truncate it. I'd be appalled if a Unixish system did. CAUTION: Someone should run this test on Linux (etc) too. The truncation part was commented out before. Note that test_largefile isn't run by default.	2002-03-11 00:24:00 +00:00
Guido van Rossum	dfce3bf908	Bugfix candidate. Adapter from SF patch 528038; fixes SF bug 527816. The wrapper for __nonzero__ should be wrap_inquiry rather than wrap_unaryfunc, since the slot returns an int, not a PyObject *.	2002-03-10 14:11:16 +00:00
Martin v. Löwis	c8bb9eba31	Patch #494047 : removes 64-bit ?: to cope on plan9.	2002-03-09 12:02:59 +00:00
Tim Peters	dc5a508761	SF bug 525705: [2.2] underflow raise OverflowException. Another year in the quest to out-guess random C behavior. Added macros Py_ADJUST_ERANGE1(X) and Py_ADJUST_ERANGE2(X, Y). The latter is useful for functions with complex results. Two corrections to errno- after-libm-call are attempted: 1. If the platform set errno to ERANGE due to underflow, clear errno. Some unknown subset of libm versions and link options do this. It's allowed by C89, but I never figured anyone would do it. 2. If the platform did not set errno but overflow occurred, force errno to ERANGE. C89 required setting errno to ERANGE, but C99 doesn't. Some unknown subset of libm versions and link options do it the C99 way now. Bugfix candidate, but hold off until some Linux people actually try it, with and without -lieee. I'll send a help plea to Python-Dev.	2002-03-09 04:58:24 +00:00
Jeremy Hylton	0522d9891a	Fix leak of NotImplemented in previous checkin to PyNumber_Add(). If result == Py_NotImplemented, always DECREF it before assigning a new value to result.	2002-03-08 21:28:54 +00:00
Jeremy Hylton	6ae6a43a77	Fix for SF bug 516727: MyInt(2) + "3" -> NotImplemented PyNumber_Add() tries the nb_add slot first, then falls back to sq_concat. However, tt didn't check the return value of sq_concat. If sq_concat returns NotImplemented, raise the standard TypeError.	2002-03-08 21:11:37 +00:00
Michael W. Hudson	70ffddf8af	Guido pointed out that I was missing a couple decrefs.	2002-03-07 15:13:40 +00:00
Michael W. Hudson	ce358e3015	Apply (my) patch: [ 526072 ] pickling os.stat results round II structseq's constructors can now take "invisible" fields in a dict. Gave the constructors better error messages. their __reduce__ method puts these fields in a dict. (this is all in aid of getting os.stat_result's to pickle portably) Also fixes [ 526039 ] devious code can crash structseqs Thought needed about how much of this counts as a bugfix. Certainly #526039 needs to be fixed.	2002-03-06 17:07:49 +00:00
Michael W. Hudson	7bb466a1a5	A fix & test for [ 496873 ] structseqs unpicklable by adding a __reduce__ method to structseqs. Will also commit this to the 2.2.1 branch momentarily.	2002-03-05 13:27:58 +00:00
Tim Peters	a5d78cc208	Whether platform malloc(0) returns NULL has nothing to do with whether platform realloc(p, 0) returns NULL, so MALLOC_ZERO_RETURNS_NULL can be correctly undefined yet realloc(p, 0) can return NULL anyway. Prevent realloc(p, 0) doing free(p) and returning NULL via a different hack. Would probably be better to get rid of MALLOC_ZERO_RETURNS_NULL entirely. Bugfix candidate.	2002-03-02 08:43:19 +00:00
Tim Peters	5329cdb3ce	_PyLong_Copy(): was creating a copy of the absolute value, but should copy the sign too. Added a test to test_descr to ensure that it does. Bugfix candidate.	2002-03-02 04:18:04 +00:00
Tim Peters	db30ac41de	Revert the last odd change to PyNumber_Long: the problem it was trying to fix was almost certainly a bug in _PyLong_Copy (which I'll fix next).	2002-03-02 04:14:21 +00:00
Guido van Rossum	2eb0b87d14	SF patch 514641 (Naofumi Honda) - Negative ob_size of LongObjects Due to the bizarre definition of _PyLong_Copy(), creating an instance of a subclass of long with a negative value could cause core dumps later on. Unfortunately it looks like the behavior of _PyLong_Copy() is quite intentional, so the fix is more work than feels comfortable. This fix is almost, but not quite, the code that Naofumi Honda added; in addition, I added a test case.	2002-03-01 22:24:49 +00:00
Andrew MacIntyre	5e9c80d906	%#x/%#X format conversion cleanup (see patch #450267 ): Objects/ stringobject.c unicodeobject.c	2002-02-28 11:38:24 +00:00
Andrew MacIntyre	c487439aa7	OS/2 EMX port changes (Objects part of patch #450267 ): Objects/ fileobject.c stringobject.c unicodeobject.c This commit doesn't include the cleanup patches for stringobject.c and unicodeobject.c which are shown separately in the patch manager. Those patches will be regenerated and applied in a subsequent commit, so as to preserve a fallback position (this commit to those files).	2002-02-26 11:36:35 +00:00
Martin v. Löwis	f9bd6b09e1	Allow __doc__ to be of arbitrary type. Patch by James Henstridge, fixes #504343. 2.2.1 candidate.	2002-02-18 17:46:48 +00:00
Martin v. Löwis	a5854c24a2	Patch #508038 : Do not use a type as a variable name.	2002-02-16 23:39:10 +00:00
Guido van Rossum	fa2e4c27d2	Declare real and imag as read-only attributes. This fixes SF bug #514858 (Gregory Smith): complex not entirely immutable 2.2.1 Bugfix candidate!	2002-02-08 21:26:07 +00:00
Marc-André Lemburg	bd3be8f0ca	Fix to the UTF-8 encoder: it failed on 0-length input strings. Fix for the UTF-8 decoder: it will now accept isolated surrogates (previously it raised an exception which causes round-trips to fail). Added new tests for UTF-8 round-trip safety (we rely on UTF-8 for marshalling Unicode objects, so we better make sure it works for all Unicode code points, including isolated surrogates). Bumped the PYC magic in a non-standard way -- please review. This was needed because the old PYC format used illegal UTF-8 sequences for isolated high surrogates which now raise an exception.	2002-02-07 11:33:49 +00:00
Marc-André Lemburg	dc724d6e35	Cosmetics.	2002-02-06 18:20:19 +00:00
Marc-André Lemburg	e7c6ee4b8a	Whitespace fixes.	2002-02-06 18:18:03 +00:00
Marc-André Lemburg	3688a882d3	Fix for the UTF-8 memory allocation bug and the UTF-8 encoding bug related to lone high surrogates.	2002-02-06 18:09:02 +00:00
Guido van Rossum	e75bfde7e9	Bugfix candidate. Fix SF bug #511603: Error calling str on subclass of int Explicitly fill in tp_str with the same pointer as tp_repr.	2002-02-01 15:34:10 +00:00
Neal Norwitz	2a47c0fa23	Fix spelling mistakes. Bugfix candidates.	2002-01-29 00:53:41 +00:00
Martin v. Löwis	1f803f782c	Updated patch #487906 : Revise inline docs.	2002-01-16 10:53:24 +00:00
Martin v. Löwis	cdc4451222	Include <unistd.h> in Python.h. Fixes #500924 .	2002-01-12 11:05:12 +00:00
Martin v. Löwis	b0d71d0ec6	Implement PyObject_DelItemString. Fixes #498915 .	2002-01-05 10:50:30 +00:00
Neal Norwitz	649b75954a	SF Patch #494863 , file.xreadlines() should raise ValueError if file is closed This makes xreadlines behave like all other file methods (other than close() which just returns).	2002-01-01 19:07:13 +00:00
Fred Drake	2a908f6b7b	proxy_compare(): Make sure that we unwrap both objects being compared if both are proxy objects.	2001-12-19 16:44:30 +00:00
Guido van Rossum	f884b74910	- PyType_Ready(): Initialize the ob_type field to &PyType_Type if it's NULL, so that you can call PyType_Ready() to initialize a type that is to be separately compiled with C on Windows. inherit_special(): Add a long comment explaining that you have to set tp_new if your base class is PyBaseObject_Type.	2001-12-17 17:14:22 +00:00
Sjoerd Mullender	564980bd86	Portability fix: Not every compiler implements the extension of unescaped newlines in strings.	2001-12-17 11:39:56 +00:00
Guido van Rossum	33c1a8893d	SF patch #493452 : docstrings for staticmethod/classmethod (Skip Montanaro) (With minor adjustments.)	2001-12-17 02:53:53 +00:00
Guido van Rossum	2b8235e485	SF bug #493561 : incorrect format string descrobject.c (Neal Norwitz) %300s should be %.300s, twice.	2001-12-15 05:00:30 +00:00
Guido van Rossum	e54616cb6f	(Merge into trunk.) Fix for SF bug #492345. (I could've sworn I checked this in, but apparently I didn't!) This code: class Classic: pass class New(Classic): __metaclass__ = type attempts to create a new-style class with only classic bases -- but it doesn't work right. Attempts to fix it so it works caused problems elsewhere, so I'm now raising a TypeError in this case.	2001-12-14 04:19:56 +00:00
Fred Drake	526c7a0101	Ensure that complex() only accepts a string argument as the first arg, and only if there is no second arg. This closes SF patch #479551.	2001-12-13 19:52:22 +00:00
Tim Peters	77d8a4fc91	float_floor_div: An expression like 3.//1j crashed the interpreter, or delivered bizarre results. Check float_divmod for a Py_NotImplemented return and pass it along (instead of treating Py_NotImplemented as a 2-tuple). CONVERT_TO_DOUBLE: Added comments; this macro is obscure.	2001-12-11 20:31:34 +00:00
Tim Peters	63a3571e17	float_int_div(): For clarity, move this closer to the other float division functions, and rename to float_floor_div.	2001-12-11 19:57:24 +00:00
Tim Peters	f582b82fe9	SF bug #491415 PyDict_UpdateFromSeq2() unused PyDict_UpdateFromSeq2(): removed it. PyDict_MergeFromSeq2(): made it public and documented it. PyDict_Merge() docs: updated to reveal <wink> that the second argument can be any mapping object.	2001-12-11 18:51:08 +00:00
Fred Drake	ef8ebd1e74	Make sure that when we invoke callback functions associated with weak references, we do not allow any outstanding exceptions "leak" into the callback's execution state. This closes SF bug #478534.	2001-12-10 23:44:54 +00:00
Guido van Rossum	7171f1c8d8	Well what do you know. The Python implementation contained the same bug as the C code. :-(	2001-12-10 18:06:21 +00:00
Guido van Rossum	ba2485f947	Fix the Python property class in a comment right.	2001-12-10 18:03:34 +00:00
Guido van Rossum	b75ba918d6	property_descr_get(): Fix a curious bug in the property() type: when no get function was defined, the property's doc string was inaccessible. This was because the test for prop_get was made before the test for a NULL/None object argument. Also changed the property class defined in Python in a comment to test for NULL to decide between get and delete; this makes it less Python but then, assigning None to a property doesn't delete it!	2001-12-10 18:00:15 +00:00
Guido van Rossum	169192e818	SF patch #491049 (David Jacobs): Small PyString_FromString optimization PyString_FromString(): Since the length of the string is already being stored in size, changed the strcpy() to a memcpy() for a small speed improvement.	2001-12-10 15:45:54 +00:00
Guido van Rossum	f70590f990	_PyTuple_Resize(): this dumped core on tuple(globals()) for me. Turns out the for loop at the end intended to zero out new items wasn't doing anything, because sv->ob_size was already equal to newsize. The fix slightly refactors the function, introducing a variable oldsize and doing away with sizediff (which was used only once), and using oldsize and newsize consistently. I also added comments explaining what the two for loops do. (Looking at the CVS annotation of this function, it's no miracle a bug crept in -- this has been patched by many different folks! :-)	2001-12-07 20:00:04 +00:00
Tim Peters	62de65b25e	PyString_FromString: this requires its argument be non-NULL, but doesn't check it. Added an assert() to that effect.	2001-12-06 20:29:32 +00:00
Guido van Rossum	604ddf80d8	Fix for #489669 (Neil Norwitz): memory leak in test_descr (unicode). This is best reproduced by while 1: class U(unicode): pass U(u"xxxxxx") The unicode_dealloc() code wasn't properly freeing the str and defenc fields of the Unicode object when freeing a subtype instance. Fixed this by a subtle refactoring that actually reduces the amount of code slightly.	2001-12-06 20:03:56 +00:00
Jeremy Hylton	1a48ca8c53	Fix memory leak in dict_to_map(), SF bug [ #485152 ] memory leak in test_scope. PyCell_Set() incremenets the reference count, so the earlier XINCREF causes a leak. Also make a number of small performance improvements to the code on the assumption that most of the time variables are not rebound across a FastToLocals() / LocalsToFast() pair. Replace uses of PyCell_Set() and PyCell_Get() with PyCell_SET() and PyCell_GET(), since the frame is guaranteed to contain cells.	2001-12-06 15:48:16 +00:00
Jeremy Hylton	7802a53e38	Little stuff. Add a missing DECREF in an obscure corner. If the str() or repr() of an object passed to a string interpolation -- e.g. "%s" % obj -- returns a non-string, the returned object was leaked. Repair an indentation glitch. Replace a bunch of PyString_AsString() calls (and their ilk) with macros.	2001-12-06 15:18:48 +00:00
Guido van Rossum	14227b4dd4	The previous checkin to clear __slots__ variables did a little bit of the work each time it found another base class. All the work is contiguous, so we might as well do it all at once at the end.	2001-12-06 02:35:58 +00:00
Guido van Rossum	33bab01da6	Fix SF bug #489581 : __slots__ leak. It was easier than I thought, assuming that no other things contribute to the instance size besides slots -- a pretty good bet. With a test suite, no less!	2001-12-05 22:45:48 +00:00
Guido van Rossum	d331cb5502	At the PythonLabs meeting someone mentioned it would make Jim really happy if one could delete the __dict__ attribute of an instance. I love to make Jim happy, so here goes... - New-style objects now support deleting their __dict__. This is for all intents and purposes equivalent to assigning a brand new empty dictionary, but saves space if the object is not used further.	2001-12-05 19:46:42 +00:00
Tim Peters	a3c01ce696	SF bug #488480 : integer multiply to return -max_int-1. int_mul(): new and vastly simpler overflow checking. Whether it's faster or slower will likely vary across platforms, favoring boxes with fast floating point. OTOH, we no longer have to worry about people shipping broken LONG_BIT definitions <0.9 wink>.	2001-12-04 23:05:10 +00:00
Guido van Rossum	64b206c19e	Fix SF bug #486144 : Uninitialized __slot__ vrbl is None. There's now a new structmember code, T_OBJECT_EX, which is used for all __slot__ variables (except __weakref__, which has special behavior anyway). This new code raises AttributeError when the variable is NULL rather than converting NULL to None.	2001-12-04 17:13:22 +00:00
Guido van Rossum	03b3f04542	long_mul(): The PyNumber_Multiply() call can return a long if the result would overflow an int. Check for this. (SF bug #488482, Armin Rigo.)	2001-12-04 16:36:39 +00:00
Guido van Rossum	ebca9fc1ba	PyObject_Generic{Get,Set}Attr(): ensure that the attribute name is a string object (or a Unicode that's trivially converted to ASCII). PyObject_GetAttr(): add an 'else' to the Unicode test like PyObject_SetAttr() already has.	2001-12-04 15:54:53 +00:00
Guido van Rossum	be5234610a	function_call(): Remove a bogus (and I mean really bogus) call to Py_DECREF(arg) after the PyErr_NoMemory() call. (Armin Rigo, SF bug #488477.)	2001-12-03 19:22:38 +00:00
Guido van Rossum	dbb53d9918	Fix of SF bug #475877 (Mutable subtype instances are hashable). Rather than tweaking the inheritance of type object slots (which turns out to be too messy to try), this fix adds a __hash__ to the list and dict types (the only mutable types I'm aware of) that explicitly raises an error. This has the advantage that list.__hash__([]) also raises an error (previously, this would invoke object.__hash__([]), returning the argument's address); ditto for dict.__hash__. The disadvantage for this fix is that 3rd party mutable types aren't automatically fixed. This should be added to the rules for creating subclassable extension types: if you don't want your object to be hashable, add a tp_hash function that raises an exception. Also, it's possible that I've forgotten about other mutable types for which this should be done.	2001-12-03 16:32:18 +00:00
Guido van Rossum	5b443c6282	Address SF patch #480716 as well as related issues. SF patch #480716 by Greg Chapman fixes the problem that super's __get__ method always returns an instance of super, even when the instance whose __get__ method is called is an instance of a subclass of super. Other issues fixed: - super(C, C()).__class__ would return the __class__ attribute of C() rather than the __class__ attribute of the super object. This is confusing. To fix this, I decided to change the semantics of super so that it only applies to code attributes, not to data attributes. After all, overriding data attributes is not supported anyway. - While super(C, x) carefully checked that x is an instance of C, super(C).__get__(x) made no such check, allowing for a loophole. This is now fixed.	2001-12-03 15:38:28 +00:00
Martin v. Löwis	8f1ea71eab	Add more inline documentation, as contributed in #487906 .	2001-12-03 08:24:52 +00:00
Tim Peters	9161c8b0a1	PyString_FromFormatV, string_repr: document why these use sprintf instead of PyOS_snprintf; add some relevant comments and asserts.	2001-12-03 01:55:38 +00:00
Guido van Rossum	1d5b3f29ff	Fix for SF bug #485678 . slot_tp_descr_set(): When deleting an attribute described by a descriptor implemented in Python, the descriptor's __del__ method is called by the slot_tp_descr_set dispatch function. This is bogus -- __del__ already has a different meaning. Renaming this use of __del__ is renamed to __delete__.	2001-12-03 00:08:33 +00:00
Martin v. Löwis	d132750206	Patch 487906: update inline docs.	2001-12-02 18:09:41 +00:00
Tim Peters	422210426e	SF bug #487743 : test_builtin fails on 64 bit platform. Bugfix candidate. int_repr(): we've never had a buffer big enough to hold the largest possible result on a 64-bit box. Now that we're using snprintf instead of sprintf, this can lead to nonsense results instead of random stack corruption.	2001-12-01 02:52:56 +00:00
Jack Jansen	b3be216b41	Merged changes made on r22b2-branch between r22b2 and r22b2-mac (the changes from start of branch upto r22b2 were already merged, of course).	2001-11-30 14:16:36 +00:00
Tim Peters	97019e4110	PyFloat_AsStringEx(): This function takes an output char* but doesn't pass the buffer length. Stop using it. It should be deprecated, but too late in the release cycle to do that now. New static format_float() does the same thing but requires passing the buffer length too. Use it instead.	2001-11-28 22:43:45 +00:00
Tim Peters	c1bbcb87aa	PyFile_WriteString(): change prototype so that the string arg is const char* instead of char. The change is conceptually correct, and indirectly fixes a compiler wng introduced when somebody else innocently passed a const char to this function.	2001-11-28 22:13:25 +00:00
Barry Warsaw	d586756dc5	weakref_repr(), proxy_repr(): Conversion of sprintf() to PyOS_snprintf() for buffer overrun avoidance.	2001-11-28 21:01:56 +00:00
Barry Warsaw	e5c492d72a	formatfloat(), formatint(): Conversion of sprintf() to PyOS_snprintf() for buffer overrun avoidance.	2001-11-28 21:00:41 +00:00
Barry Warsaw	312af42b47	structseq_new(): Conversion of sprintf() to PyOS_snprintf() for buffer overrun avoidance.	2001-11-28 20:56:44 +00:00
Barry Warsaw	6197509f24	PyInt_FromString(), int_repr(), int_oct(), int_hex(): Conversion of sprintf() to PyOS_snprintf() for buffer overrun avoidance.	2001-11-28 20:55:34 +00:00
Barry Warsaw	af8aef9ee2	PyFloat_FromString(): Conversion of sprintf() to PyOS_snprintf() for buffer overrun avoidance.	2001-11-28 20:52:21 +00:00
Barry Warsaw	01d697a067	complex_to_buf(), complex_subtype_from_c_complex(): Conversion of sprintf() to PyOS_snprintf() for buffer overrun avoidance. complex_print(), complex_repr(), complex_str(): Call complex_to_buf() passing in sizeof(buf).	2001-11-28 20:50:56 +00:00
Tim Peters	885d457709	sprintf -> PyOS_snprintf in some "obviously safe" cases. Also changed <>-style #includes to ""-style in some places where the former didn't make sense.	2001-11-28 20:27:42 +00:00
Marc-André Lemburg	11326de657	Fix for bug #485951 : repr diff between string and unicode.	2001-11-28 12:56:20 +00:00
Marc-André Lemburg	d4c0a9c59b	Fixes for possible buffer overflows in sprintf() usages.	2001-11-28 11:47:00 +00:00
Guido van Rossum	64585f6afb	PyObject_GetItem(), PyObject_SetItem(), PyObject_DelItem(): Fix a few confusing error messages. If a new-style class has no sequence or mapping behavior, attempting to use the indexing notation with a non-integer key would complain that the sequence index must be an integer, rather than complaining that the operation is not supported.	2001-11-24 18:24:47 +00:00
Marc-André Lemburg	72f8213ba4	Fix for bug #438164 : %-formatting using Unicode objects. This patch also does away with an incompatibility between Jython and CPython.	2001-11-20 15:18:49 +00:00
Tim Peters	a91e9646e0	Changing diapers reminded Guido that he wanted to allow for some measure of multiple inheritance from a mix of new- and classic-style classes. This is his patch, plus a start at some test cases from me. Will check in more, plus a NEWS blurb, later tonight.	2001-11-14 23:32:33 +00:00
Jeremy Hylton	89c3a22a27	Add PyObject_CheckReadBuffer(), which returns true if its argument supports the single-segment readable buffer interface. Add documentation for this and other PyObject_XXXBuffer() calls.	2001-11-09 21:59:42 +00:00
Tim Peters	a27a150ea5	open_the_file(): Explicitly set errno to 0 before calling fopen().	2001-11-09 20:59:14 +00:00
Tim Peters	114486701a	open_the_file(): this routine has a borrowed reference to the file object, so the "Metroworks only" section should not decref it in case of error (the caller is responsible for decref'ing in case of error -- and does).	2001-11-09 19:23:47 +00:00
Jeremy Hylton	41c8321252	Fix SF buf #476953 : Bad more for opening file gives bad msg. If fopen() fails with EINVAL it means that the mode argument is invalid. Return the mode in the error message instead of the filename.	2001-11-09 16:17:24 +00:00
Tim Peters	6f97e493e1	long_true_divide(): decref its converted arguments. test_long_future.py run in an infinite loop no longer grows. Thanks to Neal Norwitz for determining that test leaked!	2001-11-04 23:09:40 +00:00
Tim Peters	67754e993e	Rehabilitated the fast-path richcmp code, and sped it up. It wasn't helping for types that defined tp_richcmp but not tp_compare, although that's when it's most valuable, and strings moved into that category since the fast path was first introduced. Now it helps for same-type non-Instance objects that define rich or 3-way compares. For all the edits here, the rest just amounts to moving the fast path from do_richcmp into PyObject_RichCompare, saving a layer of function call (measurable on my box!). This loses when NESTING_LIMIT is exceeded, but I don't care about that (fast-paths are for normal cases, not pathologies). Also added a tasteful <wink> label to get out of PyObject_RichCompare, as the if/else nesting in this routine was getting incomprehensible.	2001-11-04 07:29:31 +00:00
Tim Peters	c99213f993	No code change -- just trying to document the return conditions for all the internal comparison routines.	2001-11-04 05:57:16 +00:00
Tim Peters	4e8ab5db38	float_divmod(): the code wasn't sick enough to stop the MS optimizer from optimizing away mod's sign adjustment when mod == 0; so it got the intended result only in the debug build.	2001-11-01 23:59:56 +00:00
Tim Peters	d2e40d6691	SF bug #477221 : abs and divmod act oddly with -0.0 Try to ensure that divmod(-0.0, 1.0) -> (-0.0, +0.0) across platforms. It always did on Windows, and still does. It didn't on Linux. Alas, there's no platform-independent way to write a test case for this. Bugfix candidate.	2001-11-01 23:12:27 +00:00
Tim Peters	faf0cd21ed	float_abs() again: Guido pointed out that this could screw up in the presence of NaNs. So pass the issue on to the platform libm fabs(); after all, fabs() is a std C function because you can't implement it correctly in portable C89.	2001-11-01 21:51:15 +00:00
Fred Drake	573395a7a8	PyFunction_Call() did not check the result of PyObject_Repr() for NULL, and should just avoid calling it in the first place to avoid waiting for a repr of a large object like a dict or list. The result of PyObject_Repr() was being leaked as well. Bugfix candidate!	2001-11-01 20:26:12 +00:00
Tim Peters	d2364e8e2d	SF bug #477221 : abs and divmod act oddly with -0.0. Partial fix. float_abs(): ensure abs(-0.0) returns +0.0. Bugfix candidate.	2001-11-01 20:09:42 +00:00
Michael W. Hudson	e2ec3ebcb8	fix for [ #476557 ] Wrong error message for file.write(a, b) Makes file.write a METH_VARARGS function.	2001-10-31 18:51:01 +00:00
Tim Peters	c2fe618575	Fix bad bug in structseq slicing (NULL pointers in result). Reported by Jack Jansen on python-dev. Add simple test case. Move vereq() from test_descr to test_support (it's handy!).	2001-10-30 23:20:46 +00:00
Guido van Rossum	d82fb78b5c	Add values to tp_getattro and tp_flags so that dir(Ellipsis) will return the same as dir(None).	2001-10-30 02:40:52 +00:00
Tim Peters	a427a2b8d0	Rename "dictionary" (type and constructor) to "dict".	2001-10-29 22:25:45 +00:00
Guido van Rossum	7ad2d1eb8e	Add __del__ callbacks. They are too useful to leave out. XXX Remaining problems: - The GC module doesn't know about these; I think it has its reasons to disallow calling __del__, but for now, __del__ on new-style objects is called when the GC module discards an object, for better or for worse. - The code to call a __del__ handler is really ridiculously complicated, due to all the different debug #ifdefs. I've copied this from the similar code in classobject.c, so I'm pretty sure I did it right, but it's not pretty. :-( - No tests yet.	2001-10-29 22:11:00 +00:00
Guido van Rossum	afe7a94089	When overriding __str__ or __repr__, set the tp_print slot to NULL.	2001-10-29 14:33:44 +00:00
Fred Drake	b0c079e3e5	PyObject_CallFunctionObArgs() ---> PyObject_CallFunctionObjArgs() PyObject_CallMethodObArgs() ---> PyObject_CallMethodObjArgs()	2001-10-28 02:39:03 +00:00
Tim Peters	3abca127fe	SF bug #475327 : type() produces incorrect error msg object.h: Added PyType_CheckExact macro. typeobject.c, type_new(): + Use the new macro. + Assert that the arguments have the right types rather than do incomplete runtime checks "sometimes". + If this isn't the 1-argument flavor() of type, and there aren't 3 args total, produce a "types() takes 1 or 3 args" msg before PyArg_ParseTupleAndKeywords produces a "takes exactly 3" msg.	2001-10-27 19:37:48 +00:00
Tim Peters	4d85953fe6	dictionary() constructor: + Change keyword arg name from "x" to "items". People passing a mapping object can stretch their imaginations <wink>. + Simplify the docstring text.	2001-10-27 18:27:48 +00:00
Fred Drake	b92cf067c6	PyObject_CallFunction(), PyObject_CallMethod(): Make sure we do not touch the va_list until we are sure we have a format string and need to use it; this avoid premature initialization and having to finalize it several different places because of error returns.	2001-10-27 06:16:31 +00:00
Fred Drake	c916f5a390	Be smarter about clearing the weakref lists for instances, instance methods, and functions: we only need to call PyObject_ClearWeakRefs() if the weakref list is non-NULL. Since these objects are common but weakrefs are still unusual, saving the call at deallocation time makes a lot of sense.	2001-10-26 17:56:51 +00:00
Fred Drake	b421b8c191	Added two new functions to conveniently call functions/methods from C. PyObject_CallFunctionObArgs() and PyObject_CallMethodObArgs() have the advantage that no format strings need to be parsed. The CallMethod variant also avoids creating a new string object in order to retrieve a method from an object as well.	2001-10-26 16:21:32 +00:00
Tim Peters	1fc240e851	Generalize dictionary() to accept a sequence of 2-sequences. At the outer level, the iterator protocol is used for memory-efficiency (the outer sequence may be very large if fully materialized); at the inner level, PySequence_Fast() is used for time-efficiency (these should always be sequences of length 2). dictobject.c, new functions PyDict_{Merge,Update}FromSeq2. These are wholly analogous to PyDict_{Merge,Update}, but process a sequence-of-2- sequences argument instead of a mapping object. For now, I left these functions file static, so no corresponding doc changes. It's tempting to change dict.update() to allow a sequence-of-2-seqs argument too. Also changed the name of dictionary's keyword argument from "mapping" to "x". Got a better name? "mapping_or_sequence_of_pairs" isn't attractive, although more so than "mosop" <wink>. abstract.h, abstract.tex: Added new PySequence_Fast_GET_SIZE function, much faster than going thru the all-purpose PySequence_Size. libfuncs.tex: - Document dictionary(). - Fiddle tuple() and list() to admit that their argument is optional. - The long-winded repetitions of "a sequence, a container that supports iteration, or an iterator object" is getting to be a PITA. Many months ago I suggested factoring this out into "iterable object", where the definition of that could include being explicit about generators too (as is, I'm not sure a reader outside of PythonLabs could guess that "an iterator object" includes a generator call). - Please check my curly braces -- I'm going blind <0.9 wink>. abstract.c, PySequence_Tuple(): When PyObject_GetIter() fails, leave its error msg alone now (the msg it produces has improved since PySequence_Tuple was generalized to accept iterable objects, and PySequence_Tuple was also stomping on the msg in cases it shouldn't have even before PyObject_GetIter grew a better msg).	2001-10-26 05:06:50 +00:00
Guido van Rossum	6661be3bed	Allow assignment to newinstance.__dict__.	2001-10-26 04:26:12 +00:00
Guido van Rossum	70e3688364	complex_subtype_from_string(): move the declaration of s_buffer[] out of the if block where it was before. The name is only used inside that if block, but the storage is referenced outside it via the 's' variable. (This patch was part of SF patch #474590 -- RISC OS support.)	2001-10-25 18:07:22 +00:00
Guido van Rossum	e2ae77b8b8	SF patch #474590 -- RISC OS support	2001-10-24 20:42:55 +00:00
Guido van Rossum	00ebd46dfc	SF patch #474175 (Jay T Miller): file.readinto arg parsing bug The C-code in fileobject.readinto(buffer) which parses the arguments assumes that size_t is interchangeable with int: size_t ntodo, ndone, nnow; if (f->f_fp == NULL) return err_closed(); if (!PyArg_Parse(args, "w#", &ptr, &ntodo)) return NULL; This causes a problem on Alpha / Tru64 / OSF1 v5.1 where size_t is a long and sizeof(long) != sizeof(int). The patch I'm proposing declares ntodo as an int. An alternative might be to redefine w# to expect size_t. [We can't change w# because there are probably third party modules relying on it. GvR]	2001-10-23 21:25:24 +00:00
Jeremy Hylton	996fad315c	Referencable is not a word, so don't use it in an error message <wink>.	2001-10-22 16:31:40 +00:00
Jeremy Hylton	39a362d9f4	cleanup indentation	2001-10-22 16:30:36 +00:00
Guido van Rossum	5c66a26dee	Make the error message for unsupported operand types cleaner, in response to a message by Laura Creighton on c.l.py. E.g. >>> 0+'' TypeError: unsupported operand types for +: 'int' and 'str' (previously this did not mention the operand types) >>> ''+0 TypeError: cannot concatenate 'str' and 'int' objects	2001-10-22 04:12:44 +00:00
Guido van Rossum	56ff387a7e	Fix for SF bug #472940 : can't getattr() attribute shown by dir() There really isn't a good reason for instance method objects to have their own __dict__, __doc__ and __name__ properties that just delegate the request to the function (callable); the default attribute behavior already does this. The test suite had to be fixed because the error changes from TypeError to AttributeError.	2001-10-22 02:00:09 +00:00
Guido van Rossum	c8e5645f15	Methods of built-in types now properly check for keyword arguments (formerly these were silently ignored). The only built-in methods that take keyword arguments are __call__, __init__ and __new__.	2001-10-22 00:43:43 +00:00
Neil Schemenauer	f23473f008	Add missing "static" declarations (found by "make smelly").	2001-10-21 22:28:58 +00:00
Neil Schemenauer	2677512fc1	Adding missing "static" declarations (found by "make smelly").	2001-10-21 22:26:43 +00:00
Guido van Rossum	6d204074cb	Big internal change that should have no external effects: unify the 'slotdef' structure typedef and 'struct wrapperbase'. By adding the wrapper docstrings to the slotdef structure, the slotdefs array can serve as the data structure that drives add_operators(); the wrapper descriptor contains a pointer to slotdef structure. This replaces lots of custom code from add_operators() by a loop over the slotdefs array, and does away with all the tab_xxx tables.	2001-10-21 00:44:31 +00:00
Thomas Heller	fdc1bd305b	Fix for Bug #216405 : use the correct base for a buffer object in _PyBuffer_FromObject.	2001-10-19 13:49:35 +00:00
Marc-André Lemburg	b5507ecd3c	Additional test and documentation for the unicode() changes. This patch should also be applied to the 2.2b1 trunk.	2001-10-19 12:02:29 +00:00
Guido van Rossum	b8c65bc27f	SF patch #470578 : Fixes to synchronize unicode() and str() This patch implements what we have discussed on python-dev late in September: str(obj) and unicode(obj) should behave similar, while the old behaviour is retained for unicode(obj, encoding, errors). The patch also adds a new feature with which objects can provide unicode(obj) with input data: the __unicode__ method. Currently no new tp_unicode slot is implemented; this is left as option for the future. Note that PyUnicode_FromEncodedObject() no longer accepts Unicode objects as input. The API name already suggests that Unicode objects do not belong in the list of acceptable objects and the functionality was only needed because PyUnicode_FromEncodedObject() was being used directly by unicode(). The latter was changed in the discussed way: * unicode(obj) calls PyObject_Unicode() * unicode(obj, encoding, errors) calls PyUnicode_FromEncodedObject() One thing left open to discussion is whether to leave the PyUnicode_FromObject() API as a thin API extension on top of PyUnicode_FromEncodedObject() or to turn it into a (macro) alias for PyObject_Unicode() and deprecate it. Doing so would have some surprising consequences though, e.g. u"abc" + 123 would turn out as u"abc123"... [Marc-Andre didn't have time to check this in before the deadline. I hope this is OK, Marc-Andre! You can still make changes and commit them on the trunk after the branch has been made, but then please mail Barry a context diff if you want the change to be merged into the 2.2b1 release branch. GvR]	2001-10-19 02:01:31 +00:00
Guido van Rossum	e82f75aa20	Missing file structseq.c for SF patch #462296	2001-10-18 20:47:51 +00:00
Fred Drake	31f4d1fa4b	Remove an unnecessary check for NULL.	2001-10-18 19:21:46 +00:00
Fred Drake	73006d0237	When weakref proxies are involved in binary & ternary slot operations, the left-hand operand may not be the proxy in all cases. If it isn't, we end up doing two things: a) unwrapping something that isn't a PyWeakReference (later resulting in a core dump) and b) passing a proxy as the right-hand operand anyway, even though that can't be handled by the actual handler (maybe eventually causing a core dump). This is fixed by always unwrapping all the proxies involved before passing anything to the actual handler.	2001-10-18 18:04:18 +00:00
Guido van Rossum	f76de62f7d	Fix SF bug #472234 : type(obj) calls type->tp_init (Roeland Rengelink) The fix is a band-aid: type_call() now makes the same exception for a single-argument call to type() as type_new() was already making.	2001-10-18 15:49:21 +00:00
Neil Schemenauer	6b47129424	Fix error checking done by abstract_issubclass and abstract_isinstance. isinstance() now allows any object as the first argument and a class, a type or something with a __bases__ tuple attribute for the second argument. This closes SF patch #464992.	2001-10-18 03:18:43 +00:00
Guido van Rossum	915f0eb212	Protect references to tp_descr_get and tp_dict with the appropriate test: PyType_HasFeature(t, Py_TPFLAGS_HAVE_CLASS).	2001-10-17 20:26:38 +00:00
Guido van Rossum	14a6f8378e	Remove a bunch of stuff that's no longer needed now that update_slot() and fixup_slot_dispatchers() always select the proper slot dispatcher. This affects slot_sq_item(), slot_tp_getattro(), and slot_tp_getattr_hook().	2001-10-17 13:59:09 +00:00
Guido van Rossum	caf59043d1	slot_sq_item(): ensure that self is an instance of the wrapper's d_type before calling the wrapped function. fixup_slot_dispatchers(): fix indentation.	2001-10-17 07:15:43 +00:00
Tim Peters	c993315b18	SF bug [#468061 ] __str__ ignored in str subclass. object.c, PyObject_Str: Don't try to optimize anything except exact string objects here; in particular, let str subclasses go thru tp_str, same as non-str objects. This allows overrides of tp_str to take effect. stringobject.c: + string_print (str's tp_print): If the argument isn't an exact string object, get one from PyObject_Str. + string_str (str's tp_str): Make a genuine-string copy of the object if it's of a proper str subclass type. str() applied to a str subclass that doesn't override __str__ ends up here. test_descr.py: New str_of_str_subclass() test.	2001-10-16 20:18:24 +00:00
Guido van Rossum	b85a8b7bc7	Refactored the update_slot() code a bit to be hopefully slightly more efficient: - recurse down subclasses only once rather than for each affected slot; - short-circuit recursing down subclasses when a subclass has its own definition of the name that caused the update_slot() calls in the first place; - inline collect_ptrs().	2001-10-16 17:00:48 +00:00
Guido van Rossum	687ae00460	Get rid of __defined__ and tp_defined -- there's no need to distinguish __dict__ and __defined__ any more. In the C structure, tp_cache takes its place -- but this hasn't been implemented yet.	2001-10-15 22:03:32 +00:00
Guido van Rossum	2f3ca6eeb6	Completely get rid of __dynamic__ and the corresponding Py_TPFLAGS_DYNAMICTYPE bit. There is no longer a performance benefit, and I don't really see the use case any more.	2001-10-15 21:05:10 +00:00
Guido van Rossum	825d875371	Add (void *) casts to solve some problems on HP-UX 11.0, as discussed on SF bug #467145.	2001-10-15 19:44:24 +00:00
Guido van Rossum	d396b9c9c3	Redid the slot computation. The initial slot assignments are now done using the same algorithm as the slot updates. The slotdefs array is now sorted by slot offset and has an interned string object corresponding to the name added to each item. More can be done but I need to commit this first as a working intermediate stage.	2001-10-13 20:02:41 +00:00
Fred Drake	2bae4face2	Remove extra "]" in splitlines() docstring. Reported by Neal Norwitz.	2001-10-13 15:57:55 +00:00
Guido van Rossum	79fd0fcae4	Band-aid solution to SF bug #470634 : readlines() on linux requires 2 ^D's. The problem is that if fread() returns a short count, we attempt another fread() the next time through the loop, and apparently glibc clears or ignores the eof condition so the second fread() requires another ^D to make it see the eof condition. According to the man page (and the C std, I hope) fread() can only return a short count on error or eof. I'm using that in the band-aid solution to avoid calling fread() a second time after a short read. Note that xreadlines() still has this problem: it calls readlines(sizehint) until it gets a zero-length return. Since xreadlines() is mostly used for reading real files, I won't worry about this until we get a bug report.	2001-10-12 20:01:53 +00:00
Guido van Rossum	5af588b7f0	Now that COPYBUF is a new local macro, add #undef COPYBUF.	2001-10-12 14:13:21 +00:00
Tim Peters	fc57ccb982	SF bug [#470040 ] ParseTuple t# vs subclasses. inherit_slots(): tp_as_buffer was getting inherited as if it were a method pointer, rather than a pointer to a vector of method pointers. As a result, inheriting from a type that implemented buffer methods was ineffective, leaving all the tp_as_buffer slots NULL in the subclass.	2001-10-12 02:38:24 +00:00
Guido van Rossum	875eeaa193	Another step in the right direction: when a new class's attribute corresponding to a dispatch slot (e.g. __getitem__ or __add__) is set, calculate the proper dispatch slot and propagate the change to all subclasses. Because of multiple inheritance, there's no easy way to avoid always recursing down the tree of subclasses. Who cares? (There's more to do, but this works. There's also a test for this now.)	2001-10-11 18:33:53 +00:00
Jack Jansen	2771b5b52b	Rather gross workaround for a bug in the mac GUSI I/O library: lseek(fp, 0L, SEEK_CUR) can make a filedescriptor unusable. This workaround is expected to last only a few weeks (until GUSI is fixed), but without it test_email fails.	2001-10-10 22:03:27 +00:00
Guido van Rossum	fd38f8e638	The slot definition table entry for mp_getitem had a bogus wrapper function, which caused test_minidom to fail. Fixed this.	2001-10-09 20:17:57 +00:00
Guido van Rossum	7b9144b2ee	Halfway checkin. This is still messy, but it's beginning to address the problem that slots weren't inherited properly. override_slots() no longer exists; in its place comes fixup_slot_dispatchers() which does more and different work and is table-based. (Eventually I want this table also to replace all the little tab_foo tables.) Also add a wrapper for __delslice__; this required a change in test_descrtut.py.	2001-10-09 19:39:46 +00:00
Guido van Rossum	0eb2a6e974	It turned out not so difficult to support old-style numbers (those without the Py_TPFLAGS_CHECKTYPES flag) in the wrappers. This required a few changes in test_descr.py to cope with the fact that the complex type has __int__, __long__ and __float__ methods that always raise an exception.	2001-10-09 11:07:24 +00:00
Tim Peters	44383384b3	type_subclasses(): debug build was broken due to typo in new assert().	2001-10-08 16:49:26 +00:00
Guido van Rossum	1c45073aba	Keep track of a type's subclasses (subtypes), in tp_subclasses, which is a list of weak references to types (new-style classes). Make this accessible to Python as the function __subclasses__ which returns a list of types -- we don't want Python programmers to be able to manipulate the raw list. In order to make this possible, I also had to add weak reference support to type objects. This will eventually be used together with a trap on attribute assignment for dynamic classes for a major speed-up without losing the dynamic properties of types: when a __foo__ method is added to a class, the class and all its subclasses will get an appropriate tp_foo slot function.	2001-10-08 15:18:27 +00:00
Guido van Rossum	03290ecbf1	Implement isinstance(x, (A, B, ...)). Note that we only allow tuples, not other sequences (then we'd have to except strings, and we'd still be susceptible to recursive attacks).	2001-10-07 20:54:12 +00:00
Tim Peters	f2a67daca2	Guido suggests, and I agree, to insist that SIZEOF_VOID_P be a power of 2. This simplifies the rounding in _PyObject_VAR_SIZE, allows to restore the pre-rounding calling sequence, and allows some nice little simplifications in its callers. I'm still making it return a size_t, though.	2001-10-07 03:54:51 +00:00
Tim Peters	6d483d3477	_PyObject_VAR_SIZE: always round up to a multiple-of-pointer-size value. As Guido suggested, this makes the new subclassing code substantially simpler. But the mechanics of doing it w/ C macro semantics are a mess, and _PyObject_VAR_SIZE has a new calling sequence now. Question: The PyObject_NEW_VAR macro appears to be part of the public API. Regardless of what it expands to, the notion that it has to round up the memory it allocates is new, and extensions containing the old PyObject_NEW_VAR macro expansion (which was embedded in the PyObject_NEW_VAR expansion) won't do this rounding. But the rounding isn't actually needed except for new-style instances with dict pointers after a variable-length blob of embedded data. So my guess is that we do not need to bump the API version for this (as the rounding isn't needed for anything an extension can do unless it's recompiled anyway). What's your guess?	2001-10-06 21:27:34 +00:00
Tim Peters	406fe3b1c0	Repaired the debug Windows deaths in test_descr, by allocating enough pad memory to properly align the __dict__ pointer in all cases. gcmodule.c/objimpl.h, _PyObject_GC_Malloc: + Added a "padding" argument so that this flavor of malloc can allocate enough bytes for alignment padding (it can't know this is needed, but its callers do). typeobject.c, PyType_GenericAlloc: + Allocated enough bytes to align the __dict__ pointer. + Sped and simplified the round-up-to-PTRSIZE logic. + Added blank lines so I could parse the if/else blocks <0.7 wink>.	2001-10-06 19:04:01 +00:00
Tim Peters	7254e5a3ed	_PyObject_GetDictPtr(): + Use the _PyObject_VAR_SIZE macro to compute object size. + Break the computation into lines convenient for debugger inspection. + Speed the round-up-to-pointer-size computation.	2001-10-06 17:45:17 +00:00
Fred Drake	b3f0d349b6	PyObject_ClearWeakRefs() is now a real function instead of a function pointer; the implementation is in Objects/weakrefobject.c.	2001-10-05 21:58:11 +00:00
Fred Drake	8844d5264f	The weak reference implementation, separated from the weakref module.	2001-10-05 21:52:26 +00:00
Guido van Rossum	9475a2310d	Enable GC for new-style instances. This touches lots of files, since many types were subclassable but had a xxx_dealloc function that called PyObject_DEL(self) directly instead of deferring to self->ob_type->tp_free(self). It is permissible to set tp_free in the type object directly to _PyObject_Del, for non-GC types, or to _PyObject_GC_Del, for GC types. Still, PyObject_DEL was a tad faster, so I'm fearing that our pystone rating is going down again. I'm not sure if doing something like void xxx_dealloc(PyObject *self) { if (PyXxxCheckExact(self)) PyObject_DEL(self); else self->ob_type->tp_free(self); } is any faster than always calling the else branch, so I haven't attempted that -- however those types whose own dealloc is fancier (int, float, unicode) do use this pattern.	2001-10-05 20:51:39 +00:00
Guido van Rossum	50fda3ba26	Make new classes dynamic by default.	2001-10-04 19:46:06 +00:00
Tim Peters	59f809d3bc	type_new(): cast PyObject_MALLOC's result to char*, for clarity.	2001-10-04 05:43:02 +00:00
Tim Peters	2f93e28a19	SF bug [#467331 ] ClassType.__doc__ always None. For a dynamically constructed type object, fill in the tp_doc slot with a copy of the argument dict's "__doc__" value, provided the latter exists and is a string. NOTE: I don't know what to do if it's a Unicode string, so in that case tp_doc is left NULL (which shows up as Py_None if you do Class.__doc__). Note that tp_doc holds a char, not a general PyObject.	2001-10-04 05:27:00 +00:00
Guido van Rossum	1e1de1cf35	typeobject.c, slot_tp_gettattr_hook(): fix the speedup hack -- the test for getattribute==NULL was bogus because it always found object.__getattribute__. Pick it apart using the trick we learned from slot_sq_item, and if it's just a wrapper around PyObject_GenericGetAttr, zap it. Also added a long XXX comment explaining the consequences.	2001-10-03 13:58:35 +00:00
Guido van Rossum	f4593e0b65	EXPERIMENTAL speedup of slot_sq_item. This sped up the following test dramatically: class T(tuple): __dynamic__ = 1 t = T(range(1000)) for i in range(1000): tt = tuple(t) The speedup was about 5x compared to the previous state of CVS (1.7 vs. 8.8, in arbitrary time units). But it's still more than twice as slow as as the same test with __dynamic__ = 0 (0.8). I'm not sure that I really want to go through the trouble of this kind of speedup for every slot. Even doing it just for the most popular slots will be a major effort (the new slot_sq_item is 40+ lines, while the old one was one line with a powerful macro -- unfortunately the speedup comes from expanding the macro and doing things in a way specific to the slot signature). An alternative that I'm currently considering is sketched in PLAN.txt: trap setattr on type objects. But this will require keeping track of all derived types using weak references.	2001-10-03 12:09:30 +00:00
Guido van Rossum	da21c0110b	call_method(), call_maybe(): fix a performance bug: the argument pointing to a static variable to hold the object form of the string was never used, causing endless calls to PyString_InternFromString(). One particular test (with lots of __getitem__ calls) became a third faster with this!	2001-10-03 00:50:18 +00:00
Tim Peters	c15c4f1f39	SF bug [#467265 ] Compile errors on SuSe Linux on IBM/s390. Unknown whether this fixes it. - stringobject.c, PyString_FromFormatV: don't assume that va_list is of a type that can be copied via an initializer. - errors.c, PyErr_Format: add a va_end() to balance the va_start().	2001-10-02 21:32:07 +00:00
Guido van Rossum	048eb75c2d	Add Garbage Collection support to new-style classes (not yet to their instances). Also added GC support to various auxiliary types: super, property, descriptors, wrappers, dictproxy. (Only type objects have a tp_clear field; the other types are.) One change was necessary to the GC infrastructure. We have statically allocated type objects that don't have a GC header (and can't easily be given one) and heap-allocated type objects that do have a GC header. Giving these different metatypes would be really ugly: I tried, and I had to modify pickle.py, cPickle.c, copy.py, add a new invent a new name for the new metatype and make it a built-in, change affected tests... In short, a mess. So instead, we add a new type slot tp_is_gc, which is a simple Boolean function that determines whether a particular instance has GC headers or not. This slot is only relevant for types that have the (new) GC flag bit set. If the tp_is_gc slot is NULL (by far the most common case), all instances of the type are deemed to have GC headers. This slot is called by the PyObject_IS_GC() macro (which is only used twice, both times in gcmodule.c). I also changed the extern declarations for a bunch of GC-related functions (_PyObject_GC_Del etc.): these always exist but objimpl.h only declared them when WITH_CYCLE_GC was defined, but I needed to be able to reference them without #ifdefs. (When WITH_CYCLE_GC is not defined, they do the same as their non-GC counterparts anyway.)	2001-10-02 21:24:57 +00:00
Guido van Rossum	55f2099b2f	Miscellaneous code fiddling: - SLOT1BINFULL() macro: changed this to check for __rop__ overriding __op__, like binary_op1() in abstract.c -- the latter only calls the slot function once if both types use the same slot function, so the slot function must make both calls -- which it already did for the __op__, __rop__ order, but not yet for the __rop__, __op__ order when B.__class__ is a subclass of A.__class__. - slot_sq_contains(), slot_nb_nonzero(): use lookup_maybe() rather than lookup_method() which sets an exception which we then clear. - slot_nb_coerce(): don't give up when left argument's __coerce__ returns NotImplemented, but give the right argument a chance.	2001-10-01 17:18:22 +00:00
Guido van Rossum	89c4264792	binary_op1(), ternary_op(): rearrange the code so that slotw is tested (to see whether __rop__ should go before __op__) only when slotv is set. This saves a test+branch when only slotw is set.	2001-10-01 17:10:18 +00:00
Guido van Rossum	2611162345	slot_sq_length(): squash a leak.	2001-10-01 16:42:49 +00:00
Guido van Rossum	25d1807d23	slot_tp_new(): newargs was leaking.	2001-10-01 15:55:28 +00:00
Guido van Rossum	d016e45fdb	Fix typo found by doerwalter.	2001-10-01 13:17:24 +00:00
Tim Peters	8b13b3ede2	SF bug [#466173 ] unpack TypeError unclear Replaced 3 instances of "iter() of non-sequence" with "iteration over non-sequence". Restored "unpack non-sequence" for stuff like "a, b = 1".	2001-09-30 05:58:42 +00:00
Tim Peters	d38b1c74f3	SF [#466125 ] PyLong_AsLongLong works for any integer. Generalize PyLong_AsLongLong to accept int arguments too. The real point is so that PyArg_ParseTuple's 'L' code does too. That code was undocumented (AFAICT), so documented it.	2001-09-30 05:09:37 +00:00
Guido van Rossum	84675acb49	The changes to ternary_op could cause a core dump. Fix this, and rewrite the code a bit to avoid calling the same slot more than once.	2001-09-29 01:05:03 +00:00
Guido van Rossum	4bb1e36eec	It's a fact: for binary operators, under certain circumstances, __rop__ now takes precendence over __op__. Those circumstances are: - Both arguments are new-style classes - Both arguments are new-style numbers - Their implementation slots for tp_op differ - Their types differ - The right argument's type is a subtype of the left argument's type Also did this for the ternary operator (pow) -- only the binary case is dealt with properly though, since __rpow__ is not supported anyway.	2001-09-28 23:49:48 +00:00
Guido van Rossum	9bea3abf0d	Ouch. The wrapper for __rpow__ was the same as for __pow__, resulting in bizarre outcomes. Test forthcoming.	2001-09-28 22:58:52 +00:00
Guido van Rossum	2ed6bf87c9	Merge branch changes (coercion, rich comparisons) into trunk.	2001-09-27 20:30:07 +00:00
Guido van Rossum	874f15aa28	add_operators(): the __floordiv__ and __truediv__ descriptors (and their 'i' and 'r' variants) were not being generated if the corresponding nb_ slots were present in the type object. I bet this is because floor and true division were introduced after I last looked at that part of the code.	2001-09-25 21:16:33 +00:00
Guido van Rossum	3926a63d05	- Provisional support for pickling new-style objects. () - Made cls.__module__ writable. - Ensure that obj.__dict__ is returned as {}, not None, even upon first reference; it simply springs into life when you ask for it. () The pickling support is provisional for the following reasons: - It doesn't support classes with __slots__. - It relies on additional support in copy_reg.py: the C method __reduce__, defined in the object class, really calls calling copy_reg._reduce(obj). Eventually the Python code in copy_reg.py needs to be migrated to C, but I'd like to experiment with the Python implementation first. The _reduce() code also relies on an additional helper function, _reconstructor(), defined in copy_reg.py; this should also be reimplemented in C.	2001-09-25 16:25:58 +00:00
Guido van Rossum	a4cb78874c	Change repr() of a new-style class to say <class 'ClassName'> rather than <type 'ClassName'>. Exception: if it's a built-in type or an extension type, continue to call it <type 'ClassName>. Call me a wimp, but I don't want to break more user code than necessary.	2001-09-25 03:56:29 +00:00
Guido van Rossum	5c294fb0e6	Make __class__ assignment possible, when the object structures are the same. I hope the test for structural equivalence is stringent enough. It only allows the assignment if the old and new types: - have the same basic size - have the same item size - have the same dict offset - have the same weaklist offset - have the same GC flag bit - have a common base that is the same except for maybe the dict and weaklist (which may have been added separately at the same offsets in both types)	2001-09-25 03:43:42 +00:00
Tim Peters	66c1a525e0	Make properties discoverable from Python: - property() now takes 4 keyword arguments: fget, fset, fdel, doc. Note that the real purpose of the 'f' prefix is to make fdel fit in ('del' is a keyword, so can't used as a keyword argument name). - These map to visible readonly attributes 'fget', 'fset', 'fdel', and '__doc__' in the property object. - fget/fset/fdel weren't discoverable from Python before. - __doc__ is new, and allows to associate a docstring with a property.	2001-09-24 21:17:50 +00:00
Guido van Rossum	3d45d8f12e	Another comparison patch-up: comparing a type with a dynamic metatype to one with a static metatype raised an obscure error.	2001-09-24 18:47:40 +00:00
Guido van Rossum	2205642fe0	Do the same thing to complex that I did to str: the rich comparison function returns NotImplemented when comparing objects whose tp_richcompare slot is not itself.	2001-09-24 17:52:04 +00:00
Guido van Rossum	bb77e6801e	Change string comparison so that it applies even when one (or both) arguments are subclasses of str, as long as they don't override rich comparison.	2001-09-24 16:51:54 +00:00
Guido van Rossum	ff0e6d6ef5	Fix the baffler that Tim reported: sometimes the repr() of an object looks like <X object at ...>, sometimes it says <X instance at ...>. Make this uniformly say <X object at ...>.	2001-09-24 16:03:59 +00:00
Tim Peters	2c9aa5ea8d	Generalize file.writelines() to allow iterable objects.	2001-09-23 04:06:05 +00:00
Guido van Rossum	19c1cd5b35	Add the __getattr__ hook back. The rules are now: - if __getattribute__ exists, it is called first; if it doesn't exists, PyObject_GenericGetAttr is called first. - if the above raises AttributeError, and __getattr__ exists, it is called.	2001-09-21 21:24:49 +00:00
Guido van Rossum	867a8d2e26	Change the name of the __getattr__ special method for new-style classes to __getattribute__, to make it crystal-clear that it doesn't have the same semantics as overriding __getattr__ on classic classes. This is a halfway checkin -- I'll proceed to add a __getattr__ hook that works the way it works in classic classes.	2001-09-21 19:29:08 +00:00
Guido van Rossum	ad9744a67a	Fix a bug in rendering of \\ by repr() -- it rendered as \\\ instead of \\.	2001-09-21 15:38:17 +00:00
Guido van Rossum	32d34c809f	Add optional docstrings to getset descriptors. Fortunately, there's no backwards compatibility to worry about, so I just pushed the 'closure' struct member to the back -- it's never used in the current code base (I may eliminate it, but that's more work because the getter and setter signatures would have to change.) As examples, I added actual docstrings to the getset attributes of a few types: file.closed, xxsubtype.spamdict.state.	2001-09-20 21:45:26 +00:00
Guido van Rossum	6f7993765a	Add optional docstrings to member descriptors. For backwards compatibility, this required all places where an array of "struct memberlist" structures was declared that is referenced from a type's tp_members slot to change the type of the structure to PyMemberDef; "struct memberlist" is now only used by old code that still calls PyMember_Get/Set. The code in PyObject_GenericGetAttr/SetAttr now calls the new APIs PyMember_GetOne/SetOne, which take a PyMemberDef argument. As examples, I added actual docstrings to the attributes of a few types: file, complex, instance method, super, and xxsubtype.spamlist. Also converted the symtable to new style getattr.	2001-09-20 20:46:19 +00:00
Marc-André Lemburg	3508e30861	Fix Unicode .join() method to raise a TypeError for sequence elements which are not Unicode objects or strings. (This matches the string.join() behaviour.) Fix a memory leak in the .join() method which occurs in case the Unicode resize fails. Restore the test_unicode output.	2001-09-20 17:22:58 +00:00
Guido van Rossum	dd4d1c4f5d	_PyObject_GetDictPtr(): when the offset is negative, always align -- we can't trust that tp_basicsize is aligned. Fixes SF bug #462848.	2001-09-20 13:38:22 +00:00
Marc-André Lemburg	6871f6ac57	Implement the changes proposed in patch #413333 . unicode(obj) now works just like str(obj) in that it tries __str__/tp_str on the object in case it finds that the object is not a string or buffer.	2001-09-20 12:53:16 +00:00
Marc-André Lemburg	c60e6f7771	Patch #435971 : UTF-7 codec by Brian Quinlan.	2001-09-20 10:35:46 +00:00
Tim Peters	efc3a3af3b	SF bug [#463093 ] File methods need doc strings. Now they don't.	2001-09-20 07:55:22 +00:00
Martin v. Löwis	2777c021fc	Patch #462849 : Pass Unicode objects to file's .write method.	2001-09-19 13:47:32 +00:00
Guido van Rossum	1952e388ca	Add additional coercion support for "self subtypes" to int, long, float (compare the recent checkin to complex). Added tests for these.	2001-09-19 01:25:16 +00:00
Guido van Rossum	638059603c	complex_coerce(): add explicit PyComplex_Check() test. Previously, complex_coerce() would never be called with a complex argument, because PyNumber_Coerce[Ex] doesn't bother calling the type's coercion method if the values already have the same type. But now, of course, it's possible to pass an instance of a complex subtype, and those must be accepted.	2001-09-19 01:13:10 +00:00
Guido van Rossum	ab3b0343b8	Hopefully fix 3-way comparisons. This unfortunately adds yet another hack, and it's even more disgusting than a PyInstance_Check() call. If the tp_compare slot is the slot used for overrides in Python, it's always called. Add some tests that show what should work too.	2001-09-18 20:38:53 +00:00
Martin v. Löwis	cf95f9cacb	Properly repr classes without module names.	2001-09-18 20:23:28 +00:00
Guido van Rossum	ceccae5365	wrap_cmpfunc(): added a safety check for the __cmp__ wrapper. We can only safely call a type's tp_compare slot if the second argument is also an instance of the same type. I hate to think what e.g. int_compare() would do with a second argument that's a float!	2001-09-18 20:03:57 +00:00
Guido van Rossum	f0b35e1501	Redo the PyMethod attributes using a dir()-friendly approach, creating descriptors for each attribute. The getattr() implementation is similar to PyObject_GenericGetAttr(), but delegates to im_self instead of looking in __dict__; I couldn't do this as a wrapper around PyObject_GenericGetAttr(). XXX A problem here is that this is a case of delegation. dir() doesn't see exactly the same attributes that are actually defined; e.g. if the delegate is a Python function object, it supports attributes like func_code etc., but these are not visible to dir(); on the other hand, dynamic function attributes (stored in the function's __dict__) are visible to dir(). Maybe we need a mechanism to tell dir() about the delegation mechanism? I vaguely recall seeing a request in the newsgroup for a more formal definition of attribute delegation too. Sigh, time for a new PEP.	2001-09-18 03:53:24 +00:00
Tim Peters	26f68f5957	type_new(): Didn't compile anymore, due to change in struct memberlist definition. Guido, what else did you forget to check in <wink>?	2001-09-18 00:23:33 +00:00
Guido van Rossum	d9d1d4ac6f	Rewrite function attributes to use the generic routines properly. This uses the new "restricted" feature of structmember, and getset descriptors for some of the type checks.	2001-09-17 23:46:56 +00:00
Tim Peters	305b5857f6	PyObject_Dir(): Merge in __members__ and __methods__ too (if they exist, and are lists, and then just the string elements (if any)). There are good and bad reasons for this. The good reason is to support dir() "like before" on objects of extension types that haven't migrated to the class introspection API yet. The bad reason is that Python's own method objects are such a type, and this is the quickest way to get their im_self etc attrs to "show up" via dir(). It looks much messier to move them to the new scheme, as their current getattr implementation presents a view of their attrs that's a untion of their own attrs plus their im_func's attrs. In particular, methodobject.__dict__ actually returns methodobject.im_func.__dict__, and if that's important to preserve it doesn't seem to fit the class introspection model at all.	2001-09-17 02:38:46 +00:00
Tim Peters	bc7e863ce2	merge_class_dict(): Clear the error if __bases__ doesn't exist.	2001-09-16 20:33:22 +00:00
Guido van Rossum	7e35d57c0c	A fix for SF bug #461546 (bug in long_mul). Both int and long multiplication are changed to be more careful in their assumptions about when one of the arguments is a sequence: the assumption that at least one of the arguments must be an int (or long, respectively) is still held, but the assumption that these don't smell like sequences is no longer true: a subtype of int or long may well have a sequence-repeat thingie!	2001-09-15 03:14:32 +00:00
Guido van Rossum	a8c60f478c	tp_new_wrapper(): A subtle change in the check for safe use. Allow staticbase != type, as long as their tp_new slots are the same.	2001-09-14 19:43:36 +00:00
Guido van Rossum	f21c6be7bd	Add call_maybe(): a variant of call_method() that returns NotImplemented when the lookup fails, and use this for binary operators. Also lookup_maybe() which doesn't raise an exception when the lookup fails (still returning NULL).	2001-09-14 17:51:50 +00:00
Guido van Rossum	717ce00c7c	call_method(): - Don't turn a non-tuple argument into a one-tuple. Rather, the caller must pass a format that causes Py_VaBuildValue() to return a tuple. - Speed things up by calling PyObject_Call (which is fairly low-level and straightforward) rather than PyObject_CallObject (which calls PyEval_CallObjectWithKeywords which calls PyObject_Call, and nothing is really done in the mean time except some tests for NULL args and valid types, which are already guaranteed). - Cosmetics. Other places: - Make sure that the format argument to call_method() is surrounded by parentheses, so it will cause a tuple to be created. - Replace a few calls to PyEval_CallObject() with a surefire tuple for args to calls to PyObject_Call(). (A few calls to PyEval_CallObject() remain that have NULL for args.)	2001-09-14 16:58:08 +00:00
Guido van Rossum	5560b7492c	PyObject_CallObject(): this may as well call PyEval_CallObject() directly, as the only thing done here (replace NULL args with an empty tuple) is also done there. XXX Maybe we should take one step further and equate the two at the macro level? That's harder though because PyEval_Call* is declared in a header that's not included standard. But it is silly that PyObject_CallObject calls PyEval_CallObject which calls back to PyObject_Call. Maybe PyEval_CallObject should be moved into this file instead? All I know is that there are too many call APIs! The differences between PyObject_Call and PyEval_CallObjectWithKeywords is that the latter allows args to be NULL, and does explicit type checks for args and kwds.	2001-09-14 16:47:50 +00:00
Guido van Rossum	5f5512d246	_PyObject_Dump(): print the type of the object. This is by far the most frequently interesting information IMO. Also tidy up the output.	2001-09-14 15:50:08 +00:00
Tim Peters	4441001b56	The end of [#460467 ] file objects should be subclassable. A surprising number of changes to split tp_new into tp_new and tp_init. Turned out the older PyFile_FromFile() didn't initialize the memory it allocated in all (error) cases, which caused new sanity asserts elsewhere to fail left & right (and could have, e.g., caused file_dealloc to try decrefing random addresses).	2001-09-14 03:26:08 +00:00
Tim Peters	0ab085c4cb	Changed the dict implementation to take "string shortcuts" only when keys are true strings -- no subclasses need apply. This may be debatable. The problem is that a str subclass may very well want to override __eq__ and/or __hash__ (see the new example of case-insensitive strings in test_descr), but go-fast shortcuts for strings are ubiquitous in our dicts (and subclass overrides aren't even looked for then). Another go-fast reason for the change is that PyCheck_StringExact() is a quicker test than PyCheck_String(), and we make such a test on virtually every access to every dict. OTOH, a str subclass may also be perfectly happy using the base str eq and hash, and this change slows them a lot. But those cases are still hypothetical, while Python's own reliance on true-string dicts is not.	2001-09-14 00:25:33 +00:00
Tim Peters	742dfd6f17	Get rid of builtin_open() entirely (the C code and docstring, not the builtin function); Guido pointed out that it could be just another name in the __builtin__ dict for the file constructor now.	2001-09-13 21:49:44 +00:00
Tim Peters	8fa45677c1	Now that file objects are subclassable, you can get at the file constructor just by doing type(f) where f is any file object. This left a hole in restricted execution mode that rexec.py can't plug by itself (although it can plug part of it; the rest is plugged in fileobject.c now).	2001-09-13 21:01:29 +00:00
Tim Peters	3f996e7266	type_call(): Change in policy. The keyword args (if any) are now passed on to the tp_new slot (if non-NULL), as well as to the tp_init slot (if any). A sane type implementing both tp_new and tp_init should probably pay attention to the arguments in only one of them.	2001-09-13 19:18:27 +00:00
Skip Montanaro	bafedecc06	based upon a suggestion in c.l.py, this slight expansion of the OverflowError message seems reasonable.	2001-09-13 19:05:30 +00:00
Tim Peters	59c9a645e2	SF bug [#460467 ] file objects should be subclassable. Preliminary support. What's here works, but needs fine-tuning.	2001-09-13 05:38:56 +00:00
Tim Peters	2400fa4ad1	Again perhaps the end of [#460020 ] bug or feature: unicode() and subclasses. Inhibited complex unary plus optimization when applied to a complex subtype. Added PyComplex_CheckExact macro. Some comments and minor code fiddling.	2001-09-12 19:12:49 +00:00
Tim Peters	111f60964e	If interning an instance of a string subclass, intern a real string object with the same value instead. This ensures that a string (or string subclass) object's ob_sinterned pointer is always a str (or NULL), and that the dict of interned strings only has strs as keys.	2001-09-12 07:54:51 +00:00
Tim Peters	af90b3e610	str_subtype_new, unicode_subtype_new: + These were leaving the hash fields at 0, which all string and unicode routines believe is a legitimate hash code. As a result, hash() applied to str and unicode subclass instances always returned 0, which in turn confused dict operations, etc. + Changed local names "new"; no point to antagonizing C++ compilers.	2001-09-12 05:18:58 +00:00
Tim Peters	7a29bd5861	More on bug 460020: disable many optimizations of unicode subclasses.	2001-09-12 03:03:31 +00:00
Tim Peters	8fa5dd0601	More bug 460020: lots of string optimizations inhibited for string subclasses, all "the usual" ones (slicing etc), plus replace, translate, ljust, rjust, center and strip. I don't know how to be sure they've all been caught. Question: Should we complain if someone tries to intern an instance of a string subclass? I hate to slow any code on those paths.	2001-09-12 02:18:30 +00:00
Tim Peters	40c397dd56	long_invert(): tiny speed and space optimization.	2001-09-11 23:24:22 +00:00
Tim Peters	69c2de3ad6	More bug 460020. Disable a number of long optimizations for long subclasses.	2001-09-11 22:31:33 +00:00
Tim Peters	0280cf79a7	More bug 460020: when F is a subclass of float, disable the unary plus optimization (+F(whatever)).	2001-09-11 21:53:35 +00:00
Tim Peters	73a1dfe367	More bug 460020. When I is a subclass of int, disable the +I(whatever), I(0) << whatever, I(0) >> whatever, I(whatever) << 0 and I(whatever) >> 0 optimizations.	2001-09-11 21:44:14 +00:00
Tim Peters	7b07a41e9f	The endless 460020 bug. Disable t[:], t0, t1 optimizations when t is of a tuple subclass type.	2001-09-11 19:48:03 +00:00
Guido van Rossum	dea6ef9bfd	Replace a few places where X->ob_type was compared to &PyXXX_Type with calls to PyXXX_CheckExact(X).	2001-09-11 16:13:52 +00:00
Tim Peters	78e0fc74bc	Possibly the end of SF [#460020 ] bug or feature: unicode() and subclasses. Changed unicode(i) to return a true Unicode object when i is an instance of a unicode subclass. Added PyUnicode_CheckExact macro.	2001-09-11 03:07:38 +00:00
Tim Peters	0ebeb584a4	PyUnicode_FromEncodedObject(): Repair memory leak in an error case.	2001-09-11 02:00:50 +00:00
Tim Peters	5a49ade70e	More on SF bug [#460020 ] bug or feature: unicode() and subclasses. Repaired str(i) to return a genuine string when i is an instance of a str subclass. New PyString_CheckExact() macro.	2001-09-11 01:41:59 +00:00
Tim Peters	8ff70a9606	Fix tortured comment -- I must be on drugs today.	2001-09-10 23:53:53 +00:00
Tim Peters	4c3a0a35cd	More on SF bug [#460020 ] bug or feature: unicode() and subclasses. tuple(i) repaired to return a true tuple when i is an instance of a tuple subclass. Added PyTuple_CheckExact macro. PySequence_Tuple(): if a tuple-like object isn't exactly a tuple, it's not safe to return the object as-is -- make a new tuple of it instead.	2001-09-10 23:37:46 +00:00
Tim Peters	7a50f2536e	More for SF bug [#460020 ] bug or feature: unicode() and subclasses Repair float constructor to return a true float when passed a subclass instance. New PyFloat_CheckExact macro.	2001-09-10 21:28:20 +00:00
Tim Peters	64b5ce3a69	SF bug #460020 : bug or feature: unicode() and subclasses. Given an immutable type M, and an instance I of a subclass of M, the constructor call M(I) was just returning I as-is; but it should return a new instance of M. This fixes it for M in {int, long}. Strings, floats and tuples remain to be done. Added new macros PyInt_CheckExact and PyLong_CheckExact, to more easily distinguish between "is" and "is a" (i.e., only an int passes PyInt_CheckExact, while any sublass of int passes PyInt_Check). Added private API function _PyLong_Copy.	2001-09-10 20:52:51 +00:00
Guido van Rossum	8b4e43e768	_portable_fseek(): Subtlety on Windows: if we change test_largefile.py to use a file > 4GB, it still fails. A debug session suggests this is because fseek(fp, 0, 2) refuses to seek to the end of the file when the file is > 4GB, because it uses the SetFilePointer() in 32-bit mode. But it only fails when we seek relative to the end of the file, because in the other seek modes only calls to fgetpos() and fsetpos() are made, which use Get/SetFilePointer() in 64-bit mode. Solution: #ifdef MS_WInDOWS, replace the call to fseek(fp, ...) with a call to _lseeki64(fileno(fp), ...). Make sure to call fflush(fp) first. (XXX Could also replace the entire branch with a call to _lseeki64(). Would that be more efficient? Certainly less generated code.) (XXX This needs more testing. I can't actually test that it works for files >4GB on my Win98 machine, because the filesystem here won't let me create files >=4GB at all. Tim should test this on his Win2K machine.)	2001-09-10 20:43:35 +00:00
Guido van Rossum	8dbd3d8c50	PyObject_Dir(): - use PyModule_Check() instead of PyObject_TypeCheck(), now we can. - don't assert that the __dict__ gotten out of a module is always a dictionary; check its type, and raise an exception if it's not.	2001-09-10 18:27:43 +00:00
Tim Peters	16a77adfbd	Generalize operator.indexOf (PySequence_Index) to work with any iterable object. I'm not sure how that got overlooked before! Got rid of the internal _PySequence_IterContains, introduced a new internal _PySequence_IterSearch, and rewrote all the iteration-based "count of", "index of", and "is the object in it or not?" routines to just call the new function. I suppose it's slower this way, but the code duplication was getting depressing.	2001-09-08 04:00:12 +00:00
Guido van Rossum	28d80b1058	PyClass_New(): put the extended Don Beaudry hook back in. When one of the base classes is not a classic class, and its class (the metaclass) is callable, call the metaclass to do the deed. One effect of this is that, when mixing classic and new-style classes amongst the bases of a class, it doesn't matter whether the first base class is a classic class or not: you will always get the error "TypeError: metatype conflict among bases". (Formerly, with a classic class first, you'd get "TypeError: PyClass_New: base must be a class".) Another effect is that multiple inheritance from ExtensionClass.Base, with a classic class as the first class, transfers control to the ExtensionClass.Base class. This is what we need for SF #443239 (and also for running Zope under 2.2a4, before ExtensionClass is replaced).	2001-09-07 21:08:32 +00:00
Guido van Rossum	8700b4281a	PySequence_Check(), PyMapping_Check(): only return true if the corresponding "getitem" operation (sq_item or mp_subscript) is implemented. I realize that "sequence-ness" and "mapping-ness" are poorly defined (and the tests may still be wrong for user-defined instances, which always have both slots filled), but I believe that a sequence that doesn't support its getitem operation should not be considered a sequence. All other operations are optional though. For example, the ZODB BTree tests crashed because PySequence_Check() returned true for a dictionary! (In 2.2, the dictionary type has a tp_as_sequence pointer, but the only field filled is sq_contains, so you can write "if key in dict".) With this fix, all standalone ZODB tests succeed.	2001-09-07 20:20:11 +00:00
Guido van Rossum	9478d07ee7	PyType_IsSubtype(): test tp_flags for HAVE_CLASS bit before accessing a->tp_mro. If a doesn't have class, it's considered a subclass only of itself or of 'object'. This one fix is enough to prevent the ExtensionClass test suite from dumping core, but that doesn't say much (it's a rather small test suite). Also note that for ExtensionClass-defined types, a different subclass test may be needed. But I haven't checked whether PyType_IsSubtype() is actually used in situations where this matters -- probably it doesn't, since we also don't check for classic classes.	2001-09-07 18:52:13 +00:00
Tim Peters	e56ed9ba15	long_true_divide: reliably force underflow to 0 when the denominator has more bits than the numerator than can be counted in a C int (yes, that's unlikely, and no, I'm not adding a test case with a 2 gigabit long).	2001-09-06 21:59:14 +00:00
Guido van Rossum	8bce4acb17	Rename 'getset' to 'property'.	2001-09-06 21:56:42 +00:00
Martin v. Löwis	387c547fd3	Revert parts of patch #453627 , documenting the resulting test failures instead.	2001-09-06 08:16:17 +00:00
Tim Peters	6e13a562ae	Enable large file support on Win32 systems. Curious: the MS docs say stati64 etc are supported even on Win95, but Win95 doesn't support a filesystem that allows partitions > 2 Gb. test_largefile: This was opening its test file in text mode. I have no idea how that worked under Win64, but it sure needs binary mode on Win98. BTW, on Win98 test_largefile runs quickly (under a second).	2001-09-06 00:32:15 +00:00
Tim Peters	97f4a33e12	Better error msg for 3-arg pow with a float argument.	2001-09-05 23:49:24 +00:00
Guido van Rossum	b479dc561c	Add PyMethod_Function(), PyMethod_Self(), PyMethod_Class() back. While not even documented, they were clearly part of the C API, there's no great difficulty to support them, and it has the cool effect of not requiring any changes to ExtensionClass.c.	2001-09-05 22:52:50 +00:00
Tim Peters	a40c793d06	Rework the way we try to check for libm overflow, given that C99 no longer requires that errno ever get set, and it looks like glibc is already playing that game. New rules: + Never use HUGE_VAL. Use the new Py_HUGE_VAL instead. + Never believe errno. If overflow is the only thing you're interested in, use the new Py_OVERFLOWED(x) macro. If you're interested in any libm errors, use the new Py_SET_ERANGE_IF_OVERFLOW(x) macro, which attempts to set errno the way C89 said it worked. Unfortunately, none of these are reliable, but they work on Windows and I expect under glibc too.	2001-09-05 22:36:56 +00:00
Guido van Rossum	b855216099	Changes to automatically enable large file support on some systems. I believe this works on Linux (tested both on a system with large file support and one without it), and it may work on Solaris 2.7. The changes are twofold: (1) The configure script now boldly tries to set the two symbols that are recommended (for Solaris and Linux), and then tries a test script that does some simple seeking without writing. (2) The _portable_{fseek,ftell} functions are a little more systematic in how they try the different large file support options: first try fseeko/ftello, but only if off_t is large; then try fseek64/ftell64; then try hacking with fgetpos/fsetpos. I'm keeping my fingers crossed. The meaning of the HAVE_LARGEFILE_SUPPORT macro is not at all clear. I'll see if I can get it to work on Windows as well.	2001-09-05 14:58:11 +00:00
Martin v. Löwis	655c9557f6	Patch #453627 : Define the following macros when compiling on a UnixWare 7.x system: SCO_ATAN2_BUG, SCO_ACCEPT_BUG, and STRICT_SYSV_CURSES. Work aroudn a bug in the SCO UnixWare atan2() implementation.	2001-09-05 14:45:54 +00:00
Tim Peters	4c483c4d8e	Make the error msgs in our pow() implementations consistent.	2001-09-05 06:24:58 +00:00
Tim Peters	57f282a2a0	Try to recover from that glibc's ldexp apparently doesn't set errno on overflow. Needs testing on Linux (test_long.py and test_long_future.py especially).	2001-09-05 05:38:10 +00:00
Tim Peters	7eea37e831	At Guido's suggestion, here's a new C API function, PyObject_Dir(), like __builtin__.dir(). Moved the guts from bltinmodule.c to object.c.	2001-09-04 22:08:56 +00:00
Tim Peters	e2a600099d	Change long/long true division to return as many good bits as it can; e.g., (1L << 40000)/(1L << 40001) returns 0.5, not Inf or NaN or whatever.	2001-09-04 06:17:36 +00:00
Tim Peters	9c1d7fd5f2	Move int_true_divide next to the other division routines.	2001-09-04 05:52:47 +00:00
Tim Peters	20dab9f168	Move long_true_divide next to the other division routines (for clarity!).	2001-09-04 05:31:47 +00:00
Tim Peters	9fffa3eea3	Raise OverflowError when appropriate on long->float conversion. Most of the fiddling is simply due to that no caller of PyLong_AsDouble ever checked for failure (so that's fixing old bugs). PyLong_AsDouble is much faster for big inputs now too, but that's more of a happy consequence than a design goal.	2001-09-04 05:14:19 +00:00
Guido van Rossum	1832de4bc0	PEP 238 documented -Qwarn as warning only for classic int or long division, and this makes sense. Add -Qwarnall to warn for all classic divisions, as required by the fixdiv.py tool.	2001-09-04 03:51:09 +00:00
Tim Peters	a1c1b0f468	Introduce new private API function _PyLong_AsScaledDouble. Not used yet, but will be the foundation for Good Things: + Speed PyLong_AsDouble. + Give PyLong_AsDouble the ability to detect overflow. + Make true division of long/long nearly as accurate as possible (no spurious infinities or NaNs). + Return non-insane results from math.log and math.log10 when passing a long that can't be approximated by a double better than HUGE_VAL.	2001-09-04 02:50:49 +00:00
Tim Peters	32f453eaa4	New restriction on pow(x, y, z): If z is not None, x and y must be of integer types, and y must be >= 0. See discussion at http://sf.net/tracker/index.php?func=detail&aid=457066&group_id=5470&atid=105470	2001-09-03 08:35:41 +00:00
Tim Peters	b95ec09a44	Repair typo in comment.	2001-09-02 18:35:54 +00:00
Tim Peters	25786c0851	Make dictionary() a real constructor. Accepts at most one argument, "a mapping object", in the same sense dict.update(x) requires of x (that x has a keys() method and a getitem). Questionable: The other type constructors accept a keyword argument, so I did that here too (e.g., dictionary(mapping={1:2}) works). But type_call doesn't pass the keyword args to the tp_new slot (it passes NULL), it only passes them to the tp_init slot, so getting at them required adding a tp_init slot to dicts. Looks like that makes the normal case (i.e., no args at all) a little slower (the time it takes to call dict.tp_init and have it figure out there's nothing to do).	2001-09-02 08:22:48 +00:00
Tim Peters	1b8ca0d87a	Rewrite the tuple() docstring to parallel the list() docstring.	2001-09-02 06:42:25 +00:00
Tim Peters	9577761337	Repair apparent cut'n'pasteo in tuple() docstring.	2001-09-02 06:29:48 +00:00
Guido van Rossum	393661d15f	Add warning mode for classic division, almost exactly as specified in PEP 238. Changes: - add a new flag variable Py_DivisionWarningFlag, declared in pydebug.h, defined in object.c, set in main.c, and used in {int,long,float,complex}object.c. When this flag is set, the classic division operator issues a DeprecationWarning message. - add a new API PyRun_SimpleStringFlags() to match PyRun_SimpleString(). The main() function calls this so that commands run with -c can also benefit from -Dnew. - While I was at it, I changed the usage message in main() somewhat: alphabetized the options, split it in four parts to fit in under 512 bytes (not that I still believe this is necessary -- doc strings elsewhere are much longer), and perhaps most visibly, don't display the full list of options on each command line error. Instead, the full list is only displayed when -h is used, and otherwise a brief reminder of -h is displayed. When -h is used, write to stdout so that you can do `python -h \| more'. Notes: - I don't want to use the -W option to control whether the classic division warning is issued or not, because the machinery to decide whether to display the warning or not is very expensive (it involves calling into the warnings.py module). You can use -Werror to turn the warnings into exceptions though. - The -Dnew option doesn't select future division for all of the program -- only for the __main__ module. I don't know if I'll ever change this -- it would require changes to the .pyc file magic number to do it right, and a more global notion of compiler flags. - You can usefully combine -Dwarn and -Dnew: this gives the __main__ module new division, and warns about classic division everywhere else.	2001-08-31 17:40:15 +00:00
Guido van Rossum	29d55a38ce	Fix a memory leak in str_subtype_new(). (All the other xxx_subtype_new() functions are OK, but I goofed up in this one. :-( )	2001-08-31 16:11:15 +00:00
Guido van Rossum	41eb14dffa	Give 'super' a decent repr(), and readonly attributes to access the type and obj properties. The "bogus super object" message is gone -- this will now just raise an AttributeError.	2001-08-30 23:13:11 +00:00
Guido van Rossum	21922aa939	PyObject_Repr(): add missing ">" back at end of format string: "<%s object at %p>".	2001-08-30 20:26:05 +00:00
Tim Peters	017cb2c7d8	Squash new compiler wng.	2001-08-30 20:07:55 +00:00
Guido van Rossum	6fb3fdec7c	Pytype_GenericAlloc(): round up size so we zap all four bytes of the __dict__ slot for string subtypes. subtype_dealloc(): properly use _PyObject_GetDictPtr() to get the (potentially negative) dict offset. Don't copy things into local variables that are used only once. type_new(): properly calculate a negative dict offset when tp_itemsize is nonzero. The __dict__ attribute, if present, is now a calculated attribute rather than a structure member.	2001-08-30 20:00:07 +00:00
Guido van Rossum	4b8c0f6d7d	More stuff discovered while writing the simplest of testcases: tupledealloc(): only feed the free list when the type is really a tuple, not a subtype. Otherwise, use PyObject_GC_Del(). _PyTuple_Resize(): disallow using this for tuple subtypes.	2001-08-30 18:31:30 +00:00
Guido van Rossum	13228a6f09	Ah, the joy of writing test cases... long_subtype_new(): fix a typo (type->ob_size instead of tmp->ob_size).	2001-08-30 15:54:44 +00:00
Sjoerd Mullender	38b88c233a	Removed some unreachable break statements to silence SGI compiler.	2001-08-30 13:58:58 +00:00
Tim Peters	52e0717215	Give the internal immutable list type .extend and .pop methods (they "should have" been added here when they were added to lists).	2001-08-30 06:15:32 +00:00
Guido van Rossum	c41418751f	Safety measures now that str and tuple are subclassable: If tp_itemsize of the basetype is nonzero, only allow empty __slots__ (declaring that no __dict__ should be added), and don't add a weakref offset.	2001-08-30 04:43:35 +00:00
Guido van Rossum	31bcff8815	Make 'super' subclassable. (Not sure how useful this is yet. :-)	2001-08-30 04:37:15 +00:00
Guido van Rossum	e023fe0eef	Make unicode subclassable.	2001-08-30 03:12:59 +00:00
Guido van Rossum	ae960afb5e	Make str and tuple types subclassable.	2001-08-30 03:11:59 +00:00
Guido van Rossum	147b13c069	Make getset subclassable.	2001-08-30 03:10:36 +00:00
Guido van Rossum	d93dce1699	Fix typo: double semicolons.	2001-08-30 03:09:31 +00:00
Tim Peters	deb77e8394	Squash new compiler wng in debug build.	2001-08-30 00:32:51 +00:00
Neil Schemenauer	c806c8858d	Use new GC API. Remove usage of BASICSIZE macros.	2001-08-29 23:54:54 +00:00
Neil Schemenauer	e83c00efd0	Use new GC API.	2001-08-29 23:54:21 +00:00
Neil Schemenauer	fd34369ecb	Remove GC related code. It lives in gcmodule now.	2001-08-29 23:54:03 +00:00
Neil Schemenauer	4f4817fee8	Make frames a PyVarObject. Use new GC API.	2001-08-29 23:52:17 +00:00
Guido van Rossum	bef1417f9f	Make int, long and float subclassable. This uses a slightly wimpy and wasteful approach, but it works. :-)	2001-08-29 15:47:46 +00:00
Guido van Rossum	e705ef1bce	Fix super() so that it is usable for static methods (like __new__) as well. In particular, the second argument can now be a subclass of the first as well (normally it must be an instance though).	2001-08-29 15:47:06 +00:00
Guido van Rossum	5592e4d7d5	Fix a typo in SLOT0 macro for the declaration of cache_str. Dunno why I didn't catch this before.	2001-08-28 18:28:21 +00:00
Guido van Rossum	2730b13202	Finish the previous checkin: also avoid getattr when calling the method directly.	2001-08-28 18:22:14 +00:00
Guido van Rossum	607187325f	Change in policy: when a slot_tp_xxx function looks for the __xxx__ method, don't use getattr, but only look in the dict of the type and base types. This prevents picking up all sorts of weird stuff, including things defined by the metaclass when the object is a class (type). For this purpose, a helper function lookup_method() was added. One or two other places also use this.	2001-08-28 17:47:51 +00:00
Barry Warsaw	7c47beb860	Two improvements suggested by Greg Stein: PyString_FromFormatV(): In the final resize at the end, we can use PyString_AS_STRING() since we know the object is a string and can avoid the typechecking. PyString_FromFormat(): GS sez: "For safety/propriety, you should call va_end() on the vargs variable."	2001-08-27 03:11:09 +00:00
Tim Peters	6af5bbb565	PyString_FromFormatV: Massage platform %p output to match what gcc does, at least in the first two characters. %p is ill-defined, and people will forever commit bad tests otherwise ("bad" in the sense that they fall over (at least on Windows) for lack of a leading '0x'; 5 of the 7 tests in test_repr.py failed on Windows for that reason this time around).	2001-08-25 03:02:28 +00:00
Guido van Rossum	239abf7e23	getset_init(): the function name in the PyArg_ParseTuple() format should just be "getset", not "getset.__init__".	2001-08-24 18:49:10 +00:00
Guido van Rossum	a15dece519	Improve the error message issued when an unbound method is called with an inappropriate first argument. Now that there are more ways for this to fail, make sure to report the name of the class of the expected instance and of the actual instance.	2001-08-24 18:48:27 +00:00
Barry Warsaw	7ce3694a52	repr's converted to using PyString_FromFormat() instead of sprintf'ing into a hardcoded char* buffer. Closes patch #454743.	2001-08-24 18:34:26 +00:00
Barry Warsaw	dadace004b	PyString_FromFormat() and PyString_FromFormatV(): Largely ripped from PyErr_Format() these new C API methods can be used instead of sprintf()'s into hardcoded char* buffers. This allows us to fix many situation where long package, module, or class names get truncated in reprs. PyString_FromFormat() is the varargs variety. PyString_FromFormatV() is the va_list variety Original PyErr_Format() code was modified to allow %p and %ld expansions. Many reprs were converted to this, checkins coming soo. Not changed: complex_repr(), float_repr(), float_print(), float_str(), int_repr(). There may be other candidates not yet converted. Closes patch #454743.	2001-08-24 18:32:06 +00:00
Guido van Rossum	705f0f5a91	Add 'super', another new object type with magical properties. super(type) -> unbound super object super(type, obj) -> bound super object; requires isinstance(obj, type) Typical use to call a cooperative superclass method: class C(B): def meth(self, arg): super(C, self).meth(arg);	2001-08-24 16:47:00 +00:00
Guido van Rossum	271410ad18	Change the getset type to take an optional third function argument: the delete function. (Question: should the attribute name also be recorded in the getset object? That makes the protocol more work, but may give us better error messages.)	2001-08-24 15:23:20 +00:00
Guido van Rossum	845fc48bf7	getset_descr_set(): guard against deletion (indicated by a set call with a NULL value), in a somewhat lame way: call the set() function with one argument. Should I add a 3rd function, 'del', instead?	2001-08-24 10:17:36 +00:00
Guido van Rossum	2c25239215	slot_tp_descr_get(): guard against NULL obj or type (bug reported by Thomas Hellor on python-dev). slot_tp_descr_set(): if value is NULL, call __del__ instead of __set__.	2001-08-24 10:13:31 +00:00
Guido van Rossum	91c0d8a922	getset_init(): make the arguments optional. getset_doc: add docstring.	2001-08-24 09:55:51 +00:00
Tim Peters	96685bfbf0	float_pow: Put all of the burden on the libm pow in normal cases. powu: Deleted. This started with a nonsensical error msg: >>> x = -1. >>> import sys >>> x*(-sys.maxint-1L) Traceback (most recent call last): File "<stdin>", line 1, in ? ValueError: negative number cannot be raised to a fractional power >>> The special-casing in float_pow was simply wrong in this case (there's not even anything peculiar about these inputs), and I don't see any point to it in any* case: a decent libm pow should have worst-case error under 1 ULP, so in particular should deliver the exact result whenever the exact result is representable (else its error is at least 1 ULP). Thus our special fiddling for integral values "shouldn't" buy anything in accuracy, and, to the contrary, repeated multiplication is less accurate than a decent pow when the true result isn't exactly representable. So just letting pow() do its job here (we may not be able to trust libm x-platform in exceptional cases, but these are normal cases).	2001-08-23 22:31:37 +00:00
Guido van Rossum	29a62dd6eb	Add new built-in type 'getset' (PyGetSet_Type). This implements the 'getset' class from test_binop.py.	2001-08-23 21:40:38 +00:00
Guido van Rossum	0b13116a62	err_ovf(): only raise OverflowError when OverflowWarning was raised.	2001-08-23 21:32:40 +00:00
Tim Peters	31960db5a5	int_pow(): Repair typo when passing on to float pow (the 2nd argument was being passed as both the 2nd and 3rd args). Regression test will follow.	2001-08-23 21:28:33 +00:00
Guido van Rossum	e27f795b72	Change all case where we used to raise OverflowError to issue a warning and then redo the operation using long ints.	2001-08-23 02:59:04 +00:00
Barry Warsaw	60f018846d	Merge changes from r22a2-branch back into trunk. Also, change patch level to 2.2a2+	2001-08-22 19:24:42 +00:00
Guido van Rossum	5d815f323b	Address SF bug #442813 . The sequence getitem wrappers should do interpretation of negative indices, since neither the sq_*item slots nor the slot_ wrappers do this. (Slices are a different story, there the size wrapping is done too early.)	2001-08-17 21:57:47 +00:00
Guido van Rossum	9676b22cd7	Weak reference support, closing SF bug #451773 . Classes that don't use __slots__ have a __weakref__ member added in the same way as __dict__ is added (i.e. only if the base didn't already have one). Classes using __slots__ can enable weak referenceability by adding '__weakref__' to the __slots__ list. Renamed the __weaklistoffset__ class member to __weakrefoffset__ -- it's not always a list, it seems. (Is tp_weaklistoffset a historical misnomer, or do I misunderstand this?)	2001-08-17 20:32:36 +00:00
Martin v. Löwis	339d0f720e	Patch #445762 : Support --disable-unicode - Do not compile unicodeobject, unicodectype, and unicodedata if Unicode is disabled - check for Py_USING_UNICODE in all places that use Unicode functions - disables unicode literals, and the builtin functions - add the types.StringTypes list - remove Unicode literals from most tests.	2001-08-17 18:39:25 +00:00
Guido van Rossum	1a49350e8d	type_new(): look for __dynamic__ at the module level (after looking in the class dict). Anything but a nonnegative int in either place is ignored (before, a non-Boolean was an error). The default is still static -- in a comparative test, Jeremy's Tools/compiler package ran twice as slow (compiling itself) using dynamic as the default. (The static version, which requires a few tweaks to avoid modifying class variables, runs at about the same speed as the classic version.) slot_tp_descr_get(): this also needed fallback behavior. slot_tp_getattro(): remove a debug fprintf() call.	2001-08-17 16:47:50 +00:00
Guido van Rossum	4066769b91	Fix core dump in repr() of instancemethod whose class==NULL.	2001-08-17 13:59:27 +00:00
Guido van Rossum	f23c41d56a	instance_getattr2(): rewritten to remove unnecessary stuff and streamlined a bit. instancemethod_descr_get(): don't bind an unbound method of a class that's not a base class of the argument class.	2001-08-17 13:43:27 +00:00
Guido van Rossum	cdf0d75897	Instance methods: allow a NULL value for im_class.	2001-08-17 12:07:34 +00:00
Guido van Rossum	8d32c8b59f	type_new(): only defer to the winning metatype if it's different from the metatype passed in as an argument. This prevents infinite recursion when a metatype written in Python calls type.__new__() as a "super" call. Also tweaked some comments.	2001-08-17 11:18:38 +00:00
Guido van Rossum	501c7c7d0e	classobject.c:instancemethod_descr_get(): when a bound method is assigned to a class variable and then accessed via an instance, it should not be rebound. test_descr.py:methods(): test for the condition above.	2001-08-16 20:41:56 +00:00
Barry Warsaw	2907fe6ce7	module_repr(): Instead of fixing the maximum buf size to 400, calculate it on the fly. This way even modules with long package names get an accurate repr instead of a truncated one. The extra malloc/free cost shouldn't be a problem in a repr function. Closes SF bug #437984	2001-08-16 20:39:24 +00:00
Guido van Rossum	76e6963fc1	Fix object_repr() to include the module (using the same rules as type_repr() for when to show or not to show it).	2001-08-16 18:52:43 +00:00
Martin v. Löwis	e3eb1f2b23	Patch #427190 : Implement and use METH_NOARGS and METH_O.	2001-08-16 13:15:00 +00:00
Guido van Rossum	c35422109b	Fix SF bug #442501 : calculate __module__ properly. - type_module(), type_name(): if tp_name contains one or more period, the part before the last period is __module__, the part after that is __name__. Otherwise, for non-heap types, __module__ is "__builtin__". For heap types, __module__ is looked up in tp_defined. - type_new(): heap types have their __module__ set from globals().__name__; a pre-existing __module__ in their dict is not overridden. This is not inherited. - type_repr(): if __module__ exists and is not "__builtin__", it is included in the string representation (just as it already is for classes). For example <type '__main__.C'>.	2001-08-16 09:18:56 +00:00
Guido van Rossum	8098ddbe81	Subtle change to make None.__class__ work: - descrobject.c:descr_check(): only believe None means the same as NULL if the type given is None's type. - typeobject.c:wrap_descr_get(): don't "conventiently" default an absent type to the type of the object argument. Let the called function figure it out.	2001-08-16 08:27:33 +00:00
Guido van Rossum	ba21a49f9d	Add a function _Py_ReadyTypes() which initializes various and sundry types -- currently Type, List, None and NotImplemented. To be called from Py_Initialize() instead of accumulating calls there. Also rename type(None) to NoneType and type(NotImplemented) to NotImplementedType -- naming the type identical to the object was confusing.	2001-08-16 08:17:26 +00:00
Guido van Rossum	82fc51c19c	Update to MvL's patch #424475 to avoid returning 2 when tp_compare returns that. (This fix is also by MvL; checkin it in because I want to make more changes here. I'm still not 100% satisfied -- see comments attached to the patch.)	2001-08-16 08:02:45 +00:00
Guido van Rossum	b8f636641f	- Another big step in the right direction. All the overridable operators for which a default implementation exist now work, both in dynamic classes and in static classes, overridden or not. This affects __repr__, __str__, __hash__, __contains__, __nonzero__, __cmp__, and the rich comparisons (__lt__ etc.). For dynamic classes, this meant copying a lot of code from classobject! (XXX There are still some holes, because the comparison code in object.c uses PyInstance_Check(), meaning new-style classes don't get the same dispensation. This needs more thinking.) - Add object.__hash__, object.__repr__, object.__str__. The __str__ dispatcher now calls the __repr__ dispatcher, as it should. - For static classes, the tp_compare, tp_richcompare and tp_hash slots are now inherited together, or not at all. (XXX I fear there are still some situations where you can inherit __hash__ when you shouldn't, but mostly it's OK now, and I think there's no way we can get that 100% right.)	2001-08-15 23:57:02 +00:00
Guido van Rossum	23cc2b4991	PyMethod_Type: add a tp_descr_get slot function to ensure proper binding of unbound methods.	2001-08-15 17:52:31 +00:00
Guido van Rossum	4dd64ab5ea	Non-function fields, like tp_dictoffset and tp_weaklistoffset, should be inherited in inherit_special(), otherwise dynamic types don't inherit these. Also added some XXX comments about open ends.	2001-08-14 20:04:48 +00:00
Barry Warsaw	142865cae1	func_getattro(), func_setattro(): Implement the new semantics for setting and deleting a function's __dict__ attribute. Deleting it, or setting it to a non-dictionary result in a TypeError. Note that getting it the first time magically initializes it to an empty dict so that func.__dict__ will always appear to be a dictionary (never None). Closes SF bug #446645.	2001-08-14 18:23:58 +00:00
Jeremy Hylton	910d7d46dc	Remove much dead code from ceval.c The descr changes moved the dispatch for calling objects from call_object() in ceval.c to PyObject_Call() in abstract.c. call_object() and the many functions it used in ceval.c were no longer used, but were not removed. Rename meth_call() as PyCFunction_Call() so that it can be called by the CALL_FUNCTION opcode in ceval.c. Also, fix error message that referred to PyEval_EvalCodeEx() by its old name eval_code2(). (I'll probably refer to it by its old name, too.)	2001-08-12 21:52:24 +00:00
Guido van Rossum	8e24818cf4	Make dynamic types work as intended. Or at least more so. XXX There are still some loose ends: repr(), str(), hash() and comparisons don't inherit a default implementation from object. This must be resolved similarly to the way it's resolved for classic instances.	2001-08-12 05:17:56 +00:00
Guido van Rossum	8de8680d07	Temporary stop-gap fix for dynamic classes, so they pass the test. XXX This is not sufficient: if a dynamic class has no __repr__ method (for instance), but later one is added, that doesn't add a tp_repr slot, so repr() doesn't call the __repr__ method. To make this work, I'll have to add default implementations of several slots to 'object'. XXX Also, dynamic types currently only inherit slots from their dominant base.	2001-08-12 03:43:35 +00:00
Guido van Rossum	13d52f0b32	- Big changes to fix SF bug #442833 (a nasty multiple inheritance problem). inherit_slots() is split in two parts: inherit_special() which inherits the flags and a few very special members from the dominant base; inherit_slots() which inherits only regular slots, and is now called for each base in the MRO in turn. These are now both void functions since they don't have error returns. - Added object.__setitem__() back -- for the same reason as object.__new__(): a subclass of object should be able to call object.__new__(). - add_wrappers() was moved around to be closer to where it is used (it was defined together with add_methods() etc., but has nothing to do with these).	2001-08-10 21:24:08 +00:00
Guido van Rossum	05ac6de2d5	Add PyDict_Merge(a, b, override): PyDict_Merge(a, b, 1) is the same as PyDict_Update(a, b). PyDict_Merge(a, b, 0) does something similar but leaves existing items unchanged.	2001-08-10 20:28:28 +00:00
Guido van Rossum	d614f97733	Change PyType_Ready() to use the READY and READYING flags. This makes it possible to detect recursive calls early (as opposed to when the stack overflows :-).	2001-08-10 17:39:49 +00:00
Tim Peters	772747b3f1	SF patch #438013 Remove 2-byte Py_UCS2 assumptions Removed all instances of Py_UCS2 from the codebase, and so also (I hope) the last remaining reliance on the platform having an integral type with exactly 16 bits. PyUnicode_DecodeUTF16() and PyUnicode_EncodeUTF16() now read and write one byte at a time.	2001-08-09 22:21:55 +00:00
Guido van Rossum	29687cd211	Sigh. Strengthen the resriction of the previous checkin: tp_new is inherited unless both: (a) the base type is 'object', and (b) the subtype is not a "heap" type.	2001-08-09 19:43:37 +00:00
Guido van Rossum	c11e192d41	Thinking back to the 2.22 revision, I didn't like what I did there one bit. For one, this class: class C(object): def __new__(myclass, ...): ... would have no way to call the __new__ method of its base class, and the workaround (to create an intermediate base class whose __new__ you can call) is ugly. So, I've come up with a better solution that restores object.__new__, but still solves the original problem, which is that built-in and extension types shouldn't inherit object.__new__. The solution is simple: only "heap types" inherit tp_new. Simpler, less code, perfect!	2001-08-09 19:38:15 +00:00
Guido van Rossum	29206bc8a3	Apply anonymous SF patch #441229 . Previously, f.read() and f.readlines() checked for errors on their file object and possibly raised an IOError, but f.readline() didn't. This patch makes f.readline() behave like the others. Note that I've added a call to clearerr() since the other calls to ferror() include that too. I have no way to test this code. :-)	2001-08-09 18:14:59 +00:00
Guido van Rossum	dc91b99f23	Proper support for binary operators, including true division and floor division. The basic binary operators now all correctly call the __rxxx__ variant when they should. In type_new(), I now make the new type a new-style number unless it inherits from an old-style number that has numeric methods. By way of cosmetics, I've changed the signatures of the SLOT<i> macros to take actual function names and operator names as strings, rather than rely on C preprocessor symbol manipulations. This makes the calls slightly more verbose, but greatly helps simple searches through the file: you can now find out where "__radd__" is used or where the function slot_nb_power() is defined and where it is used.	2001-08-08 22:26:22 +00:00
Jack Jansen	8e938b4257	Removed extraneous semicolons that caused a gazzilion "empty declaration" warnings in the MetroWerks compiler.	2001-08-08 15:29:49 +00:00
Guido van Rossum	4668b000a1	Implement PEP 238 in its (almost) full glory. This introduces: - A new operator // that means floor division (the kind of division where 1/2 is 0). - The "future division" statement ("from __future__ import division) which changes the meaning of the / operator to implement "true division" (where 1/2 is 0.5). - New overloadable operators __truediv__ and __floordiv__. - New slots in the PyNumberMethods struct for true and floor division, new abstract APIs for them, new opcodes, and so on. I emphasize that without the future division statement, the semantics of / will remain unchanged until Python 3.0. Not yet implemented are warnings (default off) when / is used with int or long arguments. This has been on display since 7/31 as SF patch #443474. Flames to /dev/null.	2001-08-08 05:00:18 +00:00
Guido van Rossum	528b7eb0b0	- Rename PyType_InitDict() to PyType_Ready(). - Add an explicit call to PyType_Ready(&PyList_Type) to pythonrun.c (just for the heck of it, really -- we should either explicitly ready all types, or none).	2001-08-07 17:24:28 +00:00
Guido van Rossum	f040ede6e8	Cosmetics: - Add comment blocks explaining add_operators() and override_slots(). (This file could use some more explaining, but this is all I had breath for today. :) - Renamed the argument 'base' of add_wrappers() to 'wraps' because it's not a base class (which is what the 'base' identifier is used for elsewhere). Small nits: - Fix add_tp_new_wrapper() to avoid overwriting an existing __new__ descriptor in tp_defined. - In add_operators(), check the return value of add_tp_new_wrapper(). Functional change: - Remove the tp_new functionality from PyBaseObject_Type; this means you can no longer instantiate the 'object' type. It's only useful as a base class. - To make up for the above loss, add tp_new to dynamic types. This has to be done in a hackish way (after override_slots() has been called, with an explicit call to add_tp_new_wrapper() at the very end) because otherwise I ran into recursive calls of slot_tp_new(). Sigh.	2001-08-07 16:40:56 +00:00
Guido van Rossum	63e0a64562	Remove spurious "closed" attribute definition from the memberlist table. (reported as an aside in SF #446049).	2001-08-06 18:51:38 +00:00
Guido van Rossum	0d231eda52	A totally new way to do the __new__ wrapper. This should address the problem brought up in SF bug #444229.	2001-08-06 16:50:37 +00:00
Guido van Rossum	2b8d7bdd77	Fix SF #442791 (revisited): No __delitem__ wrapper was defined.	2001-08-02 15:31:58 +00:00
Tim Peters	5962cbf5ba	Fix the test_weakref.py failure. Introduced by resolving "a conflict" (which didn't actually exist!) incorrectly.	2001-08-02 04:45:20 +00:00
Tim Peters	6d6c1a35e0	Merge of descr-branch back into trunk.	2001-08-02 04:15:00 +00:00
Jeremy Hylton	3ce45389bd	Add _PyUnicode_AsDefaultEncodedString to unicodeobject.h. And remove all the extern decls in the middle of .c files. Apparently, it was excluded from the header file because it is intended for internal use by the interpreter. It's still intended for internal use and documented as such in the header file.	2001-07-30 22:34:24 +00:00
Tim Peters	7321ec437b	SF bug #444510 : int() should guarantee truncation. It's guaranteed now, assuming the platform modf() works correctly.	2001-07-26 20:02:17 +00:00
Marc-André Lemburg	80d1dd5f3b	Fix for bug #444493 : u'\U00010001' segfaults with current CVS on wide builds.	2001-07-25 16:05:59 +00:00
Marc-André Lemburg	6c6bfb7c70	Make the unicode-escape and the UTF-16 codecs handle surrogates correctly and thus roundtrip-safe. Some minor cleanups of the code. Added tests for the roundtrip-safety.	2001-07-20 17:39:11 +00:00
Guido van Rossum	0d42e0c54a	#ifdef out generation of \U escapes unless Py_UNICODE_WIDE. This #caused warnings with the VMS C compiler. (SF bug #442998, in part.) On a narrow system the current code should never be executed since ch will always be < 0x10000. Marc-Andre: you may end up fixing this a different way, since I believe you have plans to generate \U for surrogate pairs. I'll leave that to you.	2001-07-20 16:36:21 +00:00
Fred Drake	1bc8fab0e7	Kill more warnings from the SGI compiler. Part of SF patch #434992.	2001-07-19 21:49:38 +00:00
Tim Peters	0d5dd68692	Python.h: Don't attempt to redefine NDEBUG if it's already defined. Others: Remove redundant includes of assert.h.	2001-07-15 18:38:47 +00:00
Tim Peters	586b2e3f3d	long_format: Simplify the overly elaborate base-is-a-power-of-2 code.	2001-07-15 09:11:14 +00:00
Guido van Rossum	3c29602d74	_Py_GetObjects(): GCC suggests to add () around && within \|\| for some code only compiled in debug mode, and I dutifully comply.	2001-07-14 17:58:00 +00:00
Tim Peters	212e614f60	divrem1 & long_format: found a clean way to factor divrem1 so that long_format can reuse a scratch area for its repeated divisions (instead of malloc/free for every digit produced); speeds str(long)/repr(long).	2001-07-14 12:23:19 +00:00
Tim Peters	c8a6b9b6d6	long_format(): Simplify new code a bit.	2001-07-14 11:01:28 +00:00
Tim Peters	fad225f1ba	long_format(): Easy speedup for output bases that aren't a power of 2 (in particular, str(long) and repr(long) use base 10, and that gets a factor of 4 speedup). Another factor of 2 can be gotten by refactoring divrem1 to support in-place division, but that started getting messy so I'm leaving that out.	2001-07-13 02:59:26 +00:00
Neil Schemenauer	10c6692774	GC for method objects.	2001-07-12 13:27:35 +00:00
Neil Schemenauer	7eac9b72d4	GC for iterator objects.	2001-07-12 13:27:25 +00:00
Neil Schemenauer	19cd292bbc	GC for frame objects.	2001-07-12 13:27:11 +00:00
Guido van Rossum	0ec9abaa2b	On long to the negative long power, let float handle it instead of raising an error. This was one of the two issues that the VPython folks were particularly problematic for their students. (The other one was integer division...) This implements (my) SF patch #440487.	2001-07-12 11:21:17 +00:00
Guido van Rossum	b82fedc7d8	On int to the negative integral power, let float handle it instead of raising an error. This was one of the two issues that the VPython folks were particularly problematic for their students. (The other one was integer division...) This implements (my) SF patch #440487.	2001-07-12 11:19:45 +00:00
Thomas Wouters	efafcea280	Re-add 'advanced' xrange features, adding DeprecationWarnings as discussed on python-dev. The features will still vanish, however, just one release later.	2001-07-09 12:30:54 +00:00
Tim Peters	6ee4234802	SF bug #439104 : Tuple richcompares has code-typo. Symptom: (1, 2, 3) <= (1, 2) returned 1. This was already fixed in CVS for tuples, but an isomorphic error was in the list richcompare code.	2001-07-06 17:45:43 +00:00
Guido van Rossum	3f56166b1a	Rip out the fancy behaviors of xrange that nobody uses: repeat, slice, contains, tolist(), and the start/stop/step attributes. This includes removing the 4th ('repeat') argument to PyRange_New().	2001-07-05 13:27:48 +00:00
Fredrik Lundh	72b068566a	removed "register const" from scalar arguments to the unicode predicates	2001-06-27 22:08:26 +00:00
Fredrik Lundh	8f4558583f	use Py_UNICODE_WIDE instead of USE_UCS4_STORAGE and Py_UNICODE_SIZE tests.	2001-06-27 18:59:43 +00:00
Martin v. Löwis	ce9b5a55e1	Encode surrogates in UTF-8 even for a wide Py_UNICODE. Implement sys.maxunicode. Explicitly wrap around upper/lower computations for wide Py_UNICODE. When decoding large characters with UTF-8, represent expected test results using the \U notation.	2001-06-27 06:28:56 +00:00
Martin v. Löwis	ac93bc2501	When decoding UTF-16, don't assume that the buffer is in native endianness when checking surrogates.	2001-06-26 22:43:40 +00:00
Martin v. Löwis	0ba70cc3c8	Support using UCS-4 as the Py_UNICODE type: Add configure option --enable-unicode. Add config.h macros Py_USING_UNICODE, PY_UNICODE_TYPE, Py_UNICODE_SIZE, SIZEOF_WCHAR_T. Define Py_UCS2. Encode and decode large UTF-8 characters into single Py_UNICODE values for wide Unicode types; likewise for UTF-16. Remove test whether sizeof Py_UNICODE is two.	2001-06-26 22:22:37 +00:00
Fredrik Lundh	ee13dba1aa	more unicode tweaks: fix unicodectype for sizeof(Py_UNICODE) > sizeof(int)	2001-06-26 20:36:12 +00:00
Barry Warsaw	66a0d1d9b9	dict_update(): Generalize this method so {}.update() accepts any "mapping" object, specifically one that supports PyMapping_Keys() and PyObject_GetItem(). This allows you to say e.g. {}.update(UserDict()) We keep the special case for concrete dict objects, although that seems moderately questionable. OTOH, the code exists and works, so why change that? .update()'s docstring already claims that D.update(E) implies calling E.keys() so it's appropriate not to transform AttributeErrors in PyMapping_Keys() to TypeErrors. Patch eyeballed by Tim.	2001-06-26 20:08:32 +00:00
Fredrik Lundh	1294ad0c59	experimental UCS-4 support: added USE_UCS4_STORAGE define to unicodeobject.h, which forces sizeof(Py_UNICODE) == sizeof(Py_UCS4). (this may be good enough for platforms that doesn't have a 16-bit type. the UTF-16 codecs don't work, though)	2001-06-26 17:17:07 +00:00
Fredrik Lundh	45714e9ecb	experimental UCS-4 support: made compare a bit more robust, in case sizeof(Py_UNICODE) >= sizeof(long). also changed surrogate expansion to work if sizeof(Py_UNICODE) > 2.	2001-06-26 16:39:36 +00:00
Fredrik Lundh	3083163dc1	experimental UCS-4 support: don't assume that MS_WIN32 implies HAVE_USABLE_WCHAR_T	2001-06-26 15:11:00 +00:00
Tim Peters	8c96369513	PyFrameObject: rename f_stackbottom to f_stacktop, since it points to the next free valuestack slot, not to the base (in America, stacks push and pop at the top -- they mutate at the bottom in Australia <winK>). eval_frame(): assert that f_stacktop isn't NULL upon entry. frame_delloc(): avoid ordered pointer comparisons involving f_stacktop when f_stacktop is NULL.	2001-06-23 05:26:56 +00:00
Tim Peters	5ca576ed0a	Merging the gen-branch into the main line, at Guido's direction. Yay! Bugfix candidate in inspect.py: it was referencing "self" outside of a method.	2001-06-18 22:08:13 +00:00
Tim Peters	1dad6a86de	SF bug 434186: 0x80000000/2 != 0x80000000>>1 i_divmod: New and simpler algorithm. Old one returned gibberish on most boxes when the numerator was -sys.maxint-1. Oddly enough, it worked in the release (not debug) build on Windows, because the compiler optimized away some tricky sign manipulations that were incorrect in this case. Makes you wonder <wink> ... Bugfix candidate.	2001-06-18 19:21:11 +00:00
Tim Peters	70128a1ba6	PyLong_{As, From}VoidPtr: cleanup, replacing assumptions in comments with #if/#error constructs.	2001-06-16 08:48:40 +00:00
Tim Peters	c605784174	dict_repr: Reuse one of the int vars (minor code simplification).	2001-06-16 07:52:53 +00:00
Tim Peters	52e155e31b	Reformat decl of new _PyString_Join. Add NEWS blurb about repr() speedup.	2001-06-16 05:42:57 +00:00
Tim Peters	a7259597f1	SF bug 433228: repr(list) woes when len(list) big. Gave Python linear-time repr() implementations for dicts, lists, strings. This means, e.g., that repr(range(50000)) is no longer 50x slower than pprint.pprint() in 2.2 <wink>. I don't consider this a bugfix candidate, as it's a performance boost. Added _PyString_Join() to the internal string API. If we want that in the public API, fine, but then it requires runtime error checks instead of asserts.	2001-06-16 05:11:17 +00:00
Tim Peters	cf37dfc3db	Change IS_LITTLE_ENDIAN macro -- a little faster now.	2001-06-14 18:42:50 +00:00
Guido van Rossum	ad98db1d9e	Fix a mis-indentation in _PyUnicode_New() that caused me to stare at some code for longer than needed.	2001-06-14 17:52:02 +00:00
Tim Peters	ede0509111	_PyLong_AsByteArray: simplify the logic for dealing with the most- significant digits sign bits. Again no change in semantics.	2001-06-14 08:53:38 +00:00
Tim Peters	ce9de2f79a	PyLong_From{Unsigned,}Long: count the # of digits first, so no more space is allocated than needed (used to allocate 80 bytes of digit space no matter how small the long input). This also runs faster, at least on 32- bit boxes.	2001-06-14 04:56:19 +00:00
Tim Peters	f251d06a66	_PyLong_FromByteArray: changed decl of "carry" to match "thisbyte". No semantic change, but a bit clearer and may help a really stupid compiler avoid pointless runtime length conversions.	2001-06-13 21:09:15 +00:00
Tim Peters	05607ad4fd	_PyLong_AsByteArray: Don't do the "delicate overflow" check unless it's truly needed; usually saves a little time, but no change in semantics.	2001-06-13 21:01:27 +00:00
Tim Peters	898cf85c25	_PyLong_AsByteArray: added assert that the input is normalized. This is outside the function's control, but is crucial to correct operation.	2001-06-13 20:50:08 +00:00
Tim Peters	9cb0c38fff	PyLong_As{Unsigned,}LongLong: fiddled final result casting.	2001-06-13 20:45:17 +00:00
Tim Peters	d1a7da6c0d	longobject.c: Replaced PyLong_{As,From}{Unsigned,}LongLong guts with calls to _PyLong_{As,From}ByteArray. _testcapimodule.c: Added strong tests of PyLong_{As,From}{Unsigned,}LongLong. Fixes SF bug #432552 PyLong_AsLongLong() problems. Possible bugfix candidate, but the fix relies on code added to longobject to support the new q/Q structmodule format codes.	2001-06-13 00:35:57 +00:00
Tim Peters	8bc84b4391	_PyLong_{As,From}ByteArray: Minor code rearrangement aimed at improving clarity. Should have no effect visible to callers.	2001-06-12 19:17:03 +00:00
Marc-André Lemburg	8c2133da7b	Fix for bug #432384 : Recursion in PyString_AsEncodedString?	2001-06-12 13:14:10 +00:00
Tim Peters	7a3bfc3a47	Added q/Q standard (x-platform 8-byte ints) mode in struct module. This completes the q/Q project. longobject.c _PyLong_AsByteArray: The original code had a gross bug: the most-significant Python digit doesn't necessarily have SHIFT significant bits, and you really need to count how many copies of the sign bit it has else spurious overflow errors result. test_struct.py: This now does exhaustive std q/Q testing at, and on both sides of, all relevant power-of-2 boundaries, both positive and negative. NEWS: Added brief dict news while I was at it.	2001-06-12 01:22:22 +00:00
Tim Peters	2a9b367385	Two new private longobject API functions, _PyLong_FromByteArray _PyLong_AsByteArray Untested and probably buggy -- they compile OK, but nothing calls them yet. Will soon be called by the struct module, to implement x-platform 'q' and 'Q'. If other people have uses for them, we could move them into the public API. See longobject.h for usage details.	2001-06-11 21:23:58 +00:00
Jack Jansen	fcc54cab10	Added a missing cast to the hashfunc initializer.	2001-06-10 21:43:28 +00:00
Martin v. Löwis	0163d6d6ef	Patch #424475 : Speed-up tp_compare usage, by special-casing the common case of objects with equal types which support tp_compare. Give type objects a tp_compare function. Also add c<0 tests before a few PyErr_Occurred tests.	2001-06-09 07:34:05 +00:00
Marc-André Lemburg	8879a33613	Fixes [ #430986 ] Buglet in PyUnicode_FromUnicode.	2001-06-07 12:26:56 +00:00
Tim Peters	afb6ae8452	Store the mask instead of the size in dictobjects. The mask is more frequently used, and in particular this allows to drop the last remaining obvious time-waster in the crucial lookdict() and lookdict_string() functions. Other changes consist mostly of changing "i < ma_size" to "i <= ma_mask" everywhere.	2001-06-04 21:00:21 +00:00
Tim Peters	453163d842	lookdict: stop more insane core-dump mutating comparison cases. Should be possible to provoke unbounded recursion now, but leaving that to someone else to provoke and repair. Bugfix candidate -- although this is getting harder to backstitch, and the cases it's protecting against are mondo contrived.	2001-06-03 04:54:32 +00:00
Tim Peters	7b5d0afb1e	lookdict: Reduce obfuscating code duplication with a judicious goto. This code is likely to get even hairier to squash core dumps due to mutating comparisons, and it's hard enough to follow without that.	2001-06-03 04:14:43 +00:00
Tim Peters	19b77cfc4b	Finish the dict->string coredump fix. Need sleep. Bugfix candidate.	2001-06-02 08:27:39 +00:00
Tim Peters	23cf6be23c	Coredumpers from Michael Hudson, mutating dicts while printing or converting to string. Critical bugfix candidate -- if you take this seriously <wink>.	2001-06-02 08:02:56 +00:00
Tim Peters	f4b33f61fb	dict_popitem(): Repaired last-second 2.1 comment, which misidentified the true reason for allocating the tuple before checking the dict size.	2001-06-02 05:42:29 +00:00
Tim Peters	eb28ef209e	New collision resolution scheme: no polynomials, simpler, faster, less code, less memory. Tests have uncovered no drawbacks. Christian and Vladimir are the other two people who have burned many brain cells on the dict code in recent years, and they like the approach too, so I'm checking it in without further ado.	2001-06-02 05:27:19 +00:00
Jeremy Hylton	9cea41c195	fix bogus indentation	2001-05-29 17:13:15 +00:00
Thomas Wouters	0dcea5973d	_PyTuple_Resize: guard against PyTuple_New() returning NULL, using Tim's suggestion (modulo style).	2001-05-29 07:58:45 +00:00
Tim Peters	4324aa3572	Cruft cleanup: Removed the unused last_is_sticky argument from the internal _PyTuple_Resize().	2001-05-28 22:30:08 +00:00
Thomas Wouters	6a922372ad	_PyTuple_Resize: take into account the empty tuple. There can be only one. Instead of raising a SystemError, just create a new tuple of the desired size. This fixes (at least) SF bug #420343.	2001-05-28 13:11:02 +00:00
Tim Peters	15d4929ae4	Implement an old idea of Christian Tismer's: use polynomial division instead of multiplication to generate the probe sequence. The idea is recorded in Python-Dev for Dec 2000, but that version is prone to rare infinite loops. The value is in getting all the bits of the hash code to participate; and, e.g., this speeds up querying every key in a dict with keys [i << 16 for i in range(20000)] by a factor of 500. Should be equally valuable in any bad case where the high-order hash bits were getting ignored. Also wrote up some of the motivations behind Python's ever-more-subtle hash table strategy.	2001-05-27 07:39:22 +00:00
Tim Peters	1af03e98d9	Change list.extend() error msgs and NEWS to reflect that list.extend() now takes any iterable argument, not only sequences. NEEDS DOC CHANGES -- but I don't think we settled on a concise way to say this stuff.	2001-05-26 19:37:54 +00:00
Tim Peters	442914d265	Cruft cleanup: removed the #ifdef'ery in support of compiling to allow multi-argument list.append(1, 2, 3) (as opposed to .append((1,2,3))).	2001-05-26 05:50:03 +00:00
Tim Peters	65b8b84839	roundupsize() and friends: fiddle over-allocation strategy for list resizing. Accurate timings are impossible on my Win98SE box, but this is obviously faster even on this box for reasonable list.append() cases. I give credit for this not to the resizing strategy but to getting rid of integer multiplication and divsion (in favor of shifting) when computing the rounded-up size. For unreasonable list.append() cases, Win98SE now displays linear behavior for one-at-time appends up to a list with about 35 million elements. Then it dies with a MemoryError, due to fatally fragmented address space (there's plenty of VM available, but by this point Win9X has broken user space into many distinct heaps none of which has enough contiguous space left to resize the list, and for whatever reason Win9x isn't coalescing the dead heaps). Before the patch it got a MemoryError for the same reason, but once the list reached about 2 million elements. Haven't yet tried on Win2K but have high hopes extreme list.append() will be much better behaved now (NT & Win2K didn't fragment address space, but suffered obvious quadratic-time behavior before as lists got large). For other systems I'm relying on common sense: replacing integer * and / by << and >> can't plausibly hurt, the number of function calls hasn't changed, and the total operation count for reasonably small lists is about the same (while the operations are cheaper now).	2001-05-26 05:28:40 +00:00
Martin v. Löwis	cd35306a25	Patch #424335 : Implement string_richcompare, remove string_compare. Use new _PyString_Eq in lookdict_string.	2001-05-24 16:56:35 +00:00
Tim Peters	f8a548c23c	dictresize(): Rebuild small tables if there are any dummies, not just if they're entirely full. Not a question of correctness, but of temporarily misplaced common sense.	2001-05-24 16:26:40 +00:00
Tim Peters	0c6010be75	Jack Jansen hit a bug in the new dict code, reported on python-dev. dictresize() was too aggressive about never ever resizing small dicts. If a small dict is entirely full, it needs to rebuild it despite that it won't actually resize it, in order to purge old dummy entries thus creating at least one virgin slot (lookdict assumes at least one such exists). Also took the opportunity to add some high-level comments to dictresize.	2001-05-23 23:33:57 +00:00
Fred Drake	0c23231f6e	Remove unused variable.	2001-05-22 22:36:52 +00:00
Tim Peters	dea48ec581	SF patch #425242 : Patch which "inlines" small dictionaries. The idea is Marc-Andre Lemburg's, the implementation is Tim's. Add a new ma_smalltable member to dictobjects, an embedded vector of MINSIZE (8) dictentry structs. Short course is that this lets us avoid additional malloc(s) for dicts with no more than 5 entries. The changes are widespread but mostly small. Long course: WRT speed, all scalar operations (getitem, setitem, delitem) on non-empty dicts benefit from no longer needing NULL-pointer checks (ma_table is never NULL anymore). Bulk operations (copy, update, resize, clearing slots during dealloc) benefit in some cases from now looping on the ma_fill count rather than on ma_size, but that was an unexpected benefit: the original reason to loop on ma_fill was to let bulk operations on empty dicts end quickly (since the NULL-pointer checks went away, empty dicts aren't special-cased any more). Special considerations: For dicts that remain empty, this change is a lose on two counts: the dict object contains 8 new dictentry slots now that weren't needed before, and dict object creation also spends time memset'ing these doomed-to-be-unsused slots to NULLs. For dicts with one or two entries that never get larger than 2, it's a mix: a malloc()/free() pair is no longer needed, and the 2-entry case gets to use 8 slots (instead of 4) thus decreasing the chance of collision. Against that, dict object creation spends time memset'ing 4 slots that aren't strictly needed in this case. For dicts with 3 through 5 entries that never get larger than 5, it's a pure win: the dict is created with all the space they need, and they never need to resize. Before they suffered two malloc()/free() calls, plus 1 dict resize, to get enough space. In addition, the 8-slot table they ended with consumed more memory overall, because of the hidden overhead due to the additional malloc. For dicts with 6 or more entries, the ma_smalltable member is wasted space, but then these are large(r) dicts so 8 slots more or less doesn't make much difference. They still benefit all the time from removing ubiquitous dynamic null-pointer checks, and get a small benefit (but relatively smaller the larger the dict) from not having to do two mallocs, two frees, and a resize on the way to getting their sixth entry. All in all it appears a small but definite general win, with larger benefits in specific cases. It's especially nice that it allowed to get rid of several branches, gotos and labels, and overall made the code smaller.	2001-05-22 20:40:22 +00:00
Guido van Rossum	5b021848ac	file_getiter(): make iter(file) be equivalent to file.xreadlines(). This should be faster. This means: (1) "for line in file:" won't work if the xreadlines module can't be imported. (2) The body of "for line in file:" shouldn't use the file directly; the effects (e.g. of file.readline(), file.seek() or even file.tell()) would be undefined because of the buffering that goes on in the xreadlines module.	2001-05-22 16:48:37 +00:00
Guido van Rossum	0ba9e3ac27	init_name_op(): add (void) to the argument list to make it a valid prototype, for gcc -Wstrict-prototypes.	2001-05-22 02:33:08 +00:00
Marc-André Lemburg	489b56e044	This patch changes the behaviour of the UTF-16 codec family. Only the UTF-16 codec will now interpret and remove a leading BOM mark. Sub- sequent BOM characters are no longer interpreted and removed. UTF-16-LE and -BE pass through all BOM mark characters. These changes should get the UTF-16 codec more in line with what the Unicode FAQ recommends w/r to BOM marks.	2001-05-21 20:30:15 +00:00
Tim Peters	91a364df17	Bugfix candidate. Two exceedingly unlikely errors in dictresize(): 1. The loop for finding the new size had an off-by-one error at the end (could over-index the polys[] vector). 2. The polys[] vector ended with a 0, apparently intended as a sentinel value but never used as such; i.e., it was never checked, so 0 could have been used as a polynomial. Neither bug could trigger unless a dict grew to 2**30 slots; since that would consume at least 12GB of memory just to hold the dict pointers, I'm betting it's not the cause of the bug Fred's tracking down <wink>.	2001-05-19 07:04:38 +00:00
Tim Peters	1928314ef4	Speed dictresize by collapsing its two passes into one; the reason given in the comments for using two passes was bogus, as the only object that can get decref'ed due to the copy is the dummy key, and decref'ing dummy can't have side effects (for one thing, dummy is immortal! for another, it's a string object, not a potentially dangerous user-defined object).	2001-05-17 22:25:34 +00:00
Tim Peters	d7ed3bf552	Speed tuple comparisons in two ways: 1. Omit the early-out EQ/NE "lengths different?" test. Was unable to find any real code where it triggered, but it always costs. The same is not true of list richcmps, where different-size lists appeared to get compared about half the time. 2. Because tuples are immutable, there's no need to refetch the lengths of both tuples from memory again on each loop trip. BUG ALERT: The tuple (and list) richcmp algorithm is arguably wrong, because it won't believe there's any difference unless Py_EQ returns false for some corresponding elements: >>> class C: ... def __lt__(x, y): return 1 ... __eq__ = __lt__ ... >>> C() < C() 1 >>> (C(),) < (C(),) 0 >>> That doesn't make sense -- provided you believe the defn. of C makes sense.	2001-05-15 20:12:59 +00:00
Marc-André Lemburg	2d9204199f	This patch changes the way the string .encode() method works slightly and introduces a new method .decode(). The major change is that strg.encode() will no longer try to convert Unicode returns from the codec into a string, but instead pass along the Unicode object as-is. The same is now true for all other codec return types. The underlying C APIs were changed accordingly. Note that even though this does have the potential of breaking existing code, the chances are low since conversion from Unicode previously took place using the default encoding which is normally set to ASCII rendering this auto-conversion mechanism useless for most Unicode encodings. The good news is that you can now use .encode() and .decode() with much greater ease and that the door was opened for better accessibility of the builtin codecs. As demonstration of the new feature, the patch includes a few new codecs which allow string to string encoding and decoding (rot13, hex, zip, uu, base64). Written by Marc-Andre Lemburg. Copyright assigned to the PSF.	2001-05-15 12:00:02 +00:00
Tim Peters	342c65e19a	Aggressive reordering of dict comparisons. In case of collision, it stands to reason that me_key is much more likely to match the key we're looking for than to match dummy, and if the key is absent me_key is much more likely to be NULL than dummy: most dicts don't even have a dummy entry. Running instrumented dict code over the test suite and some apps confirmed that matching dummy was 200-300x less frequent than matching key in practice. So this reorders the tests to try the common case first. It can lose if a large dict with many collisions is mostly deleted, not resized, and then frequently searched, but that's hardly a case we should be favoring.	2001-05-13 06:43:53 +00:00
Tim Peters	2f228e75e4	Get rid of the superstitious "~" in dict hashing's "i = (~hash) & mask". The comment following used to say: /* We use ~hash instead of hash, as degenerate hash functions, such as for ints <sigh>, can have lots of leading zeros. It's not really a performance risk, but better safe than sorry. 12-Dec-00 tim: so ~hash produces lots of leading ones instead -- what's the gain? / That is, there was never a good reason for doing it. And to the contrary, as explained on Python-Dev last December, it tended to make the sum* (i + incr) & mask (which is the first table index examined in case of collison) the same "too often" across distinct hashes. Changing to the simpler "i = hash & mask" reduced the number of string-dict collisions (== # number of times we go around the lookup for-loop) from about 6 million to 5 million during a full run of the test suite (these are approximate because the test suite does some random stuff from run to run). The number of collisions in non-string dicts also decreased, but not as dramatically. Note that this may, for a given dict, change the order (wrt previous releases) of entries exposed by .keys(), .values() and .items(). A number of std tests suffered bogus failures as a result. For dicts keyed by small ints, or (less so) by characters, the order is much more likely to be in increasing order of key now; e.g., >>> d = {} >>> for i in range(10): ... d[i] = i ... >>> d {0: 0, 1: 1, 2: 2, 3: 3, 4: 4, 5: 5, 6: 6, 7: 7, 8: 8, 9: 9} >>> Unfortunately. people may latch on to that in small examples and draw a bogus conclusion. test_support.py Moved test_extcall's sortdict() into test_support, made it stronger, and imported sortdict into other std tests that needed it. test_unicode.py Excluced cp875 from the "roundtrip over range(128)" test, because cp875 doesn't have a well-defined inverse for unicode("?", "cp875"). See Python-Dev for excruciating details. Cookie.py Chaged various output functions to sort dicts before building strings from them. test_extcall Fiddled the expected-result file. This remains sensitive to native dict ordering, because, e.g., if there are multiple errors in a keyword-arg dict (and test_extcall sets up many cases like that), the specific error Python complains about first depends on native dict ordering.	2001-05-13 00:19:31 +00:00
Tim Peters	16cabc0a3d	Repair "module has no attribute xxx" error msg; bug introduced when switching from tp_getattr to tp_getattro.	2001-05-12 20:24:22 +00:00
Tim Peters	d85e102337	Variant of patch #423262 : Change module attribute get & set Allow module getattr and setattr to exploit string interning, via the previously null module object tp_getattro and tp_setattro slots. Yields a very nice speedup for things like random.random and os.path etc.	2001-05-11 21:51:48 +00:00
Jeremy Hylton	1b0feb4ada	Variant of SF patch 423181 For rich comparisons, use instance_getattr2() when possible to avoid the expense of setting an AttributeError. Also intern the name_op[] table and use the interned strings rather than creating a new string and interning it each time through.	2001-05-11 14:48:41 +00:00
Tim Peters	5acbfcc164	Cosmetic: code under "else" clause was missing indent.	2001-05-11 03:36:45 +00:00
Tim Peters	4fa58bfac2	Restore dicts' tp_compare slot, and change dict_richcompare to say it doesn't know how to do LE, LT, GE, GT. dict_richcompare can't do the latter any faster than dict_compare can. More importantly, for cmp(dict1, dict2), Python first tries rich compares with EQ, LT, and GT one at a time, even if the tp_compare slot is defined, and dict_richcompare called dict_compare for the latter two because it couldn't do them itself. The result was a lot of wasted calls to dict_compare. Now dict_richcompare gives up at once the times Python calls it with LT and GT from try_rich_to_3way_compare(), and dict_compare is called only once (when Python gets around to trying the tp_compare slot). Continued mystery: despite that this cut the number of calls to dict_compare approximately in half in test_mutants.py, the latter still runs amazingly slowly. Running under the debugger doesn't show excessive activity in the dict comparison code anymore, so I'm guessing the culprit is somewhere else -- but where? Perhaps in the element (key/value) comparison code? We clearly spend a lot of time figuring out how to compare things.	2001-05-10 21:45:19 +00:00
Tim Peters	3918fb2549	Repair typo in comment.	2001-05-10 18:58:31 +00:00
Tim Peters	95bf9390a4	SF bug #422121 Insecurities in dict comparison. Fixed a half dozen ways in which general dict comparison could crash Python (even cause Win98SE to reboot) in the presence of kay and/or value comparison routines that mutate the dict during dict comparison. Bugfix candidate.	2001-05-10 08:32:44 +00:00
Tim Peters	9c012af3c3	Heh. I need a break. After this: stropmodule & stringobject were more out of synch than I realized, and I managed to break replace's "count" argument when it was 0. All is well again. Maybe. Bugfix candidate.	2001-05-10 00:32:57 +00:00
Tim Peters	4cd44ef4bf	Fudge. stropmodule and stringobject both had copies of the buggy mymemXXX stuff, and they were already out of synch. Fix the remaining bugs in both and get them back in synch. Bugfix release candidate.	2001-05-10 00:05:33 +00:00
Tim Peters	1a97d5f098	SF patch #416247 2.1c1 stringobject: unused vrbl cleanup. Thanks to Mark Favas.	2001-05-09 20:06:00 +00:00
Tim Peters	4862ab7bf4	Sheesh -- repair the dodge around "cast isn't an lvalue" complaints to restore correct semantics.	2001-05-09 08:43:21 +00:00
Tim Peters	9e897f41db	Mark Favas reported that gcc caught me using casts as lvalues. Dodge it.	2001-05-09 07:37:07 +00:00
Tim Peters	b4bbcd76ea	Ack! Restore the COUNT_ALLOCS one_strings code.	2001-05-09 00:31:40 +00:00
Tim Peters	cf5ad5d6f6	My change to string_item() left an extra reference to each 1-character interned string created by "string"[i]. Since they're immortal anyway, this was hard to notice, but it was still wrong <wink>.	2001-05-09 00:24:55 +00:00
Tim Peters	5b4d477568	Intern 1-character strings as soon as they're created. As-is, they aren't interned when created, so the cached versions generally aren't ever interned. With the patch, the Py_INCREF(t); *p = t; Py_DECREF(s); return; indirection block in PyString_InternInPlace() is never executed during a full run of the test suite, but was executed very many times before. So I'm trading more work when creating one-character strings for doing less work later. Note that the "more work" here can happen at most 256 times per program run, so it's trivial. The same reasoning accounts for the patch's simplification of string_item (the new version can call PyString_FromStringAndSize() no more than 256 times per run, so there's no point to inlining that stuff -- if we were serious about saving time here, we'd pre-initialize the characters vector so that no runtime testing at all was needed!).	2001-05-08 22:33:50 +00:00
Tim Peters	72f98e9b83	SF bug #422177 : Results from .pyc differs from .py Store floats and doubles to full precision in marshal. Test that floats read from .pyc/.pyo closely match those read from .py. Declare PyFloat_AsString() in floatobject header file. Add new PyFloat_AsReprString() API function. Document the functions declared in floatobject.h.	2001-05-08 15:19:57 +00:00
Tim Peters	e63415ead8	SF patch #421922 : Implement rich comparison for dicts. d1 == d2 and d1 != d2 now work even if the keys and values in d1 and d2 don't support comparisons other than ==, and testing dicts for equality is faster now (especially when inequality obtains).	2001-05-08 04:38:29 +00:00
Jeremy Hylton	4c889011db	SF patch 419176 from MvL; fixed bug 418977 Two errors in dict_to_map() helper used by PyFrame_LocalsToFast().	2001-05-08 04:08:59 +00:00
Jeremy Hylton	d37292bb8d	Remove unused variable	2001-05-08 04:00:45 +00:00
Tim Peters	6d60b2e762	SF bug #422108 - Error in rich comparisons. 2.1.1 bugfix candidate too. Fix a bad (albeit unlikely) return value in try_rich_to_3way_compare(). Also document do_cmp()'s return values.	2001-05-07 20:53:51 +00:00
Tim Peters	cb8d368b82	Reimplement PySequence_Contains() and instance_contains(), so they work safely together and don't duplicate logic (the common logic was factored out into new private API function _PySequence_IterContains()). Visible change: some_complex_number in some_instance no longer blows up if some_instance has __getitem__ but neither __contains__ nor __iter__. test_iter changed to ensure that remains true.	2001-05-05 21:05:01 +00:00
Tim Peters	75f8e35ef4	Generalize PySequence_Count() (operator.countOf) to work with iterators.	2001-05-05 11:33:43 +00:00
Tim Peters	de9725f135	Make 'x in y' and 'x not in y' (PySequence_Contains) play nice w/ iterators. NEEDS DOC CHANGES A few more AttributeErrors turned into TypeErrors, but in test_contains this time. The full story for instance objects is pretty much unexplainable, because instance_contains() tries its own flavor of iteration-based containment testing first, and PySequence_Contains doesn't get a chance at it unless instance_contains() blows up. A consequence is that some_complex_number in some_instance dies with a TypeError unless some_instance.__class__ defines __iter__ but does not define __getitem__.	2001-05-05 10:06:17 +00:00
Tim Peters	2cfe368283	Make unicode.join() work nice with iterators. This also required a change to string.join(), so that when the latter figures out in midstream that it really needs unicode.join() instead, unicode.join() can actually get all the sequence elements (i.e., there's no guarantee that the sequence passed to string.join() can be iterated over again by unicode.join(), so string.join() must not pass on the original sequence object anymore).	2001-05-05 05:36:48 +00:00
Tim Peters	12d0a6c78a	Fix a tiny and unlikely memory leak. Was there before too, and actually several of these turned up and got fixed during the iteration crusade.	2001-05-05 04:10:25 +00:00
Tim Peters	6912d4ddf0	Generalize tuple() to work nicely with iterators. NEEDS DOC CHANGES. This one surprised me! While I expected tuple() to be a no-brainer, turns out it's actually dripping with consequences: 1. It will allow the popular PySequence_Fast() to work with any iterable object (code for that not yet checked in, but should be trivial). 2. It caused two std tests to fail. This because some places used PyTuple_Sequence() (the C spelling of tuple()) as an indirect way to test whether something is a sequence. But tuple() code only looked for the existence of sq->item to determine that, and e.g. an instance passed that test whether or not it supported the other operations tuple() needed (e.g., __len__). So some things the tests expected to fail with an AttributeError now fail with a TypeError instead. This looks like an improvement to me; e.g., test_coercion used to produce 559 TypeErrors and 2 AttributeErrors, and now they're all TypeErrors. The error details are more informative too, because the places calling this were looking for TypeErrors in order to replace the generic tuple() "not a sequence" msg with their own more specific text, and AttributeErrors snuck by that.	2001-05-05 03:56:37 +00:00
Tim Peters	f4848dac41	Make PyIter_Next() a little smarter (wrt its knowledge of iterator internals) so clients can be a lot dumber (wrt their knowledge).	2001-05-05 00:14:56 +00:00
Fred Drake	6aebded915	The weakref support in PyObject_InitVar() as well; this should have come out at the same time as it did from PyObject_Init() .	2001-05-03 20:04:33 +00:00
Fred Drake	ba40ec42c8	Remove unnecessary intialization for the case of weakly-referencable objects; the code necessary to accomplish this is simpler and faster if confined to the object implementations, so we only do this there. This causes no behaviorial changes beyond a (very slight) speedup.	2001-05-03 19:44:50 +00:00
Fred Drake	4dcb85b817	Since Py_TPFLAGS_HAVE_WEAKREFS is set in Py_TPFLAGS_DEFAULT, it does not need to be specified in the type structures independently. The flag exists only for binary compatibility. This is a "source cleanliness" issue and introduces no behavioral changes.	2001-05-03 16:04:13 +00:00
Guido van Rossum	b1f35bffe5	Mchael Hudson pointed out that the code for detecting changes in dictionary size was comparing ma_size, the hash table size, which is always a power of two, rather than ma_used, wich changes on each insertion or deletion. Fixed this.	2001-05-02 15:13:44 +00:00
Marc-André Lemburg	542fe56cb9	Fix for bug #417030 : "print '%*s' fails for unicode string"	2001-05-02 14:21:53 +00:00
Tim Peters	6ad22c41c2	Plug a memory leak in list(), when appending to the result list.	2001-05-02 07:12:39 +00:00
Tim Peters	f553f89d45	Generalize list(seq) to work with iterators. This also generalizes list() to no longer insist that len(seq) be defined. NEEDS DOC CHANGES. This is meant to be a model for how other functions of this ilk (max, filter, etc) can be generalized similarly. Feel encouraged to grab your favorite and convert it! Note some cute consequences: list(file) == file.readlines() == list(file.xreadlines()) list(dict) == dict.keys() list(dict.iteritems()) = dict.items() list(xrange(i, j, k)) == range(i, j, k)	2001-05-01 20:45:31 +00:00
Guido van Rossum	47668928e6	Discard a misleading comment about iter_iternext().	2001-05-01 17:01:25 +00:00
Guido van Rossum	4f288ab7d6	Printing objects to a real file still wasn't done right: if the object's type didn't define tp_print, there were still cases where the full "print uses str() which falls back to repr()" semantics weren't honored. This resulted in >>> print None <None object at 0x80bd674> >>> print type(u'') <type object at 0x80c0a80> Fixed this by always using the appropriate PyObject_Repr() or PyObject_Str() call, rather than trying to emulate what they would do. Also simplified PyObject_Str() to always fall back on PyObject_Repr() when tp_str is not defined (rather than making an extra check for instances with a __str__ method). And got rid of the special case for strings.	2001-05-01 16:53:37 +00:00
Guido van Rossum	189f1df301	Add a proper implementation for the tp_str slot (returning self, of course), so I can get rid of the special case for strings in PyObject_Str().	2001-05-01 16:51:53 +00:00
Guido van Rossum	09e563abb4	Add experimental iterkeys(), itervalues(), iteritems() to dict objects. Tests show that iteritems() is 5-10% faster than iterating over the dict and extracting the value with dict[key].	2001-05-01 12:10:21 +00:00
Guido van Rossum	82c690f11a	Well darnit! The innocuous fix I made to PyObject_Print() caused printing of instances not to look for __str__(). Fix this.	2001-04-30 14:39:18 +00:00
Tim Peters	b3d8d1f76c	A different approach to the problem reported in Patch #419651: Metrowerks on Mac adds 0x itself C std says %#x and %#X conversion of 0 do not add the 0x/0X base marker. Metrowerks apparently does. Mark Favas reported the same bug under a Compaq compiler on Tru64 Unix, but no other libc broken in this respect is known (known to be OK under MSVC and gcc). So just try the damn thing at runtime and see what the platform does. Note that we've always had bugs here, but never knew it before because a relevant test case didn't exist before 2.1.	2001-04-28 05:38:26 +00:00
Guido van Rossum	3a80c4a29c	(Adding this to the trunk as well.) Fix a very old flaw in PyObject_Print(). Amazing! When an object type defines tp_str but not tp_repr, 'print x' to a real file object would not call the tp_str slot but rather print a default style representation: <foo object at 0x....>. This even though 'print x' to a file-like-object would correctly call the tp_str slot.	2001-04-27 21:35:01 +00:00
Marc-André Lemburg	8155e0e541	This patch originated from an idea by Martin v. Loewis who submitted a patch for sharing single character Unicode objects. Martin's patch had to be reworked in a number of ways to take Unicode resizing into consideration as well. Here's what the updated patch implements: * Single character Unicode strings in the Latin-1 range are shared (not only ASCII chars as in Martin's original patch). * The ASCII and Latin-1 codecs make use of this optimization, providing a noticable speedup for single character strings. Most Unicode methods can use the optimization as well (by virtue of using PyUnicode_FromUnicode()). * Some code cleanup was done (replacing memcpy with Py_UNICODE_COPY) * The PyUnicode_Resize() can now also handle the case of resizing unicode_empty which previously resulted in an error. * Modified the internal API _PyUnicode_Resize() and the public PyUnicode_Resize() API to handle references to shared objects correctly. The _PyUnicode_Resize() signature changed due to this. * Callers of PyUnicode_FromUnicode() may now only modify the Unicode object contents of the returned object in case they called the API with NULL as content template. Note that even though this patch passes the regression tests, there may still be subtle bugs in the sharing code.	2001-04-23 14:44:21 +00:00
Guido van Rossum	213c7a6aa5	Mondo changes to the iterator stuff, without changing how Python code sees it (test_iter.py is unchanged). - Added a tp_iternext slot, which calls the iterator's next() method; this is much faster for built-in iterators over built-in types such as lists and dicts, speeding up pybench's ForLoop with about 25% compared to Python 2.1. (Now there's a good argument for iterators. ;-) - Renamed the built-in sequence iterator SeqIter, affecting the C API functions for it. (This frees up the PyIter prefix for generic iterator operations.) - Added PyIter_Check(obj), which checks that obj's type has a tp_iternext slot and that the proper feature flag is set. - Added PyIter_Next(obj) which calls the tp_iternext slot. It has a somewhat complex return condition due to the need for speed: when it returns NULL, it may not have set an exception condition, meaning the iterator is exhausted; when the exception StopIteration is set (or a derived exception class), it means the same thing; any other exception means some other error occurred.	2001-04-23 14:08:49 +00:00
Guido van Rossum	65967259f2	Oops, forgot to merge this from the iter-branch to the trunk. This adds "for line in file" iteration, as promised.	2001-04-21 13:20:18 +00:00
Tim Peters	cf96de052f	SF but #417587 : compiler warnings compiling 2.1. Repaired some of the SGI compiler warnings Sjoerd Mullender reported.	2001-04-21 02:46:11 +00:00
Guido van Rossum	05311481d4	Adding iterobject.[ch], which were accidentally not added. Sorry\!	2001-04-20 21:06:46 +00:00
Guido van Rossum	59d1d2b434	Iterators phase 1. This comprises: new slot tp_iter in type object, plus new flag Py_TPFLAGS_HAVE_ITER new C API PyObject_GetIter(), calls tp_iter new builtin iter(), with two forms: iter(obj), and iter(function, sentinel) new internal object types iterobject and calliterobject new exception StopIteration new opcodes for "for" loops, GET_ITER and FOR_ITER (also supported by dis.py) new magic number for .pyc files new special method for instances: __iter__() returns an iterator iteration over dictionaries: "for x in dict" iterates over the keys iteration over files: "for x in file" iterates over lines TODO: documentation test suite decide whether to use a different way to spell iter(function, sentinal) decide whether "for key in dict" is a good idea use iterators in map/filter/reduce, min/max, and elsewhere (in/not in?) speed tuning (make next() a slot tp_next???)	2001-04-20 19:13:02 +00:00
Guido van Rossum	55ad67d74d	Oops. Removed dictiter_new decl that wasn't supposed to go in yet.	2001-04-20 16:52:06 +00:00
Guido van Rossum	0dbb4fba4c	Implement, test and document "key in dict" and "key not in dict". I know some people don't like this -- if it's really controversial, I'll take it out again. (If it's only Alex Martelli who doesn't like it, that doesn't count as "real controversial" though. :-) That's why this is a separate checkin from the iterators stuff I'm about to check in next.	2001-04-20 16:50:40 +00:00
Tim Peters	78fe5308b4	CVS patch 416248: 2.1c1 unicodeobject: unused vrbl cleanup, from Mark Favas.	2001-04-19 21:55:14 +00:00
Jeremy Hylton	b8a93215c2	Revert previous checkin, which caused test_unicodedata to fail.	2001-04-19 16:43:49 +00:00
Martin v. Löwis	da3dc5b892	Patch #416953 : Cache ASCII characters to speed up ASCII decoding.	2001-04-18 12:49:15 +00:00
Guido van Rossum	e04eaec5b6	Tim pointed out a remaining vulnerability in popitem(): the PyTuple_New() could conceivably clear the dict, so move the test for an empty dict after the tuple allocation. It means that we waste time allocating and deallocating a 2-tuple when the dict is empty, but who cares. It also means that when the dict is empty and there's no memory to allocate a 2-tuple, we raise MemoryError, not KeyError -- but that may actually a good idea: if there's no room for a lousy 2-tuple, what are the chances that there's room for a KeyError instance?	2001-04-16 00:02:32 +00:00
Guido van Rossum	a4dd011259	Tentative fix for a problem that Tim discovered at the last moment, and reported to python-dev: because we were calling dict_resize() in PyDict_Next(), and because GC's dict_traverse() uses PyDict_Next(), and because PyTuple_New() can cause GC, and because dict_items() calls PyTuple_New(), it was possible for dict_items() to have the dict resized right under its nose. The solution is convoluted, and touches several places: keys(), values(), items(), popitem(), PyDict_Next(), and PyDict_SetItem(). There are two parts to it. First, we no longer call dict_resize() in PyDict_Next(), which seems to solve the immediate problem. But then PyDict_SetItem() must have a different policy about when it calls dict_resize(), because we want to guarantee (e.g. for an algorithm that Jeremy uses in the compiler) that you can loop over a dict using PyDict_Next() and make changes to the dict as long as those changes are only value replacements for existing keys using PyDict_SetItem(). This is done by resizing after the insertion instead of before, and by remembering the size before we insert the item, and if the size is still the same, we don't bother to even check if we might need to resize. An additional detail is that if the dict starts out empty, we must still resize it before the insertion. That was the first part. :-) The second part is to make keys(), values(), items(), and popitem() safe against side effects on the dict caused by allocations, under the assumption that if the GC can cause arbitrary Python code to run, it can cause other threads to run, and it's not inconceivable that our dict could be resized -- it would be insane to write code that relies on this, but not all code is sane. Now, I have this nagging feeling that the loops in lookdict probably are blissfully assuming that doing a simple key comparison does not change the dict's size. This is not necessarily true (the keys could be class instances after all). But that's a battle for another day.	2001-04-15 22:16:26 +00:00
Guido van Rossum	6b356e70b5	Make one more private symbol static.	2001-04-14 17:55:41 +00:00
Guido van Rossum	f68d8e52e7	Make some private symbols static.	2001-04-14 17:55:09 +00:00
Tim Peters	fff5325078	Bug 415514 reported that e.g. "%#x" % 0 blew up, at heart because C sprintf supplies a base marker if and only if the value is not 0. I then fixed that, by tolerating C's inconsistency when it does %#x, and taking away that Python produced 0x0 when formatting 0L (the "long" flavor of 0) under %#x itself. But after talking with Guido, we agreed it would be better to supply 0x for the short int case too, despite that it's inconsistent with C, because C is inconsistent with itself and with Python's hex(0) (plus, while "%#x" % 0 didn't work before, "%#x" % 0L did, and returned "0x0"). Similarly for %#X conversion.	2001-04-12 18:38:48 +00:00
Tim Peters	711088d9b8	Fix for SF bug #415514 : "%#x" % 0 caused assertion failure/abort. http://sourceforge.net/tracker/index.php?func=detail&aid=415514&group_id=5470&atid=105470 For short ints, Python defers to the platform C library to figure out what %#x should do. The code asserted that the platform C returned a string beginning with "0x". However, that's not true when-- and only when --the value being formatted is 0. Changed the code to live with C's inconsistency here. In the meantime, the problem does not arise if you format a long 0 (0L) instead. However, that's because the code we wrote to do %#x conversions on longs produces a leading "0x" regardless of value. That's probably wrong too: we should drop leading "0x", for consistency with C, when (& only when) formatting 0L. So I changed the long formatting code to do that too.	2001-04-12 00:35:51 +00:00
Marc-André Lemburg	ae605341e3	Fixed ref count bug. Patch #411191 . Found by Walter Dörwald.	2001-03-25 19:16:13 +00:00
Fred Drake	db81e8ddf8	Add support for weak references to the function and method types.	2001-03-23 04:19:27 +00:00
Fred Drake	4e262a9631	A small change to the C API for weakly-referencable types: Such types must now initialize the extra field used by the weak-ref machinery to NULL themselves, to avoid having to require PyObject_INIT() to check if the type supports weak references and do it there. This causes less work to be done for all objects (the type object does not need to be consulted to check for the Py_TPFLAGS_HAVE_WEAKREFS bit).	2001-03-22 18:26:47 +00:00
Tim Peters	6783070ebf	Make PyDict_Next safe to use for loops that merely modify the values associated with existing dict keys. This is a variant of part of Michael Hudson's patch #409864 "lazy fix for Pings bizarre scoping crash".	2001-03-21 19:23:56 +00:00
Guido van Rossum	823649d544	Move the code implementing isinstance() and issubclass() to new C APIs, PyObject_IsInstance() and PyObject_IsSubclass() -- both returning an int, or -1 for errors.	2001-03-21 18:40:58 +00:00
Jeremy Hylton	220ae7c0bf	Fix PyFrame_FastToLocals() and counterpart to deal with cells and frees. Note there doesn't seem to be any way to test LocalsToFast(), because the instructions that trigger it are illegal in nested scopes with free variables. Fix allocation strategy for cells that are also formal parameters. Instead of emitting LOAD_FAST / STORE_DEREF pairs for each parameter, have the argument handling code in eval_code2() do the right thing. A side-effect of this change is that cell variables that are also arguments are listed at the front of co_cellvars in the order they appear in the argument list.	2001-03-21 16:43:47 +00:00
Guido van Rossum	a1351fbd88	SF patch #408326 by Robin Thomas: slice objects comparable, not hashable This patch changes the behavior of slice objects in the following manner: - Slice objects are now comparable with other slice objects as though they were logically tuples of (start,stop,step). The tuple is not created in the comparison function, but the comparison behavior is logically equivalent. - Slice objects are not hashable. With the above change to being comparable, slice objects now cannot be used as keys in dictionaries. [I've edited the patch for style. Note that this fixes the problem that dict[i:j] seemed to work but was meaningless. --GvR]	2001-03-20 12:41:34 +00:00
Tim Peters	0f33604e17	SF bug [ #409448 ] Complex division is braindead http://sourceforge.net/tracker/?func=detail&aid=409448&group_id=5470&atid=105470 Now less braindead. Also added test_complex.py, which doesn't test much, but fails without this patch.	2001-03-18 08:21:57 +00:00
Jeremy Hylton	30c9f3991c	Variety of small INC/DECREF patches that fix reported memory leaks with free variables. Thanks to Martin v. Loewis for finding two of the problems. This fixes SF buf 405583. There is also a C API change: PyFrame_New() is reverting to its pre-2.1 signature. The change introduced by nested scopes was a mistake. XXX Is this okay between beta releases? cell_clear(), the GC helper, must decref its reference to break cycles. frame_dealloc() must dealloc all cell vars and free vars in addition to locals. eval_code2() setup code must INCREF cells it copies out of the closure. The STORE_DEREF opcode implementation must DECREF the object it passes to PyCell_Set().	2001-03-13 01:58:22 +00:00
Tim Peters	b2336529ef	Identifiers matching _[A-Z_]\w* are reserved for C implementations. May or may not be related to bug 407680 (obmalloc.c - looks like it's corrupted). This repairs the illegal vrbl names, but leaves a pile of illegal macro names (_THIS_xxx, _SYSTEM_xxx, _SET_HOOKS, _FETCH_HOOKS).	2001-03-11 18:36:13 +00:00
Tim Peters	7069512bd0	When 1.6 boosted the # of digits produced by repr(float), repr(complex) apparently forgot to play along. Make complex act like float.	2001-03-11 08:37:29 +00:00
Martin v. Löwis	01c6526c0e	Avoid giving prototypes on Solaris.	2001-03-06 12:14:54 +00:00
Martin v. Löwis	2b6727bd8a	Use Py_CHARMASK for ctype macros. Fixes bug #232787 .	2001-03-06 12:12:02 +00:00
Guido van Rossum	4f53da07bf	Two improvements to large file support: - In _portable_ftell(), try fgetpos() before ftello() and ftell64(). I ran into a situation on a 64-bit capable Linux where the C library's ftello() and ftell64() returned negative numbers despite fpos_t and off_t both being 64-bit types; fgetpos() did the right thing. - Define a new typedef, Py_off_t, which is either fpos_t or off_t, depending on which one is 64 bits. This removes the need for a lot of #ifdefs later on. (XXX Should this be moved to pyport.h? That file currently seems oblivious to large fille support, so for now I'll leave it here where it's needed.)	2001-03-01 18:26:53 +00:00

... 13 14 15 16 17 ...

2326 Commits