cpython

Commit Graph

Author	SHA1	Message	Date
Raymond Hettinger	97bc618229	list_inplace_concat() is now expressed in terms of list_extend() which avoids creating an intermediate tuple for iterable arguments other than lists or tuples. In other words, a+=b no longer requires extra memory when b is not a list or tuple. The list and tuple cases are unchanged.	2004-03-11 07:34:19 +00:00
Neil Schemenauer	4252a7a5d1	Make buffer objects based on mutable objects (like array) safe.	2004-03-11 02:42:45 +00:00
Neil Schemenauer	0eadcd9cbb	Document one of the many problems with the buffer object.	2004-03-11 01:00:44 +00:00
Neil Schemenauer	5e3a675b6d	Rename static functions, they should not have the _Py prefix.	2004-03-11 00:44:54 +00:00
Raymond Hettinger	66d31f8f38	Use memcpy() instead of memmove() when the buffers are known to be distinct.	2004-03-10 11:44:04 +00:00
Raymond Hettinger	ef9bf4031a	Tidied up the implementations of reversed (including the custom ones for xrange and list objects). * list.__reversed__ now checks the length of the sequence object before calling PyList_GET_ITEM() because the mutable could have changed length. * all three implementations are now tranparent with respect to length and maintain the invariant len(it) == len(list(it)) even when the underlying sequence mutates. * __builtin__.reversed() now frees the underlying sequence as soon as the iterator is exhausted. * the code paths were rearranged so that the most common paths do not require a jump.	2004-03-10 10:10:42 +00:00
Raymond Hettinger	d2c36261a2	Eliminate the double reverse option. It's only use case was academic and it was potentially confusing to use.	2004-03-10 08:32:47 +00:00
Raymond Hettinger	a6366fe085	Optimize inner loops for subscript, repeat, and concat.	2004-03-09 13:05:22 +00:00
Raymond Hettinger	f889e10c19	Optimize slice assignments. * Replace sprintf message with a constant message string -- this error message ran on every invocation except straight deletions but it was only needed when the rhs was not iterable. The message was also out-of-date and did not reflect that iterable arguments were allowed. * For inner loops that do not make ref count adjustments, use memmove() for fast copying and better readability. * For inner loops that do make ref count adjustments, speed them up by factoring out the constant structure reference and using vitem[] instead.	2004-03-09 08:04:33 +00:00
Raymond Hettinger	3fd500b4a5	The copy module now handles sets directly. The __copy__ methods are no longer needed.	2004-03-08 18:31:10 +00:00
Raymond Hettinger	b7d05db0be	Optimize tuple_slice() and make further improvements to list_slice() and list.extend(). Factoring the inner loops to remove the constant structure references and fixed offsets gives speedups ranging from 20% to 30%.	2004-03-08 07:25:05 +00:00
Raymond Hettinger	99842b6534	Small optimizations for list_slice() and list_extend_internal(). * Using addition instead of substraction on array indices allows the compiler to use a fast addressing mode. Saves about 10%. * Using PyTuple_GET_ITEM and PyList_SET_ITEM is about 7% faster than PySequenceFast_GET_ITEM which has to make a list check on every pass.	2004-03-08 05:56:15 +00:00
Raymond Hettinger	ebedb2f773	Factor out code common to PyDict_Copy() and PyDict_Merge().	2004-03-08 04:19:01 +00:00
Raymond Hettinger	31017aed36	SF #904720 : dict.update should take a 2-tuple sequence like dict.__init_ (Championed by Bob Ippolito.) The update() method for mappings now accepts all the same argument forms as the dict() constructor. This includes item lists and/or keyword arguments.	2004-03-04 08:25:44 +00:00
Michael W. Hudson	6bee23cdc3	Oops, didn't mean to commit the removal of float_compare!	2004-02-26 13:16:03 +00:00
Michael W. Hudson	957f9774b6	Pass a variable that actually exists to PyFPE_END_PROTECT in float_richcompare. Reported on c.l.py by Helmut Jarausch.	2004-02-26 12:33:09 +00:00
Michael W. Hudson	d3b33b5f6f	"Fix" (for certain configurations of the planets, including recent gcc on Linux/x86) [ 899109 ] 1==float('nan') by implementing rich comparisons for floats. Seems to make comparisons involving NaNs somewhat less surprising when the underlying C compiler actually implements C99 semantics.	2004-02-19 19:35:22 +00:00
Raymond Hettinger	fa6c6f8a73	Keep the list.pop() optimization while restoring the many possibility for types other than PyInt being accepted for the optional argument. (Spotted by Neal Norwitz.)	2004-02-19 06:12:06 +00:00
Jeremy Hylton	7083bb744a	Oops. Return -1 to distinguish error from empty dict. This change probably isn't work a bug fix. It's unlikely that anyone was calling this method without passing it a real dict.	2004-02-17 20:10:11 +00:00
Raymond Hettinger	9eb86b3c7c	Double the speed of list.pop() which was spending most of its time parsing arguments.	2004-02-17 11:36:16 +00:00
Raymond Hettinger	90a39bf12c	Refactor list_extend() and list_fill() for gains in code size, memory utilization, and speed: * Moved the responsibility for emptying the previous list from list_fill to list_init. * Replaced the code in list_extend with the superior code from list_fill. * Eliminated list_fill. Results: * list.extend() no longer creates an intermediate tuple except to handle the special case of x.extend(x). The saves memory and time. * list.extend(x) runs 5 to 10% faster when x is a list or tuple 15% faster when x is an iterable not defining __len__ twice as fast when x is an iterable defining __len__ * the code is about 15 lines shorter and no longer duplicates functionality.	2004-02-15 03:57:00 +00:00
Raymond Hettinger	ab517d2eac	Fine tune the speed/space trade-off for overallocating small lists. The Py2.3 approach overallocated small lists by up to 8 elements. The last checkin would limited this to one but slowed down (by 20 to 30%) the creation of small lists between 3 to 8 elements. This tune-up balances the two, limiting overallocation to 3 elements (significantly reducing space consumption from Py2.3) and running faster than the previous checkin. The first part of the growth pattern (0, 4, 8, 16) neatly meshes with allocators that trigger data movement only when crossing a power of two boundary. Also, then even numbers mesh well with common data alignments.	2004-02-14 18:34:46 +00:00
Raymond Hettinger	2731ae4d6d	Fix missing return value. Spotted by Neal Norwitz	2004-02-14 03:07:21 +00:00
Raymond Hettinger	cb3e580ebc	Optimize list.pop() for the common special case of popping off the end. More than doubles its speed.	2004-02-13 18:36:31 +00:00
Raymond Hettinger	4bb9540dd6	* Optimized list appends and pops by making fewer calls the underlying system realloc(). This is achieved by tracking the overallocation size in a new field and using that information to skip calls to realloc() whenever possible. * Simplified and tightened the amount of overallocation. For larger lists, this overallocates by 1/8th (compared to the previous scheme which ranged between 1/4th to 1/32nd over-allocation). For smaller lists (n<6), the maximum overallocation is one byte (formerly it could be upto eight bytes). This saves memory in applications with large numbers of small lists. * Eliminated the NRESIZE macro in favor of a new, static list_resize function that encapsulates the resizing logic. Coverting this back to macro would give a small (under 1%) speed-up. This was too small to warrant the loss of readability, maintainability, and de-coupling. * Some functions using NRESIZE had grown unnecessarily complex in their efforts to bend to the macro's calling pattern. With the new list_resize function in place, those other functions could be simplified. That is being saved for a separate patch. * The ob_item==NULL check could be eliminated from the new list_resize function. This would entail finding each piece of code that sets ob_item to NULL and adding a new line to invalidate the overallocation tracking field. Rather than impose a new requirement on other pieces of list code, it was preferred to leave the NULL check in place and retain the benefits of decoupling, maintainability and information hiding (only PyList_New() and list_sort() need to know about the new field). This approach also reduces the odds of breaking an extension module. (Collaborative effort by Raymond Hettinger, Hye-Shik Chang, Tim Peters, and Armin Rigo.)	2004-02-13 11:36:39 +00:00
Raymond Hettinger	029dba5a40	Make reversed() transparent with respect to length.	2004-02-10 09:33:39 +00:00
Raymond Hettinger	b32e640489	SF patch #875689 : >100k alloc wasted on startup (Contributed by Mike Pall.) Make sure fill_free_list() is called only once rather than 106 times when pre-allocating small ints.	2004-02-08 18:54:37 +00:00
Raymond Hettinger	06353f76be	Let reversed() work with itself.	2004-02-08 10:49:42 +00:00
Jim Fulton	8a1a594590	Fixed a bug in object.__reduce_ex__ (reduce_2) when using protocol 2. Failure to clear the error when attempts to get the __getstate__ attribute fail caused intermittent errors and odd behavior.	2004-02-08 04:21:26 +00:00
Skip Montanaro	db6080507d	Remove support for --without-universal-newlines (see PEP 11).	2004-02-07 13:53:46 +00:00
Raymond Hettinger	c058fd14a9	* Fix ref counting in extend() and extendleft(). * Let deques support reversed().	2004-02-07 02:45:22 +00:00
Walter Dörwald	cd736e71a3	Fix reallocation bug in unicode.translate(): The code was comparing characters instead of character pointers to determine space requirements.	2004-02-05 17:36:00 +00:00
Fred Drake	bc875f5a36	Allocating a new weakref object can cause existing weakref objects for the same object to be collected by the cyclic GC support if they are only referenced by a cycle. If the weakref being collected was one of the weakrefs without callbacks, some local variables for the constructor became invalid and have to be re-computed. The test caused a segfault under a debug build without the fix applied.	2004-02-04 23:14:14 +00:00
Fred Drake	6a2852cd48	Fix bug in interpretation of the "callback" argument in the constructors for weakref ref and proxy objects; None was not being treated as identical to NULL, though it was documented as equivalent.	2004-02-03 19:52:56 +00:00
Brett Cannon	fb5a4e33fb	Removed two unneeded lines from PyObject_Compare(). Closes bug #885293 (thanks, Josiah Carlson).	2004-01-27 20:17:54 +00:00
Armin Rigo	76beca957f	Two forgotten Py_DECREF() for two out-of-memory conditions.	2004-01-27 16:08:07 +00:00
Tim Peters	7049d816fb	Revert change accidentally checked in as part of a whitespace normalization patch.	2004-01-18 20:31:02 +00:00
Tim Peters	58eb11cf62	Whitespace normalization.	2004-01-18 20:29:55 +00:00
Skip Montanaro	ce59c04127	Remove support for SunOS 4. Remove BAD_EXEC_PROTOYPE (leftover from IRIX 4 demolition).	2004-01-17 14:19:44 +00:00
Raymond Hettinger	2fb702966c	SF Patch #871704 : Py_SequenceFast can mask errors (Contributed by Greg Chapman.) Since this only changes the error message, I doubt that it should be backported.	2004-01-11 23:26:51 +00:00
Hye-Shik Chang	75c00efcc7	[SF #866875 ] Add a specialized routine for one character separaters on str.split() and str.rsplit().	2004-01-05 00:29:51 +00:00
Raymond Hettinger	b86269db45	Apply pre-sizing optimization to a broader class of objects. Formerly, the length was only fetched from sequence objects. Now, any object that reports its length can benefit from pre-sizing.	2004-01-04 11:00:08 +00:00
Raymond Hettinger	7832cd6141	Apply tuple/list pre-sizing optimization to a broader class of objects. Formerly, length data fetched from sequence objects. Now, any object that reports its length can benefit from pre-sizing. On one sample timing, it gave a threefold speedup for list(s) where s was a set object.	2004-01-04 06:08:16 +00:00
Hye-Shik Chang	1bc09b7c2a	Cosmetic fix for wrongly indented tabs with ts=4.	2004-01-03 19:35:43 +00:00
Raymond Hettinger	a3b11e7fb3	* Simplify and speedup logic for tp_print. * Speed-up intersection whenever PyDict_Next can be used.	2003-12-31 14:08:58 +00:00
Hye-Shik Chang	7db07e6972	Fix gcc 3.3 warnings related to Py_UNICODE_WIDE.	2003-12-29 01:36:01 +00:00
Andrew MacIntyre	f1ca7f561c	complete backout of listobject.c v2.171	2003-12-28 07:43:56 +00:00
Jeremy Hylton	30973414c5	Revert previous two checkins to repair test failure. The special-case code that was removed could return a value indicating success but leave an exception set. test_fileinput failed in a debug build as a result.	2003-12-26 19:05:04 +00:00
Andrew MacIntyre	694e3a4a9d	use the correct macro to access list size	2003-12-26 00:09:04 +00:00
Andrew MacIntyre	d57caed52c	Performance of list([]) in 2.3 came up in a thread on comp.lang.python, which can be reviewed via http://coding.derkeiler.com/Archive/Python/comp.lang.python/2003-12/1011.html Duncan Booth investigated, and discovered that an "optimisation" was in fact a pessimisation for small numbers of elements in a source list, compared to not having the optimisation, although with large numbers of elements in the source list the optimisation was quite beneficial. He posted his change to comp.lang.python (but not to SF). Further research has confirmed his assessment that the optimisation only becomes a net win when the source list has more than 100 elements. I also found that the optimisation could apply to tuples as well, but the gains only arrive with source tuples larger than about 320 elements and are nowhere near as significant as the gains with lists, (~95% gain @ 10000 elements for lists, ~20% gain @ 10000 elements for tuples) so I haven't proceeded with this. The code as it was applied the optimisation to list subclasses as well, and this also appears to be a net loss for all reasonable sized sources (~80-100% for up to 100 elements, ~20% for more than 500 elements; I tested up to 10000 elements). Duncan also suggested special casing empty lists, which I've extended to all empty sequences. On the basis that list_fill() is only ever called with a list for the result argument, testing for the source being the destination has now happens before testing source types.	2003-12-25 13:28:48 +00:00
Hye-Shik Chang	7fc4cf57b8	Fix unicode.rsplit()'s bug that ignores separater on the end of string when using specialized splitter for 1 char sep.	2003-12-23 09:10:16 +00:00
Skip Montanaro	ac4ea13a3a	There are places in Python which assume bytes have 8-bits. Formalize that a bit by checking the value of UCHAR_MAX in Include/Python.h. There was a check in Objects/stringobject.c. Remove that. (Note that we don't define UCHAR_MAX if it's not defined as the old test did.)	2003-12-22 16:31:41 +00:00
Hye-Shik Chang	40e9509dc7	Fix broken xmlcharrefreplace by rev 2.204. (Pointy hat goes to perky)	2003-12-22 01:31:13 +00:00
Hye-Shik Chang	4a264fb054	SF #859573 : Reduce compiler warnings on gcc 3.2 and above.	2003-12-19 01:59:56 +00:00
Raymond Hettinger	64958a15d7	Guido grants a Christmas wish: sorted() becomes a regular function instead of a classmethod.	2003-12-17 20:43:33 +00:00
Raymond Hettinger	81ad32e435	Speedup set.update by using the override mode for PyDict_Merge().	2003-12-15 21:16:06 +00:00
Hye-Shik Chang	3ae811b57d	Add rsplit method for str and unicode builtin types. SF feature request #801847. Original patch is written by Sean Reifschneider.	2003-12-15 18:49:53 +00:00
Raymond Hettinger	fb4e33a8e2	Improve algorithm for set.difference when the input is not a set.	2003-12-15 13:23:55 +00:00
Raymond Hettinger	438e02dfc8	* Refactor set.__contains__() * Use Py_RETURN_NONE everywhere. * Fix-up the firstpass check for the tp_print slot.	2003-12-13 19:38:47 +00:00
Raymond Hettinger	0deab62704	Refactor set.discard() and set.remove().	2003-12-13 18:53:18 +00:00
Raymond Hettinger	6a8bbdbe7b	Improve argument checking speed.	2003-12-13 15:21:55 +00:00
Raymond Hettinger	dc5ae11abf	Use dictionary specific looping idiom where possible. Simplifies and speeds-up the code.	2003-12-13 14:46:46 +00:00
Raymond Hettinger	0c66967e3d	Simplify previous checkin -- a new function was not needed.	2003-12-13 13:31:55 +00:00
Raymond Hettinger	d3ae6729e7	Use PyDict_Contains() instead of PySequence_Contains().	2003-12-13 11:58:56 +00:00
Raymond Hettinger	8f5cdaa784	* Added a new method flag, METH_COEXIST. * Used the flag to optimize set.__contains__(), dict.__contains__(), dict.__getitem__(), and list.__getitem__().	2003-12-13 11:26:12 +00:00
Hye-Shik Chang	19cb193244	Fix memory error treatment correctly. Going to dsu_fail causes deallocating garbage pointers; saved_ob_item and empty_ob_item. (Reviewed by Raymond Hettinger)	2003-12-10 07:31:08 +00:00
Michael W. Hudson	1df0f654e8	Fixes and tests for various "holding pointers when arbitrary Python code can run" bugs as discussed in [ 848856 ] couple of new list.sort bugs	2003-12-04 11:25:46 +00:00
Guido van Rossum	6c9e130524	- Removed FutureWarnings related to hex/oct literals and conversions and left shifts. (Thanks to Kalle Svensson for SF patch 849227.) This addresses most of the remaining semantic changes promised by PEP 237, except for repr() of a long, which still shows the trailing 'L'. The PEP appears to promise warnings for operations that changed semantics compared to Python 2.3, but this is not implemented; we've suffered through enough warnings related to hex/oct literals and I think it's best to be silent now.	2003-11-29 23:52:13 +00:00
Raymond Hettinger	37e136373e	Make sure the list.sort's decorate step unwinds itself before returning an exception raised by the key function. (Suggested by Michael Hudson.)	2003-11-28 21:43:02 +00:00
Raymond Hettinger	4f8f976576	Add optional fillchar argument to ljust(), rjust(), and center() string methods.	2003-11-26 08:21:35 +00:00
Raymond Hettinger	bc0f2ab9bb	Expose dict_contains() and PyDict_Contains() with is about 10% faster than PySequence_Contains() and more clearly applicable to dicts. Apply the new function in setobject.c where __contains__ checking is ubiquitous.	2003-11-25 21:12:14 +00:00
Raymond Hettinger	a38123e2fa	Factor out more duplicate code.	2003-11-24 22:18:49 +00:00
Guido van Rossum	5f4e45d66f	Stop GCC warning about int literal that's so long that it becomes an unsigned int (on a 32-bit machine), by adding an explicit 'u' to the literal (a prime used to improve the hash function for frozenset).	2003-11-24 04:13:13 +00:00
Raymond Hettinger	f5f41bf087	* Checkin remaining documentation * Add more tests * Refactor and neaten the code a bit. * Rename union_update() to update(). * Improve the algorithms (making them a closer to sets.py).	2003-11-24 02:57:33 +00:00
Raymond Hettinger	49ba4c39c4	* Simplify hash function and add test to show effectiveness of the hash function. * Add a better test for deepcopying. * Add tests to show the __init__() function works like it does for list and tuple. Add related test. * Have shallow copies of frozensets return self. Add related test. * Have frozenset(f) return f if f is already a frozenset. Add related test. * Beefed-up some existing tests.	2003-11-23 02:49:05 +00:00
Guido van Rossum	baf0f8f24d	- When method objects have an attribute that can be satisfied either by the function object or by the method object, the function object's attribute usually wins. Christian Tismer pointed out that that this is really a mistake, because this only happens for special methods (like __reduce__) where the method object's version is really more appropriate than the function's attribute. So from now on, all method attributes will have precedence over function attributes with the same name.	2003-11-22 23:55:50 +00:00
Raymond Hettinger	bfd334a42d	Extend temporary hashability to remove() and discard(). Brings the functionality back in line with sets.py.	2003-11-22 03:55:23 +00:00
Raymond Hettinger	19c2d77842	Allow temporary hashability for the __contains__ test. (Requested by Alex Martelli.)	2003-11-21 18:36:54 +00:00
Raymond Hettinger	3fbec701ca	issubset() and issuperset() to work with general iterables	2003-11-21 07:56:36 +00:00
Raymond Hettinger	82d73dd459	Three minor performance improvements: * Improve the hash function to increase the chance that distinct sets will have distinct xor'd hash totals. * Use PyDict_Merge where possible (it is faster than an equivalent iter/set pair). * Don't rebuild dictionaries where the input already has one.	2003-11-20 22:54:33 +00:00
Tim Peters	403a203223	SF bug 839548: Bug in type's GC handling causes segfaults. Also SF patch 843455. This is a critical bugfix. I'll backport to 2.3 maint, but not beyond that. The bugs this fixes have been there since weakrefs were introduced.	2003-11-20 21:21:46 +00:00
Jack Jansen	eddc1449ba	Getting rid of all the code inside #ifdef macintosh too.	2003-11-20 01:44:59 +00:00
Jack Jansen	4bae2d5e46	Getting rid of code dependent on GUSI or the MetroWerks compiler.	2003-11-19 22:52:23 +00:00
Jack Jansen	fb2765666f	Getting rid of support for the ancient Apple MPW compiler.	2003-11-19 15:24:47 +00:00
Guido van Rossum	b61982bacb	Implement straightforward suggestions from gcc warnings (remove unused variable, add extra braces).	2003-11-18 19:27:19 +00:00
Raymond Hettinger	1b92fd5bca	Use PySequence_Contains() instead of direct access macro.	2003-11-18 14:15:31 +00:00
Raymond Hettinger	50a4bb325c	Various fixups (most suggested by Armin Rigo).	2003-11-17 16:42:33 +00:00
Raymond Hettinger	e2c277a69f	Fix output spacing typo	2003-11-16 16:36:58 +00:00
Raymond Hettinger	a690a9967e	* Migrate set() and frozenset() from the sandbox. * Install the unittests, docs, newsitem, include file, and makefile update. * Exercise the new functions whereever sets.py was being used. Includes the docs for libfuncs.tex. Separate docs for the types are forthcoming.	2003-11-16 16:17:49 +00:00
Tim Peters	0bd743cee1	subtype_dealloc(): Simplified overly contorted retracking logic. With this change, I think subtype_dealloc is actually a smidgen less obscure than it was in 2.3 -- we got rid of a negation in an "if" <wink>.	2003-11-13 22:50:00 +00:00
Tim Peters	f7f9e9966b	subtype_dealloc(): A more complete fix for critical bug 840829 + expanded the test case with a piece that needs the more-complete fix. I'll backport this to 2.3 maint.	2003-11-13 21:59:32 +00:00
Tim Peters	add09b4149	SF bug 840829: weakref callbacks and gc corrupt memory. subtype_dealloc(): This left the dying object exposed to gc, so that if cyclic gc triggered during the weakref callback, gc tried to delete the dying object a second time. That's a disaster. subtype_dealloc() had a (I hope!) unique problem here, as every normal dealloc routine untracks the object (from gc) before fiddling with weakrefs etc. But subtype_dealloc has obscure technical reasons for re-registering the dying object with gc (already explained in a large comment block at the bottom of the function). The fix amounts to simply refraining from reregistering the dying object with gc until after the weakref callback (if any) has been called. This is a critical bug (hard to predict, and causes seemingly random memory corruption when it occurs). I'll backport it to 2.3 later.	2003-11-12 20:43:28 +00:00
Raymond Hettinger	001f228f36	Improve the reverse list iterator to free memory as soon as the iterator is exhausted.	2003-11-08 11:58:44 +00:00
Raymond Hettinger	c24c9106e8	Minor code fixup. Make sure that len reflects the current list size.	2003-11-08 11:35:22 +00:00
Raymond Hettinger	1021c44b41	Optimize reversed(list) using a custom iterator.	2003-11-07 15:38:09 +00:00
Raymond Hettinger	85c20a41df	Implement and apply PEP 322, reverse iteration	2003-11-06 14:06:48 +00:00
Jeremy Hylton	ceac90aecb	Fix compiler warning about possible use of n without assignment. Also fix use of n for two different variables in two different blocks.	2003-11-03 20:58:28 +00:00
Raymond Hettinger	54a831bef7	Use PyTuple_Pack() to simplify enumerate().	2003-11-02 05:37:44 +00:00
Raymond Hettinger	0a9b9da0c3	Add list.sorted() classmethod.	2003-10-29 06:54:43 +00:00
Armin Rigo	2b3eb4062c	Deleting cyclic object comparison. SF patch 825639 http://mail.python.org/pipermail/python-dev/2003-October/039445.html	2003-10-28 12:05:48 +00:00

1 2 3 4 5 ...

2144 Commits