cpython

Commit Graph

Author	SHA1	Message	Date
Guido van Rossum	e92e610a9e	Christian Tismer -- total rewrite on trashcan code. Improvements: - does no longer need any extra memory - has no relationship to tstate - works in debug mode - can easily be modified for free threading (hi Greg:) Side effects: Trashcan does change the order of object destruction. Prevending that would be quite an immense effort, as my attempts have shown. This version works always the same, with debug mode or not. The slightly changed destruction order should therefore be no problem. Algorithm: While the old idea of delaying the destruction of some obejcts at a certain recursion level was kept, we now no longer aloocate an object to hold these objects. The delayed objects are instead chained together via their ob_type field. The type is encoded via ob_refcnt. When it comes to the destruction of the chain of waiting objects, the topmost object is popped off the chain and revived with type and refcount 1, then it gets a normal Py_DECREF. I am confident that this solution is near optimum for minimizing side effects and code bloat.	2000-04-24 15:40:53 +00:00
Guido van Rossum	5ce78f8e4e	Patch by Charles G Waldman to avoid a sneaky memory leak in _PyTuple_Resize(). In addition, a change suggested by Jeremy Hylton to limit the size of the free lists is also merged into this patch. Charles wrote initially: """ Test Case: run the following code: class Nothing: def __len__(self): return 5 def __getitem__(self, i): if i < 3: return i else: raise IndexError, i def g(a,b,c): return for x in xrange(1000000): g(Nothing()) and watch Python's memory use go up and up. Diagnosis: The analysis begins with the call to PySequence_Tuple at line 1641 in ceval.c - the argument to g is seen to be a sequence but not a tuple, so it needs to be converted from an abstract sequence to a concrete tuple. PySequence_Tuple starts off by creating a new tuple of length 5 (line 1122 in abstract.c). Then at line 1149, since only 3 elements were assigned, _PyTuple_Resize is called to make the 5-tuple into a 3-tuple. When we're all done the 3-tuple is decrefed, but rather than being freed it is placed on the free_tuples cache. The basic problem is that the 3-tuples are being added to the cache but never picked up again, since _PyTuple_Resize doesn't make use of the free_tuples cache. If you are resizing a 5-tuple to a 3-tuple and there is already a 3-tuple in free_tuples[3], instead of using this tuple, _PyTuple_Resize will realloc the 5-tuple to a 3-tuple. It would more efficient to use the existing 3-tuple and cache the 5-tuple. By making _PyTuple_Resize aware of the free_tuples (just as PyTuple_New), we not only save a few calls to realloc, but also prevent this misbehavior whereby tuples are being added to the free_tuples list but never properly "recycled". """ And later: """ This patch replaces my submission of Sun, 16 Apr and addresses Jeremy Hylton's suggestions that we also limit the size of the free tuple list. I chose 2000 as the maximum number of tuples of any particular size to save. There was also a problem with the previous version of this patch causing a core dump if Python was built with Py_TRACE_REFS. This is fixed in the below version of the patch, which uses tupledealloc instead of _Py_Dealloc. """	2000-04-21 21:15:05 +00:00
Jeremy Hylton	4a3dd2dcc2	Fix PR#7 comparisons of recursive objects Note that comparisons of deeply nested objects can still dump core in extreme cases.	2000-04-14 19:13:24 +00:00
Guido van Rossum	f0b7b04ae8	Marc-Andre Lemburg: The maxsplit functionality in .splitlines() was replaced by the keepends functionality which allows keeping the line end markers together with the string. Added support for '%r' % obj: this inserts repr(obj) rather than str(obj).	2000-04-11 15:39:26 +00:00
Guido van Rossum	dc742b3184	Marc-Andre Lemburg: Added a few missing whitespace Unicode char mappings. Thanks to Brian Hooper.	2000-04-11 15:39:02 +00:00
Guido van Rossum	86662914be	Marc-Andre Lemburg: The maxsplit functionality in .splitlines() was replaced by the keepends functionality which allows keeping the line end markers together with the string.	2000-04-11 15:38:46 +00:00
Guido van Rossum	ba71a247ac	Simple optimization by Christian Tismer, who gives credit to Lenny Kneler for reporting this issue: long_mult() is faster when the smaller argument is on the left. Swap the arguments accordingly.	2000-04-10 17:31:58 +00:00
Guido van Rossum	fd4b957b06	Marc-Andre Lemburg: * New exported API PyUnicode_Resize() * The experimental Keep-Alive optimization was turned back on after some tweaks to the implementation. It should now work without causing core dumps... this has yet to tested though (switching it off is easy: see the unicodeobject.c file for details). * Fixed a memory leak in the Unicode freelist cleanup code. * Added tests to correctly process the return code from _PyUnicode_Resize(). * Fixed a bug in the 'ignore' error handling routines of some builtin codecs. Added test cases for these to test_unicode.py.	2000-04-10 13:51:10 +00:00
Guido van Rossum	90daa87569	Marc-Andre Lemburg: * string_contains now calls PyUnicode_Contains() only when the other operand is a Unicode string (not whenever it's not a string). * New format style '%r' inserts repr(arg) instead of str(arg). * '...%s...' % u"abc" now coerces to Unicode just like string methods. Care is taken not to reevaluate already formatted arguments -- only the first Unicode object appearing in the argument mapping is looked up twice. Added test cases for this to test_unicode.py.	2000-04-10 13:47:21 +00:00
Guido van Rossum	b244f6950b	Marc-Andre Lemburg: * TypeErrors during comparing of mixed type arguments including a Unicode object are now masked (just like they are for all other combinations).	2000-04-10 13:42:33 +00:00
Guido van Rossum	5f8b12f27e	Mark Hammond: In line with a similar checkin to object.c a while ago, this patch gives a more descriptive error message for an attribute error on a class instance. The message now looks like: AttributeError: 'Descriptor' instance has no attribute 'GetReturnType'	2000-04-10 13:03:19 +00:00
Guido van Rossum	5db862dd0c	Skip Montanaro: add string precisions to calls to PyErr_Format to prevent possible buffer overruns.	2000-04-10 12:46:51 +00:00
Guido van Rossum	ba47704943	Conrad Huang points out that "if (0 < ch < 256)", while legal C, doesn't mean what the Python programmer thought...	2000-04-06 18:18:10 +00:00
Guido van Rossum	34888ed689	Fredrik Lundh: eliminate a MSVC compiler warning.	2000-04-05 21:29:50 +00:00
Guido van Rossum	9e896b37c7	Marc-Andre's third try at this bulk patch seems to work (except that his copy of test_contains.py seems to be broken -- the lines he deleted were already absent). Checkin messages: New Unicode support for int(), float(), complex() and long(). - new APIs PyInt_FromUnicode() and PyLong_FromUnicode() - added support for Unicode to PyFloat_FromString() - new encoding API PyUnicode_EncodeDecimal() which converts Unicode to a decimal char* string (used in the above new APIs) - shortcuts for calls like int(<int object>) and float(<float obj>) - tests for all of the above Unicode compares and contains checks: - comparing Unicode and non-string types now works; TypeErrors are masked, all other errors such as ValueError during Unicode coercion are passed through (note that PyUnicode_Compare does not implement the masking -- PyObject_Compare does this) - contains now works for non-string types too; TypeErrors are masked and 0 returned; all other errors are passed through Better testing support for the standard codecs. Misc minor enhancements, such as an alias dbcs for the mbcs codec. Changes: - PyLong_FromString() now applies the same error checks as does PyInt_FromString(): trailing garbage is reported as error and not longer silently ignored. The only characters which may be trailing the digits are 'L' and 'l' -- these are still silently ignored. - string.ato?() now directly interface to int(), long() and float(). The error strings are now a little different, but the type still remains the same. These functions are now ready to get declared obsolete ;-) - PyNumber_Int() now also does a check for embedded NULL chars in the input string; PyNumber_Long() already did this (and still does) Followed by: Looks like I've gone a step too far there... (and test_contains.py seem to have a bug too). I've changed back to reporting all errors in PyUnicode_Contains() and added a few more test cases to test_contains.py (plus corrected the join() NameError).	2000-04-05 20:11:21 +00:00
Guido van Rossum	2ea3e143f0	Some blank lines.	2000-03-31 17:24:09 +00:00
Jeremy Hylton	a12c7a7620	Add PyDict_Copy() function to C API for dicts. It returns a new dictionary that contains the same key/value pairs as p.	2000-03-30 22:27:31 +00:00
Guido van Rossum	b7a40ba8d3	MBCS codecs. (Win32 only.) By Mark Hammond.	2000-03-28 02:01:52 +00:00
Guido van Rossum	13ff8eb493	Christian Tismer: Added "better safe than sorry" patch to the new trashcan code in object.c, to ensure that tstate is not touched when it might be undefined.	2000-03-25 18:39:19 +00:00
Barry Warsaw	51ac58039f	On 17-Mar-2000, Marc-Andre Lemburg said: Attached you find an update of the Unicode implementation. The patch is against the current CVS version. I would appreciate if someone with CVS checkin permissions could check the changes in. The patch contains all bugs and patches sent this week and also fixes a leak in the codecs code and a bug in the free list code for Unicode objects (which only shows up when compiling Python with Py_DEBUG; thanks to MarkH for spotting this one).	2000-03-20 16:36:48 +00:00
Guido van Rossum	ee70ad1e52	Checking in the new, improve file.writelines() code. This (1) avoids thread unsafety whereby another thread could zap the list while we were using it, and (2) now supports writing arbitrary sequences of strings.	2000-03-13 16:27:06 +00:00
Guido van Rossum	d724b23420	Christian Tismer's "trashcan" patch: Added wrapping macros to dictobject.c, listobject.c, tupleobject.c, frameobject.c, traceback.c that safely prevends core dumps on stack overflow. Macros and functions in object.c, object.h. The method is an "elevator destructor" that turns cascading deletes into tail recursive behavior when some limit is hit.	2000-03-13 16:01:29 +00:00
Guido van Rossum	96a45adf80	Fix typo in replace() detected by Mark Hammond and fixed by Marc-Andre.	2000-03-13 15:56:08 +00:00
Guido van Rossum	403d68b484	Add sq_contains implementation.	2000-03-13 15:55:09 +00:00
Guido van Rossum	ef93b87f1c	Added Christian Tismer's patch to allow list.append(a,b,c) back -- with a twist: you have to define NO_STRICT_LIST_APPEND manually to enable multi-arg append().	2000-03-13 15:41:59 +00:00
Guido van Rossum	4aa1e63e4c	Marc-AAndre Lemburg: add new unicode files	2000-03-10 22:55:40 +00:00
Guido van Rossum	4c08d554b9	Many changes for Unicode, by Marc-Andre Lemburg.	2000-03-10 22:55:18 +00:00
Guido van Rossum	d57fd91488	Unicode implementation by Marc-Andre Lemburg based on original code by Fredrik Lundh.	2000-03-10 22:53:23 +00:00
Guido van Rossum	603484d759	Unicode character type helpers, written by Marc-Andre Lemburg.	2000-03-10 22:52:46 +00:00
Guido van Rossum	9611e0b462	Patch by Moshe Zadka: remove the string special case in PySequence_Contains() now that string objects have this code in their tp_contains.	2000-03-07 15:54:45 +00:00
Guido van Rossum	9284a572bc	Patch by Moshe Zadka: move the string special case from abstract.c here. [Patch modified by GvR to keep the original exception.]	2000-03-07 15:53:43 +00:00
Barry Warsaw	bf32583084	string_join(): Fix memory leaks discovered by Charles Waldman (and a few other paths through the function that leaked).	2000-03-06 14:52:18 +00:00
Guido van Rossum	43713e5a28	Massive patch by Skip Montanaro to add ":name" to as many PyArg_ParseTuple() format string arguments as possible.	2000-02-29 13:59:29 +00:00
Guido van Rossum	ee28c3a5ea	Patch by Mozhe Zadka, for __contains__ (overloading 'in'). This adds an instance method instance_contains as sq_contains. It looks for __contains__ and if not found falls back to previous behaviour. Done.	2000-02-28 15:03:15 +00:00
Guido van Rossum	46c6b20392	Patch by Mozhe Zadka, for __contains__ (overloading 'in'). This patches PySequence_Contains() to check for a valid sq_contains field. More to follow.	2000-02-28 15:01:46 +00:00
Guido van Rossum	c00a938be8	OKOK, Greg's right, I should've used the :name option in the argument format strings.	2000-02-24 21:48:29 +00:00
Guido van Rossum	4aa24f9979	Made all list methods use PyArg_ParseTuple(), for more accurate diagnostics. * INCOMPATIBLE CHANGE: This changes append(), remove(), index(), and * count() to require exactly one argument -- previously, multiple *** arguments were silently assumed to be a tuple.	2000-02-24 15:23:03 +00:00
Andrew M. Kuchling	0f223d2418	Allow using long integers as arguments to PyObject_GetItem(), _SetItem(), and _DelItem(). In sequence multiplication by a long, only call PyErr_Occurred() when the value returned is -1.	2000-02-23 22:21:50 +00:00
Fred Drake	52fccfda5b	dict_has_key(): Accept only one parameter. PR#210 reported by Andreas Jung <ajung@sz-sb.de>.	2000-02-23 15:47:16 +00:00
Guido van Rossum	fb4574e320	In response to one particular complaint on edu-sig, change some error messages from "OverflowError: integer pow()" to "OverflowError: integer exponentiation". (Not that this takes care of the complaint in general that the error messages could be greatly improved. :-)	2000-02-15 14:51:46 +00:00
Andrew M. Kuchling	1991ddc3e1	Make multiplying a sequence by a long integer (5L * 'b') legal	2000-02-14 22:22:04 +00:00
Guido van Rossum	bffd683f73	The rest of the changes by Trent Mick and Dale Nagata for warning-free compilation on NT Alpha. Mostly added casts etc.	2000-01-20 22:32:56 +00:00
Guido van Rossum	687ef6e70b	On Linux, one sometimes sees spurious errors after interrupting previous output. Call clearerr() to prevent past errors affecting our ferror() test later, in PyObject_Print(). Suggested by Marc Lemburg.	2000-01-12 16:28:58 +00:00
Guido van Rossum	57072eb79f	Implement the other easy thing: repr() of a float now uses %.17g, while str() uses %.12g as before.	1999-12-23 19:00:28 +00:00
Fred Drake	121ee2722e	long_format(): Now takes a third parameter, addL; iff true, a trailing 'L' is appended to the representation, otherwise not. All existing call sites are modified to pass true for addL. Remove incorrect statement about external use of this function from elsewhere; it's static! long_str(): Handler for the tp_str slot in the type object. Identical to long_repr(), but passes false as the addL parameter of long_format().	1999-12-23 15:41:28 +00:00
Barry Warsaw	153a27ceb2	do_strip(): Fixed cut-and-paste error; this function should check for zero arguments (found by Marc Lemburg).	1999-12-15 02:22:52 +00:00
Barry Warsaw	226ae6ca12	Mainlining the string_methods branch. See branch revision log messages for specific changes.	1999-10-12 19:54:53 +00:00
Guido van Rossum	2c7b8fe618	Fix PR#66. Solution: add error checking around l_divmod() calls in long_pow().	1999-10-11 22:34:41 +00:00
Guido van Rossum	42636dc64d	Fix for PR#98 (Adrian Eyre) -- in instancemethod_repr, the funcname object is DECREFed too early.	1999-10-11 14:03:12 +00:00
Guido van Rossum	e13ff2e2d6	Patch by Tim Peters fixing PR#88: Integer division can crash under Windows.	1999-09-27 17:12:47 +00:00

1 2 3 4 5 ...

585 Commits