cpython

Commit Graph

Author	SHA1	Message	Date
Ka-Ping Yee	fa004ad36c	Show '\011', '\012', and '\015' as '\t', '\n', '\r' in strings. Switch from octal escapes to hex escapes for other nonprintable characters.	2001-01-24 17:19:08 +00:00
Tim Peters	19fe14e76a	Derivative of patch #102549 , "simpler, faster(!) implementation of string.join". Also fixes two long-standing bugs (present in 2.0): 1. .join() didn't check that the result size fit in an int. 2. string.join(s) when len(s)==1 returned s[0] regardless of s[0]'s type; e.g., "".join([3]) returned 3 (overly optimistic optimization). I resisted a keen temptation to make .join() apply str() automagically.	2001-01-19 03:03:47 +00:00
Marc-André Lemburg	3a645e4dd4	Added checks to prevent PyUnicode_Count() from dumping core in case the parameters are out of bounds and fixes error handling for .count(), .startswith() and .endswith() for the case of mixed string/Unicode objects. This patch adds Python style index semantics to PyUnicode_Count() indices (including the special handling of negative indices). The patch is an extended version of patch #103249 submitted by Michael Hudson (mwh) on SF. It also includes new test cases.	2001-01-16 11:54:12 +00:00
Andrew M. Kuchling	6ca8917758	[ Patch #102852 ] Make % error a bit more informative by indicates the index at which an unknown %-escape was found	2000-12-15 13:07:46 +00:00
Fred Drake	49312a52ec	Jeffrey D. Collins <tokeneater@users.sourceforge.net>: Fix type of the self parameter to some string object methods. This closes patch #102670.	2000-12-06 14:27:49 +00:00
Tim Peters	a3a3a030af	Fox for SF bug #123859 : %[duxXo] long formats inconsistent.	2000-11-30 05:22:44 +00:00
Guido van Rossum	2ccda8a7c4	SF patch #102548 , fix for bug #121013 , by mwh@users.sourceforge.net. Fixes a typo that caused "".join(u"this is a test") to dump core.	2000-11-27 18:46:26 +00:00
Fred Drake	661ea26b3d	Ka-Ping Yee <ping@lfw.org>: Changes to error messages to increase consistency & clarity. This (mostly) closes SourceForge patch #101839.	2000-10-24 19:57:45 +00:00
Marc-André Lemburg	53f3d4ac74	[ Bug #116174 ] using %% in cstrings sometimes fails with unicode paramsFix for the bug reported in Bug #116174 : "%% %s" % u"abc" failed due to the way string formatting delegated work to the Unicode formatting function.	2000-10-07 08:54:09 +00:00
Fred Drake	d5fadf75e4	Rationalize use of limits.h, moving the inclusion to Python.h. Add definitions of INT_MAX and LONG_MAX to pyport.h. Remove includes of limits.h and conditional definitions of INT_MAX and LONG_MAX elsewhere. This closes SourceForge patch #101659 and bug #115323.	2000-09-26 05:46:01 +00:00
Tim Peters	38fd5b6413	Derived from Martin's SF patch 110609: support unbounded ints in %d,i,u,x,X,o formats. Note a curious extension to the std C rules: x, X and o formatting can never produce a sign character in C, so the '+' and ' ' flags are meaningless for them. But unbounded ints can produce a sign character under these conversions (no fixed- width bitstring is wide enough to hold all negative values in 2's-comp form). So these flags become meaningful in Python when formatting a Python long which is too big to fit in a C long. This required shuffling around existing code, which hacked x and X conversions to death when both the '#' and '0' flags were specified: the hacks weren't strong enough to deal with the simultaneous possibility of the ' ' or '+' flags too, since signs were always meaningless before for x and X conversions. Isomorphic shuffling was required in unicodeobject.c. Also added dozens of non-trivial new unbounded-int test cases to test_format.py.	2000-09-21 05:43:11 +00:00
Marc-André Lemburg	d1ba443206	This patch adds a new Python C API called PyString_AsStringAndSize() which implements the automatic conversion from Unicode to a string object using the default encoding. The new API is then put to use to have eval() and exec accept Unicode objects as code parameter. This closes bugs #110924 and #113890. As side-effect, the traditional C APIs PyString_Size() and PyString_AsString() will also accept Unicode objects as parameters.	2000-09-19 21:04:18 +00:00
Tim Peters	8f422461b4	Fix for bug 113934. stringn and unicoden did no overflow checking at all, either to see whether the # of chars fit in an int, or that the amount of memory needed fit in a size_t. Checking these is expensive, but the alternative is silently wrong answers (as in the bug report) or core dumps (which were easy to provoke using Unicode strings).	2000-09-09 06:13:41 +00:00
Guido van Rossum	8586991099	REMOVED all CWI, CNRI and BeOpen copyright markings. This should match the situation in the 1.6b1 tree.	2000-09-01 23:29:29 +00:00
Barry Warsaw	4df762ff98	Insure properly identifies the `interned' dictionary as leaking at shutdown time, but CVS log entry for revision 2.45 explains why this is so. Simply include a comment so we don't have to re-figure it out again 5 years from now.	2000-08-16 23:41:01 +00:00
Peter Schneider-Kamp	7e01890986	merge Include/my.h into Include/pyport.h marked my.h as obsolete	2000-07-31 15:28:04 +00:00
Thomas Wouters	7e47402264	Spelling fixes supplied by Rob W. W. Hooft. All these are fixes in either comments, docstrings or error messages. I fixed two minor things in test_winreg.py ("didn't" -> "Didn't" and "Didnt" -> "Didn't"). There is a minor style issue involved: Guido seems to have preferred English grammar (behaviour, honour) in a couple places. This patch changes that to American, which is the more prominent style in the source. I prefer English myself, so if English is preferred, I'd be happy to supply a patch myself ;)	2000-07-16 12:04:32 +00:00
Jeremy Hylton	03657cfdb0	replace PyXXX_Length calls with PyXXX_Size calls	2000-07-12 13:05:33 +00:00
Andrew M. Kuchling	bd9848d02f	Fix typo in error message	2000-07-12 02:58:28 +00:00
Jeremy Hylton	88887aa38e	small updates to string_join: use PyString_AS_STRING macro on local string object when resizing string, make sure resized string will always be big enough split string containing error message across two lines add test to string_tests that causes resizing	2000-07-11 20:55:38 +00:00
Barry Warsaw	771d0675b6	string_join(): Some cleaning up of reference counting. In the seqlen==1 clause, before returning item, we need to DECREF seq. In the res=PyString... failure clause, we need to goto finally to also decref seq (and the DECREF of res in finally is changed to a XDECREF). Also, we need to DECREF seq just before the PyUnicode_Join() return.	2000-07-11 04:58:12 +00:00
Jeremy Hylton	4904829dbf	fix two refcount bugs in new string_join implementation: 1. PySequence_Fast_GET_ITEM is a macro and borrows a reference 2. The seq returned from PySequence_Fast must be decref'd	2000-07-11 03:28:17 +00:00
Jeremy Hylton	194e43e953	two changes to string_join: implementation -- use PySequence_Fast interface to iterate over elements interface -- if instance object reports wrong length, ignore it; previous version raised an IndexError if reported length was too high	2000-07-10 21:30:28 +00:00
Tim Peters	c2e7da9859	Somebody started playing with const, so of course the outcome was cascades of warnings about mismatching const decls. Overall, I think const creates lots of headaches and solves almost nothing. Added enough consts to shut up the warnings, but this did require casting away const in one spot too (another usual outcome of starting down this path): the function mymemreplace can't return const char, but sometimes wants to return its first argument as-is, which latter must be declared const char in order to avoid const warnings at mymemreplace's call sites. So, in the case the function wants to return the first arg, that arg's declared constness must be subverted.	2000-07-09 08:02:21 +00:00
Fred Drake	ba09633e1e	ANSI-fication of the sources.	2000-07-09 07:04:36 +00:00
Marc-André Lemburg	63f3d17418	Added new codec APIs and a new interface method .encode() which works just like the Unicode one. The C APIs match the ones in the Unicode implementation, but were extended to be able to reuse the existing Unicode codecs for string purposes too. Conversions from string to Unicode and back are done using the default encoding.	2000-07-06 11:29:01 +00:00
Marc-André Lemburg	4027f8f4b3	Added new .isalpha() and .isalnum() methods to match the same ones on the Unicode objects. Note that the string versions use the (locale aware) C lib APIs isalpha() and isalnum().	2000-07-05 09:47:46 +00:00
Guido van Rossum	ffcc3813d8	Change copyright notice - 2nd try.	2000-06-30 23:58:06 +00:00
Guido van Rossum	fd71b9e9d4	Change copyright notice.	2000-06-30 23:50:40 +00:00
Marc-André Lemburg	f28dd83b86	Marc-Andre Lemburg <mal@lemburg.com>: New buffer overflow checks for formatting strings. By Trent Mick.	2000-06-30 10:29:57 +00:00
Fred Drake	396f6e0d6a	Fredrik Lundh <effbot@telia.com>: Simplify find code; this is a performance improvement on at least some platforms.	2000-06-20 15:47:54 +00:00
Marc-André Lemburg	60bc809d9a	Marc-Andre Lemburg <mal@lemburg.com>: Added code so that .isXXX() testing returns 0 for emtpy strings.	2000-06-14 09:18:32 +00:00
Andrew M. Kuchling	cb95a1470a	Patch from Michael Hudson: improve unclear error message	2000-06-09 14:04:53 +00:00
Fred Drake	b6a9ada757	Michael Hudson <mwh21@cam.ac.uk>: Removed PyErr_BadArgument() calls and replaced them with more useful error messages.	2000-06-01 03:12:13 +00:00
Guido van Rossum	c682140de7	Trent Mick: Fix the string methods that implement slice-like semantics with optional args (count, find, endswith, etc.) to properly handle indeces outside [INT_MIN, INT_MAX]. Previously the "i" formatter for PyArg_ParseTuple was used to get the indices. These could overflow. This patch changes the string methods to use the "O&" formatter with the slice_index() function from ceval.c which is used to do the same job for Python code slices (e.g. 'abcabcabc'[0:1000000000L]). slice_index() is renamed _PyEval_SliceIndex() and is now exported. As well, the return values for success/fail were changed to make slice_index directly usable as required by the "O&" formatter. [GvR: shouldn't a similar patch be applied to unicodeobject.c?]	2000-05-08 14:08:05 +00:00
Guido van Rossum	b8f820c5a9	The methods islower(), isupper(), isspace(), isdigit() and istitle() gave bogus results for chars in the range 128-255, because their implementation was using signed characters. Fixed this by using unsigned character pointers (as opposed to using Py_CHARMASK()).	2000-05-05 20:44:24 +00:00
Guido van Rossum	b18618dab7	Vladimir Marangozov's long-awaited malloc restructuring. For more comments, read the patches@python.org archives. For documentation read the comments in mymalloc.h and objimpl.h. (This is not exactly what Vladimir posted to the patches list; I've made a few changes, and Vladimir sent me a fix in private email for a problem that only occurs in debug mode. I'm also holding back on his change to main.c, which seems unnecessary to me.)	2000-05-03 23:44:39 +00:00
Guido van Rossum	f0b7b04ae8	Marc-Andre Lemburg: The maxsplit functionality in .splitlines() was replaced by the keepends functionality which allows keeping the line end markers together with the string. Added support for '%r' % obj: this inserts repr(obj) rather than str(obj).	2000-04-11 15:39:26 +00:00
Guido van Rossum	90daa87569	Marc-Andre Lemburg: * string_contains now calls PyUnicode_Contains() only when the other operand is a Unicode string (not whenever it's not a string). * New format style '%r' inserts repr(arg) instead of str(arg). * '...%s...' % u"abc" now coerces to Unicode just like string methods. Care is taken not to reevaluate already formatted arguments -- only the first Unicode object appearing in the argument mapping is looked up twice. Added test cases for this to test_unicode.py.	2000-04-10 13:47:21 +00:00
Barry Warsaw	51ac58039f	On 17-Mar-2000, Marc-Andre Lemburg said: Attached you find an update of the Unicode implementation. The patch is against the current CVS version. I would appreciate if someone with CVS checkin permissions could check the changes in. The patch contains all bugs and patches sent this week and also fixes a leak in the codecs code and a bug in the free list code for Unicode objects (which only shows up when compiling Python with Py_DEBUG; thanks to MarkH for spotting this one).	2000-03-20 16:36:48 +00:00
Guido van Rossum	96a45adf80	Fix typo in replace() detected by Mark Hammond and fixed by Marc-Andre.	2000-03-13 15:56:08 +00:00
Guido van Rossum	4c08d554b9	Many changes for Unicode, by Marc-Andre Lemburg.	2000-03-10 22:55:18 +00:00
Guido van Rossum	9284a572bc	Patch by Moshe Zadka: move the string special case from abstract.c here. [Patch modified by GvR to keep the original exception.]	2000-03-07 15:53:43 +00:00
Barry Warsaw	bf32583084	string_join(): Fix memory leaks discovered by Charles Waldman (and a few other paths through the function that leaked).	2000-03-06 14:52:18 +00:00
Guido van Rossum	43713e5a28	Massive patch by Skip Montanaro to add ":name" to as many PyArg_ParseTuple() format string arguments as possible.	2000-02-29 13:59:29 +00:00
Guido van Rossum	bffd683f73	The rest of the changes by Trent Mick and Dale Nagata for warning-free compilation on NT Alpha. Mostly added casts etc.	2000-01-20 22:32:56 +00:00
Barry Warsaw	153a27ceb2	do_strip(): Fixed cut-and-paste error; this function should check for zero arguments (found by Marc Lemburg).	1999-12-15 02:22:52 +00:00
Barry Warsaw	226ae6ca12	Mainlining the string_methods branch. See branch revision log messages for specific changes.	1999-10-12 19:54:53 +00:00
Guido van Rossum	98c9eba945	Fix bug discovered by John W. Shipman -- when the width of a format specifier came from an int expression instead of a constant in the format, a negative width was truncated to zero instead of taken to mean the same as that negative constant plugged into the format. E.g. "(%*s)" % (-5, "foo") yielded "(foo)" while "(%-5s)" yields "(foo )". Now both yield the latter -- like sprintf() in C.	1999-06-07 15:12:32 +00:00
Guido van Rossum	1db7070217	Greg Stein: Implement the new bf_getcharbuffer function, indicating that (as far as the data type is concerned!) this is character data.	1998-10-08 02:18:52 +00:00
Guido van Rossum	07d780089d	Typo reported by Greg Stein: "modifiable" is the correct spelling.	1998-10-01 15:59:48 +00:00
Guido van Rossum	4a0144c0de	Should check that PyObject_Str() really returned a string!	1998-06-09 15:08:41 +00:00
Guido van Rossum	1109fbca76	Make new gcc -Wall happy	1998-04-10 22:16:39 +00:00
Guido van Rossum	045e688f6f	Patch submitted by Brad Howes (with one bug fixed by me): allow arbitrary nested parens in a %(...)X style format. #Also folded two lines and added more detail to the error message for #unsupported format character.	1997-09-08 18:30:11 +00:00
Guido van Rossum	971a7aaeac	Change the Fini function to only remove otherwise unreferenced strings from the interned table. There are references in hard-to-find static variables all over the interpreter, and it's not worth trying to get rid of all those; but "uninterning" isn't fair either and may cause subtle failures later -- so we have to keep them in the interned table. Also get rid of no-longer-needed insert of None in interned dict.	1997-08-05 02:15:12 +00:00
Guido van Rossum	8cf0476474	Added internal routine PyString_Fini() which deletes all interned strings. For use in Py_Finalize() only.	1997-08-02 02:57:45 +00:00
Guido van Rossum	71160aaffe	Use #include "mymath.h" instead of declaring fabs() explicitly. This should solve a weird problem on the Mac for Jack.	1997-06-03 18:03:18 +00:00
Guido van Rossum	fdf95dd525	Checkin of Jack's buffer mods. Not really checked, but didn't fail any tests either...	1997-05-05 22:15:02 +00:00
Guido van Rossum	c0b618a2cc	Quickly renamed the last directory.	1997-05-02 03:12:38 +00:00
Guido van Rossum	2095d24842	Tweaks to keep the Microsoft compiler quiet.	1997-04-09 19:41:24 +00:00
Guido van Rossum	36b9f7908a	Slight tweak: in string_hash(), if the hash hasn't been computed yet, and if there's a pointer to an interned version of the string, use its hash and store its hash in this object, rather than recomputing it.	1997-02-14 16:29:22 +00:00
Guido van Rossum	4acdc2327f	Fix bug reported by Per Lindqvist: "%#06x" % 1 stuck the 0 padding in front of the 0x, like such: "0000x1".	1997-01-29 06:00:24 +00:00
Guido van Rossum	a04d47b319	Don't use static buffers internally for formatstring().	1997-01-21 16:12:09 +00:00
Guido van Rossum	2a61e7428d	String interning.	1997-01-18 07:55:05 +00:00
Guido van Rossum	067998f35e	Add const to error and newstring functions	1996-12-10 15:33:34 +00:00
Guido van Rossum	da9c2710c7	Make gcc -Wall happy	1996-12-05 21:58:58 +00:00
Guido van Rossum	d266eb460e	New permission notice, includes CNRI.	1996-10-25 14:44:06 +00:00
Guido van Rossum	fde7a75b78	Fixed compare function to do first char comparison in unsigned mode, for consistency with the way other characters are compared.	1996-10-23 14:19:40 +00:00
Guido van Rossum	eddcb3bae1	Multiply by 1000003 instead of 3 in string hach	1996-09-11 20:22:48 +00:00
Guido van Rossum	441e4ab802	new debugger symbol names	1996-05-23 22:46:51 +00:00
Guido van Rossum	f97632639e	Plug memory leak in the previous fix :-(	1996-05-21 23:44:17 +00:00
Guido van Rossum	993952bfb2	Fix obscure bug in string%mapping where the mapping creates its items on the fly -- there was an unsafe DECREF. Actually save some lines of code by using abstract.c:PyObject_GetItem().	1996-05-21 22:44:20 +00:00
Guido van Rossum	6f9e433ab3	fix dusty debugging macros	1995-03-29 16:57:48 +00:00
Guido van Rossum	5fe605889a	a few peephole optimizations	1995-03-09 12:12:50 +00:00
Guido van Rossum	caeaafccf7	don't complain about too many args if arg is a dict	1995-02-27 10:13:23 +00:00
Guido van Rossum	9fa2c11613	use Py_CHARMASK; and don't check for neg. float to the float power here	1995-02-10 17:00:37 +00:00
Guido van Rossum	6610ad9d6b	Added 1995 to copyright message. floatobject.c: fix hash(). methodobject.c: support METH_FREENAME flag bit.	1995-01-04 19:07:38 +00:00
Guido van Rossum	d7047b395e	Lots of minor changes. Note for mappingobject.c: the hash table pointer can now be NULL.	1995-01-02 19:07:15 +00:00
Guido van Rossum	03093a248d	* Include/classobject.h, Objects/classobject.c, Python/ceval.c: entirely redone operator overloading. The rules for class instances are now much more relaxed than for other built-in types (whose coerce must still return two objects of the same type) * Objects/floatobject.c: add overflow check when converting float to int and implement truncation towards zero using ceil/float * Objects/longobject.c: change ValueError to OverflowError when converting to int * Objects/rangeobject.c: modernized * Objects/stringobject.c: use HAVE_LIMITS instead of __STDC__ * Objects/xxobject.c: changed to use new style (not finished?)	1994-09-28 15:51:32 +00:00
Guido van Rossum	013142a95f	fix nasty bug in resizing (formatstring)	1994-08-30 08:19:36 +00:00
Guido van Rossum	71e57d090d	Fix the fix :-(	1993-11-11 15:03:51 +00:00
Guido van Rossum	6938a297da	Three micro fixes to formatstring	1993-11-11 14:51:57 +00:00
Sjoerd Mullender	615194a352	Fixed bugs in resizetuple and extended the interface. Added ifdefs in stringobject.c for shared strings of length 1. Renamed free_list in tupleobject.c to free_tuples.	1993-11-01 13:46:50 +00:00
Guido van Rossum	444fc7c90c	Add some necessary casts; use double quotes to represent strings in some cases.	1993-10-26 15:25:16 +00:00
Sjoerd Mullender	3bb8a05947	Several optimizations and speed improvements. cstubs: Use Matrix type instead of float[4][4].	1993-10-22 12:04:32 +00:00
Sjoerd Mullender	a9c3c22c33	* Extended X interface: pixmap objects, colormap objects visual objects, image objects, and lots of new methods. * Added counting of allocations and deallocations of builtin types if COUNT_ALLOCS is defined. Had to move calls to NEWREF down in some files. * Bug fix in sorting lists.	1993-10-11 12:54:31 +00:00
Guido van Rossum	234f942aef	* Added gmtime/localtime/mktime and SYSV timezone globals to timemodule.c. Added $(SYSDEF) to its build rule in Makefile. * cgensupport.[ch], modsupport.[ch]: removed some old stuff. Also changed files that still used it... And made several things static that weren't but should have been... And other minor cleanups... * listobject.[ch]: add external interfaces {set,get}listslice * socketmodule.c: fix bugs in new send() argument parsing. * sunaudiodevmodule.c: added flush() and close().	1993-06-17 12:35:49 +00:00
Guido van Rossum	6ac258d381	* pythonrun.c: Print exception type+arg after stack trace instead of before it. * ceval.c, object.c: moved testbool() to object.c (now extern visible) * stringobject.c: fix bugs in and rationalize string resize in formatstring() * tokenizer.[ch]: fix non-working code for lines longer than BUFSIZ	1993-05-12 08:24:20 +00:00
Guido van Rossum	9bfef44d97	* Changed all copyright messages to include 1993. * Stubs for faster implementation of local variables (not yet finished) * Added function name to code object. Print it for code and function objects. THIS MAKES THE .PYC FILE FORMAT INCOMPATIBLE (the version number has changed accordingly) * Print address of self for built-in methods * New internal functions getattro and setattro (getattr/setattr with string object arg) * Replaced "dictobject" with more powerful "mappingobject" * New per-type functio tp_hash to implement arbitrary object hashing, and hashobject() to interface to it * Added built-in functions hash(v) and hasattr(v, 'name') * classobject: made some functions static that accidentally weren't; added __hash__ special instance method to implement hash() * Added proper comparison for built-in methods and functions	1993-03-29 10:43:31 +00:00
Guido van Rossum	e537240c25	* Changed many files to use mkvalue() instead of newtupleobject(). * Fixcprt.py: added [-y file] option, do only files younger than file. * modsupport.[ch]: added vmkvalue(). * intobject.c: use mkvalue(). * stringobject.c: added "formatstring"; renamed string* to string_; ceval.c: call formatstring for string % value. longobject.c: close memory leak in divmod. * parsetok.c: set result node to NULL when returning an error.	1993-03-16 12:15:04 +00:00
Guido van Rossum	67daef567f	Remove bogus type-and-refcnt setting from newsizedstringobject().	1992-09-03 20:44:02 +00:00
Guido van Rossum	bab9d03855	Copyright for 1992 added	1992-04-05 14:26:55 +00:00
Guido van Rossum	719f5fa86a	lint fix	1992-03-27 17:31:02 +00:00
Guido van Rossum	bcaa31c411	printobject now returns an error code Remove superfluous err_nomem() call	1991-06-07 22:58:57 +00:00
Guido van Rossum	f380e66c0f	Fix comments in string_as_sequence	1991-06-04 19:36:32 +00:00
Guido van Rossum	daa8bb334d	Optimized single-character strings gotten from s[i].	1991-04-04 10:48:33 +00:00
Guido van Rossum	b6a6bdc7db	Optimized stringitem.	1991-03-06 13:15:02 +00:00
Guido van Rossum	f70e43a073	Added copyright notice.	1991-02-19 12:39:46 +00:00
Guido van Rossum	253919f3b7	Fix stringcompare when strings contain null bytes.	1991-02-13 23:18:39 +00:00
Guido van Rossum	3f5da24ea3	"Compiling" version	1990-12-20 15:06:42 +00:00
Guido van Rossum	392ab32859	Fix wrong #ifdef.	1990-11-18 17:41:19 +00:00
Guido van Rossum	921842f2c2	Fixed resizestring() to work if reference tracing is turned on. The realloc() call would move the list head without fixing the pointers to in the the chain of allocated objects...	1990-11-18 17:30:23 +00:00
Guido van Rossum	2a9096b5f9	New errors.	1990-10-21 22:15:08 +00:00
Guido van Rossum	85a5fbbdfe	Initial revision	1990-10-14 12:07:46 +00:00

... 3 4 5 6 7

304 Commits