cpython

Commit Graph

Author	SHA1	Message	Date
Victor Stinner	170ca6f84b	Fix bug in Unicode decoders related to _PyUnicodeWriter Bug introduced by changesets 7ed9993d53b4 and edf029fc9591.	2013-04-18 00:25:28 +02:00
Victor Stinner	376cfa122d	Fix typo in unicode_decode_call_errorhandler_writer() Bug introduced by changeset 7ed9993d53b4.	2013-04-17 23:58:16 +02:00
Victor Stinner	8f674ccd64	Close #17694 : Add minimum length to _PyUnicodeWriter * Add also min_char attribute to _PyUnicodeWriter structure (currently unused) * _PyUnicodeWriter_Init() has no more argument (except the writer itself): min_length and overallocate must be set explicitly * In error handlers, only enable overallocation if the replacement string is longer than 1 character * CJK decoders don't use overallocation anymore * Set min_length, instead of preallocating memory using _PyUnicodeWriter_Prepare(), in many decoders * _PyUnicode_DecodeUnicodeInternal() checks for integer overflow	2013-04-17 23:02:17 +02:00
Victor Stinner	77282cb4f8	Cleanup PyUnicode_Contains() * No need to double-check that strings are ready: test already done by PyUnicode_FromObject() * Remove useless kind variable (use kind1 instead)	2013-04-14 19:22:47 +02:00
Victor Stinner	d92e078c8d	Minor change: fix character in do_strip() for the ASCII case	2013-04-14 19:17:42 +02:00
Victor Stinner	f033510fee	Cleanup PyUnicode_Append() * Check also that right is a Unicode object * call directly resize_compact() instead of unicode_resize() for a more explicit error handling, and to avoid testing some properties twice (ex: unicode_modifiable())	2013-04-14 19:13:03 +02:00
Victor Stinner	4560f9c63f	PyUnicode_Join(): move use_memcpy test out of the loop to cleanup and optimize the code	2013-04-14 18:56:46 +02:00
Victor Stinner	55c08781e8	Optimize repr(str): use _PyUnicode_FastCopyCharacters() when no character is escaped	2013-04-14 18:45:39 +02:00
Victor Stinner	af03757d20	Optimize ascii(str): don't encode/decode repr if repr is already ASCII	2013-04-14 18:44:10 +02:00
Victor Stinner	76b3b2726c	stringlib: remove unused STRINGLIB_RESIZE macro	2013-04-14 16:29:09 +02:00
Victor Stinner	8a1a6cffd6	Add _PyUnicodeWriter_WriteCharInline()	2013-04-14 02:35:33 +02:00
Serhiy Storchaka	e2cef885a2	Issue #16061 : Speed up str.replace() for replacing 1-character strings.	2013-04-13 22:45:04 +03:00
Mark Dickinson	93196eb44f	Issue #17715 : Merge fix from 3.3.	2013-04-13 17:46:04 +01:00
Mark Dickinson	c9734484ca	Issue #17715 : Add missing NULL Check to PyNumber_Long.	2013-04-13 17:44:44 +01:00
Mark Dickinson	556e94b8fe	Issue #17643 : Add __callback__ attribute to weakref.ref.	2013-04-13 15:45:44 +01:00
Mark Dickinson	548677bb8c	Issue #16447 : Merge fix from 3.3.	2013-04-13 15:30:16 +01:00
Mark Dickinson	64aafeb4de	Issue #16447 : Fix potential segfault when setting __name__ on a class.	2013-04-13 15:26:58 +01:00
Victor Stinner	a0dd0213cc	Close #17693 : Rewrite CJK decoders to use the _PyUnicodeWriter API instead of the legacy Py_UNICODE API. Add also a new _PyUnicodeWriter_WriteChar() function.	2013-04-11 22:09:04 +02:00
Antoine Pitrou	dc040f099d	Fix supernumerary 's' in sys._debugmallocstats() output.	2013-04-11 21:02:20 +02:00
Antoine Pitrou	36b045f4db	Fix supernumerary 's' in sys._debugmallocstats() output.	2013-04-11 21:01:40 +02:00
Benjamin Peterson	34ad84d80a	merge 3.3 (#17669 )	2013-04-10 17:01:38 -04:00
Benjamin Peterson	c9314d9e08	don't run frame if it has no stack (closes #17669 )	2013-04-10 17:00:56 -04:00
Victor Stinner	247109e74d	Issue #17615 : On Windows (VS2010), Performances of wmemcmp() to compare Unicode strings are not convincing. For UCS2 (16-bit wchar_t type), use a dummy loop instead of wmemcmp(). The dummy loop is as fast, or a little bit faster. wchar_t is only 16-bit long on Windows. wmemcmp() is still used for 32-bit wchar_t.	2013-04-09 23:53:26 +02:00
Victor Stinner	0cff4b16d9	replace(): only call PyUnicode_DATA(u) once	2013-04-09 22:52:48 +02:00
Victor Stinner	cc7af72192	Write super-fast version of str.strip(), str.lstrip() and str.rstrip() for pure ASCII	2013-04-09 22:39:24 +02:00
Victor Stinner	f50a4e9bc9	Don't calls macros in PyUnicode_WRITE() parameters PyUnicode_WRITE() expands some parameters twice or more.	2013-04-09 22:38:52 +02:00
Victor Stinner	9c79e41fc5	Fix do_strip(): don't call PyUnicode_READ() in Py_UNICODE_ISSPACE() to not call it twice	2013-04-09 22:21:08 +02:00
Victor Stinner	b3a6014504	Fix _PyUnicode_XStrip() Inline the BLOOM_MEMBER() to only call PyUnicode_READ() only once (per loop iteration). Store also the length of the seperator in a variable to avoid calls to PyUnicode_GET_LENGTH().	2013-04-09 22:19:21 +02:00
Victor Stinner	63d5c1a14a	Optimize PyUnicode_DecodeCharmap() Avoid expensive PyUnicode_READ() and PyUnicode_WRITE(), manipulate pointers instead.	2013-04-09 22:13:33 +02:00
Victor Stinner	a85af502a4	Optimize make_bloom_mask(), used by str.strip(), str.lstrip() and str.rstrip() Write specialized functions per Unicode kind to avoid the expensive PyUnicode_READ() macro.	2013-04-09 21:53:54 +02:00
Victor Stinner	69ed0f4c86	Use PyUnicode_READ() instead of PyUnicode_READ_CHAR() "PyUnicode_READ_CHAR() is less efficient than PyUnicode_READ() because it calls PyUnicode_KIND() and might call it twice." according to its documentation.	2013-04-09 21:48:24 +02:00
Victor Stinner	03c3e35d42	Add fast-path in PyUnicode_DecodeCharmap() for pure 8 bit encodings: cp037, cp500 and iso8859_1 codecs	2013-04-09 21:53:09 +02:00
Victor Stinner	cd777eaf53	Issue #17615 : Comparing two Unicode strings now uses wmemcmp() when possible wmemcmp() is twice faster than a dummy loop (342 usec vs 744 usec) on Fedora 18/x86_64, GCC 4.7.2.	2013-04-08 22:43:44 +02:00
Victor Stinner	c1302bba4c	Issue #17615 : Expand expensive PyUnicode_READ() macro in unicode_compare(): write specialized functions for each combination of Unicode kinds.	2013-04-08 21:50:54 +02:00
Victor Stinner	7efa3b8242	Close #13126 : "Simplify" FASTSEARCH() code to help the compiler to emit more efficient machine code. Patch written by Antoine Pitrou. Without this change, str.find() was 10% slower than str.rfind() in the worst case.	2013-04-08 00:26:43 +02:00
Serhiy Storchaka	ee57f159af	Revert a premature patch for issue #14010 (changeset 846bd418aee5).	2013-04-06 22:55:12 +03:00
Serhiy Storchaka	278d03bd66	Revert a premature patch for issue #14010 (changeset aaaf36026511).	2013-04-06 22:52:34 +03:00
Serhiy Storchaka	aac81e2780	Issue #14010 : Fix a crash when iterating or deleting deeply nested filters (builting and in itertools module, i.e. map(), itertools.chain(), etc).	2013-04-06 21:20:30 +03:00
Serhiy Storchaka	e8f706eda7	Issue #14010 : Fix a crash when iterating or deleting deeply nested filters (builting and in itertools module, i.e. map(), itertools.chain(), etc).	2013-04-06 21:14:43 +03:00
Antoine Pitrou	0aaaa62200	Issue #17469 : Fix _Py_GetAllocatedBlocks() and sys.getallocatedblocks() when running on valgrind.	2013-04-06 01:15:30 +02:00
Victor Stinner	207dd38726	fix unused variable	2013-04-03 03:14:58 +02:00
Victor Stinner	eb4b5ac8af	Close #16757 : Avoid calling the expensive _PyUnicode_FindMaxChar() function when possible	2013-04-03 02:02:33 +02:00
Victor Stinner	cfc4c13b04	Add _PyUnicodeWriter_WriteSubstring() function Write a function to enable more optimizations: * If the substring is the whole string and overallocation is disabled, just keep a reference to the string, don't copy characters * Avoid a call to the expensive _PyUnicode_FindMaxChar() function when possible	2013-04-03 01:48:39 +02:00
Benjamin Peterson	d3f41fe121	merge 3.3 (#17610 )	2013-04-01 17:43:30 -04:00
Benjamin Peterson	6395241471	list slotdefs in offset order rather than sorting them (closes #17610 ) This means we can remove our usage of qsort() than relied on undefined behavior.	2013-04-01 17:41:41 -04:00
Antoine Pitrou	7faf70512a	Issue #17591 : Use lowercase filenames when including Windows header files. Patch by Roumen Petrov.	2013-03-31 22:48:04 +02:00
Raymond Hettinger	51612fd803	merge	2013-03-23 08:21:52 -07:00
Raymond Hettinger	378170d5d9	Issue 17447: Clarify that str.isidentifier doesn't check for reserved keywords.	2013-03-23 08:21:12 -07:00
Benjamin Peterson	5589850c14	fix warning (closes #17327 )	2013-03-08 08:36:49 -05:00
Benjamin Peterson	00e9886bd9	Add PyDict_SetDefault. (closes #17327 ) Patch by Stefan Behnel and I.	2013-03-07 22:16:29 -05:00
Victor Stinner	fb84b5d48d	(Merge 3.3) _PyUnicode_Writer() now also reuses Unicode singletons: empty string and latin1 single character	2013-03-06 19:29:09 +01:00
Victor Stinner	2cb16aa3cb	_PyUnicode_Writer() now also reuses Unicode singletons: empty string and latin1 single character	2013-03-06 19:28:37 +01:00
Victor Stinner	cf77da9fb5	Backed out changeset b9f7b1bf36aa	2013-03-06 01:09:24 +01:00
Victor Stinner	313cac88c5	Issue #17223 : Fix PyUnicode_FromUnicode() on Windows (16-bit wchar_t type) to reject invalid UTF-16 surrogate.	2013-03-06 00:41:50 +01:00
Benjamin Peterson	42f382facd	merge 3.3 (#17328 )	2013-03-04 09:48:30 -05:00
Benjamin Peterson	b1efa53662	fix possible setdefault refleak (closes #17328 )	2013-03-04 09:47:50 -05:00
Victor Stinner	36025478bf	(Merge 3.3) Issue #17223 : Fix PyUnicode_FromUnicode() for string of 1 character outside the range U+0000-U+10ffff.	2013-02-26 00:16:57 +01:00
Victor Stinner	d21b58c05d	Issue #17223 : Fix PyUnicode_FromUnicode() for string of 1 character outside the range U+0000-U+10ffff.	2013-02-26 00:15:54 +01:00
Serhiy Storchaka	06b16f879f	Remove unused defines.	2013-02-23 14:49:09 +02:00
Serhiy Storchaka	18809fa94e	Remove unused defines.	2013-02-23 14:48:16 +02:00
Benjamin Peterson	abe40c2528	merge 3.3 (#17228 )	2013-02-20 16:56:06 -05:00
Benjamin Peterson	2dba1ee3e6	fix building without pymalloc (closes #17228 )	2013-02-20 16:54:30 -05:00
Stefan Krah	5e06d1d0b9	Merge 3.3.	2013-02-19 14:02:59 +01:00
Stefan Krah	674a42b114	Fix error messages.	2013-02-19 13:44:49 +01:00
R David Murray	aaf16b9cfb	Merge: #7963 : fix error message when 'object' called with arguments.	2013-02-18 21:44:03 -05:00
R David Murray	702a5dc1ed	#7963 : fix error message when 'object' called with arguments.	2013-02-18 21:39:18 -05:00
R David Murray	6b30759022	#7963 : fix error message when 'object' called with arguments. Patch by Alexander Belopolsky.	2013-02-18 21:20:08 -05:00
Eric Snow	9d05c8c0e0	Issue #15022 : Ensure all pickle protocols are supported.	2013-02-16 18:20:32 -07:00
Eric Snow	b5c8f92782	Issue #15022 : Add pickle and comparison support to types.SimpleNamespace.	2013-02-16 16:32:39 -07:00
Serhiy Storchaka	b8cbba5877	Issue #12983 : Bytes literals with invalid \x escape now raise a SyntaxError and a full traceback including line number.	2013-02-10 17:43:25 +02:00
Serhiy Storchaka	801d955f04	Issue #12983 : Bytes literals with invalid \x escape now raise a SyntaxError and a full traceback including line number.	2013-02-10 17:42:01 +02:00
Serhiy Storchaka	5e61f14c6d	Issue #12983 : Bytes literals with invalid \x escape now raise a SyntaxError and a full traceback including line number.	2013-02-10 17:36:00 +02:00
Antoine Pitrou	8ad5b07ccb	Issue #17173 : Remove uses of locale-dependent C functions (isalpha() etc.) in the interpreter. I've left a couple of them in: zlib (third-party lib), getaddrinfo.c (doesn't include Python.h, and probably obsolete), _sre.c (legitimate use for the re.LOCALE flag), mpdecimal (needs to build without Python.h).	2013-02-09 23:16:51 +01:00
Antoine Pitrou	c73c561181	Issue #17173 : Remove uses of locale-dependent C functions (isalpha() etc.) in the interpreter. I've left a couple of them in: zlib (third-party lib), getaddrinfo.c (doesn't include Python.h, and probably obsolete), _sre.c (legitimate use for the re.LOCALE flag), mpdecimal (needs to build without Python.h).	2013-02-09 23:14:42 +01:00
Antoine Pitrou	4de7457009	Issue #17173 : Remove uses of locale-dependent C functions (isalpha() etc.) in the interpreter. I've left a couple of them in: zlib (third-party lib), getaddrinfo.c (doesn't include Python.h, and probably obsolete), _sre.c (legitimate use for the re.LOCALE flag).	2013-02-09 23:11:27 +01:00
Victor Stinner	cfd2c1b4cc	(Merge 3.3) Issue #17137 : When an Unicode string is resized, the internal wide character string (wstr) format is now cleared.	2013-02-07 23:17:34 +01:00
Victor Stinner	bbbac2ec34	Issue #17137 : When an Unicode string is resized, the internal wide character string (wstr) format is now cleared.	2013-02-07 23:12:46 +01:00
Serhiy Storchaka	d0c79dcda5	Issue #17043 : The unicode-internal decoder no longer read past the end of input buffer.	2013-02-07 16:26:55 +02:00
Serhiy Storchaka	03ee12ed72	Issue #17043 : The unicode-internal decoder no longer read past the end of input buffer.	2013-02-07 16:25:25 +02:00
Serhiy Storchaka	3fd4ab356d	Issue #17043 : The unicode-internal decoder no longer read past the end of input buffer.	2013-02-07 16:23:21 +02:00
Serhiy Storchaka	8911ef5b6d	Issue #17034 : Use Py_CLEAR() in bytesobject.c.	2013-02-02 18:46:19 +02:00
Serhiy Storchaka	d357a3f841	Issue #17034 : Use Py_CLEAR() in bytesobject.c.	2013-02-02 18:45:54 +02:00
Serhiy Storchaka	f458a03617	Issue #17034 : Use Py_CLEAR() in bytesobject.c.	2013-02-02 18:45:22 +02:00
Gregory P. Smith	ce9e3c3af9	Silence a -Wformat-extra-argument warning when compiling.	2013-02-01 16:14:00 -08:00
Serhiy Storchaka	2aee6a6460	Issue #16971 : Fix a refleak in the charmap decoder.	2013-01-29 12:16:57 +02:00
Serhiy Storchaka	afb1cb5579	Issue #16971 : Fix a refleak in the charmap decoder.	2013-01-29 12:13:22 +02:00
Serhiy Storchaka	8fe5a9f9c3	Issue #16979 : Fix error handling bugs in the unicode-escape-decode decoder.	2013-01-29 10:37:39 +02:00
Serhiy Storchaka	24193debd4	Issue #16979 : Fix error handling bugs in the unicode-escape-decode decoder.	2013-01-29 10:28:07 +02:00
Serhiy Storchaka	d679377be7	Issue #16979 : Fix error handling bugs in the unicode-escape-decode decoder.	2013-01-29 10:20:44 +02:00
Mark Dickinson	07c7136524	Issue #16772 : in int(x, base), non-integer bases must have an __index__ method.	2013-01-27 10:17:52 +00:00
Ezio Melotti	3a62e45b97	Merge typo fixes from 3.3.	2013-01-27 06:20:51 +02:00
Ezio Melotti	3f5db3940f	Fix a few typos and a double semicolon. Patch by Eitan Adler.	2013-01-27 06:20:14 +02:00
Serhiy Storchaka	ed3c4128c0	Issue #10156 : In the interpreter's initialization phase, unicode globals are now initialized dynamically as needed.	2013-01-26 12:18:17 +02:00
Serhiy Storchaka	678db84b37	Issue #10156 : In the interpreter's initialization phase, unicode globals are now initialized dynamically as needed.	2013-01-26 12:16:36 +02:00
Serhiy Storchaka	059972535f	Issue #10156 : In the interpreter's initialization phase, unicode globals are now initialized dynamically as needed.	2013-01-26 12:14:02 +02:00
Serhiy Storchaka	570c5b2354	Issue #16980 : Fix processing of escaped non-ascii bytes in the unicode-escape-decode decoder.	2013-01-25 23:53:29 +02:00
Serhiy Storchaka	73e38809e0	Issue #16980 : Fix processing of escaped non-ascii bytes in the unicode-escape-decode decoder.	2013-01-25 23:52:21 +02:00
Serhiy Storchaka	f584aba3a5	Issue #16975 : Fix error handling bug in the escape-decode bytes decoder.	2013-01-25 23:33:22 +02:00
Serhiy Storchaka	e58785b200	Issue #16975 : Fix error handling bug in the escape-decode bytes decoder.	2013-01-25 23:32:41 +02:00
Serhiy Storchaka	ace3ad3bf7	Issue #16975 : Fix error handling bug in the escape-decode bytes decoder.	2013-01-25 23:31:43 +02:00
Serhiy Storchaka	6481bfb2b5	Issue #16335 : Fix integer overflow in unicode-escape decoder.	2013-01-21 11:44:40 +02:00
Serhiy Storchaka	c35f3a9f61	Issue #16335 : Fix integer overflow in unicode-escape decoder.	2013-01-21 11:42:57 +02:00
Serhiy Storchaka	4f5f0e54e0	Issue #16335 : Fix integer overflow in unicode-escape decoder.	2013-01-21 11:38:00 +02:00
Serhiy Storchaka	441d30fac7	Issue #15989 : Fix several occurrences of integer overflow when result of PyLong_AsLong() narrowed to int without checks. This is a backport of changesets 13e2e44db99d and 525407d89277.	2013-01-19 12:26:26 +02:00
Serhiy Storchaka	9101e23ff6	Issue #15989 : Fix several occurrences of integer overflow when result of PyLong_AsLong() narrowed to int without checks. This is a backport of changesets 13e2e44db99d and 525407d89277.	2013-01-19 12:41:45 +02:00
Serhiy Storchaka	55e2cb497b	Issue #14850 : Now a chamap decoder treates U+FFFE as "undefined mapping" in any mapping, not only in an unicode string.	2013-01-15 15:30:04 +02:00
Serhiy Storchaka	45d16d9924	Issue #14850 : Now a chamap decoder treates U+FFFE as "undefined mapping" in any mapping, not only in an unicode string.	2013-01-15 15:01:20 +02:00
Serhiy Storchaka	4fb8caee87	Issue #14850 : Now a chamap decoder treates U+FFFE as "undefined mapping" in any mapping, not only in an unicode string.	2013-01-15 14:43:21 +02:00
Serhiy Storchaka	b946af5897	Check for NULL before the pointer aligning in fastsearch_memchr_1char. There is no guarantee that NULL is aligned.	2013-01-15 13:32:41 +02:00
Serhiy Storchaka	18ba40b945	Check for NULL before the pointer aligning in fastsearch_memchr_1char. There is no guarantee that NULL is aligned.	2013-01-15 13:27:28 +02:00
Serhiy Storchaka	7898043868	Issue #15989 : Fix several occurrences of integer overflow when result of PyLong_AsLong() narrowed to int without checks.	2013-01-15 01:12:17 +02:00
Benjamin Peterson	0b32a480bd	merge 3.3 (#16906 )	2013-01-09 09:52:22 -06:00
Benjamin Peterson	0c270a8bb7	correct static string clearing loop (closes #16906 )	2013-01-09 09:52:01 -06:00
Serhiy Storchaka	24a3ef6999	Issue #11461 : Fix the incremental UTF-16 decoder. Original patch by Amaury Forgeot d'Arc. Added tests for partial decoding of non-BMP characters.	2013-01-08 23:41:55 +02:00
Serhiy Storchaka	ae3b32ad6b	Issue #11461 : Fix the incremental UTF-16 decoder. Original patch by Amaury Forgeot d'Arc. Added tests for partial decoding of non-BMP characters.	2013-01-08 23:40:52 +02:00
Serhiy Storchaka	48e188e573	Issue #11461 : Fix the incremental UTF-16 decoder. Original patch by Amaury Forgeot d'Arc. Added tests for partial decoding of non-BMP characters.	2013-01-08 23:14:24 +02:00
Serhiy Storchaka	dec798eb46	Fix out of bound read in UTF-32 decoder on "narrow Unicode" builds.	2013-01-08 22:45:42 +02:00
Christian Heimes	34bdeb5d81	Add a comment about not caching the hash value. Issue #9685 suggested to memorize the hash value, but the feature request was rejected because no speed ups were found.	2013-01-07 21:24:18 +01:00
Serhiy Storchaka	4e02538bf3	Issue #16856 : Fix a segmentation fault from calling repr() on a dict with a key whose repr raise an exception.	2013-01-04 12:40:35 +02:00
Serhiy Storchaka	6c83e739d7	Issue #16856 : Fix a segmentation fault from calling repr() on a dict with a key whose repr raise an exception.	2013-01-04 12:39:34 +02:00
Victor Stinner	18aa4477d3	Close #16281 : handle tailmatch() failure and remove useless comment "honor direction and do a forward or backwards search": the runtime speed may be different, but I consider that it doesn't really matter in practice. The direction was never honored before: Python 2.7 uses memcmp() for the str type for example.	2013-01-03 03:18:09 +01:00
Victor Stinner	7ae320d667	(Merge 3.2) Issue #16455 : On FreeBSD and Solaris, if the locale is C, the ASCII/surrogateescape codec is now used, instead of the locale encoding, to decode the command line arguments. This change fixes inconsistencies with os.fsencode() and os.fsdecode() because these operating systems announces an ASCII locale encoding, whereas the ISO-8859-1 encoding is used in practice.	2013-01-03 01:21:07 +01:00
Victor Stinner	20b654acb5	Issue #16455 : On FreeBSD and Solaris, if the locale is C, the ASCII/surrogateescape codec is now used, instead of the locale encoding, to decode the command line arguments. This change fixes inconsistencies with os.fsencode() and os.fsdecode() because these operating systems announces an ASCII locale encoding, whereas the ISO-8859-1 encoding is used in practice.	2013-01-03 01:08:58 +01:00
Antoine Pitrou	a2678f3eb6	Fix the advertised size of PyCFunctionObjects in sys._debugmallocstats().	2012-12-30 22:46:56 +01:00
Antoine Pitrou	0811f98e10	Fix the advertised size of PyCFunctionObjects in sys._debugmallocstats().	2012-12-30 22:46:04 +01:00
Serhiy Storchaka	c819b077bb	Issue #16761 : Raise TypeError when int() called with base argument only.	2012-12-28 10:09:54 +02:00
Serhiy Storchaka	00e2843115	Issue #16761 : Raise TypeError when int() called with base argument only.	2012-12-28 10:02:42 +02:00
Serhiy Storchaka	0b386d5247	Issue #16761 : Raise TypeError when int() called with base argument only.	2012-12-28 09:42:11 +02:00
Benjamin Peterson	513762fe9c	use more specific type	2012-12-26 16:43:33 -06:00
Andrew Svetlov	4de2924dab	Fix compilation error for #15422	2012-12-26 23:08:54 +02:00
Gregory P. Smith	a689e524e7	Test for issue16772 and redoes the previous fix to accept __index__-aware objects as the base by using PyNumber_AsSsize_t similar to round().	2012-12-25 22:38:32 -08:00
Gregory P. Smith	4fbbf8c0a3	Fixes issue #16772 : int() constructor second argument (base) must be an int. Consistent with the behavior in Python 2.	2012-12-25 13:05:31 -08:00
Andrew Svetlov	3ba3a3ee56	Issue #15422 : get rid of PyCFunction_New macro	2012-12-25 13:32:35 +02:00
Andrew Svetlov	2cd8ce4690	Issue #9856 : Replace deprecation warinigs to raising TypeError in object.__format__ Patch by Florent Xicluna.	2012-12-23 14:27:17 +02:00
Benjamin Peterson	7643c92cdd	merge 3.3 (#16722 )	2012-12-19 15:28:46 -06:00
Benjamin Peterson	5ff3f73d94	try to call __bytes__ before __index__ (closes #16722 )	2012-12-19 15:27:41 -06:00
Andrew Svetlov	2606a6f197	Issue #16719 : Get rid of WindowsError. Use OSError instead Patch by Serhiy Storchaka.	2012-12-19 14:33:35 +02:00
Antoine Pitrou	928405303d	Following issue #13390 , fix compilation --without-pymalloc, and make sys.getallocatedblocks() return 0 in that situation.	2012-12-17 23:05:59 +01:00
Gregory P. Smith	27dc02e8c5	Fix the internals of our hash functions to used unsigned values during hash computation as the overflow behavior of signed integers is undefined. NOTE: This change is smaller compared to 3.2 as much of this cleanup had already been done. I added the comment that my change in 3.2 added so that the code would match up. Otherwise this just adds or synchronizes appropriate UL designations on some constants to be pedantic. In practice we require compiling everything with -fwrapv which forces overflow to be defined as twos compliment but this keeps the code cleaner for checkers or in the case where someone has compiled it without -fwrapv or their compiler's equivalent. We could work to get rid of the -fwrapv requirement in 3.4 but that requires more planning. Found by Clang trunk's Undefined Behavior Sanitizer (UBSan). Cleanup only - no functionality or hash values change.	2012-12-10 19:51:29 -08:00
Gregory P. Smith	a6be61ec71	Keep y a Py_hash_t instead of Py_uhash_t as it is compared with == -1 and the compiler logic will do the right thing with just x as a Py_uhash_t. This matches what was already done in the 3.3 version. cleanup only - no functionality or hash values change.	2012-12-10 18:34:09 -08:00
Gregory P. Smith	c2176e46d7	Fix the internals of our hash functions to used unsigned values during hash computation as the overflow behavior of signed integers is undefined. NOTE: This change is smaller compared to 3.2 as much of this cleanup had already been done. I added the comment that my change in 3.2 added so that the code would match up. Otherwise this just adds or synchronizes appropriate UL designations on some constants to be pedantic. In practice we require compiling everything with -fwrapv which forces overflow to be defined as twos compliment but this keeps the code cleaner for checkers or in the case where someone has compiled it without -fwrapv or their compiler's equivalent. Found by Clang trunk's Undefined Behavior Sanitizer (UBSan). Cleanup only - no functionality or hash values change.	2012-12-10 18:32:53 -08:00
Gregory P. Smith	27cbcd6241	Fix the internals of our hash functions to used unsigned values during hash computation as the overflow behavior of signed integers is undefined. In practice we require compiling everything with -fwrapv which forces overflow to be defined as twos compliment but this keeps the code cleaner for checkers or in the case where someone has compiled it without -fwrapv or their compiler's equivalent. Found by Clang trunk's Undefined Behavior Sanitizer (UBSan). Cleanup only - no functionality or hash values change.	2012-12-10 18:15:46 -08:00
Antoine Pitrou	f9d0b1256f	Issue #13390 : New function :func:`sys.getallocatedblocks()` returns the number of memory blocks currently allocated. Also, the ``-R`` option to regrtest uses this function to guard against memory allocation leaks.	2012-12-09 14:28:26 +01:00
Antoine Pitrou	53f604c794	Issue #16602 : When a weakref's target was part of a long deallocation chain, the object could remain reachable through its weakref even though its refcount had dropped to zero. Thanks to Eugene Toder for diagnosing and reporting the issue.	2012-12-08 21:18:50 +01:00
Antoine Pitrou	f93ed3fa67	Issue #16602 : When a weakref's target was part of a long deallocation chain, the object could remain reachable through its weakref even though its refcount had dropped to zero. Thanks to Eugene Toder for diagnosing and reporting the issue.	2012-12-08 21:17:03 +01:00
Antoine Pitrou	62a0d6ea40	Issue #16602 : When a weakref's target was part of a long deallocation chain, the object could remain reachable through its weakref even though its refcount had dropped to zero. Thanks to Eugene Toder for diagnosing and reporting the issue.	2012-12-08 21:15:26 +01:00
Chris Jerdonek	e7f2186f99	Issue #16495 : remove extraneous NULL encoding check from bytes_decode(). The NULL encoding check in bytes_decode() was unnecessary because this case is already taken care of by the call to _Py_normalize_encoding() inside PyUnicode_Decode().	2012-12-07 15:51:53 -08:00
Victor Stinner	8dbd421b4d	Cleanup unicodeobject.c * Remove micro-optization: (errors == "surrogateescape" \|\| strcmp(errors, "surrogateescape") == 0). Only use strcmp() * Initialize 'arg' members in unicode_format_arg() to help the compiler to diagnose real bugs and also make the code simpler to read	2012-12-04 09:30:24 +01:00
Victor Stinner	d45c7f8d74	Issue #16455 : On FreeBSD and Solaris, if the locale is C, the ASCII/surrogateescape codec is now used, instead of the locale encoding, to decode the command line arguments. This change fixes inconsistencies with os.fsencode() and os.fsdecode() because these operating systems announces an ASCII locale encoding, whereas the ISO-8859-1 encoding is used in practice.	2012-12-04 01:34:47 +01:00
Victor Stinner	2660e427d1	(Merge 3.2) Issue #16416 : On Mac OS X, operating system data are now always encoded/decoded to/from UTF-8/surrogateescape, instead of the locale encoding (which may be ASCII if no locale environment variable is set), to avoid inconsistencies with os.fsencode() and os.fsdecode() functions which are already using UTF-8/surrogateescape.	2012-12-03 12:48:53 +01:00
Victor Stinner	27b1ca29cc	Issue #16416 : On Mac OS X, operating system data are now always encoded/decoded to/from UTF-8/surrogateescape, instead of the locale encoding (which may be ASCII if no locale environment variable is set), to avoid inconsistencies with os.fsencode() and os.fsdecode() functions which are already using UTF-8/surrogateescape.	2012-12-03 12:47:59 +01:00
Antoine Pitrou	0e9958b543	Issue #16562 : Optimize dict equality testing. Patch by Serhiy Storchaka (reviewed by Martin and Raymond).	2012-12-02 19:10:07 +01:00
Christian Heimes	5f7e8dab11	Issue #16592 : stringlib_bytes_join doesn't raise MemoryError on allocation failure	2012-12-02 07:56:42 +01:00
Antoine Pitrou	5439458a2a	Issue #16215 : Fix potential double memory free in str.replace(). Patch by Serhiy Storchaka.	2012-11-17 23:29:28 +01:00
Antoine Pitrou	6d5ad227a5	Issue #16215 : Fix potential double memory free in str.replace(). Patch by Serhiy Storchaka.	2012-11-17 23:28:17 +01:00
Mark Dickinson	ffdb2c21b3	Issue #16451 : Refactor to remove duplication between range and slice in slice index computations.	2012-11-17 19:18:10 +00:00
Mark Dickinson	d20fb82195	Issue #16290 : __complex__ must now always return an instance of complex.	2012-11-14 17:08:31 +00:00
Victor Stinner	0d92c4f667	Issue #16416 : Fix error handling in _Py_wchar2char() _Py_char2wchar() functions	2012-11-12 23:32:21 +01:00
Antoine Pitrou	898347056a	Issue #16453 : Fix equality testing of dead weakref objects. Also add tests for ordering and hashing.	2012-11-11 19:39:35 +01:00
Antoine Pitrou	f6a50cfa07	Issue #16453 : Fix equality testing of dead weakref objects. Also add tests for ordering and hashing.	2012-11-11 19:37:41 +01:00
Antoine Pitrou	e11fecb5a9	Issue #16453 : Fix equality testing of dead weakref objects. Also add tests for ordering and hashing.	2012-11-11 19:36:51 +01:00
Mark Dickinson	c8a6967ea8	Issue #14794 : slice.indices no longer returns OverflowError for out-of-range start, stop, step or length.	2012-11-10 14:52:10 +00:00
Victor Stinner	fc009eff9e	Close #16311 : Use the _PyUnicodeWriter API in text decoders * Remove unicode_widen(): replaced with _PyUnicodeWriter_Prepare() * Remove unicode_putchar(): replaced with PyUnicodeWriter_Prepare() + PyUnicode_WRITER() * When handling an decoding error, only overallocate the buffer by +25% instead of +100%	2012-11-07 00:36:38 +01:00
Victor Stinner	6caa6fb535	(Merge 3.3) Issue #8271 : Fix compilation on Windows	2012-11-05 00:00:50 +01:00
Victor Stinner	ab60de478d	Issue #8271 : Fix compilation on Windows	2012-11-04 23:59:15 +01:00
Ezio Melotti	cfa9636404	#8271 : merge with 3.3.	2012-11-04 23:23:09 +02:00
Ezio Melotti	f7ed5d111b	#8271 : the utf-8 decoder now outputs the correct number of U+FFFD characters when used with the "replace" error handler on invalid utf-8 sequences. Patch by Serhiy Storchaka, tests by Ezio Melotti.	2012-11-04 23:21:38 +02:00
Mark Dickinson	c992fafddc	Issue #16402 : Merge fix from 3.3	2012-11-04 11:47:47 +00:00
Mark Dickinson	1321edaa55	Issue #16402 : Merge fix from 3.2	2012-11-04 11:47:05 +00:00
Mark Dickinson	8cd1c7681d	Issue #16402 : In range slicing, fix shadowing of exceptions from __index__ method.	2012-11-04 11:46:17 +00:00
Christian Heimes	e9d08cf450	Fix compilation on Windows	2012-11-03 23:08:27 +01:00
Christian Heimes	d081fbba58	Fix compilation on Windows	2012-11-03 23:08:18 +01:00
Christian Heimes	6d26ade920	Fix compilation on Windows	2012-11-03 23:07:59 +01:00
Ezio Melotti	212843b29f	#8401 : merge with 3.3.	2012-11-03 21:24:47 +02:00
Ezio Melotti	7376801f61	#8401 : merge with 3.2.	2012-11-03 21:22:41 +02:00
Ezio Melotti	c64bcbec4b	#8401 : assigning an int to a bytearray slice (e.g. b[3:4] = 5) now raises an error.	2012-11-03 21:19:06 +02:00
Stefan Krah	c38c816ea1	Merge 3.3.	2012-11-02 17:55:11 +01:00
Stefan Krah	4af77a0276	Issue #15814 : Use hash function that is compatible with the equality definition from #15573.	2012-11-02 17:49:22 +01:00
Benjamin Peterson	8781d4a84c	merge 3.3	2012-10-31 14:22:31 -04:00
Benjamin Peterson	591c921411	merge 3.2	2012-10-31 14:22:25 -04:00
Benjamin Peterson	9892f52145	avoid a function call with redundant checks for dict size	2012-10-31 14:22:12 -04:00
Benjamin Peterson	7503e08588	merge 3.3 (#16345 )	2012-10-31 14:10:04 -04:00
Benjamin Peterson	d97eb0d338	merge 3.2 (#16345 )	2012-10-31 14:09:11 -04:00
Benjamin Peterson	d1f2cb37a2	only fast-path fromkeys() when the constructor returns a empty dict (closes #16345 )	2012-10-31 14:05:55 -04:00
Benjamin Peterson	3cb90241fc	merge 3.3	2012-10-31 00:04:42 -04:00
Benjamin Peterson	2c05a2e01b	do safety checks on __qualname__ assignment	2012-10-31 00:01:15 -04:00
Benjamin Peterson	8afa7fa510	don't shadow the __qualname__ descriptor with __qualname__ in the class's __dict__ (closes #16271 )	2012-10-30 23:51:03 -04:00
Benjamin Peterson	42124a727d	initialize map/filter/zip in _PyBuiltin_Init rather than the catch-all function	2012-10-30 23:41:54 -04:00
Benjamin Peterson	7ff2094bc7	merge 3.3 (#16369 )	2012-10-30 23:31:12 -04:00
Benjamin Peterson	e8ea97fffb	merge 3.2 (#16369 )	2012-10-30 23:27:52 -04:00
Benjamin Peterson	c43112823b	initialize more global type objects (closes #16369 )	2012-10-30 23:21:10 -04:00
Victor Stinner	7a6d7cf3db	Issue #9566 : Use the right type to fix a compiler warnings on Win64	2012-10-31 00:37:41 +01:00
Victor Stinner	4ca1cf35fb	Issue #16086 : PyTypeObject.tp_flags and PyType_Spec.flags are now unsigned ... (unsigned long and unsigned int) to avoid an undefined behaviour with Py_TPFLAGS_TYPE_SUBCLASS ((1 << 31). PyType_GetFlags() result type is now unsigned too (unsigned long, instead of long).	2012-10-30 23:40:45 +01:00
Victor Stinner	e64322e034	Close #14625 : Rewrite the UTF-32 decoder. It is now 3x to 4x faster Patch written by Serhiy Storchaka.	2012-10-30 23:12:47 +01:00
Victor Stinner	76df43de30	Issue #16330 : Use surrogate-related macros Patch written by Serhiy Storchaka.	2012-10-30 01:42:39 +01:00
Mark Dickinson	fb90c0934c	Issue #14700 : Fix buggy overflow checks for large precision and width in new-style and old-style formatting.	2012-10-28 10:18:03 +00:00
Victor Stinner	c6cf1ba29e	Replace usage of the deprecated Py_UNICODE_COPY() with Py_MEMCPY() in resize_copy()	2012-10-23 02:54:47 +02:00
Victor Stinner	fe75fb4b3e	Optimize _PyUnicode_HasNULChars(): use findchar() instead of PyUnicode_Contains()	2012-10-23 02:52:18 +02:00
Victor Stinner	6fa627578a	Inline raise_translate_exception(): it is only used once	2012-10-23 02:51:50 +02:00
Victor Stinner	e5567ad236	Optimize PyUnicode_RichCompare() for Py_EQ and Py_NE: always use memcmp()	2012-10-23 02:48:49 +02:00
Antoine Pitrou	6f7b0da6bc	Issue #12805 : Make bytes.join and bytearray.join faster when the separator is empty. Patch by Serhiy Storchaka.	2012-10-20 23:08:34 +02:00
Mark Dickinson	e453e4c007	Issue 16280: Drop questionable special-casing of null pointer in PyLong_FromVoidPtr.	2012-10-18 22:18:42 +01:00
Mark Dickinson	5cb65917e1	Issue #16277 : merge fix from 3.3	2012-10-18 19:53:45 +01:00
Mark Dickinson	44362a88ad	Issue #16277 : merge fix from 3.2	2012-10-18 19:53:28 +01:00
Mark Dickinson	91044799f7	Issue #16277 : in PyLong_FromVoidPtr, add missing branch for sizeof(void*) <= sizeof(long).	2012-10-18 19:21:43 +01:00
Christian Heimes	743e0cd6b5	Issue #16166 : Add PY_LITTLE_ENDIAN and PY_BIG_ENDIAN macros and unified endianess detection and handling.	2012-10-17 23:52:17 +02:00
Eric Snow	42da889fec	merge for issue #16160 : Subclass support now works for types.SimpleNamespace.	2012-10-16 22:45:49 -07:00
Eric Snow	547298c94c	Close #16160 : Subclass support now works for types.SimpleNamespace. Thanks to RDM for noticing.	2012-10-16 22:35:38 -07:00
Antoine Pitrou	cfc22b4a9b	Issue #15958 : bytes.join and bytearray.join now accept arbitrary buffer objects.	2012-10-16 21:07:23 +02:00
Chris Jerdonek	4a7df9aba9	Issue #14783 : Merge changes from 3.3.	2012-10-07 15:02:16 -07:00
Chris Jerdonek	042fa653ab	Issue #14783 : Merge changes from 3.2.	2012-10-07 14:56:27 -07:00
Chris Jerdonek	83fe2e1c22	Issue #14783 : Improve int() docstring and also str(), range(), and slice(). This commit rewrites the docstring for int() to incorporate the documentation changes made in issue #16036. It also switches the docstrings for int(), str(), range(), and slice() to use multi-line signatures.	2012-10-07 14:48:36 -07:00
Armin Ronacher	74b38b190f	Issue #16148 : Small improvements and cleanup. Added version information to docs.	2012-10-07 10:29:32 +02:00
Victor Stinner	4c63a972d1	Cleanup PyUnicode_FromFormatV() for zero padding Skip the "0" instead of parsing it twice: detect zero padding and then parsed as a digit of the width.	2012-10-06 23:55:33 +02:00
Victor Stinner	15a1136547	Issue #16147 : PyUnicode_FromFormatV() doesn't need anymore to allocate a buffer on the heap to format numbers.	2012-10-06 23:48:20 +02:00
Victor Stinner	ff5a848db5	Issue #16147 : PyUnicode_FromFormatV() now raises an error if the argument of '%c' is not in the range(0x110000).	2012-10-06 23:05:45 +02:00
Victor Stinner	3921e90c5a	Issue #16147 : PyUnicode_FromFormatV() now detects integer overflow when parsing width and precision	2012-10-06 23:05:00 +02:00
Victor Stinner	e215d960be	Issue #16147 : Rewrite PyUnicode_FromFormatV() to use _PyUnicodeWriter API * Simplify the code: replace 4 steps with one unique step using the _PyUnicodeWriter API. PyUnicode_Format() has the same design. It avoids to store intermediate results which require to allocate an array of pointers on the heap. * Use the _PyUnicodeWriter API for speed (and its convinient API): overallocate the buffer to reduce the number of "realloc()" * Implement "width" and "precision" in Python, don't rely on sprintf(). It avoids to need of a temporary buffer allocated on the heap: only use a small buffer allocated in the stack. * Add _PyUnicodeWriter_WriteCstr() function * Split PyUnicode_FromFormatV() into two functions: add unicode_fromformat_arg(). * Inline parse_format_flags(): the format of an argument is now only parsed once, it's no more needed to have a subfunction. * Optimize PyUnicode_FromFormatV() for characters between two "%" arguments: search the next "%" and copy the substring in one chunk, instead of copying character per character.	2012-10-06 23:03:36 +02:00
Mark Dickinson	cf46d62fcb	Issue #16096 : port fix from 3.3	2012-10-06 18:50:19 +01:00
Mark Dickinson	fc9adb62fb	Issue #16096 : Fix signed overflow in Objects/longobject.c. Thanks Serhiy Storchaka.	2012-10-06 18:50:02 +01:00
Mark Dickinson	ff9c54aca2	Issue #16096 : Merge fixes from 3.3.	2012-10-06 18:05:14 +01:00
Mark Dickinson	c04ddff290	Issue #16096 : Fix several occurrences of potential signed integer overflow. Thanks Serhiy Storchaka.	2012-10-06 18:04:49 +01:00
Christian Heimes	b70e8a1958	and another one	2012-10-06 17:16:39 +02:00
Christian Heimes	6314d164c9	move var declaration to top of block to fix compilation on Windows, fixes a7ec0a1b0f7c	2012-10-06 17:13:29 +02:00
Armin Ronacher	23c5bb4030	Fixed a missing incref introduced by a7ec0a1b0f7c	2012-10-06 14:30:32 +02:00
Armin Ronacher	226b1db0e2	Added notimplemented_dealloc for better error reporting	2012-10-06 14:28:58 +02:00
Armin Ronacher	aa9a79d279	Issue #16148 : implemented PEP 424	2012-10-06 14:03:24 +02:00
Victor Stinner	8c6db45d3e	In debug mode, unicode_write_cstr() now checks that non-ASCII characters are not written into an ASCII string	2012-10-06 00:40:45 +02:00
Ezio Melotti	080a2c087e	#16127 : merge with 3.3.	2012-10-05 03:34:02 +03:00
Ezio Melotti	e7f90375b1	#16127 : remove outdated references to narrow builds. Patch by Serhiy Storchaka.	2012-10-05 03:33:31 +03:00
Victor Stinner	1929407406	Fix PyUnicode_Format(): return NULL if PyUnicode_READY(uformat) failed This error cannot occur in practice: PyUnicode_FromObject() always return a "ready" string.	2012-10-05 00:09:33 +02:00
Victor Stinner	770e19e0cc	Optimize unicode_compare(): use memcmp() when comparing two UCS1 strings	2012-10-04 22:59:45 +02:00
Victor Stinner	90db9c47dc	Enable also ptr==ptr optimization in PyUnicode_Compare() It was already implemented in PyUnicode_RichCompare()	2012-10-04 21:53:50 +02:00
Victor Stinner	9cc98c93a7	long_to_decimal_string_internal() doesn't need to write the final NULL character	2012-10-04 02:43:02 +02:00
Victor Stinner	aa7712711d	unicode_result_wchar(): move the assert() to the "#ifdef Py_DEBUG" block	2012-10-04 02:32:58 +02:00
Victor Stinner	a4708231e6	Split the huge PyUnicode_Format() function (+540 lines) into subfunctions	2012-10-04 02:19:54 +02:00
Victor Stinner	a049443fab	PyUnicode_Format(): disable overallocation when we are writing the last part of the output string	2012-10-03 23:03:46 +02:00
Victor Stinner	afffce489b	Unicode: resize_compact() and resize_inplace() fills also the Unicode strings with invalid bytes in debug mode, as done by PyUnicode_New()	2012-10-03 23:03:17 +02:00
Victor Stinner	c89d28fdfc	Issue #15609 : Fix refleak introduced by my last optimization	2012-10-02 12:54:07 +02:00
Victor Stinner	621ef3d84f	Issue #15609 : Optimize str%args for integer argument - Use _PyLong_FormatWriter() instead of formatlong() when possible, to avoid a temporary buffer - Enable the fast path when width is smaller or equals to the length, and when the precision is bigger or equals to the length - Add unit tests! - formatlong() uses PyUnicode_Resize() instead of _PyUnicode_FromASCII() to resize the output string	2012-10-02 00:33:47 +02:00
Benjamin Peterson	b8350f1c7d	upgrade to UCD 6.2	2012-09-29 13:47:39 -04:00
Ezio Melotti	0e1af282b8	Fix typo.	2012-09-28 16:43:40 +03:00
Mark Dickinson	7c95bb35e4	Issue #16060 : Fix a double DECREF in int() implementation. Thanks Serhiy Storchaka.	2012-09-27 19:38:59 +01:00
Antoine Pitrou	a1f7655fa7	Issue #15379 : Fix passing of non-BMP characters as integers for the charmap decoder (already working as unicode strings). Patch by Serhiy Storchaka.	2012-09-23 20:00:04 +02:00
Antoine Pitrou	6f80f5d444	Issue #15379 : Fix passing of non-BMP characters as integers for the charmap decoder (already working as unicode strings). Patch by Serhiy Storchaka.	2012-09-23 19:55:21 +02:00
Mark Dickinson	5710c2a3e8	Issue 15959: Merge from 3.2.	2012-09-20 21:30:34 +01:00
Mark Dickinson	c286e58044	Issue 15959: Fix type mismatch for quick{_neg}_int_allocs. Thanks Serhiy Storchaka.	2012-09-20 21:29:28 +01:00
Antoine Pitrou	ca8aa4acf6	Issue #15144 : Fix possible integer overflow when handling pointers as integer values, by using Py_uintptr_t instead of size_t. Patch by Serhiy Storchaka.	2012-09-20 20:56:47 +02:00
Trent Nelson	da064d0745	Silence compiler warnings on Solaris 10 via explicit (void *) casts.	2012-09-18 22:00:25 -04:00
Trent Nelson	ab02db23b1	Silence compiler warnings on Solaris 10 via explicit (void *) casts. (Compiler: Solaris Studio 12.3)	2012-09-18 21:58:03 -04:00

... 3 4 5 6 7 ...

4789 Commits