cpython

Commit Graph

Author	SHA1	Message	Date
Benjamin Peterson	736b8012b4	prevent overflow in unicode_repr (closes #22520 )	2014-09-29 23:02:15 -04:00
Benjamin Peterson	a1c1be4e03	cleanup overflowing handling in unicode_decode_call_errorhandler and unicode_encode_ucs1 (closes #22518 )	2014-09-29 18:18:57 -04:00
Kristján Valur Jónsson	25dded041f	Make the various iterators' "setstate" sliently and consistently clip the index. This avoids the possibility of setting an iterator to an invalid state.	2014-03-05 13:47:57 +00:00
Serhiy Storchaka	94ee389308	Issue #19619 : Blacklist non-text codecs in method API str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings. Backported changeset d68df99d7a57.	2014-02-24 14:43:03 +02:00
Benjamin Peterson	9743b2c2b5	give non-iterable TypeError a message (closes #20507 )	2014-02-15 13:02:52 -05:00
Serhiy Storchaka	505ff755d7	Issue #20437 : Fixed 21 potential bugs when deleting objects references.	2014-02-09 13:33:53 +02:00
Serhiy Storchaka	016a3f33a5	Issue #20538 : UTF-7 incremental decoder produced inconsistant string when input was truncated in BASE64 section.	2014-02-08 14:01:29 +02:00
Serhiy Storchaka	35804e4c63	Issue #19279 : UTF-7 decoder no more produces illegal strings.	2013-10-19 20:38:19 +03:00
Raymond Hettinger	e56666d17f	Silence compiler warning about an uninitialized variable	2013-08-04 11:51:03 -07:00
Christian Heimes	26532f7519	Check return value of PyType_Ready(&EncodingMapType) CID 486654	2013-07-20 14:57:16 +02:00
Serhiy Storchaka	8eeae2126c	Issue #18184 : PyUnicode_FromFormat() and PyUnicode_FromFormatV() now raise OverflowError when an argument of %c format is out of range.	2013-06-23 20:12:14 +03:00
Benjamin Peterson	7e30373126	remove MAX_MAXCHAR because it's unsafe for computing maximum codepoitn value (see #18183 )	2013-06-10 09:19:46 -07:00
Antoine Pitrou	8b0e98426d	Issue #17237 : Fix crash in the ASCII decoder on m68k.	2013-05-11 15:58:34 +02:00
Raymond Hettinger	378170d5d9	Issue 17447: Clarify that str.isidentifier doesn't check for reserved keywords.	2013-03-23 08:21:12 -07:00
Victor Stinner	2cb16aa3cb	_PyUnicode_Writer() now also reuses Unicode singletons: empty string and latin1 single character	2013-03-06 19:28:37 +01:00
Victor Stinner	d21b58c05d	Issue #17223 : Fix PyUnicode_FromUnicode() for string of 1 character outside the range U+0000-U+10ffff.	2013-02-26 00:15:54 +01:00
Victor Stinner	bbbac2ec34	Issue #17137 : When an Unicode string is resized, the internal wide character string (wstr) format is now cleared.	2013-02-07 23:12:46 +01:00
Serhiy Storchaka	03ee12ed72	Issue #17043 : The unicode-internal decoder no longer read past the end of input buffer.	2013-02-07 16:25:25 +02:00
Serhiy Storchaka	3fd4ab356d	Issue #17043 : The unicode-internal decoder no longer read past the end of input buffer.	2013-02-07 16:23:21 +02:00
Serhiy Storchaka	afb1cb5579	Issue #16971 : Fix a refleak in the charmap decoder.	2013-01-29 12:13:22 +02:00
Serhiy Storchaka	24193debd4	Issue #16979 : Fix error handling bugs in the unicode-escape-decode decoder.	2013-01-29 10:28:07 +02:00
Serhiy Storchaka	d679377be7	Issue #16979 : Fix error handling bugs in the unicode-escape-decode decoder.	2013-01-29 10:20:44 +02:00
Serhiy Storchaka	678db84b37	Issue #10156 : In the interpreter's initialization phase, unicode globals are now initialized dynamically as needed.	2013-01-26 12:16:36 +02:00
Serhiy Storchaka	059972535f	Issue #10156 : In the interpreter's initialization phase, unicode globals are now initialized dynamically as needed.	2013-01-26 12:14:02 +02:00
Serhiy Storchaka	73e38809e0	Issue #16980 : Fix processing of escaped non-ascii bytes in the unicode-escape-decode decoder.	2013-01-25 23:52:21 +02:00
Serhiy Storchaka	c35f3a9f61	Issue #16335 : Fix integer overflow in unicode-escape decoder.	2013-01-21 11:42:57 +02:00
Serhiy Storchaka	4f5f0e54e0	Issue #16335 : Fix integer overflow in unicode-escape decoder.	2013-01-21 11:38:00 +02:00
Serhiy Storchaka	441d30fac7	Issue #15989 : Fix several occurrences of integer overflow when result of PyLong_AsLong() narrowed to int without checks. This is a backport of changesets 13e2e44db99d and 525407d89277.	2013-01-19 12:26:26 +02:00
Serhiy Storchaka	9101e23ff6	Issue #15989 : Fix several occurrences of integer overflow when result of PyLong_AsLong() narrowed to int without checks. This is a backport of changesets 13e2e44db99d and 525407d89277.	2013-01-19 12:41:45 +02:00
Serhiy Storchaka	45d16d9924	Issue #14850 : Now a chamap decoder treates U+FFFE as "undefined mapping" in any mapping, not only in an unicode string.	2013-01-15 15:01:20 +02:00
Serhiy Storchaka	4fb8caee87	Issue #14850 : Now a chamap decoder treates U+FFFE as "undefined mapping" in any mapping, not only in an unicode string.	2013-01-15 14:43:21 +02:00
Benjamin Peterson	0c270a8bb7	correct static string clearing loop (closes #16906 )	2013-01-09 09:52:01 -06:00
Serhiy Storchaka	ae3b32ad6b	Issue #11461 : Fix the incremental UTF-16 decoder. Original patch by Amaury Forgeot d'Arc. Added tests for partial decoding of non-BMP characters.	2013-01-08 23:40:52 +02:00
Serhiy Storchaka	48e188e573	Issue #11461 : Fix the incremental UTF-16 decoder. Original patch by Amaury Forgeot d'Arc. Added tests for partial decoding of non-BMP characters.	2013-01-08 23:14:24 +02:00
Serhiy Storchaka	dec798eb46	Fix out of bound read in UTF-32 decoder on "narrow Unicode" builds.	2013-01-08 22:45:42 +02:00
Serhiy Storchaka	6c83e739d7	Issue #16856 : Fix a segmentation fault from calling repr() on a dict with a key whose repr raise an exception.	2013-01-04 12:39:34 +02:00
Victor Stinner	7ae320d667	(Merge 3.2) Issue #16455 : On FreeBSD and Solaris, if the locale is C, the ASCII/surrogateescape codec is now used, instead of the locale encoding, to decode the command line arguments. This change fixes inconsistencies with os.fsencode() and os.fsdecode() because these operating systems announces an ASCII locale encoding, whereas the ISO-8859-1 encoding is used in practice.	2013-01-03 01:21:07 +01:00
Victor Stinner	20b654acb5	Issue #16455 : On FreeBSD and Solaris, if the locale is C, the ASCII/surrogateescape codec is now used, instead of the locale encoding, to decode the command line arguments. This change fixes inconsistencies with os.fsencode() and os.fsdecode() because these operating systems announces an ASCII locale encoding, whereas the ISO-8859-1 encoding is used in practice.	2013-01-03 01:08:58 +01:00
Gregory P. Smith	c2176e46d7	Fix the internals of our hash functions to used unsigned values during hash computation as the overflow behavior of signed integers is undefined. NOTE: This change is smaller compared to 3.2 as much of this cleanup had already been done. I added the comment that my change in 3.2 added so that the code would match up. Otherwise this just adds or synchronizes appropriate UL designations on some constants to be pedantic. In practice we require compiling everything with -fwrapv which forces overflow to be defined as twos compliment but this keeps the code cleaner for checkers or in the case where someone has compiled it without -fwrapv or their compiler's equivalent. Found by Clang trunk's Undefined Behavior Sanitizer (UBSan). Cleanup only - no functionality or hash values change.	2012-12-10 18:32:53 -08:00
Gregory P. Smith	27cbcd6241	Fix the internals of our hash functions to used unsigned values during hash computation as the overflow behavior of signed integers is undefined. In practice we require compiling everything with -fwrapv which forces overflow to be defined as twos compliment but this keeps the code cleaner for checkers or in the case where someone has compiled it without -fwrapv or their compiler's equivalent. Found by Clang trunk's Undefined Behavior Sanitizer (UBSan). Cleanup only - no functionality or hash values change.	2012-12-10 18:15:46 -08:00
Victor Stinner	2660e427d1	(Merge 3.2) Issue #16416 : On Mac OS X, operating system data are now always encoded/decoded to/from UTF-8/surrogateescape, instead of the locale encoding (which may be ASCII if no locale environment variable is set), to avoid inconsistencies with os.fsencode() and os.fsdecode() functions which are already using UTF-8/surrogateescape.	2012-12-03 12:48:53 +01:00
Victor Stinner	27b1ca29cc	Issue #16416 : On Mac OS X, operating system data are now always encoded/decoded to/from UTF-8/surrogateescape, instead of the locale encoding (which may be ASCII if no locale environment variable is set), to avoid inconsistencies with os.fsencode() and os.fsdecode() functions which are already using UTF-8/surrogateescape.	2012-12-03 12:47:59 +01:00
Antoine Pitrou	6d5ad227a5	Issue #16215 : Fix potential double memory free in str.replace(). Patch by Serhiy Storchaka.	2012-11-17 23:28:17 +01:00
Ezio Melotti	f7ed5d111b	#8271 : the utf-8 decoder now outputs the correct number of U+FFFD characters when used with the "replace" error handler on invalid utf-8 sequences. Patch by Serhiy Storchaka, tests by Ezio Melotti.	2012-11-04 23:21:38 +02:00
Benjamin Peterson	e8ea97fffb	merge 3.2 (#16369 )	2012-10-30 23:27:52 -04:00
Benjamin Peterson	c43112823b	initialize more global type objects (closes #16369 )	2012-10-30 23:21:10 -04:00
Mark Dickinson	fb90c0934c	Issue #14700 : Fix buggy overflow checks for large precision and width in new-style and old-style formatting.	2012-10-28 10:18:03 +00:00
Chris Jerdonek	042fa653ab	Issue #14783 : Merge changes from 3.2.	2012-10-07 14:56:27 -07:00
Chris Jerdonek	83fe2e1c22	Issue #14783 : Improve int() docstring and also str(), range(), and slice(). This commit rewrites the docstring for int() to incorporate the documentation changes made in issue #16036. It also switches the docstrings for int(), str(), range(), and slice() to use multi-line signatures.	2012-10-07 14:48:36 -07:00
Mark Dickinson	c04ddff290	Issue #16096 : Fix several occurrences of potential signed integer overflow. Thanks Serhiy Storchaka.	2012-10-06 18:04:49 +01:00

1 2 3 4 5 ...

1004 Commits