cpython

Commit Graph

Author	SHA1	Message	Date
Victor Stinner	cfd2c1b4cc	(Merge 3.3) Issue #17137 : When an Unicode string is resized, the internal wide character string (wstr) format is now cleared.	2013-02-07 23:17:34 +01:00
Victor Stinner	bbbac2ec34	Issue #17137 : When an Unicode string is resized, the internal wide character string (wstr) format is now cleared.	2013-02-07 23:12:46 +01:00
Ezio Melotti	5b1acc0dff	#16910 : merge with 3.3.	2013-01-10 07:46:29 +02:00
Ezio Melotti	0dceb560b6	#16910 : test_bytes, test_unicode, and test_userstring now work with unittest test discovery. Patch by Zachary Ware.	2013-01-10 07:43:26 +02:00
Andrew Svetlov	2cd8ce4690	Issue #9856 : Replace deprecation warinigs to raising TypeError in object.__format__ Patch by Florent Xicluna.	2012-12-23 14:27:17 +02:00
Chris Jerdonek	d675a2c48a	Merge from 3.3: Improve str() and object.__str__() docs (issue #13538 ).	2012-11-20 17:53:17 -08:00
Chris Jerdonek	5fae0e5854	Improve str() and object.__str__() documentation (issue #13538 ).	2012-11-20 17:45:51 -08:00
Ezio Melotti	cfa9636404	#8271 : merge with 3.3.	2012-11-04 23:23:09 +02:00
Ezio Melotti	f7ed5d111b	#8271 : the utf-8 decoder now outputs the correct number of U+FFFD characters when used with the "replace" error handler on invalid utf-8 sequences. Patch by Serhiy Storchaka, tests by Ezio Melotti.	2012-11-04 23:21:38 +02:00
Mark Dickinson	61254b9391	Issue #14700 : merge tests from 3.3.	2012-10-28 10:23:08 +00:00
Mark Dickinson	2a83f16e5e	Issue #14700 : merge tests from 3.2.	2012-10-28 10:22:22 +00:00
Mark Dickinson	fb90c0934c	Issue #14700 : Fix buggy overflow checks for large precision and width in new-style and old-style formatting.	2012-10-28 10:18:03 +00:00
Victor Stinner	15a1136547	Issue #16147 : PyUnicode_FromFormatV() doesn't need anymore to allocate a buffer on the heap to format numbers.	2012-10-06 23:48:20 +02:00
Victor Stinner	e215d960be	Issue #16147 : Rewrite PyUnicode_FromFormatV() to use _PyUnicodeWriter API * Simplify the code: replace 4 steps with one unique step using the _PyUnicodeWriter API. PyUnicode_Format() has the same design. It avoids to store intermediate results which require to allocate an array of pointers on the heap. * Use the _PyUnicodeWriter API for speed (and its convinient API): overallocate the buffer to reduce the number of "realloc()" * Implement "width" and "precision" in Python, don't rely on sprintf(). It avoids to need of a temporary buffer allocated on the heap: only use a small buffer allocated in the stack. * Add _PyUnicodeWriter_WriteCstr() function * Split PyUnicode_FromFormatV() into two functions: add unicode_fromformat_arg(). * Inline parse_format_flags(): the format of an argument is now only parsed once, it's no more needed to have a subfunction. * Optimize PyUnicode_FromFormatV() for characters between two "%" arguments: search the next "%" and copy the substring in one chunk, instead of copying character per character.	2012-10-06 23:03:36 +02:00
Benjamin Peterson	4eda93723e	add another testcase	2012-08-05 15:05:34 -07:00
Brett Cannon	acc0c181a8	Remove a now worthless test.	2012-05-12 17:40:28 -04:00
Victor Stinner	f59c28c930	unicode_writer_finish() checks string consistency	2012-05-09 03:24:14 +02:00
Victor Stinner	ece58deb9f	Close #14648 : Compute correctly maxchar in str.format() for substrin	2012-04-23 23:36:38 +02:00
Benjamin Peterson	80d07f8251	inherit maxchar of field value where needed (closes #14648 )	2012-04-23 10:55:29 -04:00
Eric V. Smith	97722c4132	str.format_map tests don't do what they say: fix to actually implement the intent of the test. Closes #13450 . Patch by Akira Li.	2012-03-12 15:26:21 -07:00
Eric V. Smith	edbb6ca084	str.format_map tests don't do what they say: fix to actually implement the intent of the test. Closes #13450 .	2012-03-12 15:16:22 -07:00
Benjamin Peterson	d5890c8db5	add str.casefold() (closes #13752 )	2012-01-14 13:23:30 -05:00
Benjamin Peterson	b2bf01d824	use full unicode mappings for upper/lower/title case (#12736 ) Also broaden the category of characters that count as lowercase/uppercase.	2012-01-11 18:17:06 -05:00
Victor Stinner	6345be9a14	Close #13093 : PyUnicode_EncodeDecimal() doesn't support error handlers different than "strict" anymore. The caller was unable to compute the size of the output buffer: it depends on the error handler.	2011-11-25 20:09:01 +01:00
Victor Stinner	b84d723509	(Merge 3.2) Issue #13093 : Fix error handling on PyUnicode_EncodeDecimal()	2011-11-22 01:50:07 +01:00
Victor Stinner	c814a38f3f	Add a test on str.__getnewargs__() It tests indirectly PyUnicode_Copy(): ensure that the string is a copy.	2011-11-22 01:06:15 +01:00
Victor Stinner	42bf77537e	Rewrite PyUnicode_EncodeDecimal() to use the new Unicode API Add tests for PyUnicode_EncodeDecimal() and PyUnicode_TransformDecimalToASCII().	2011-11-21 22:52:58 +01:00
Victor Stinner	040e16e3e8	"unicode_internal" codec has been deprecated: fix related tests	2011-11-15 22:44:05 +01:00
Antoine Pitrou	78edf7576e	Issue #13333 : The UTF-7 decoder now accepts lone surrogates (the encoder already accepts them).	2011-11-15 01:44:16 +01:00
Antoine Pitrou	5418ee0b9a	Issue #13333 : The UTF-7 decoder now accepts lone surrogates (the encoder already accepts them).	2011-11-15 01:42:21 +01:00
Ezio Melotti	40dc919b0d	Fix range in test.	2011-11-11 17:00:46 +02:00
Antoine Pitrou	51f6648a31	Make test more inclusive	2011-11-11 13:35:44 +01:00
Antoine Pitrou	dffab19218	Enable commented out test	2011-11-11 13:31:59 +01:00
Antoine Pitrou	2c3b2302ad	Issue #13134 : optimize finding single-character strings using memchr	2011-10-11 20:29:21 +02:00
Antoine Pitrou	798b4df812	test_unicode was forgetting to run the common string tests for str.find()	2011-10-08 22:42:00 +02:00
Antoine Pitrou	c0bbe7d38a	test_unicode was forgetting to run the common string tests for str.find()	2011-10-08 22:41:35 +02:00
Victor Stinner	1d972ad12a	Mark 'abc'.expandtab() optimization as specific to CPython Improve also str.replace(a, a) test	2011-10-07 13:31:46 +02:00
Victor Stinner	59de0ee9e0	str.replace(a, a) is now returning str unchanged if a is a	2011-10-07 10:01:28 +02:00
Ezio Melotti	a9860aeb08	#13054 : fix usage of sys.maxunicode after PEP-393.	2011-10-04 19:06:00 +03:00
Antoine Pitrou	e19aa388e8	When expandtabs() would be a no-op, don't create a duplicate string	2011-10-04 16:04:01 +02:00
Victor Stinner	07ac3ebd7b	Optimize unicode_subtype_new(): don't encode to wchar_t and decode from wchar_t Rewrite unicode_subtype_new(): allocate directly the right type.	2011-10-01 16:16:43 +02:00
Benjamin Peterson	811c2f1369	remove "fast-path" for (i)adding strings These were just an artifact of the old unicode concatenation hack and likely just penalized other kinds of adding. Also, this fixes __(i)add__ on string subclasses.	2011-09-30 21:31:21 -04:00
Martin v. Löwis	287eca658d	Fix struct sizes. Drop -1, since the resulting string was actually the largest one that could be allocated.	2011-09-28 10:03:28 +02:00
Martin v. Löwis	d63a3b8beb	Implement PEP 393.	2011-09-28 07:41:54 +02:00
Ezio Melotti	a3fbde3504	Merge indentation fix and skip decorator with 3.2.	2011-08-23 00:40:09 +03:00
Ezio Melotti	a5c92b4714	Fix indentation and add a skip decorator.	2011-08-23 00:37:08 +03:00
Ezio Melotti	6f2a683a0c	#9200 : merge with 3.2.	2011-08-22 20:31:11 +03:00
Ezio Melotti	93e7afc5d9	#9200 : The str.is* methods now work with strings that contain non-BMP characters even in narrow Unicode builds.	2011-08-22 14:08:38 +03:00
Benjamin Peterson	f8e7543df9	merge 3.2 (#12732 )	2011-08-12 22:18:19 -05:00
Benjamin Peterson	f413b80806	in narrow builds, make sure to test codepoints as identifier characters (closes #12732 ) This fixes the use of Unicode identifiers outside the BMP in narrow builds.	2011-08-12 22:17:18 -05:00
Victor Stinner	ab1d16b456	Issue #13093 : Fix error handling on PyUnicode_EncodeDecimal() * Add tests for PyUnicode_EncodeDecimal() and PyUnicode_TransformDecimalToASCII() * Remove the unused "e" variable in replace()	2011-11-22 01:45:37 +01:00
Eric V. Smith	c12469df22	Merge from 3.2.	2011-07-18 14:08:55 -04:00
Eric V. Smith	12ebefc9d3	Closes #12579 . Positional fields with str.format_map() now raise a ValueError instead of SystemError.	2011-07-18 14:03:41 -04:00
Senthil Kumaran	bc9d8f838b	merge from 3.2	2011-07-03 21:05:25 -07:00
Senthil Kumaran	9ebe08d2f6	Fix closes issue12471 - wrong TypeError message when '%i' format spec was used.	2011-07-03 21:03:16 -07:00
Ezio Melotti	bf1253b25a	#6780 : merge with 3.2.	2011-04-26 06:45:24 +03:00
Ezio Melotti	f2b3f780a1	#6780 : merge with 3.1.	2011-04-26 06:40:59 +03:00
Ezio Melotti	ba42fd5801	#6780 : fix starts/endswith error message to mention that tuples are accepted too.	2011-04-26 06:09:45 +03:00
Eric V. Smith	b9cd3531c4	Issue 9856: Change object.__format__ with a non-empty format string from a PendingDeprecationWarning to a DeprecationWarning.	2011-03-12 10:08:48 -05:00
Victor Stinner	6d970f4713	Issue #10831 : PyUnicode_FromFormat() supports %li, %lli and %zi formats	2011-03-02 00:04:25 +00:00
Victor Stinner	968654515f	Issue #10829 : Refactor PyUnicode_FromFormat() * Use the same function to parse the format string in the 3 steps * Fix crashs on invalid format strings	2011-03-01 23:44:09 +00:00
Victor Stinner	2b574a2332	Merged revisions 88697 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r88697 \| victor.stinner \| 2011-03-01 23:46:52 +0100 (mar., 01 mars 2011) \| 4 lines Issue #11246: Fix PyUnicode_FromFormat("%V") Decode the byte string from UTF-8 (with replace error handler) instead of ISO-8859-1 (in strict mode). Patch written by Ray Allen. ........	2011-03-01 22:48:49 +00:00
Victor Stinner	2512a8b62e	Issue #11246 : Fix PyUnicode_FromFormat("%V") Decode the byte string from UTF-8 (with replace error handler) instead of ISO-8859-1 (in strict mode). Patch written by Ray Allen.	2011-03-01 22:46:52 +00:00
Marc-André Lemburg	8f36af7a4c	Normalize the encoding names for Latin-1 and UTF-8 to 'latin-1' and 'utf-8'. These are optimized in the Python Unicode implementation to result in more direct processing, bypassing the codec registry. Also see issue11303.	2011-02-25 15:42:01 +00:00
Victor Stinner	659eb84457	Merged revisions 88481 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r88481 \| victor.stinner \| 2011-02-21 22:13:44 +0100 (lun., 21 févr. 2011) \| 4 lines Fix PyUnicode_FromFormatV("%c") for non-BMP char Issue #10830: Fix PyUnicode_FromFormatV("%c") for non-BMP characters on narrow build. ........	2011-02-23 12:14:22 +00:00
Victor Stinner	5ed8b2c737	Fix PyUnicode_FromFormatV("%c") for non-BMP char Issue #10830: Fix PyUnicode_FromFormatV("%c") for non-BMP characters on narrow build.	2011-02-21 21:13:44 +00:00
Eric Smith	a1eac7218b	Issue #11302 : missing type check on _string.formatter_field_name_split and _string.formatter_parser caused crash. Originial patch by haypo, reviewed by me, okayed by Georg.	2011-01-29 11:15:35 +00:00
Victor Stinner	ca1e7ec344	test_unicode: use ctypes to test PyUnicode_FromFormat() Instead of _testcapi.format_unicode() because it has a limited API: it requires exactly one argument of type unicode.	2011-01-05 00:19:28 +00:00
Alexander Belopolsky	942af5a9a4	Issue #10557 : Fixed error messages from float() and other numeric types. Added a new API function, PyUnicode_TransformDecimalToASCII(), which transforms non-ASCII decimal digits in a Unicode string to their ASCII equivalents.	2010-12-04 03:38:46 +00:00
Ezio Melotti	ed3a7d2d60	#10273 : Rename assertRegexpMatches and assertRaisesRegexp to assertRegex and assertRaisesRegex.	2010-12-01 02:32:32 +00:00
Antoine Pitrou	0662bc297a	Fix tests when ctypes isn't available	2010-11-22 16:19:04 +00:00
Ezio Melotti	19f2aeba67	Merged revisions 86596 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r86596 \| ezio.melotti \| 2010-11-20 21:04:17 +0200 (Sat, 20 Nov 2010) \| 1 line #9424: Replace deprecated assert* methods in the Python test suite. ........	2010-11-21 01:30:29 +00:00
Ezio Melotti	b3aedd4862	#9424 : Replace deprecated assert* methods in the Python test suite.	2010-11-20 19:04:17 +00:00
Eric Smith	72f6620859	Removed unused test classes from test_format_map().	2010-11-06 14:43:26 +00:00
Eric Smith	27bbca6f79	Issue #6081 : Add str.format_map. str.format_map(mapping) is similar to str.format(**mapping), except mapping does not get converted to a dict.	2010-11-04 17:06:58 +00:00
Antoine Pitrou	43ffd5c013	Merged revisions 85861 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r85861 \| antoine.pitrou \| 2010-10-27 20:52:48 +0200 (mer., 27 oct. 2010) \| 3 lines Recode modules from latin-1 to utf-8 ........	2010-10-27 18:54:06 +00:00
Antoine Pitrou	d72402effc	Recode modules from latin-1 to utf-8	2010-10-27 18:52:48 +00:00
Victor Stinner	9a90900da5	PyUnicode_FromFormatV(): Fix %A format It was not completly implemented. Add a test.	2010-10-18 20:59:24 +00:00
Martin v. Löwis	baecd7243a	Upgrade to Unicode 6.0.0. makeunicodedata.py: download all data files from unicode.org, switch to extracting Unihan data from zip file. Read linebreakprops and derivednormalizationprops even for old versions, even though they are not used in delta records. test:unicode.py: U+11000 is now assigned, use U+14000 instead.	2010-10-11 22:42:28 +00:00
Victor Stinner	46c7b3b283	Issue #8670 : Rename testcapi unicode test methods * test_aswidechar() => unicode_aswidechar() * test_aswidecharstring() => unicode_aswidecharstring()	2010-10-02 11:49:31 +00:00
Victor Stinner	ea3f305a25	Oops, revert unwanted _testcapi changes of r85174	2010-10-02 11:46:20 +00:00
Victor Stinner	749261e241	Issue #8670 : ctypes.c_wchar supports non-BMP characters with 32 bits wchar_t	2010-10-02 11:25:35 +00:00
Victor Stinner	5593d8aeb4	Issue #8670 : PyUnicode_AsWideChar() and PyUnicode_AsWideCharString() replace UTF-16 surrogate pairs by single non-BMP characters for 16 bits Py_UNICODE and 32 bits wchar_t (eg. Linux in narrow build).	2010-10-02 11:11:27 +00:00
Victor Stinner	1c24bd0252	Issue #8870 : PyUnicode_AsWideCharString() doesn't count the trailing nul character And write unit tests for PyUnicode_AsWideChar() and PyUnicode_AsWideCharString().	2010-10-02 11:03:13 +00:00
Eric Smith	e4d6317c87	Issue 7994: Make object.__format__() raise a PendingDeprecationWarning if the format string is not empty. Manually merge r79596 and r84772 from 2.x. Also, apparently test_format() from test_builtin never made it into 3.x. I've added it as well. It tests the basic format() infrastructure.	2010-09-13 20:48:43 +00:00
Florent Xicluna	a87b383ac1	Reenable test_ucs4 and remove some duplicated lines.	2010-09-13 02:28:18 +00:00
Victor Stinner	4c7db315df	Issue #9738 , #9836 : Fix refleak introduced by r84704	2010-09-12 07:51:18 +00:00
Victor Stinner	1205f2774e	Issue #9738 : PyUnicode_FromFormat() and PyErr_Format() raise an error on a non-ASCII byte in the format string. Document also the encoding.	2010-09-11 00:54:47 +00:00
Amaury Forgeot d'Arc	324ac65ceb	#5127 : Even on narrow unicode builds, the C functions that access the Unicode Database (Py_UNICODE_TOLOWER, Py_UNICODE_ISDECIMAL, and others) now accept and return characters from the full Unicode range (Py_UCS4). The differences from Python code are few: - unicodedata.numeric(), unicodedata.decimal() and unicodedata.digit() now return the correct value for large code points - repr() may consider more characters as printable.	2010-08-18 20:44:58 +00:00
Eric Smith	06124c0df8	Merged revisions 83966 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r83966 \| eric.smith \| 2010-08-12 17:55:30 -0400 (Thu, 12 Aug 2010) \| 1 line Remove unused test class. ........	2010-08-13 00:12:59 +00:00
Eric Smith	994addc414	Remove unused test class.	2010-08-12 21:55:30 +00:00
Stefan Krah	aebd6f4c29	Merged revisions 82978 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r82978 \| stefan.krah \| 2010-07-19 19:58:26 +0200 (Mon, 19 Jul 2010) \| 3 lines Sub-issue of #9036: Fix incorrect use of Py_CHARMASK. ........	2010-07-19 18:01:13 +00:00
Stefan Krah	99212f61db	Sub-issue of #9036 : Fix incorrect use of Py_CHARMASK.	2010-07-19 17:58:26 +00:00
Ezio Melotti	25bc019d46	Merged revisions 82413,82468 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ........ r82413 \| ezio.melotti \| 2010-07-01 10:32:02 +0300 (Thu, 01 Jul 2010) \| 13 lines Update PyUnicode_DecodeUTF8 from RFC 2279 to RFC 3629. 1) #8271: when a byte sequence is invalid, only the start byte and all the valid continuation bytes are now replaced by U+FFFD, instead of replacing the number of bytes specified by the start byte. See http://www.unicode.org/versions/Unicode5.2.0/ch03.pdf (pages 94-95); 2) 5- and 6-bytes-long UTF-8 sequences are now considered invalid (no changes in behavior); 3) Change the error messages "unexpected code byte" to "invalid start byte" and "invalid data" to "invalid continuation byte"; 4) Add an extensive set of tests in test_unicode; 5) Fix test_codeccallbacks because it was failing after this change. ........ r82468 \| ezio.melotti \| 2010-07-03 07:52:19 +0300 (Sat, 03 Jul 2010) \| 1 line Update comment about surrogates. ........	2010-07-03 05:18:50 +00:00
Ezio Melotti	57221d02ba	Update PyUnicode_DecodeUTF8 from RFC 2279 to RFC 3629. 1) #8271: when a byte sequence is invalid, only the start byte and all the valid continuation bytes are now replaced by U+FFFD, instead of replacing the number of bytes specified by the start byte. See http://www.unicode.org/versions/Unicode5.2.0/ch03.pdf (pages 94-95); 2) 5- and 6-bytes-long UTF-8 sequences are now considered invalid (no changes in behavior); 3) Change the error messages "unexpected code byte" to "invalid start byte" and "invalid data" to "invalid continuation byte"; 4) Add an extensive set of tests in test_unicode; 5) Fix test_codeccallbacks because it was failing after this change.	2010-07-01 07:32:02 +00:00
Benjamin Peterson	5a6214afe2	Merged revisions 81499,81506 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81499 \| georg.brandl \| 2010-05-24 16:29:07 -0500 (Mon, 24 May 2010) \| 1 line #8016: add the CP858 codec (approved by Benjamin). (Also add CP720 to the tests, it was missing there.) ........ r81506 \| benjamin.peterson \| 2010-05-24 17:04:53 -0500 (Mon, 24 May 2010) \| 1 line set svn:eol-style ........	2010-06-27 22:41:29 +00:00
Benjamin Peterson	99bcf5ce08	Merged revisions 81823,81835 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r81823 \| benjamin.peterson \| 2010-06-07 17:31:26 -0500 (Mon, 07 Jun 2010) \| 9 lines Merged revisions 81820 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81820 \| benjamin.peterson \| 2010-06-07 17:23:23 -0500 (Mon, 07 Jun 2010) \| 1 line correctly overflow when indexes are too large ........ ................ r81835 \| benjamin.peterson \| 2010-06-08 09:57:22 -0500 (Tue, 08 Jun 2010) \| 9 lines Merged revisions 81834 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81834 \| benjamin.peterson \| 2010-06-08 09:53:29 -0500 (Tue, 08 Jun 2010) \| 1 line kill extra word ........ ................	2010-06-08 15:12:17 +00:00
Benjamin Peterson	59a1b2f732	Merged revisions 81820 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81820 \| benjamin.peterson \| 2010-06-07 17:23:23 -0500 (Mon, 07 Jun 2010) \| 1 line correctly overflow when indexes are too large ........	2010-06-07 22:31:26 +00:00
Victor Stinner	abdb21a3a8	Merged revisions 79281 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/py3k ................ r79281 \| victor.stinner \| 2010-03-22 13:50:40 +0100 (lun., 22 mars 2010) \| 16 lines Merged revisions 79278,79280 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r79278 \| victor.stinner \| 2010-03-22 13:24:37 +0100 (lun., 22 mars 2010) \| 2 lines Issue #1583863: An unicode subclass can now override the __str__ method ........ r79280 \| victor.stinner \| 2010-03-22 13:36:28 +0100 (lun., 22 mars 2010) \| 5 lines Fix the NEWS about my last commit: an unicode subclass can now override the __unicode__ method (and not the __str__ method). Simplify also the testcase. ........ ................	2010-03-22 12:53:14 +00:00
Victor Stinner	808fc0a0ee	Merged revisions 79278,79280 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r79278 \| victor.stinner \| 2010-03-22 13:24:37 +0100 (lun., 22 mars 2010) \| 2 lines Issue #1583863: An unicode subclass can now override the __str__ method ........ r79280 \| victor.stinner \| 2010-03-22 13:36:28 +0100 (lun., 22 mars 2010) \| 5 lines Fix the NEWS about my last commit: an unicode subclass can now override the __unicode__ method (and not the __str__ method). Simplify also the testcase. ........	2010-03-22 12:50:40 +00:00
Brett Cannon	226b2303f4	Clean up the warnings filter use in test_unicode.	2010-03-20 22:22:22 +00:00
Benjamin Peterson	577473fe68	use assert[Not]In where appropriate A patch from Dave Malcolm.	2010-01-19 00:09:57 +00:00
Benjamin Peterson	308d637c94	Merged revisions 74929 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r74929 \| benjamin.peterson \| 2009-09-18 16:14:55 -0500 (Fri, 18 Sep 2009) \| 1 line add keyword arguments support to str/unicode encode and decode #6300 ........	2009-09-18 21:42:35 +00:00
Georg Brandl	ab91fdef1f	Merged revisions 73715 via svnmerge from svn+ssh://svn.python.org/python/branches/py3k ........ r73715 \| benjamin.peterson \| 2009-07-01 01:06:06 +0200 (Mi, 01 Jul 2009) \| 1 line convert old fail* assertions to assert* ........	2009-08-13 08:51:18 +00:00
Benjamin Peterson	c9c0f201fe	convert old fail* assertions to assert*	2009-06-30 23:06:06 +00:00
Martin v. Löwis	74b7e44d7d	Issue #6150 : Fix test_unicode on wide-unicode builds.	2009-06-01 04:23:07 +00:00
Eric Smith	41669caebc	Merged revisions 72848 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r72848 \| eric.smith \| 2009-05-23 09:56:13 -0400 (Sat, 23 May 2009) \| 1 line Issue 6089: str.format raises SystemError. ........	2009-05-23 14:23:22 +00:00
Martin v. Löwis	e0a2b72e61	Rename the surrogates error handler to surrogatepass.	2009-05-10 08:08:56 +00:00
Eric Smith	741191f17a	Issue #3382 . float 'F' formatting no longer maps to 'f'. This only affects nan and inf.	2009-05-06 13:08:15 +00:00
Antoine Pitrou	244651aa2f	Merged revisions 72283-72284 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r72283 \| antoine.pitrou \| 2009-05-04 20:32:32 +0200 (lun., 04 mai 2009) \| 4 lines Issue #4426: The UTF-7 decoder was too strict and didn't accept some legal sequences. Patch by Nick Barnes and Victor Stinner. ........ r72284 \| antoine.pitrou \| 2009-05-04 20:32:50 +0200 (lun., 04 mai 2009) \| 3 lines Add Nick Barnes to ACKS. ........	2009-05-04 18:56:13 +00:00
Martin v. Löwis	db12d454e6	Issue #3672 : Reject surrogates in utf-8 codec; add surrogates error handler.	2009-05-02 18:52:14 +00:00
Benjamin Peterson	09832740d1	fix isprintable() on space characters #5126	2009-03-26 17:15:46 +00:00
Eric Smith	8ec90443f5	Merged revisions 70364 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r70364 \| eric.smith \| 2009-03-14 07:57:26 -0400 (Sat, 14 Mar 2009) \| 17 lines Issue 5237, Allow auto-numbered replacement fields in str.format() strings. For simple uses for str.format(), this makes the typing easier. Hopfully this will help in the adoption of str.format(). For example: 'The {} is {}'.format('sky', 'blue') You can mix and matcth auto-numbering and named replacement fields: 'The {} is {color}'.format('sky', color='blue') But you can't mix and match auto-numbering and specified numbering: 'The {0} is {}'.format('sky', 'blue') ValueError: cannot switch from manual field specification to automatic field numbering Will port to 3.1. ........	2009-03-14 12:29:34 +00:00
Amaury Forgeot d'Arc	a083f1eb5f	The Unicode database was updated to 5.1, and some characters have become printable. Change the tests and use another code point.	2008-09-10 23:51:42 +00:00
Antoine Pitrou	b305aeb1dd	Merged revisions 66235 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r66235 \| antoine.pitrou \| 2008-09-06 00:04:54 +0200 (sam., 06 sept. 2008) \| 6 lines #3601: test_unicode.test_raiseMemError fails in UCS4 Reviewed by Benjamin Peterson on IRC. ........	2008-09-05 22:13:06 +00:00
Antoine Pitrou	3db3e87434	Merged revisions 65773 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r65773 \| antoine.pitrou \| 2008-08-17 19:01:49 +0200 (dim., 17 août 2008) \| 3 lines #3556: test_raiseMemError consumes an insane amount of memory ........	2008-08-17 17:06:51 +00:00
Amaury Forgeot d'Arc	7888d0803d	Merged revisions 65339-65340,65342 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r65339 \| amaury.forgeotdarc \| 2008-07-31 23:28:03 +0200 (jeu., 31 juil. 2008) \| 5 lines #3479: unichr(2*32) used to return u'\x00'. The argument was fetched in a long, but PyUnicode_FromOrdinal takes an int. (why doesn't gcc issue a truncation warning in this case?) ........ r65340 \| amaury.forgeotdarc \| 2008-07-31 23:35:03 +0200 (jeu., 31 juil. 2008) \| 2 lines Remove a dummy test that was checked in by mistake ........ r65342 \| amaury.forgeotdarc \| 2008-08-01 01:39:05 +0200 (ven., 01 août 2008) \| 8 lines Correct a crash when two successive unicode allocations fail with a MemoryError: the freelist contained half-initialized objects with freed pointers. The comment / XXX UNREF/NEWREF interface should be more symmetrical */ was copied from tupleobject.c, and appears in some other places. I sign the petition. ........	2008-08-01 01:06:32 +00:00
Antoine Pitrou	5ffd9e9cc9	Merged revisions 65227 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r65227 \| antoine.pitrou \| 2008-07-25 19:45:59 +0200 (ven., 25 juil. 2008) \| 3 lines #2242: utf7 decoding crashes on bogus input on some Windows/MSVC versions ........	2008-07-25 18:05:24 +00:00
Eric Smith	b1ebcc6b0b	Forward port of r64958. Added '#' formatting to integers. This adds the 0b, 0o, or 0x prefix for bin, oct, hex. There's still one failing case, and I need to finish the docs. I hope to finish those today.	2008-07-15 13:02:41 +00:00
Amaury Forgeot d'Arc	a4db68622c	Issue #3280 : like chr() already does, the "%c" format now accepts the full unicode range even on "narrow Unicode" builds; the result is a pair of UTF-16 surrogates.	2008-07-04 21:26:43 +00:00
Georg Brandl	d52429fb49	Issue #3282 : str.isprintable() should return False for undefined Unicode characters.	2008-07-04 15:55:02 +00:00
Georg Brandl	559e5d7f4d	#2630 : Implement PEP 3138. The repr() of a string now contains printable Unicode characters unescaped. The new ascii() builtin can be used to get a repr() with only ASCII characters in it. PEP and patch were written by Atsuo Ishimoto.	2008-06-11 18:37:52 +00:00
Georg Brandl	a26f8ca668	Revert r63934 -- it was mixing two patches.	2008-06-04 13:01:30 +00:00
Georg Brandl	f954c4b9fb	Remove meaning of -ttt, but still accept -t option on cmdline for compatibility.	2008-06-04 11:41:32 +00:00
Benjamin Peterson	ee8712cda4	#2621 rename test.test_support to test.support	2008-05-20 21:35:26 +00:00
Benjamin Peterson	cd76c274c6	Added a test to make sure raw strings don't get unicode escapes	2008-04-05 15:09:30 +00:00
Benjamin Peterson	8dbca06b22	Reverted r62128 on Guido's orders	2008-04-05 14:49:54 +00:00
Benjamin Peterson	7afb766c5d	#2541 Allow unicode escapes in raw strings	2008-04-03 16:27:27 +00:00
Christian Heimes	fe337bfd0d	Merged revisions 61724-61725,61731-61735,61737,61739,61741,61743-61744,61753,61761,61765-61767,61769,61773,61776-61778,61780-61783,61788,61793,61796,61807,61813 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ................ r61724 \| martin.v.loewis \| 2008-03-22 01:01:12 +0100 (Sat, 22 Mar 2008) \| 49 lines Merged revisions 61602-61723 via svnmerge from svn+ssh://pythondev@svn.python.org/sandbox/trunk/2to3/lib2to3 ........ r61626 \| david.wolever \| 2008-03-19 17:19:16 +0100 (Mi, 19 M?\195?\164r 2008) \| 1 line Added fixer for implicit local imports. See #2414. ........ r61628 \| david.wolever \| 2008-03-19 17:57:43 +0100 (Mi, 19 M?\195?\164r 2008) \| 1 line Added a class for tests which should not run if a particular import is found. ........ r61629 \| collin.winter \| 2008-03-19 17:58:19 +0100 (Mi, 19 M?\195?\164r 2008) \| 1 line Two more relative import fixes in pgen2. ........ r61635 \| david.wolever \| 2008-03-19 20:16:03 +0100 (Mi, 19 M?\195?\164r 2008) \| 1 line Fixed print fixer so it will do the Right Thing when it encounters __future__.print_function. 2to3 gets upset, though, so the tests have been commented out. ........ r61637 \| david.wolever \| 2008-03-19 21:37:17 +0100 (Mi, 19 M?\195?\164r 2008) \| 3 lines Added a fixer for itertools imports (from itertools import imap, ifilterfalse --> from itertools import filterfalse) ........ r61645 \| david.wolever \| 2008-03-19 23:22:35 +0100 (Mi, 19 M?\195?\164r 2008) \| 1 line SVN is happier when you add the files you create... -_-' ........ r61654 \| david.wolever \| 2008-03-20 01:09:56 +0100 (Do, 20 M?\195?\164r 2008) \| 1 line Added an explicit sort order to fixers -- fixes problems like #2427 ........ r61664 \| david.wolever \| 2008-03-20 04:32:40 +0100 (Do, 20 M?\195?\164r 2008) \| 3 lines Fixes #2428 -- comments are no longer eatten by __future__ fixer. ........ r61673 \| david.wolever \| 2008-03-20 17:22:40 +0100 (Do, 20 M?\195?\164r 2008) \| 1 line Added 2to3 node pretty-printer ........ r61679 \| david.wolever \| 2008-03-20 20:50:42 +0100 (Do, 20 M?\195?\164r 2008) \| 1 line Made node printing a little bit prettier ........ r61723 \| martin.v.loewis \| 2008-03-22 00:59:27 +0100 (Sa, 22 M?\195?\164r 2008) \| 2 lines Fix whitespace. ........ ................ r61725 \| martin.v.loewis \| 2008-03-22 01:02:41 +0100 (Sat, 22 Mar 2008) \| 2 lines Install lib2to3. ................ r61731 \| facundo.batista \| 2008-03-22 03:45:37 +0100 (Sat, 22 Mar 2008) \| 4 lines Small fix that complicated the test actually when that test failed. ................ r61732 \| alexandre.vassalotti \| 2008-03-22 05:08:44 +0100 (Sat, 22 Mar 2008) \| 2 lines Added warning for the removal of 'hotshot' in Py3k. ................ r61733 \| georg.brandl \| 2008-03-22 11:07:29 +0100 (Sat, 22 Mar 2008) \| 4 lines #1918: document that weak references to an object are cleared before the object's __del__ is called, to ensure that the weak reference callback (if any) finds the object healthy. ................ r61734 \| georg.brandl \| 2008-03-22 11:56:23 +0100 (Sat, 22 Mar 2008) \| 2 lines Activate the Sphinx doctest extension and convert howto/functional to use it. ................ r61735 \| georg.brandl \| 2008-03-22 11:58:38 +0100 (Sat, 22 Mar 2008) \| 2 lines Allow giving source names on the cmdline. ................ r61737 \| georg.brandl \| 2008-03-22 12:00:48 +0100 (Sat, 22 Mar 2008) \| 2 lines Fixup this HOWTO's doctest blocks so that they can be run with sphinx' doctest builder. ................ r61739 \| georg.brandl \| 2008-03-22 12:47:10 +0100 (Sat, 22 Mar 2008) \| 2 lines Test decimal.rst doctests as far as possible with sphinx doctest. ................ r61741 \| georg.brandl \| 2008-03-22 13:04:26 +0100 (Sat, 22 Mar 2008) \| 2 lines Make doctests in re docs usable with sphinx' doctest. ................ r61743 \| georg.brandl \| 2008-03-22 13:59:37 +0100 (Sat, 22 Mar 2008) \| 2 lines Make more doctests in pprint docs testable. ................ r61744 \| georg.brandl \| 2008-03-22 14:07:06 +0100 (Sat, 22 Mar 2008) \| 2 lines No need to specify explicit "doctest_block" anymore. ................ r61753 \| georg.brandl \| 2008-03-22 21:08:43 +0100 (Sat, 22 Mar 2008) \| 2 lines Fix-up syntax problems. ................ r61761 \| georg.brandl \| 2008-03-22 22:06:20 +0100 (Sat, 22 Mar 2008) \| 4 lines Make collections' doctests executable. (The <BLANKLINE>s will be stripped from presentation output.) ................ r61765 \| georg.brandl \| 2008-03-22 22:21:57 +0100 (Sat, 22 Mar 2008) \| 2 lines Test doctests in datetime docs. ................ r61766 \| georg.brandl \| 2008-03-22 22:26:44 +0100 (Sat, 22 Mar 2008) \| 2 lines Test doctests in operator docs. ................ r61767 \| georg.brandl \| 2008-03-22 22:38:33 +0100 (Sat, 22 Mar 2008) \| 2 lines Enable doctests in functions.rst. Already found two errors :) ................ r61769 \| georg.brandl \| 2008-03-22 23:04:10 +0100 (Sat, 22 Mar 2008) \| 3 lines Enable doctest running for several other documents. We have now over 640 doctests that are run with "make doctest". ................ r61773 \| raymond.hettinger \| 2008-03-23 01:55:46 +0100 (Sun, 23 Mar 2008) \| 1 line Simplify demo code. ................ r61776 \| neal.norwitz \| 2008-03-23 04:43:33 +0100 (Sun, 23 Mar 2008) \| 7 lines Try to make this test a little more robust and not fail with: timeout (10.0025) is more than 2 seconds more than expected (0.001) I'm assuming this problem is caused by DNS lookup. This change does a DNS lookup of the hostname before trying to connect, so the time is not included. ................ r61777 \| neal.norwitz \| 2008-03-23 05:08:30 +0100 (Sun, 23 Mar 2008) \| 1 line Speed up the test by avoiding socket timeouts. ................ r61778 \| neal.norwitz \| 2008-03-23 05:43:09 +0100 (Sun, 23 Mar 2008) \| 1 line Skip the epoll test if epoll() does not work ................ r61780 \| neal.norwitz \| 2008-03-23 06:47:20 +0100 (Sun, 23 Mar 2008) \| 1 line Suppress failure (to avoid a flaky test) if we cannot connect to svn.python.org ................ r61781 \| neal.norwitz \| 2008-03-23 07:13:25 +0100 (Sun, 23 Mar 2008) \| 4 lines Move itertools before future_builtins since the latter depends on the former. From a clean build importing future_builtins would fail since itertools wasn't built yet. ................ r61782 \| neal.norwitz \| 2008-03-23 07:16:04 +0100 (Sun, 23 Mar 2008) \| 1 line Try to prevent the alarm going off early in tearDown ................ r61783 \| neal.norwitz \| 2008-03-23 07:19:57 +0100 (Sun, 23 Mar 2008) \| 4 lines Remove compiler warnings (on Alpha at least) about using chars as array subscripts. Using chars are dangerous b/c they are signed on some platforms and unsigned on others. ................ r61788 \| georg.brandl \| 2008-03-23 09:05:30 +0100 (Sun, 23 Mar 2008) \| 2 lines Make the doctests presentation-friendlier. ................ r61793 \| amaury.forgeotdarc \| 2008-03-23 10:55:29 +0100 (Sun, 23 Mar 2008) \| 4 lines #1477: ur'\U0010FFFF' raised in narrow unicode builds. Corrected the raw-unicode-escape codec to use UTF-16 surrogates in this case, just like the unicode-escape codec. ................ r61796 \| raymond.hettinger \| 2008-03-23 14:32:32 +0100 (Sun, 23 Mar 2008) \| 1 line Issue 1681432: Add triangular distribution the random module. ................ r61807 \| raymond.hettinger \| 2008-03-23 20:37:53 +0100 (Sun, 23 Mar 2008) \| 4 lines Adopt Nick's suggestion for useful default arguments. Clean-up floating point issues by adding true division and float constants. ................ r61813 \| gregory.p.smith \| 2008-03-23 22:04:43 +0100 (Sun, 23 Mar 2008) \| 6 lines Fix gzip to deal with CRC's being signed values in Python 2.x properly and to read 32bit values as unsigned to start with rather than applying signedness fixups allover the place afterwards. This hopefully fixes the test_tarfile failure on the alpha/tru64 buildbot. ................	2008-03-23 21:54:12 +00:00
Christian Heimes	a37d4c693a	Removed PyInt_GetMax and sys.maxint I replaced sys.maxint with sys.maxsize in Lib/.py. Does anybody see a problem with the change on Win 64bit platforms? Win 64's long is just 32bit but the sys.maxsize is now 2*63-1 on every 64bit platform. Also added docs for sys.maxsize.	2007-12-04 23:02:19 +00:00
Georg Brandl	ceee0773d2	#1496 : revert str.translate() to the old version, and add str.maketrans() to make a table in a more comfortable way.	2007-11-27 23:48:05 +00:00
Guido van Rossum	254348e201	Rename buffer -> bytearray.	2007-11-21 19:29:53 +00:00
Guido van Rossum	98297ee781	Merging the py3k-pep3137 branch back into the py3k branch. No detailed change log; just check out the change log for the py3k-pep3137 branch. The most obvious changes: - str8 renamed to bytes (PyString at the C level); - bytes renamed to buffer (PyBytes at the C level); - PyString and PyUnicode are no longer compatible. I.e. we now have an immutable bytes type and a mutable bytes type. The behavior of PyString was modified quite a bit, to make it more bytes-like. Some changes are still on the to-do list.	2007-11-06 21:34:58 +00:00
Georg Brandl	bd1c68c94f	Patch #1303 : Adapt str8 constructor to bytes (now buffer) one.	2007-10-24 18:55:37 +00:00
Georg Brandl	94c2c75b5e	Patch #1071 : Improve unicode.translate() so that you can pass unicode characters as mapping keys and invalid mapping keys are recognized and raise an error.	2007-10-23 06:52:59 +00:00
Brett Cannon	4043001f5d	Make str/str8 comparisons return True/False for !=/==. Code that has been returning str8 becomes much more apparent thanks to this (e.g., struct module returning str8 for all string-related formats or sqlite3 passing in str8 instances when converting objects that had a __conform__ method). One also has to watch out in C code when making a key from char * using PyString in the C code but a str instance in Python code as that will not longer compare equal. Once str8 gains a constructor like the current bytes type then test_modulefinder needs a cleanup as the fix is a little messy in that file. Thanks goes to Thomas Lee for writing the patch for the change giving an initial run-down of why most of the tests were failing.	2007-10-22 20:24:51 +00:00
Guido van Rossum	bae07c9baf	Breaking ground for PEP 3137 implementation: Get rid of buffer(). Use memoryview() in its place where possible. In a few places, do things a bit different, because memoryview() can't slice (yet).	2007-10-08 02:46:15 +00:00
Guido van Rossum	f1044293fa	Patch # 1145 by Thomas Lee: str.join(...) now applies str() to the sequence elements if they're not strings alraedy, except for bytes, which still raise TypeError (for the same reasons why ""==b"" raises it).	2007-09-27 18:01:22 +00:00
Eric Smith	11529195ca	Changed some ValueError's to KeyError and IndexError. Corrected code for invalid conversion specifier. Added tests to verify. Modified string.Formatter to correctly expand format_spec's, and added a limit to recursion depth. Added _vformat() method to support both of these.	2007-09-04 23:04:22 +00:00
Eric Smith	4cb4e4e882	Fix segfault discovered by Ron Adam. Not checking for terminating right bracket in "'{0[}'.format(())". Fixed, and tests added.	2007-09-03 08:40:29 +00:00
Eric Smith	37f10386f1	Changed to use 'U' argument to PyArg_ParseTuple, instead of manually checking for unicode objects.	2007-09-01 10:56:01 +00:00
Eric Smith	185e30cdf3	Added format tests. Fixed bug in alignment of negative numbers. Whitespace normalization.	2007-08-30 22:23:08 +00:00
Eric Smith	739e2ad64b	Additional test for formatting code.	2007-08-27 19:07:22 +00:00
Guido van Rossum	9c62772d5e	Changes in anticipation of stricter str vs. bytes enforcement.	2007-08-27 18:31:48 +00:00
Guido van Rossum	39478e8528	Changes in anticipation of stricter str vs. bytes enforcement.	2007-08-27 17:23:59 +00:00
Eric Smith	7ade6485ab	PEP 3101: Completed string.Formatter class. Reimplemented field_name to object transformation.	2007-08-26 22:27:13 +00:00
Eric Smith	8c66326368	Implementation of PEP 3101, Advanced String Formatting. Known issues: The string.Formatter class, as discussed in the PEP, is incomplete. Error handling needs to conform to the PEP. Need to fix this warning that I introduced in Python/formatter_unicode.c: Objects/stringlib/unicodedefs.h:26: warning: `STRINGLIB_CMP' defined but not used Need to make sure sign formatting is correct, more tests needed. Need to remove '()' sign formatting, left over from an earlier version of the PEP.	2007-08-25 02:26:07 +00:00
Martin v. Löwis	47383403a0	Implement PEP 3131. Add isidentifier to str.	2007-08-15 07:32:56 +00:00
Guido van Rossum	36e0a92442	Merged revisions 56443-56466 via svnmerge from svn+ssh://pythondev@svn.python.org/python/branches/p3yk ................ r56454 \| kurt.kaiser \| 2007-07-18 22:26:14 -0700 (Wed, 18 Jul 2007) \| 2 lines Make relative imports explicit for py3k ................ r56455 \| kurt.kaiser \| 2007-07-18 23:12:15 -0700 (Wed, 18 Jul 2007) \| 2 lines Was modifying dict during iteration. ................ r56457 \| guido.van.rossum \| 2007-07-19 07:33:19 -0700 (Thu, 19 Jul 2007) \| 2 lines Fix failing test. ................ r56466 \| guido.van.rossum \| 2007-07-19 20:58:16 -0700 (Thu, 19 Jul 2007) \| 35 lines Merged revisions 56413-56465 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r56439 \| georg.brandl \| 2007-07-17 23:37:55 -0700 (Tue, 17 Jul 2007) \| 2 lines Use "Unix" as platform name, not "UNIX". ........ r56441 \| guido.van.rossum \| 2007-07-18 10:19:14 -0700 (Wed, 18 Jul 2007) \| 3 lines SF patch# 1755885 by Kurt Kaiser: show location of Unicode escape errors. (Slightly tweaked for style and refcounts.) ........ r56444 \| kurt.kaiser \| 2007-07-18 12:58:42 -0700 (Wed, 18 Jul 2007) \| 2 lines Fix failing unicode test caused by change to ast.c at r56441 ........ r56451 \| georg.brandl \| 2007-07-18 15:36:53 -0700 (Wed, 18 Jul 2007) \| 2 lines Add description for wave.setcomptype() values ........ r56456 \| walter.doerwald \| 2007-07-19 06:04:38 -0700 (Thu, 19 Jul 2007) \| 3 lines Document that codecs.lookup() returns a CodecInfo object. (fixes SF bug #1754453). ........ r56463 \| facundo.batista \| 2007-07-19 16:57:38 -0700 (Thu, 19 Jul 2007) \| 6 lines Added a select.select call in the test server loop to make sure the socket is ready to be read from before attempting a read (this prevents an error 10035 on some Windows platforms). [GSoC - Alan McIntyre] ........ ................	2007-07-20 04:05:57 +00:00
Guido van Rossum	697a84b16c	Make test_unicode pass after the lexer was fixed to turn unicode errors into syntax errors.	2007-07-18 21:02:47 +00:00

1 2 3 4 5 ...

365 Commits