cpython

Commit Graph

Author	SHA1	Message	Date
Antoine Pitrou	187ac1bda4	#3601 : test_unicode.test_raiseMemError fails in UCS4 Reviewed by Benjamin Peterson on IRC.	2008-09-05 22:04:54 +00:00
Antoine Pitrou	fd7c43e7be	#3556 : test_raiseMemError consumes an insane amount of memory	2008-08-17 17:01:49 +00:00
Amaury Forgeot d'Arc	06847b13ca	Correct a crash when two successive unicode allocations fail with a MemoryError: the freelist contained half-initialized objects with freed pointers. The comment /* XXX UNREF/NEWREF interface should be more symmetrical */ was copied from tupleobject.c, and appears in some other places. I sign the petition.	2008-07-31 23:39:05 +00:00
Antoine Pitrou	4982d5d04a	#2242 : utf7 decoding crashes on bogus input on some Windows/MSVC versions	2008-07-25 17:45:59 +00:00
Amaury Forgeot d'Arc	9a0d3462fc	#1477 : ur'\U0010FFFF' raised in narrow unicode builds. Corrected the raw-unicode-escape codec to use UTF-16 surrogates in this case, just like the unicode-escape codec.	2008-03-23 09:55:29 +00:00
Christian Heimes	c5f05e45cf	Patch #2167 from calvin: Remove unused imports	2008-02-23 17:40:11 +00:00
Eric Smith	bc32fee029	Added code to correct combining str and unicode in ''.format(). Added test case.	2008-02-18 18:02:34 +00:00
Eric Smith	a9f7d62480	Backport of PEP 3101, Advanced String Formatting, from py3k. Highlights: - Adding PyObject_Format. - Adding string.Format class. - Adding __format__ for str, unicode, int, long, float, datetime. - Adding builtin format. - Adding ''.format and u''.format. - str/unicode fixups for formatters. The files in Objects/stringlib that implement PEP 3101 (stringdefs.h, unicodedefs.h, formatter.h, string_format.h) are identical in trunk and py3k. Any changes from here on should be made to trunk, and changes will propogate to py3k).	2008-02-17 19:46:49 +00:00
Kurt B. Kaiser	db98f3632a	Fix failing unicode test caused by change to ast.c at r56441	2007-07-18 19:58:42 +00:00
Neal Norwitz	ba965deea8	Prevent these tests from running on Win64 since they don\'t apply there either	2007-06-11 02:14:39 +00:00
Neal Norwitz	7dbd2a3720	Prevent expandtabs() on string and unicode objects from causing a segfault when a large width is passed on 32-bit platforms. Found by Google. It would be good for people to review this especially carefully and verify I don't have an off by one error and there is no other way to cause overflow.	2007-06-09 03:36:34 +00:00
Collin Winter	c2898c5a67	Standardize on test.test_support.run_unittest() (as opposed to a mix of run_unittest() and run_suite()). Also, add functionality to run_unittest() that admits usage of unittest.TestLoader.loadTestsFromModule().	2007-04-25 17:29:52 +00:00
Neal Norwitz	17753ecbfa	Patch #1541585 : fix buffer overrun when performing repr() on a unicode string in a build with wide unicode (UCS-4) support. This code could be improved, so add an XXX comment.	2006-08-21 22:21:19 +00:00
Tim Peters	4511a713d5	Whitespace normalization.	2006-05-03 04:46:14 +00:00
Georg Brandl	de9b624fb9	Bug #1473625 : stop cPickle making float dumps locale dependent in protocol 0. On the way, add a decorator to test_support to facilitate running single test functions in different locales with automatic cleanup.	2006-04-30 11:13:56 +00:00
Anthony Baxter	67b6d516ce	Fixed bug #1459029 - unicode reprs were double-escaped.	2006-03-30 10:54:07 +00:00
Georg Brandl	da6b107745	Checkin the test of patch #1400181 .	2006-01-20 17:48:54 +00:00
Hye-Shik Chang	835b243c71	Bug #1379994 : Fix *unicode_escape codecs to encode r'\' as r'\\' just like string codecs.	2005-12-17 04:38:31 +00:00
Neal Norwitz	430f68b447	Move registration of the codec search function to the module scope so it is only executed once. Otherwise the same search function is repeated added to the codec search path when regrtest is run with -R and leaks are reported.	2005-11-24 22:00:56 +00:00
Neil Schemenauer	cf52c07843	Change the %s format specifier for str objects so that it returns a unicode instance if the argument is not an instance of basestring and calling __str__ on the argument returns a unicode instance.	2005-08-12 17:34:58 +00:00
Brett Cannon	c3647ac93e	Make subclasses of int, long, complex, float, and unicode perform type conversion using the proper magic slot (e.g., __int__()). Also move conversion code out of PyNumber_() functions in the C API into the nb_ function. Applied patch #1109424. Thanks Walter Doewald.	2005-04-26 03:45:26 +00:00
Walter Dörwald	57d88e5abd	Move test_bug1001011() to string_tests.MixinStrUnicodeTest so that it can be used for str and unicode. Drop the test for "".join([s]) is s because this is an implementation detail (and doesn't work for unicode)	2004-08-26 16:53:04 +00:00
Hye-Shik Chang	e9ddfbb412	SF #989185 : Drop unicode.iswide() and unicode.width() and add unicodedata.east_asian_width(). You can still implement your own simple width() function using it like this: def width(u): w = 0 for c in unicodedata.normalize('NFC', u): cwidth = unicodedata.east_asian_width(c) if cwidth in ('W', 'F'): w += 2 else: w += 1 return w	2004-08-04 07:38:35 +00:00
Marc-André Lemburg	d25c650461	Let u'%s' % obj try obj.__unicode__() first and fallback to obj.__str__().	2004-07-23 16:13:25 +00:00
Hye-Shik Chang	3c145449da	Reuse width/iswide tests from strings_test. (Suggested by Walter DÃ¶rwald)	2004-06-04 04:24:54 +00:00
Hye-Shik Chang	7bd860655f	Fix typo.	2004-06-04 03:19:17 +00:00
Hye-Shik Chang	974ed7cfa5	- SF #962502 : Add two more methods for unicode type; width() and iswide() for east asian width manipulation. (Inspired by David Goodger, Reviewed by Martin v. Loewis) - Move _PyUnicode_TypeRecord.flags to the end of the struct so that no padding is added for UCS-4 builds. (Suggested by Martin v. Loewis)	2004-06-02 16:49:17 +00:00
Walter Dörwald	cd736e71a3	Fix reallocation bug in unicode.translate(): The code was comparing characters instead of character pointers to determine space requirements.	2004-02-05 17:36:00 +00:00
Jeremy Hylton	504de6bd2c	Fix for SF bug [ 817156 ] invalid \U escape gives 0=length unistr.	2003-10-06 05:08:26 +00:00
Martin v. Löwis	0d8e16c7ad	Support trailing dots in DNS names. Fixes #782510 . Will backport to 2.3.	2003-08-05 06:19:47 +00:00
Martin v. Löwis	9a3a9f7791	Consider \U-escapes in raw-unicode-escape. Fixes #444514 .	2003-05-18 12:31:09 +00:00
Walter Dörwald	21d3a32b99	Combine the functionality of test_support.run_unittest() and test_support.run_classtests() into run_unittest() and use it wherever possible. Also don't use "from test.test_support import ...", but "from test import test_support" in a few spots. From SF patch #662807.	2003-05-01 17:45:56 +00:00
Walter Dörwald	44f527fea4	Change formatchar(), so that u"%c" % 0xffffffff now raises an OverflowError instead of a TypeError to be consistent with "%c" % 256. See SF patch #710127.	2003-04-02 16:37:24 +00:00
Walter Dörwald	56fbcb525b	Remove duplicate test.	2003-03-31 18:18:41 +00:00
Walter Dörwald	43440a621e	Fix PyString_Format() so that '%c' % u'a' returns u'a' instead of raising a TypeError. (From SF patch #710127) Add tests to verify this is fixed. Add various tests for '%c' % int.	2003-03-31 18:07:50 +00:00
Walter Dörwald	0fd583ce4d	Port all string tests to PyUnit and share as much tests between str, unicode, UserString and the string module as possible. This increases code coverage in stringobject.c from 83% to 86% and should help keep the string classes in sync in the future. From SF patch #662807	2003-02-21 12:53:50 +00:00
Walter Dörwald	4f046e2e21	Add a few tests to test_count() to increase coverage in Object/unicodeobject.c::unicode_count().	2003-02-10 17:51:03 +00:00
Walter Dörwald	74640247d4	Fix copy&paste error: call title instead of count	2003-02-10 17:44:16 +00:00
Walter Dörwald	28256f276e	Port test_unicode.py to PyUnit and add tests for error cases and a few methods. This increases code coverage in Objects/unicodeobject.c from 81% to 85%. (From SF patch #662807)	2003-01-19 16:59:20 +00:00
Walter Dörwald	395bb49555	Add a test that exercises the error handling part of PyUnicode_EncodeDecimal().	2003-01-08 23:02:34 +00:00
Marc-André Lemburg	79f57833f3	Patch for bug #659709 : bogus computation of float length Python 2.2.x backport candidate. (This bug has been around since Python 1.6.)	2002-12-29 19:44:06 +00:00
Neil Schemenauer	ab9e4b76c2	check for unicode.__mod__	2002-11-18 16:11:34 +00:00
Marc-André Lemburg	9cd87aaa54	Fix for bug #626172 : crash using unicode latin1 single char Python 2.2.3 candidate.	2002-10-23 09:02:46 +00:00
Martin v. Löwis	1ce4ae3268	Don't test whether surrogate sequences round-trip in UTF-8. 2.2.2 candidate.	2002-09-14 09:19:53 +00:00
Martin v. Löwis	766e300eaa	Use integer above sys.maxunicode for range test. Fixes #608884 . 2.2.2 candidate.	2002-09-14 09:10:04 +00:00
Walter Dörwald	5c1ee17742	Change the unicode.translate docstring to document that Unicode strings (with arbitrary length) are allowed as entries in the unicode.translate mapping. Add a test case for multicharacter replacements. (Multicharacter replacements were enabled by the PEP 293 patch)	2002-09-04 20:31:32 +00:00
Guido van Rossum	2023c9b84a	Fix SF bug 599128, submitted by Inyeol Lee: .replace() would do the wrong thing for a unicode subclass when there were zero string replacements. The example given in the SF bug report was only one way to trigger this; replacing a string of length >= 2 that's not found is another. The code would actually write outside allocated memory if replacement string was longer than the search string. (I wonder how many more of these are lurking? The unicode code base is full of wonders.) Bugfix candidate; this same bug is present in 2.2.1.	2002-08-23 18:50:21 +00:00
Guido van Rossum	8b1a6d694f	Code by Inyeol Lee, submitted to SF bug 595350, to implement the string/unicode method .replace() with a zero-lengt first argument. Inyeol contributed tests for this too.	2002-08-23 18:21:28 +00:00
Guido van Rossum	76afbd9aa4	Fix some endcase bugs in unicode rfind()/rindex() and endswith(). These were reported and fixed by Inyeol Lee in SF bug 595350. The endswith() bug was already fixed in 2.3, but this adds some more test cases.	2002-08-20 17:29:29 +00:00
Marc-André Lemburg	cc8764ca9d	Add C API PyUnicode_FromOrdinal() which exposes unichr() at C level. u'%c' will now raise a ValueError in case the argument is an integer outside the valid range of Unicode code point ordinals. Closes SF bug #593581.	2002-08-11 12:23:04 +00:00

1 2 3

114 Commits