cpython

Commit Graph

Author	SHA1	Message	Date
Neal Norwitz	11c5275c61	Backport 55873: Prevent these tests from running on Win64 since they don't apply there either	2007-06-11 04:31:25 +00:00
Neal Norwitz	66e64e2b6a	Prevent expandtabs() on string and unicode objects from causing a segfault when a large width is passed on 32-bit platforms. Found by Google. It would be good for people to review this especially carefully and verify I don't have an off by one error and there is no other way to cause overflow.	2007-06-09 04:06:30 +00:00
Neal Norwitz	19c35bba5d	- Patch #1541585 : fix buffer overrun when performing repr() on a unicode string in a build with wide unicode (UCS-4) support. I will forward port to 2.6. Can someone backport to 2.4?	2006-08-21 22:13:11 +00:00
Tim Peters	4511a713d5	Whitespace normalization.	2006-05-03 04:46:14 +00:00
Georg Brandl	de9b624fb9	Bug #1473625 : stop cPickle making float dumps locale dependent in protocol 0. On the way, add a decorator to test_support to facilitate running single test functions in different locales with automatic cleanup.	2006-04-30 11:13:56 +00:00
Anthony Baxter	67b6d516ce	Fixed bug #1459029 - unicode reprs were double-escaped.	2006-03-30 10:54:07 +00:00
Georg Brandl	da6b107745	Checkin the test of patch #1400181 .	2006-01-20 17:48:54 +00:00
Hye-Shik Chang	835b243c71	Bug #1379994 : Fix *unicode_escape codecs to encode r'\' as r'\\' just like string codecs.	2005-12-17 04:38:31 +00:00
Neal Norwitz	430f68b447	Move registration of the codec search function to the module scope so it is only executed once. Otherwise the same search function is repeated added to the codec search path when regrtest is run with -R and leaks are reported.	2005-11-24 22:00:56 +00:00
Neil Schemenauer	cf52c07843	Change the %s format specifier for str objects so that it returns a unicode instance if the argument is not an instance of basestring and calling __str__ on the argument returns a unicode instance.	2005-08-12 17:34:58 +00:00
Brett Cannon	c3647ac93e	Make subclasses of int, long, complex, float, and unicode perform type conversion using the proper magic slot (e.g., __int__()). Also move conversion code out of PyNumber_() functions in the C API into the nb_ function. Applied patch #1109424. Thanks Walter Doewald.	2005-04-26 03:45:26 +00:00
Walter Dörwald	57d88e5abd	Move test_bug1001011() to string_tests.MixinStrUnicodeTest so that it can be used for str and unicode. Drop the test for "".join([s]) is s because this is an implementation detail (and doesn't work for unicode)	2004-08-26 16:53:04 +00:00
Hye-Shik Chang	e9ddfbb412	SF #989185 : Drop unicode.iswide() and unicode.width() and add unicodedata.east_asian_width(). You can still implement your own simple width() function using it like this: def width(u): w = 0 for c in unicodedata.normalize('NFC', u): cwidth = unicodedata.east_asian_width(c) if cwidth in ('W', 'F'): w += 2 else: w += 1 return w	2004-08-04 07:38:35 +00:00
Marc-André Lemburg	d25c650461	Let u'%s' % obj try obj.__unicode__() first and fallback to obj.__str__().	2004-07-23 16:13:25 +00:00
Hye-Shik Chang	3c145449da	Reuse width/iswide tests from strings_test. (Suggested by Walter DÃ¶rwald)	2004-06-04 04:24:54 +00:00
Hye-Shik Chang	7bd860655f	Fix typo.	2004-06-04 03:19:17 +00:00
Hye-Shik Chang	974ed7cfa5	- SF #962502 : Add two more methods for unicode type; width() and iswide() for east asian width manipulation. (Inspired by David Goodger, Reviewed by Martin v. Loewis) - Move _PyUnicode_TypeRecord.flags to the end of the struct so that no padding is added for UCS-4 builds. (Suggested by Martin v. Loewis)	2004-06-02 16:49:17 +00:00
Walter Dörwald	cd736e71a3	Fix reallocation bug in unicode.translate(): The code was comparing characters instead of character pointers to determine space requirements.	2004-02-05 17:36:00 +00:00
Jeremy Hylton	504de6bd2c	Fix for SF bug [ 817156 ] invalid \U escape gives 0=length unistr.	2003-10-06 05:08:26 +00:00
Martin v. Löwis	0d8e16c7ad	Support trailing dots in DNS names. Fixes #782510 . Will backport to 2.3.	2003-08-05 06:19:47 +00:00
Martin v. Löwis	9a3a9f7791	Consider \U-escapes in raw-unicode-escape. Fixes #444514 .	2003-05-18 12:31:09 +00:00
Walter Dörwald	21d3a32b99	Combine the functionality of test_support.run_unittest() and test_support.run_classtests() into run_unittest() and use it wherever possible. Also don't use "from test.test_support import ...", but "from test import test_support" in a few spots. From SF patch #662807.	2003-05-01 17:45:56 +00:00
Walter Dörwald	44f527fea4	Change formatchar(), so that u"%c" % 0xffffffff now raises an OverflowError instead of a TypeError to be consistent with "%c" % 256. See SF patch #710127.	2003-04-02 16:37:24 +00:00
Walter Dörwald	56fbcb525b	Remove duplicate test.	2003-03-31 18:18:41 +00:00
Walter Dörwald	43440a621e	Fix PyString_Format() so that '%c' % u'a' returns u'a' instead of raising a TypeError. (From SF patch #710127) Add tests to verify this is fixed. Add various tests for '%c' % int.	2003-03-31 18:07:50 +00:00
Walter Dörwald	0fd583ce4d	Port all string tests to PyUnit and share as much tests between str, unicode, UserString and the string module as possible. This increases code coverage in stringobject.c from 83% to 86% and should help keep the string classes in sync in the future. From SF patch #662807	2003-02-21 12:53:50 +00:00
Walter Dörwald	4f046e2e21	Add a few tests to test_count() to increase coverage in Object/unicodeobject.c::unicode_count().	2003-02-10 17:51:03 +00:00
Walter Dörwald	74640247d4	Fix copy&paste error: call title instead of count	2003-02-10 17:44:16 +00:00
Walter Dörwald	28256f276e	Port test_unicode.py to PyUnit and add tests for error cases and a few methods. This increases code coverage in Objects/unicodeobject.c from 81% to 85%. (From SF patch #662807)	2003-01-19 16:59:20 +00:00
Walter Dörwald	395bb49555	Add a test that exercises the error handling part of PyUnicode_EncodeDecimal().	2003-01-08 23:02:34 +00:00
Marc-André Lemburg	79f57833f3	Patch for bug #659709 : bogus computation of float length Python 2.2.x backport candidate. (This bug has been around since Python 1.6.)	2002-12-29 19:44:06 +00:00
Neil Schemenauer	ab9e4b76c2	check for unicode.__mod__	2002-11-18 16:11:34 +00:00
Marc-André Lemburg	9cd87aaa54	Fix for bug #626172 : crash using unicode latin1 single char Python 2.2.3 candidate.	2002-10-23 09:02:46 +00:00
Martin v. Löwis	1ce4ae3268	Don't test whether surrogate sequences round-trip in UTF-8. 2.2.2 candidate.	2002-09-14 09:19:53 +00:00
Martin v. Löwis	766e300eaa	Use integer above sys.maxunicode for range test. Fixes #608884 . 2.2.2 candidate.	2002-09-14 09:10:04 +00:00
Walter Dörwald	5c1ee17742	Change the unicode.translate docstring to document that Unicode strings (with arbitrary length) are allowed as entries in the unicode.translate mapping. Add a test case for multicharacter replacements. (Multicharacter replacements were enabled by the PEP 293 patch)	2002-09-04 20:31:32 +00:00
Guido van Rossum	2023c9b84a	Fix SF bug 599128, submitted by Inyeol Lee: .replace() would do the wrong thing for a unicode subclass when there were zero string replacements. The example given in the SF bug report was only one way to trigger this; replacing a string of length >= 2 that's not found is another. The code would actually write outside allocated memory if replacement string was longer than the search string. (I wonder how many more of these are lurking? The unicode code base is full of wonders.) Bugfix candidate; this same bug is present in 2.2.1.	2002-08-23 18:50:21 +00:00
Guido van Rossum	8b1a6d694f	Code by Inyeol Lee, submitted to SF bug 595350, to implement the string/unicode method .replace() with a zero-lengt first argument. Inyeol contributed tests for this too.	2002-08-23 18:21:28 +00:00
Guido van Rossum	76afbd9aa4	Fix some endcase bugs in unicode rfind()/rindex() and endswith(). These were reported and fixed by Inyeol Lee in SF bug 595350. The endswith() bug was already fixed in 2.3, but this adds some more test cases.	2002-08-20 17:29:29 +00:00
Marc-André Lemburg	cc8764ca9d	Add C API PyUnicode_FromOrdinal() which exposes unichr() at C level. u'%c' will now raise a ValueError in case the argument is an integer outside the valid range of Unicode code point ordinals. Closes SF bug #593581.	2002-08-11 12:23:04 +00:00
Guido van Rossum	f36921c4b0	Unicode replace() method with empty pattern argument should fail, like it does for 8-bit strings.	2002-08-09 15:36:48 +00:00
Raymond Hettinger	ca84d65ca7	Expanded the unittests for the new width sensitive PyUnicode_Contains().	2002-08-06 23:08:51 +00:00
Barry Warsaw	e06741704e	Added a test for PyUnicode_Contains() taking into account the width of Py_UNICODE.	2002-08-06 19:03:56 +00:00
Barry Warsaw	817918cc3c	Committing patch #591250 which provides "str1 in str2" when str1 is a string of longer than 1 character.	2002-08-06 16:58:21 +00:00
Martin v. Löwis	a729daf2e4	Add encoding declaration.	2002-08-04 17:28:33 +00:00
Barry Warsaw	04f357cffe	Get rid of relative imports in all unittests. Now anything that imports e.g. test_support must do so using an absolute package name such as "import test.test_support" or "from test import test_support". This also updates the README in Lib/test, and gets rid of the duplicate data dirctory in Lib/test/data (replaced by Lib/email/test/data). Now Tim and Jack can have at it. :)	2002-07-23 19:04:11 +00:00
Tim Peters	8ac1495a6a	Whitespace normalization.	2002-05-23 15:15:30 +00:00
Walter Dörwald	de02bcb265	Apply patch diff.txt from SF feature request http://www.python.org/sf/444708 This adds the optional argument for str.strip to unicode.strip too and makes it possible to call str.strip with a unicode argument and unicode.strip with a str argument.	2002-04-22 17:42:37 +00:00
Walter Dörwald	2ee4be0775	Apply diff3.txt from SF patch http://www.python.org/sf/536241 If a str or unicode method returns the original object, make sure that for str and unicode subclasses the original will not be returned. This should prevent SF bug http://www.python.org/sf/460020 from reappearing.	2002-04-17 21:34:05 +00:00
Tim Peters	863ac44b74	Whitespace normalization.	2002-04-16 01:38:40 +00:00

1 2 3

104 Commits