cpython

Commit Graph

Author	SHA1	Message	Date
Neil Schemenauer	cf52c07843	Change the %s format specifier for str objects so that it returns a unicode instance if the argument is not an instance of basestring and calling __str__ on the argument returns a unicode instance.	2005-08-12 17:34:58 +00:00
Brett Cannon	c3647ac93e	Make subclasses of int, long, complex, float, and unicode perform type conversion using the proper magic slot (e.g., __int__()). Also move conversion code out of PyNumber_() functions in the C API into the nb_ function. Applied patch #1109424. Thanks Walter Doewald.	2005-04-26 03:45:26 +00:00
Walter Dörwald	57d88e5abd	Move test_bug1001011() to string_tests.MixinStrUnicodeTest so that it can be used for str and unicode. Drop the test for "".join([s]) is s because this is an implementation detail (and doesn't work for unicode)	2004-08-26 16:53:04 +00:00
Hye-Shik Chang	e9ddfbb412	SF #989185 : Drop unicode.iswide() and unicode.width() and add unicodedata.east_asian_width(). You can still implement your own simple width() function using it like this: def width(u): w = 0 for c in unicodedata.normalize('NFC', u): cwidth = unicodedata.east_asian_width(c) if cwidth in ('W', 'F'): w += 2 else: w += 1 return w	2004-08-04 07:38:35 +00:00
Marc-André Lemburg	d25c650461	Let u'%s' % obj try obj.__unicode__() first and fallback to obj.__str__().	2004-07-23 16:13:25 +00:00
Hye-Shik Chang	3c145449da	Reuse width/iswide tests from strings_test. (Suggested by Walter DÃ¶rwald)	2004-06-04 04:24:54 +00:00
Hye-Shik Chang	7bd860655f	Fix typo.	2004-06-04 03:19:17 +00:00
Hye-Shik Chang	974ed7cfa5	- SF #962502 : Add two more methods for unicode type; width() and iswide() for east asian width manipulation. (Inspired by David Goodger, Reviewed by Martin v. Loewis) - Move _PyUnicode_TypeRecord.flags to the end of the struct so that no padding is added for UCS-4 builds. (Suggested by Martin v. Loewis)	2004-06-02 16:49:17 +00:00
Walter Dörwald	cd736e71a3	Fix reallocation bug in unicode.translate(): The code was comparing characters instead of character pointers to determine space requirements.	2004-02-05 17:36:00 +00:00
Jeremy Hylton	504de6bd2c	Fix for SF bug [ 817156 ] invalid \U escape gives 0=length unistr.	2003-10-06 05:08:26 +00:00
Martin v. Löwis	0d8e16c7ad	Support trailing dots in DNS names. Fixes #782510 . Will backport to 2.3.	2003-08-05 06:19:47 +00:00
Martin v. Löwis	9a3a9f7791	Consider \U-escapes in raw-unicode-escape. Fixes #444514 .	2003-05-18 12:31:09 +00:00
Walter Dörwald	21d3a32b99	Combine the functionality of test_support.run_unittest() and test_support.run_classtests() into run_unittest() and use it wherever possible. Also don't use "from test.test_support import ...", but "from test import test_support" in a few spots. From SF patch #662807.	2003-05-01 17:45:56 +00:00
Walter Dörwald	44f527fea4	Change formatchar(), so that u"%c" % 0xffffffff now raises an OverflowError instead of a TypeError to be consistent with "%c" % 256. See SF patch #710127.	2003-04-02 16:37:24 +00:00
Walter Dörwald	56fbcb525b	Remove duplicate test.	2003-03-31 18:18:41 +00:00
Walter Dörwald	43440a621e	Fix PyString_Format() so that '%c' % u'a' returns u'a' instead of raising a TypeError. (From SF patch #710127) Add tests to verify this is fixed. Add various tests for '%c' % int.	2003-03-31 18:07:50 +00:00
Walter Dörwald	0fd583ce4d	Port all string tests to PyUnit and share as much tests between str, unicode, UserString and the string module as possible. This increases code coverage in stringobject.c from 83% to 86% and should help keep the string classes in sync in the future. From SF patch #662807	2003-02-21 12:53:50 +00:00
Walter Dörwald	4f046e2e21	Add a few tests to test_count() to increase coverage in Object/unicodeobject.c::unicode_count().	2003-02-10 17:51:03 +00:00
Walter Dörwald	74640247d4	Fix copy&paste error: call title instead of count	2003-02-10 17:44:16 +00:00
Walter Dörwald	28256f276e	Port test_unicode.py to PyUnit and add tests for error cases and a few methods. This increases code coverage in Objects/unicodeobject.c from 81% to 85%. (From SF patch #662807)	2003-01-19 16:59:20 +00:00
Walter Dörwald	395bb49555	Add a test that exercises the error handling part of PyUnicode_EncodeDecimal().	2003-01-08 23:02:34 +00:00
Marc-André Lemburg	79f57833f3	Patch for bug #659709 : bogus computation of float length Python 2.2.x backport candidate. (This bug has been around since Python 1.6.)	2002-12-29 19:44:06 +00:00
Neil Schemenauer	ab9e4b76c2	check for unicode.__mod__	2002-11-18 16:11:34 +00:00
Marc-André Lemburg	9cd87aaa54	Fix for bug #626172 : crash using unicode latin1 single char Python 2.2.3 candidate.	2002-10-23 09:02:46 +00:00
Martin v. Löwis	1ce4ae3268	Don't test whether surrogate sequences round-trip in UTF-8. 2.2.2 candidate.	2002-09-14 09:19:53 +00:00
Martin v. Löwis	766e300eaa	Use integer above sys.maxunicode for range test. Fixes #608884 . 2.2.2 candidate.	2002-09-14 09:10:04 +00:00
Walter Dörwald	5c1ee17742	Change the unicode.translate docstring to document that Unicode strings (with arbitrary length) are allowed as entries in the unicode.translate mapping. Add a test case for multicharacter replacements. (Multicharacter replacements were enabled by the PEP 293 patch)	2002-09-04 20:31:32 +00:00
Guido van Rossum	2023c9b84a	Fix SF bug 599128, submitted by Inyeol Lee: .replace() would do the wrong thing for a unicode subclass when there were zero string replacements. The example given in the SF bug report was only one way to trigger this; replacing a string of length >= 2 that's not found is another. The code would actually write outside allocated memory if replacement string was longer than the search string. (I wonder how many more of these are lurking? The unicode code base is full of wonders.) Bugfix candidate; this same bug is present in 2.2.1.	2002-08-23 18:50:21 +00:00
Guido van Rossum	8b1a6d694f	Code by Inyeol Lee, submitted to SF bug 595350, to implement the string/unicode method .replace() with a zero-lengt first argument. Inyeol contributed tests for this too.	2002-08-23 18:21:28 +00:00
Guido van Rossum	76afbd9aa4	Fix some endcase bugs in unicode rfind()/rindex() and endswith(). These were reported and fixed by Inyeol Lee in SF bug 595350. The endswith() bug was already fixed in 2.3, but this adds some more test cases.	2002-08-20 17:29:29 +00:00
Marc-André Lemburg	cc8764ca9d	Add C API PyUnicode_FromOrdinal() which exposes unichr() at C level. u'%c' will now raise a ValueError in case the argument is an integer outside the valid range of Unicode code point ordinals. Closes SF bug #593581.	2002-08-11 12:23:04 +00:00
Guido van Rossum	f36921c4b0	Unicode replace() method with empty pattern argument should fail, like it does for 8-bit strings.	2002-08-09 15:36:48 +00:00
Raymond Hettinger	ca84d65ca7	Expanded the unittests for the new width sensitive PyUnicode_Contains().	2002-08-06 23:08:51 +00:00
Barry Warsaw	e06741704e	Added a test for PyUnicode_Contains() taking into account the width of Py_UNICODE.	2002-08-06 19:03:56 +00:00
Barry Warsaw	817918cc3c	Committing patch #591250 which provides "str1 in str2" when str1 is a string of longer than 1 character.	2002-08-06 16:58:21 +00:00
Martin v. Löwis	a729daf2e4	Add encoding declaration.	2002-08-04 17:28:33 +00:00
Barry Warsaw	04f357cffe	Get rid of relative imports in all unittests. Now anything that imports e.g. test_support must do so using an absolute package name such as "import test.test_support" or "from test import test_support". This also updates the README in Lib/test, and gets rid of the duplicate data dirctory in Lib/test/data (replaced by Lib/email/test/data). Now Tim and Jack can have at it. :)	2002-07-23 19:04:11 +00:00
Tim Peters	8ac1495a6a	Whitespace normalization.	2002-05-23 15:15:30 +00:00
Walter Dörwald	de02bcb265	Apply patch diff.txt from SF feature request http://www.python.org/sf/444708 This adds the optional argument for str.strip to unicode.strip too and makes it possible to call str.strip with a unicode argument and unicode.strip with a str argument.	2002-04-22 17:42:37 +00:00
Walter Dörwald	2ee4be0775	Apply diff3.txt from SF patch http://www.python.org/sf/536241 If a str or unicode method returns the original object, make sure that for str and unicode subclasses the original will not be returned. This should prevent SF bug http://www.python.org/sf/460020 from reappearing.	2002-04-17 21:34:05 +00:00
Tim Peters	863ac44b74	Whitespace normalization.	2002-04-16 01:38:40 +00:00
Walter Dörwald	068325ef92	Apply the second version of SF patch http://www.python.org/sf/536241 Add a method zfill to str, unicode and UserString and change Lib/string.py accordingly. This activates the zfill version in unicodeobject.c that was commented out and implements the same in stringobject.c. It also adds the test for unicode support in Lib/string.py back in and uses repr() instead() of str() (as it was before Lib/string.py 1.62)	2002-04-15 13:36:47 +00:00
Marc-André Lemburg	ce0b664af2	Added test case for UTF-8 encoding bug #541828 .	2002-04-10 17:18:02 +00:00
Guido van Rossum	77f6a65eb0	Add the 'bool' type and its values 'False' and 'True', as described in PEP 285. Everything described in the PEP is here, and there is even some documentation. I had to fix 12 unit tests; all but one of these were printing Boolean outcomes that changed from 0/1 to False/True. (The exception is test_unicode.py, which did a type(x) == type(y) style comparison. I could've fixed that with a single line using issubtype(x, type(y)), but instead chose to be explicit about those places where a bool is expected. Still to do: perhaps more documentation; change standard library modules to return False/True from predicates.	2002-04-03 22:41:51 +00:00
Andrew M. Kuchling	eddd68d56c	As part of fixing bug #536241 , add a test case for string.zfill() with Unicode	2002-03-29 16:21:44 +00:00
Martin v. Löwis	047c05ebc4	Do not insert characters for unicode-escape decoders if the error mode is "ignore". Fixes #529104.	2002-03-21 08:55:28 +00:00
Marc-André Lemburg	bd3be8f0ca	Fix to the UTF-8 encoder: it failed on 0-length input strings. Fix for the UTF-8 decoder: it will now accept isolated surrogates (previously it raised an exception which causes round-trips to fail). Added new tests for UTF-8 round-trip safety (we rely on UTF-8 for marshalling Unicode objects, so we better make sure it works for all Unicode code points, including isolated surrogates). Bumped the PYC magic in a non-standard way -- please review. This was needed because the old PYC format used illegal UTF-8 sequences for isolated high surrogates which now raise an exception.	2002-02-07 11:33:49 +00:00
Marc-André Lemburg	3688a882d3	Fix for the UTF-8 memory allocation bug and the UTF-8 encoding bug related to lone high surrogates.	2002-02-06 18:09:02 +00:00
Finn Bock	2b29cb2593	Skipping some tests by adding the usual jython conditional test around: - the repr of unicode. Jython only add the u'' if the string contains char values > 255. - A unicode arg to unicode() is perfectly valid in jython. - A test buffer() test. No buffer() on Jython This closes patch "[ #490920 ] Jython and test_unicode".	2001-12-10 20:57:34 +00:00
Tim Peters	82285dad8e	Whitespace normalization.	2001-12-01 04:11:16 +00:00

1 2

95 Commits