cpython

Commit Graph

Author	SHA1	Message	Date
Marc-André Lemburg	3508e30861	Fix Unicode .join() method to raise a TypeError for sequence elements which are not Unicode objects or strings. (This matches the string.join() behaviour.) Fix a memory leak in the .join() method which occurs in case the Unicode resize fails. Restore the test_unicode output.	2001-09-20 17:22:58 +00:00
Marc-André Lemburg	5e89bd656f	Update test output after the unicode() change.	2001-09-20 16:37:23 +00:00
Marc-André Lemburg	e5034378cc	Removing UTF-16 aware Unicode comparison code. This kind of compare function (together with other locale aware ones) should into a new collation support module. See python-dev for a discussion of this removal. Note: This patch should also be applied to the 1.6 branch.	2000-08-08 08:04:29 +00:00
Guido van Rossum	1bbddd085c	Added the line 'Testing UTF-16 code point order comparisons... done." to match addition to test_unicode.py.	2000-07-10 15:06:06 +00:00
Marc-André Lemburg	5f2e75e87c	Marc-Andre Lemburg <mal@lemburg.com>: Updated test output (the ucn tests are now in test_ucn).	2000-06-30 09:14:13 +00:00
Marc-André Lemburg	4a9188c557	Marc-Andre Lemburg <mal@lemburg.com>: Updated test output.	2000-06-28 16:41:46 +00:00
Fred Drake	afe73a4687	M.-A. Lemburg <mal@lemburg.com>: Added test output for Unicode string concatenation test.	2000-04-13 14:10:04 +00:00
Guido van Rossum	9e896b37c7	Marc-Andre's third try at this bulk patch seems to work (except that his copy of test_contains.py seems to be broken -- the lines he deleted were already absent). Checkin messages: New Unicode support for int(), float(), complex() and long(). - new APIs PyInt_FromUnicode() and PyLong_FromUnicode() - added support for Unicode to PyFloat_FromString() - new encoding API PyUnicode_EncodeDecimal() which converts Unicode to a decimal char* string (used in the above new APIs) - shortcuts for calls like int(<int object>) and float(<float obj>) - tests for all of the above Unicode compares and contains checks: - comparing Unicode and non-string types now works; TypeErrors are masked, all other errors such as ValueError during Unicode coercion are passed through (note that PyUnicode_Compare does not implement the masking -- PyObject_Compare does this) - contains now works for non-string types too; TypeErrors are masked and 0 returned; all other errors are passed through Better testing support for the standard codecs. Misc minor enhancements, such as an alias dbcs for the mbcs codec. Changes: - PyLong_FromString() now applies the same error checks as does PyInt_FromString(): trailing garbage is reported as error and not longer silently ignored. The only characters which may be trailing the digits are 'L' and 'l' -- these are still silently ignored. - string.ato?() now directly interface to int(), long() and float(). The error strings are now a little different, but the type still remains the same. These functions are now ready to get declared obsolete ;-) - PyNumber_Int() now also does a check for embedded NULL chars in the input string; PyNumber_Long() already did this (and still does) Followed by: Looks like I've gone a step too far there... (and test_contains.py seem to have a bug too). I've changed back to reporting all errors in PyUnicode_Contains() and added a few more test cases to test_contains.py (plus corrected the join() NameError).	2000-04-05 20:11:21 +00:00
Guido van Rossum	24bdb0474f	Marc-Andre Lemburg: The attached patch set includes a workaround to get Python with Unicode compile on BSDI 4.x (courtesy Thomas Wouters; the cause is a bug in the BSDI wchar.h header file) and Python interfaces for the MBCS codec donated by Mark Hammond. Also included are some minor corrections w/r to the docs of the new "es" and "es#" parser markers (use PyMem_Free() instead of free(); thanks to Mark Hammond for finding these). The unicodedata tests are now in a separate file (test_unicodedata.py) to avoid problems if the module cannot be found.	2000-03-28 20:29:59 +00:00
Guido van Rossum	d8855fde88	Marc-Andre Lemburg: Attached you find the latest update of the Unicode implementation. The patch is against the current CVS version. It includes the fix I posted yesterday for the core dump problem in codecs.c (was introduced by my previous patch set -- sorry), adds more tests for the codecs and two new parser markers "es" and "es#".	2000-03-24 22:14:19 +00:00
Guido van Rossum	d8fbcc95d9	Regenerated with test for 'contains'.	2000-03-24 20:42:39 +00:00
Guido van Rossum	a831cac7a8	Marc-Andre Lemburg: test script for Unicode implementation.	2000-03-10 23:23:21 +00:00

12 Commits