cpython

Commit Graph

Author	SHA1	Message	Date
Victor Stinner	7ed7aead95	bpo-29240: Fix locale encodings in UTF-8 Mode (#5170 ) Modify locale.localeconv(), time.tzname, os.strerror() and other functions to ignore the UTF-8 Mode: always use the current locale encoding. Changes: * Add _Py_DecodeLocaleEx() and _Py_EncodeLocaleEx(). On decoding or encoding error, they return the position of the error and an error message which are used to raise Unicode errors in PyUnicode_DecodeLocale() and PyUnicode_EncodeLocale(). * Replace _Py_DecodeCurrentLocale() with _Py_DecodeLocaleEx(). * PyUnicode_DecodeLocale() now uses _Py_DecodeLocaleEx() for all cases, especially for the strict error handler. * Add _Py_DecodeUTF8Ex(): return more information on decoding error and supports the strict error handler. * Rename _Py_EncodeUTF8_surrogateescape() to _Py_EncodeUTF8Ex(). * Replace _Py_EncodeCurrentLocale() with _Py_EncodeLocaleEx(). * Ignore the UTF-8 mode to encode/decode localeconv(), strerror() and time zone name. * Remove PyUnicode_DecodeLocale(), PyUnicode_DecodeLocaleAndSize() and PyUnicode_EncodeLocale() now ignore the UTF-8 mode: always use the "current" locale. * Remove _PyUnicode_DecodeCurrentLocale(), _PyUnicode_DecodeCurrentLocaleAndSize() and _PyUnicode_EncodeCurrentLocale().	2018-01-15 10:45:49 +01:00
Victor Stinner	cb3ae5588b	bpo-29240: Ignore UTF-8 Mode in time module (#5148 ) time.strftime() must use the current LC_CTYPE encoding, not UTF-8 if the UTF-8 mode is enabled. Add _PyUnicode_DecodeCurrentLocale() function.	2018-01-11 10:37:59 +01:00
Victor Stinner	2cba6b8579	bpo-29240: readline now ignores the UTF-8 Mode (#5145 ) Add new fuctions ignoring the UTF-8 mode: * _Py_DecodeCurrentLocale() * _Py_EncodeCurrentLocale() * _PyUnicode_DecodeCurrentLocaleAndSize() * _PyUnicode_EncodeCurrentLocale() Modify the readline module to use these functions. Re-enable test_readline.test_nonascii().	2018-01-10 22:46:15 +01:00
Victor Stinner	9dd762013f	bpo-32030: Add _Py_EncodeLocaleRaw() (#4961 ) Replace Py_EncodeLocale() with _Py_EncodeLocaleRaw() in: * _Py_wfopen() * _Py_wreadlink() * _Py_wrealpath() * _Py_wstat() * pymain_open_filename() These functions are called early during Python intialization, only the RAW memory allocator must be used.	2017-12-21 16:20:32 +01:00
Victor Stinner	e47e698da6	bpo-32030: Add _Py_EncodeUTF8_surrogateescape() (#4960 ) Py_EncodeLocale() now uses _Py_EncodeUTF8_surrogateescape(), instead of using temporary unicode and bytes objects. So Py_EncodeLocale() doesn't use the Python C API anymore.	2017-12-21 15:45:16 +01:00
Serhiy Storchaka	a5552f023e	bpo-32240: Add the const qualifier to declarations of PyObject* array arguments. (#4746 )	2017-12-15 13:11:11 +02:00
Victor Stinner	91106cd9ff	bpo-29240: PEP 540: Add a new UTF-8 Mode (#855 ) * Add -X utf8 command line option, PYTHONUTF8 environment variable and a new sys.flags.utf8_mode flag. * If the LC_CTYPE locale is "C" at startup: enable automatically the UTF-8 mode. * Add _winapi.GetACP(). encodings._alias_mbcs() now calls _winapi.GetACP() to get the ANSI code page * locale.getpreferredencoding() now returns 'UTF-8' in the UTF-8 mode. As a side effect, open() now uses the UTF-8 encoding by default in this mode. * Py_DecodeLocale() and Py_EncodeLocale() now use the UTF-8 encoding in the UTF-8 Mode. * Update subprocess._args_from_interpreter_flags() to handle -X utf8 * Skip some tests relying on the current locale if the UTF-8 mode is enabled. * Add test_utf8mode.py. * _Py_DecodeUTF8_surrogateescape() gets a new optional parameter to return also the length (number of wide characters). * pymain_get_global_config() and pymain_set_global_config() now always copy flag values, rather than only copying if the new value is greater than the old value.	2017-12-13 12:29:09 +01:00
Victor Stinner	6a54c676e6	bpo-31979: Remove unused align_maxchar() function (#4527 )	2017-11-23 19:02:23 +01:00
Serhiy Storchaka	9b6c60cbce	bpo-31979: Simplify transforming decimals to ASCII (#4336 ) in int(), float() and complex() parsers. This also speeds up parsing non-ASCII numbers by around 20%.	2017-11-13 21:23:48 +02:00
Serhiy Storchaka	e2f92de6a9	Add the const qualifier to "char *" variables that refer to literal strings. (#4370 )	2017-11-11 13:06:26 +02:00
stratakis	e8b1965639	bpo-23699: Use a macro to reduce boilerplate code in rich comparison functions (GH-793)	2017-11-02 20:32:54 +10:00
Serhiy Storchaka	a2314283ff	bpo-20047: Make bytearray methods partition() and rpartition() rejecting (#4158 ) separators that are not bytes-like objects.	2017-10-29 02:11:54 +03:00
Serhiy Storchaka	56cb465cc9	bpo-31825: Fixed OverflowError in the 'unicode-escape' codec (#4058 ) and in codecs.escape_decode() when decode an escaped non-ascii byte.	2017-10-20 17:08:15 +03:00
Barry Warsaw	b2e5794870	bpo-31338 (#3374 ) * Add Py_UNREACHABLE() as an alias to abort(). * Use Py_UNREACHABLE() instead of assert(0) * Convert more unreachable code to use Py_UNREACHABLE() * Document Py_UNREACHABLE() and a few other macros.	2017-09-14 18:13:16 -07:00
Serhiy Storchaka	e3b2b4b8d9	bpo-31393: Fix the use of PyUnicode_READY(). (#3451 )	2017-09-08 09:58:51 +03:00
Eric Snow	2ebc5ce42a	bpo-30860: Consolidate stateful runtime globals. (#3397 ) * group the (stateful) runtime globals into various topical structs * consolidate the topical structs under a single top-level _PyRuntimeState struct * add a check-c-globals.py script that helps identify runtime globals Other globals are excluded (see globals.txt and check-c-globals.py).	2017-09-07 23:51:28 -06:00
Stefan Krah	f432a3234f	bpo-30923: Silence fall-through warnings included in -Wextra since gcc-7.0. (#3157 )	2017-08-21 13:09:59 +02:00
Serhiy Storchaka	64e461be09	bpo-22207: Add checks for possible integer overflows in unicodeobject.c. (#2623 ) Based on patch by Victor Stinner.	2017-07-11 06:55:25 +03:00
Serhiy Storchaka	f7eae0adfc	[security] bpo-13617: Reject embedded null characters in wchar* strings. (#2302 ) Based on patch by Victor Stinner. Add private C API function _PyUnicode_AsUnicode() which is similar to PyUnicode_AsUnicode(), but checks for null characters.	2017-06-28 08:30:06 +03:00
Serhiy Storchaka	e613e6add5	bpo-30708: Check for null characters in PyUnicode_AsWideCharString(). (#2285 ) Raise a ValueError if the second argument is NULL and the wchar_t\* string contains null characters.	2017-06-27 16:03:14 +03:00
Serhiy Storchaka	40db90c1ce	bpo-29802: Fix reference counting in module-level struct functions (#1213 ) when pass arguments of wrong type.	2017-04-20 21:19:31 +03:00
Serhiy Storchaka	b879fe82e7	Expand the PySlice_GetIndicesEx macro. (#1023 )	2017-04-08 09:53:51 +03:00
Lisa Roach	43ba8861e0	bpo-29549: Fixes docstring for str.index (#256 ) * Updates B.index documentation. * Updates str.index documentation, makes it Argument Clinic compatible. * Removes ArgumentClinic code. * Finishes string.index documentation. * Updates string.rindex documentation. * Documents B.rindex.	2017-04-04 22:36:22 -07:00
Serhiy Storchaka	fff9a31a91	bpo-29865: Use PyXXX_GET_SIZE macros rather than Py_SIZE for concrete types. (#748 )	2017-03-21 08:53:25 +02:00
Serhiy Storchaka	004e03fb0c	bpo-29116: Improve error message for concatenating str with non-str. (#710 )	2017-03-19 19:38:42 +02:00
Serhiy Storchaka	202fda55c2	bpo-24037: Add Argument Clinic converter `bool(accept={int})`. (#485 )	2017-03-12 10:10:47 +02:00
Serhiy Storchaka	370fd202f1	Use Py_RETURN_FALSE/Py_RETURN_TRUE rather than PyBool_FromLong(0)/PyBool_FromLong(1). (#567 )	2017-03-08 20:47:48 +02:00
Serhiy Storchaka	9f8ad3f39e	bpo-29568: Disable any characters between two percents for escaped percent "%%" in the format string for classic string formatting. (GH-513)	2017-03-08 11:51:19 +08:00
Martin Panter	91a8866dc1	Fix grammar in doc string, RST markup	2017-01-24 00:30:06 +00:00
Serhiy Storchaka	228b12edcc	Issue #28999 : Use Py_RETURN_NONE, Py_RETURN_TRUE and Py_RETURN_FALSE wherever possible. Patch is writen with Coccinelle.	2017-01-23 09:47:21 +02:00
Serhiy Storchaka	2a404b63d4	Issue #28769 : The result of PyUnicode_AsUTF8AndSize() and PyUnicode_AsUTF8() is now of type "const char " rather of "char ".	2017-01-22 23:07:07 +02:00
Victor Stinner	0c4a828cad	Run Argument Clinic: METH_VARARGS=>METH_FASTCALL Issue #29286. Run Argument Clinic to get the new faster METH_FASTCALL calling convention for functions using "boring" positional arguments. Manually fix _elementtree: _elementtree_XMLParser_doctype() must remain consistent with the clinic code.	2017-01-17 02:21:47 +01:00
INADA Naoki	15f94596b6	Issue #20180 : forgot to update AC output.	2017-01-16 21:49:13 +09:00
INADA Naoki	3ae2056512	Issue #20180 : convert unicode methods to AC.	2017-01-16 20:41:20 +09:00
Xiang Zhang	7a4da324dc	Issue #29145 : Merge 3.6.	2017-01-10 10:56:38 +08:00
Xiang Zhang	95403d74d7	Issue #29145 : Merge 3.5.	2017-01-10 10:54:19 +08:00
Xiang Zhang	b0541f4cdf	Issue #29145 : Fix overflow checks in str.replace() and str.join(). Based on patch by Martin Panter.	2017-01-10 10:52:00 +08:00
Xiang Zhang	62497d52d9	Issue #29044 : Merge 3.6.	2016-12-22 15:31:55 +08:00
Xiang Zhang	437a5d2c25	Issue #29044 : Merge 3.5.	2016-12-22 15:31:22 +08:00
Xiang Zhang	ea1cf87030	Issue #29044 : Fix a use-after-free in string '%c' formatter.	2016-12-22 15:30:47 +08:00
Xiang Zhang	b211068f5c	Issue #28822 : Adjust indices handling of PyUnicode_FindChar().	2016-12-20 22:52:33 +08:00
Xavier de Gaye	31eaf49ed9	Merge 3.6.	2016-12-15 21:01:52 +01:00
Xavier de Gaye	76febd0792	Issue #26919 : On Android, operating system data is now always encoded/decoded to/from UTF-8, instead of the locale encoding to avoid inconsistencies with os.fsencode() and os.fsdecode() which are already using UTF-8.	2016-12-15 20:59:58 +01:00
Serhiy Storchaka	fb3134f4d4	Issue #28808 : PyUnicode_CompareWithASCIIString() now never raises exceptions.	2016-12-06 00:20:26 +02:00
Serhiy Storchaka	9a953dbb34	Issue #28808 : PyUnicode_CompareWithASCIIString() now never raises exceptions.	2016-12-06 00:17:45 +02:00
Serhiy Storchaka	419967b832	Issue #28808 : PyUnicode_CompareWithASCIIString() now never raises exceptions.	2016-12-06 00:13:34 +02:00
Victor Stinner	de4ae3d486	Backed out changeset b9c9691c72c5 Issue #28858: The change b9c9691c72c5 introduced a regression. It seems like _PyObject_CallArg1() uses more stack memory than PyObject_CallFunctionObjArgs().	2016-12-04 22:59:09 +01:00
Victor Stinner	27580c1fb5	Replace PyObject_CallFunctionObjArgs() with fastcall * PyObject_CallFunctionObjArgs(func, NULL) => _PyObject_CallNoArg(func) * PyObject_CallFunctionObjArgs(func, arg, NULL) => _PyObject_CallArg1(func, arg) PyObject_CallFunctionObjArgs() allocates 40 bytes on the C stack and requires extra work to "parse" C arguments to build a C array of PyObject*. _PyObject_CallNoArg() and _PyObject_CallArg1() are simpler and don't allocate memory on the C stack. This change is part of the fastcall project. The change on listsort() is related to the issue #23507.	2016-12-01 14:43:22 +01:00
Serhiy Storchaka	99250d5c63	Issue #28774 : Simplified encoding a str result of an error handler in ASCII and Latin1 encoders.	2016-11-23 15:13:00 +02:00
Xiang Zhang	d04d8474df	Issue #28774 : Fix start/end pos in unicode_encode_ucs1(). Fix error position of the unicode error in ASCII and Latin1 encoders when a string returned by the error handler contains multiple non-encodable characters (non-ASCII for the ASCII codec, characters out of the U+0000-U+00FF range for Latin1).	2016-11-23 19:34:01 +08:00

1 2 3 4 5 ...

1403 Commits