cpython/Objects/stringlib
Victor Stinner 3d4226a832
bpo-34523: Support surrogatepass in locale codecs (GH-8995)
Add support for the "surrogatepass" error handler in
PyUnicode_DecodeFSDefault() and PyUnicode_EncodeFSDefault()
for the UTF-8 encoding.

Changes:

* _Py_DecodeUTF8Ex() and _Py_EncodeUTF8Ex() now support the
  surrogatepass error handler (_Py_ERROR_SURROGATEPASS).
* _Py_DecodeLocaleEx() and _Py_EncodeLocaleEx() now use
  the _Py_error_handler enum instead of "int surrogateescape" to pass
  the error handler. These functions now return -3 if the error
  handler is unknown.
* Add unit tests on _Py_DecodeLocaleEx() and _Py_EncodeLocaleEx()
  in test_codecs.
* Rename get_error_handler() to _Py_GetErrorHandler() and expose it
  as a private function.
* _freeze_importlib doesn't need config.filesystem_errors="strict"
  workaround anymore.
2018-08-29 22:21:32 +02:00
..
clinic bpo-20180: complete AC conversion of Objects/stringlib/transmogrify.h (GH-8039) 2018-07-06 13:17:38 +03:00
README.txt
asciilib.h
codecs.h bpo-34523: Support surrogatepass in locale codecs (GH-8995) 2018-08-29 22:21:32 +02:00
count.h
ctype.h bpo-33012: Fix invalid function cast warnings with gcc 8 for METH_NOARGS. (GH-6030) 2018-04-29 21:59:33 +03:00
eq.h bpo-31338 (#3374) 2017-09-14 18:13:16 -07:00
fastsearch.h bpo-24821: Fixed the slowing down to 25 times in the searching of some (#505) 2017-03-30 09:11:10 +03:00
find.h Issue #26765: Moved common code and docstrings for bytes and bytearray methods 2016-05-04 22:23:26 +03:00
find_max_char.h Issue #26765: Ensure that bytes- and unicode-specific stringlib files are used 2016-05-16 09:42:29 +03:00
join.h Issue #28126: Replace Py_MEMCPY with memcpy(). Visual Studio can properly optimize memcpy(). 2016-09-13 20:22:02 +02:00
localeutil.h Issue #26765: Ensure that bytes- and unicode-specific stringlib files are used 2016-05-16 09:42:29 +03:00
partition.h
replace.h
split.h
stringdefs.h
transmogrify.h bpo-20180: complete AC conversion of Objects/stringlib/transmogrify.h (GH-8039) 2018-07-06 13:17:38 +03:00
ucs1lib.h
ucs2lib.h
ucs4lib.h
undef.h
unicode_format.h bpo-30978: str.format_map() now passes key lookup exceptions through. (#2790) 2017-08-03 11:45:23 +03:00
unicodedefs.h

README.txt

bits shared by the bytesobject and unicodeobject implementations (and
possibly other modules, in a not too distant future).

the stuff in here is included into relevant places; see the individual
source files for details.

--------------------------------------------------------------------
the following defines used by the different modules:

STRINGLIB_CHAR

    the type used to hold a character (char or Py_UNICODE)

STRINGLIB_EMPTY

    a PyObject representing the empty string, only to be used if
    STRINGLIB_MUTABLE is 0

Py_ssize_t STRINGLIB_LEN(PyObject*)

    returns the length of the given string object (which must be of the
    right type)

PyObject* STRINGLIB_NEW(STRINGLIB_CHAR*, Py_ssize_t)

    creates a new string object

STRINGLIB_CHAR* STRINGLIB_STR(PyObject*)

    returns the pointer to the character data for the given string
    object (which must be of the right type)

int STRINGLIB_CHECK_EXACT(PyObject *)

    returns true if the object is an instance of our type, not a subclass

STRINGLIB_MUTABLE

    must be 0 or 1 to tell the cpp macros in stringlib code if the object
    being operated on is mutable or not