cpython/Objects/stringlib
Miss Islington (bot) c755ca89c7 [3.7] bpo-24214: Fixed the UTF-8 and UTF-16 incremental decoders. (GH-14304) (GH-14369)
* bpo-24214: Fixed the UTF-8 and UTF-16 incremental decoders. (GH-14304)

* The UTF-8 incremental decoders fails now fast if encounter
  a sequence that can't be handled by the error handler.
* The UTF-16 incremental decoders with the surrogatepass error
  handler decodes now a lone low surrogate with final=False.
(cherry picked from commit 894263ba80)

Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>
2019-06-25 12:29:18 +02:00
..
README.txt s/stringobject/bytesobject/ (closes #22036) 2014-07-23 21:39:37 -07:00
asciilib.h stringlib: remove unused STRINGLIB_RESIZE macro 2013-04-14 16:29:09 +02:00
codecs.h [3.7] bpo-24214: Fixed the UTF-8 and UTF-16 incremental decoders. (GH-14304) (GH-14369) 2019-06-25 12:29:18 +02:00
count.h
ctype.h bpo-32677: Add .isascii() to str, bytes and bytearray (GH-5342) 2018-01-27 14:06:21 +09:00
eq.h bpo-31338 (#3374) 2017-09-14 18:13:16 -07:00
fastsearch.h bpo-24821: Fixed the slowing down to 25 times in the searching of some (#505) 2017-03-30 09:11:10 +03:00
find.h Issue #26765: Moved common code and docstrings for bytes and bytearray methods 2016-05-04 22:23:26 +03:00
find_max_char.h Issue #26765: Ensure that bytes- and unicode-specific stringlib files are used 2016-05-16 09:42:29 +03:00
join.h Issue #28126: Replace Py_MEMCPY with memcpy(). Visual Studio can properly optimize memcpy(). 2016-09-13 20:22:02 +02:00
localeutil.h bpo-33954: Fix _PyUnicode_InsertThousandsGrouping() (GH-10623) (GH-10718) 2018-11-26 14:17:01 +01:00
partition.h Issue #18408: Fix bytearrayiter.partition()/rpartition(), handle 2013-10-29 03:15:37 +01:00
replace.h Issue #16061: Speed up str.replace() for replacing 1-character strings. 2013-04-13 22:45:04 +03:00
split.h Issue #18722: Remove uses of the "register" keyword in C code. 2013-08-13 20:18:52 +02:00
stringdefs.h stringlib: remove unused STRINGLIB_RESIZE macro 2013-04-14 16:29:09 +02:00
transmogrify.h Issue #29145: Merge 3.6. 2017-01-10 10:56:38 +08:00
ucs1lib.h stringlib: remove unused STRINGLIB_RESIZE macro 2013-04-14 16:29:09 +02:00
ucs2lib.h stringlib: remove unused STRINGLIB_RESIZE macro 2013-04-14 16:29:09 +02:00
ucs4lib.h stringlib: remove unused STRINGLIB_RESIZE macro 2013-04-14 16:29:09 +02:00
undef.h stringlib: remove unused STRINGLIB_RESIZE macro 2013-04-14 16:29:09 +02:00
unicode_format.h bpo-30978: str.format_map() now passes key lookup exceptions through. (#2790) 2017-08-03 11:45:23 +03:00
unicodedefs.h Issue #18701: Remove support of old CPython versions (<3.0) from C code. 2013-08-17 00:48:02 +03:00

README.txt

bits shared by the bytesobject and unicodeobject implementations (and
possibly other modules, in a not too distant future).

the stuff in here is included into relevant places; see the individual
source files for details.

--------------------------------------------------------------------
the following defines used by the different modules:

STRINGLIB_CHAR

    the type used to hold a character (char or Py_UNICODE)

STRINGLIB_EMPTY

    a PyObject representing the empty string, only to be used if
    STRINGLIB_MUTABLE is 0

Py_ssize_t STRINGLIB_LEN(PyObject*)

    returns the length of the given string object (which must be of the
    right type)

PyObject* STRINGLIB_NEW(STRINGLIB_CHAR*, Py_ssize_t)

    creates a new string object

STRINGLIB_CHAR* STRINGLIB_STR(PyObject*)

    returns the pointer to the character data for the given string
    object (which must be of the right type)

int STRINGLIB_CHECK_EXACT(PyObject *)

    returns true if the object is an instance of our type, not a subclass

STRINGLIB_MUTABLE

    must be 0 or 1 to tell the cpp macros in stringlib code if the object
    being operated on is mutable or not