cpython

Commit Graph

Author	SHA1	Message	Date
Kumar Aditya	6dab8c95bd	GH-96458: Statically initialize utf8 representation of static strings (#96481 )	2022-09-02 23:43:08 -07:00
Kumar Aditya	129998bd7b	GH-96075: move interned dict under runtime state (GH-96077)	2022-08-22 12:05:21 -07:00
Petr Viktorin	71c3d649b5	gh-95504: Fix negative numbers in PyUnicode_FromFormat (GH-95848) Co-authored-by: philg314 <110174000+philg314@users.noreply.github.com>	2022-08-10 13:12:40 +02:00
Serhiy Storchaka	62f06508e7	gh-95781: More strict format string checking in PyUnicode_FromFormatV() (GH-95784) An unrecognized format character in PyUnicode_FromFormat() and PyUnicode_FromFormatV() now sets a SystemError. In previous versions it caused all the rest of the format string to be copied as-is to the result string, and any extra arguments discarded.	2022-08-08 19:21:07 +03:00
Dong-hee Na	fb75d015f4	gh-91146: More reduce allocation size of list from str.split/rsplit (gh-95493) Co-authored-by: Inada Naoki <songofacandy@gmail.com>	2022-08-01 22:15:07 +09:00
Dong-hee Na	50b2261bda	gh-91146: Reduce allocation size of list from str.split()/rsplit() (gh-95473)	2022-07-31 12:14:53 +09:00
Pamela Fox	70068b9336	Fix Unicode doc and replace use of macro with PyMem_New function (GH-94088)	2022-07-28 23:32:16 +01:00
Eric Snow	4a1dd73431	gh-94673: Add _PyStaticType_InitBuiltin() (#95152 ) This is the first of several precursors to storing tp_subclasses (and tp_weaklist) on the interpreter state for static builtin types. We do the following: * add `_PyStaticType_InitBuiltin()` * add `_Py_TPFLAGS_STATIC_BUILTIN` * set it on all static builtin types in `_PyStaticType_InitBuiltin()` * shuffle some code around to be able to use _PyStaticType_InitBuiltin() * rename `_PyStructSequence_InitType()` to `_PyStructSequence_InitBuiltinWithFlags()` * add `_PyStructSequence_InitBuiltin()`.	2022-07-25 12:47:31 -06:00
Kumar Aditya	9dff9f4814	GH-90699: Intern statically allocated strings (GH-93597) This is similar to how strings are interned for deepfreeze.	2022-07-08 10:47:37 -07:00
Eric Snow	caa279d6fd	bpo-40514: Drop EXPERIMENTAL_ISOLATED_SUBINTERPRETERS (gh-93185) This was added for bpo-40514 (gh-84694) to test out a per-interpreter GIL. However, it has since proven unnecessary to keep the experiment in the repo. (It can be done as a branch in a fork like normal.) So here we are removing: * the configure option * the macro * the code enabled by the macro	2022-05-27 17:38:01 -06:00
Kumar Aditya	cb04a09d2d	GH-93207: Remove HAVE_STDARG_PROTOTYPES configure check for stdarg.h (#93215 )	2022-05-27 13:30:45 +02:00
Victor Stinner	5f8c3fb997	gh-91924: Optimize unicode_check_encoding_errors() (#93200 ) Avoid _PyCodec_Lookup() and PyCodec_LookupError() for most common built-in encodings and error handlers to avoid creating a temporary Unicode string object, whereas these encodings and error handlers are known to be valid.	2022-05-27 00:39:49 +02:00
Victor Stinner	059b5baf98	gh-85858: Remove PyUnicode_InternImmortal() function (#92579 ) Remove the PyUnicode_InternImmortal() function and the SSTATE_INTERNED_IMMORTAL macro. The PyUnicode_InternImmortal() function is still exported in the stable ABI. The function is removed from the API. PyASCIIObject.state.interned size is now a single bit, rather than 2 bits. Keep SSTATE_NOT_INTERNED and SSTATE_INTERNED_MORTAL macros for backward compatibility, but no longer use them internally since the interned member is now a single bit and so can only have two values (interned or not interned). Update stats of _PyUnicode_ClearInterned().	2022-05-13 13:40:22 +02:00
Victor Stinner	f62ad4f2c4	gh-89653: Use int type for Unicode kind (#92704 ) Use the same type that PyUnicode_FromKindAndData() kind parameter type (public C API): int.	2022-05-13 12:41:05 +02:00
Inada Naoki	f9c9354a7a	gh-92536: PEP 623: Remove wstr and legacy APIs from Unicode (GH-92537)	2022-05-12 14:48:38 +09:00
Victor Stinner	804f2529d8	gh-91320: Use _PyCFunction_CAST() (#92251 ) Replace "(PyCFunction)(void()(void))func" cast with _PyCFunction_CAST(func). Change generated by the command: sed -i -e \ 's!(PyCFunction)(void(\)(void)) $[A-Za-z0-9_]\+$!_PyCFunction_CAST(\1)!g' \ $(find -name ".c")	2022-05-03 21:42:14 +02:00
Serhiy Storchaka	18b07d773e	bpo-36819: Fix crashes in built-in encoders with weird error handlers (GH-28593) If the error handler returns position less or equal than the starting position of non-encodable characters, most of built-in encoders didn't properly re-size the output buffer. This led to out-of-bounds writes, and segfaults.	2022-05-02 12:37:48 +03:00
Serhiy Storchaka	3483299a24	gh-81548: Deprecate octal escape sequences with value larger than 0o377 (GH-91668)	2022-04-30 13:16:27 +03:00
Dennis Sweeney	da6c78584b	gh-90667: Add specializations of Py_DECREF when types are known (GH-30872)	2022-04-19 19:02:19 +01:00
Kumar Aditya	ab0d35d70d	bpo-46712: share more global strings in deepfreeze (gh-32152) (for gh-90868)	2022-04-19 11:41:36 -06:00
Oleg Iarygin	2f0fc521f4	gh-91102: Use Argument Clinic for EncodingMap (#31725 ) Co-authored-by: Jelle Zijlstra <jelle.zijlstra@gmail.com>	2022-04-18 13:43:56 -07:00
Kumar Aditya	8c54c3dacc	gh-91576: Speed up iteration of strings (#91574 )	2022-04-18 07:18:27 -07:00
Tobias Stoeckmann	0859368335	gh-91421: Use constant value check during runtime (GH-91422) The left-hand side expression of the if-check can be converted to a constant by the compiler, but the addition on the right-hand side is performed during runtime. Move the addition from the right-hand side to the left-hand side by turning it into a subtraction there. Since the values are known to be large enough to not turn negative, this is a safe operation. Prevents a very unlikely integer overflow on 32 bit systems. Fixes GH-91421.	2022-04-12 20:01:02 -07:00
John Belmonte	b0b836b20c	bpo-45995: add "z" format specifer to coerce negative 0 to zero (GH-30049) Add "z" format specifier to coerce negative 0 to zero. See https://github.com/python/cpython/issues/90153 (originally https://bugs.python.org/issue45995) for discussion. This covers `str.format()` and f-strings. Old-style string interpolation is not supported. Co-authored-by: Mark Dickinson <dickinsm@gmail.com>	2022-04-11 15:34:18 +01:00
Raymond Hettinger	d6fb104690	Fix bad grammar and import docstring for split/rsplit (GH-32381)	2022-04-08 08:36:20 -05:00
Christian Heimes	44e915028d	bpo-47182: Fix crash by named unicode characters after interpreter reinitialization (GH-32212) Automerge-Triggered-By: GH:tiran	2022-03-31 08:14:50 -07:00
Victor Stinner	c14d7e4b81	bpo-47164: Add _PyASCIIObject_CAST() macro (GH-32191) Add macros to cast objects to PyASCIIObject, PyCompactUnicodeObject and PyUnicodeObject*: _PyASCIIObject_CAST(), _PyCompactUnicodeObject_CAST() and _PyUnicodeObject_CAST(). Using these new macros make the code more readable and check their argument with: assert(PyUnicode_Check(op)). Remove redundant assert(PyUnicode_Check(op)) in macros using directly or indirectly these new CAST macros. Replacing existing casts with these macros.	2022-03-31 09:59:27 +02:00
Pieter Eendebak	850687df47	bpo-47070: Add _PyBytes_Repeat() (GH-31999) Use it where appropriate: the repeat functions of `array.array`, `bytes`, `bytearray`, and `str`.	2022-03-28 04:43:45 -04:00
Jeremy Kloth	88872a29f1	bpo-47084: Clear Unicode cached representations on finalization (GH-32032)	2022-03-22 13:53:51 +01:00
Oleg Iarygin	a52f82baf2	bpo-46920: Remove disabled debug code added decades ago and likely unnecessary (GH-31812)	2022-03-14 17:03:21 +01:00
Jelle Zijlstra	54ab9ad312	bpo-46881: Fix refleak from GH-31616 (GH-31805)	2022-03-11 17:05:08 +08:00
Kumar Aditya	8714b6fa27	bpo-46881: Statically allocate and initialize the latin1 characters. (GH-31616)	2022-03-09 15:02:00 -08:00
Eric Snow	81c72044a1	bpo-46541: Replace core use of _Py_IDENTIFIER() with statically initialized global objects. (gh-30928) We're no longer using _Py_IDENTIFIER() (or _Py_static_string()) in any core CPython code. It is still used in a number of non-builtin stdlib modules. The replacement is: PyUnicodeObject (not pointer) fields under _PyRuntimeState, statically initialized as part of _PyRuntime. A new _Py_GET_GLOBAL_IDENTIFIER() macro facilitates lookup of the fields (along with _Py_GET_GLOBAL_STRING() for non-identifier strings). https://bugs.python.org/issue46541#msg411799 explains the rationale for this change. The core of the change is in: * (new) Include/internal/pycore_global_strings.h - the declarations for the global strings, along with the macros * Include/internal/pycore_runtime_init.h - added the static initializers for the global strings * Include/internal/pycore_global_objects.h - where the struct in pycore_global_strings.h is hooked into _PyRuntimeState * Tools/scripts/generate_global_objects.py - added generation of the global string declarations and static initializers I've also added a --check flag to generate_global_objects.py (along with make check-global-objects) to check for unused global strings. That check is added to the PR CI config. The remainder of this change updates the core code to use _Py_GET_GLOBAL_IDENTIFIER() instead of _Py_IDENTIFIER() and the related _PyId functions (likewise for _Py_GET_GLOBAL_STRING() instead of _Py_static_string()). This includes adding a few functions where there wasn't already an alternative to _PyId(), replacing the _Py_Identifier * parameter with PyObject . The following are not changed (yet): stop using _Py_IDENTIFIER() in the stdlib modules * (maybe) get rid of _Py_IDENTIFIER(), etc. entirely -- this may not be doable as at least one package on PyPI using this (private) API * (maybe) intern the strings during runtime init https://bugs.python.org/issue46541	2022-02-08 13:39:07 -07:00
Victor Stinner	b556f53785	bpo-46670: Test if a macro is defined, not its value (GH-31178) * audioop.c: #ifdef WORDS_BIGENDIAN * ctypes.h: #ifdef USING_MALLOC_CLOSURE_DOT_C * _ctypes/malloc_closure.c: #ifdef HAVE_FFI_CLOSURE_ALLOC and #ifdef USING_APPLE_OS_LIBFFI * pytime.c: #ifdef __APPLE__ * unicodeobject.c: #ifdef HAVE_NON_UNICODE_WCHAR_T_REPRESENTATION	2022-02-07 01:46:51 +01:00
Victor Stinner	1626bf4ac7	bpo-46417: Clear Unicode static types at exit (GH-30806) Add _PyUnicode_FiniTypes() function, called by finalize_interp_types(). It clears these static types: * EncodingMapType * PyFieldNameIter_Type * PyFormatterIter_Type _PyStaticType_Dealloc() now does nothing if tp_subclasses is not NULL.	2022-01-22 22:55:39 +01:00
Victor Stinner	a1bf329bca	bpo-46417: Add missing types of _PyTypes_InitTypes() (GH-30749) Add types removed by mistake by the commit adding _PyTypes_FiniTypes(). Move also PyBool_Type at the end, since it depends on PyLong_Type. PyBytes_Type and PyUnicode_Type no longer depend explicitly on PyBaseObject_Type: it's the default of PyType_Ready().	2022-01-21 17:53:13 +01:00
Victor Stinner	35d6540c90	bpo-46006: Revert "bpo-40521: Per-interpreter interned strings (GH-20085)" (GH-30422) This reverts commit `ea251806b8`. Keep "assert(interned == NULL);" in _PyUnicode_Fini(), but only for the main interpreter. Keep _PyUnicode_ClearInterned() changes avoiding the creation of a temporary Python list object.	2022-01-06 08:53:44 +01:00
Eric Snow	c8749b5783	bpo-46008: Make runtime-global object/type lifecycle functions and state consistent. (gh-29998) This change is strictly renames and moving code around. It helps in the following ways: * ensures type-related init functions focus strictly on one of the three aspects (state, objects, types) * passes in PyInterpreterState * to all those functions, simplifying work on moving types/objects/state to the interpreter * consistent naming conventions help make what's going on more clear * keeping API related to a type in the corresponding header file makes it more obvious where to look for it https://bugs.python.org/issue46008	2021-12-09 12:59:26 -07:00
Dennis Sweeney	03768c4d13	bpo-45885: Specialize COMPARE_OP (GH-29734) * Add COMPARE_OP_ADAPTIVE adaptive instruction. * Add COMPARE_OP_FLOAT_JUMP, COMPARE_OP_INT_JUMP and COMPARE_OP_STR_JUMP specialized instructions. * Introduce and use _PyUnicode_Equal	2021-12-03 11:29:12 +00:00
Victor Stinner	5f09bb021a	bpo-35134: Add Include/cpython/longobject.h (GH-29044) Move Include/longobject.h non-limited API to a new Include/cpython/longobject.h header file. Move the following definitions to the internal C API: * _PyLong_DigitValue * _PyLong_FormatAdvancedWriter() * _PyLong_FormatWriter()	2021-10-19 02:04:52 +02:00
Serhiy Storchaka	39aa98346d	bpo-45467: Fix IncrementalDecoder and StreamReader in the "raw-unicode-escape" codec (GH-28944) They support now splitting escape sequences between input chunks. Add the third parameter "final" in codecs.raw_unicode_escape_decode(). It is True by default to match the former behavior.	2021-10-14 20:04:19 +03:00
Serhiy Storchaka	c96d1546b1	bpo-45461: Fix IncrementalDecoder and StreamReader in the "unicode-escape" codec (GH-28939) They support now splitting escape sequences between input chunks. Add the third parameter "final" in codecs.unicode_escape_decode(). It is True by default to match the former behavior.	2021-10-14 13:17:00 +03:00
Christian Clauss	5f401f1040	Fix typos in the Objects directory (GH-28766)	2021-10-06 16:57:10 -07:00
Victor Stinner	8620be99da	bpo-45061: Revert unicode_is_singleton() change (GH-28516) Don't use a loop over 256 items, only checks for a single singleton.	2021-09-22 12:16:53 +02:00
Mohamad Mansour	8f943ca257	[codemod] Fix non-matching bracket pairs (GH-28473) Co-authored-by: Terry Jan Reedy <tjreedy@udel.edu> Co-authored-by: Serhiy Storchaka <storchaka@gmail.com> Co-authored-by: Łukasz Langa <lukasz@langa.pl>	2021-09-22 01:09:00 +02:00
Victor Stinner	86f28372b1	bpo-45061: Detect refcount bug on empty string singleton (GH-28504) Detect refcount bugs in C extensions when the empty Unicode string singleton is destroyed by mistake. * Move forward declarations to the top of unicodeobject.c. * Simplifiy unicode_is_singleton().	2021-09-21 23:43:09 +02:00
Miguel Brito	ed1076428c	bpo-44110: Improve string's __getitem__ error message (GH-26042)	2021-06-27 15:04:57 +03:00
Serhiy Storchaka	be8b631b7a	Add more const modifiers. (GH-26691)	2021-06-12 16:11:59 +03:00
Inada Naoki	9ad8f109ac	bpo-44029: Remove Py_UNICODE APIs (GH-25881) Remove deprecated `Py_UNICODE` APIs: `PyUnicode_Encode`, `PyUnicode_EncodeUTF7`, `PyUnicode_EncodeUTF8`, `PyUnicode_EncodeUTF16`, `PyUnicode_EncodeUTF32`, `PyUnicode_EncodeLatin1`, `PyUnicode_EncodeMBCS`, `PyUnicode_EncodeDecimal`, `PyUnicode_EncodeRawUnicodeEscape`, `PyUnicode_EncodeCharmap`, `PyUnicode_EncodeUnicodeEscape`, `PyUnicode_TransformDecimalToASCII`, `PyUnicode_TranslateCharmap`, `PyUnicodeEncodeError_Create`, `PyUnicodeTranslateError_Create`. See :pep:`393` and :pep:`624` for reference.	2021-05-07 15:58:29 +09:00
Jakub Kulík	9032cf5cb1	bpo-43667: Fix broken Unicode encoding in non-UTF locales on Solaris (GH-25096)	2021-04-30 15:21:42 +02:00

1 2 3 4 5 ...

1599 Commits