cpython

Commit Graph

Author	SHA1	Message	Date
Eric Snow	aa8591e9ca	gh-90111: Minor Cleanup for Runtime-Global Objects (gh-100254) * move _PyRuntime.global_objects.interned to _PyRuntime.cached_objects.interned_strings (and use _Py_CACHED_OBJECT()) * rename _PyRuntime.global_objects to _PyRuntime.static_objects (This also relates to gh-96075.) https://github.com/python/cpython/issues/90111	2022-12-14 11:53:57 -07:00
Eric Snow	91a8e002c2	gh-81057: Move More Globals to _PyRuntimeState (gh-100092) https://github.com/python/cpython/issues/81057	2022-12-07 15:56:31 -07:00
Serhiy Storchaka	a87c46eab3	bpo-15999: Accept arbitrary values for boolean parameters. (#15609 ) builtins and extension module functions and methods that expect boolean values for parameters now accept any Python object rather than just a bool or int type. This is more consistent with how native Python code itself behaves.	2022-12-03 11:52:21 -08:00
Serhiy Storchaka	f08e52ccb0	gh-99612: Fix PyUnicode_DecodeUTF8Stateful() for ASCII-only data (GH-99613) Previously *consumed was not set in this case.	2022-12-01 14:54:51 +02:00
Victor Stinner	135ec7cefb	gh-99537: Use Py_SETREF() function in C code (#99657 ) Fix potential race condition in code patterns: * Replace "Py_DECREF(var); var = new;" with "Py_SETREF(var, new);" * Replace "Py_XDECREF(var); var = new;" with "Py_XSETREF(var, new);" * Replace "Py_CLEAR(var); var = new;" with "Py_XSETREF(var, new);" Other changes: * Replace "old = var; var = new; Py_DECREF(var)" with "Py_SETREF(var, new);" * Replace "old = var; var = new; Py_XDECREF(var)" with "Py_XSETREF(var, new);" * And remove the "old" variable.	2022-11-22 13:39:11 +01:00
Eric Snow	5f55067e23	gh-81057: Move More Globals in Core Code to _PyRuntimeState (gh-99516) https://github.com/python/cpython/issues/81057	2022-11-16 09:37:14 -07:00
Victor Stinner	1960eb005e	gh-99300: Use Py_NewRef() in Objects/ directory (#99351 ) Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in C files of the Objects/ directory.	2022-11-10 23:40:31 +01:00
Eric Snow	52f91c642b	gh-90868: Adjust the Generated Objects (gh-99223) We do the following: * move the generated _PyUnicode_InitStaticStrings() to its own file * move the generated _PyStaticObjects_CheckRefcnt() to its own file * include pycore_global_objects.h in extension modules instead of pycore_runtime_init.h These changes help us avoid including things that aren't needed. https://github.com/python/cpython/issues/90868	2022-11-08 10:03:03 -07:00
Nikita Sobolev	76f989dc3e	gh-98783: Fix crashes when `str` subclasses are used in `_PyUnicode_Equal` (#98806 )	2022-10-30 02:23:20 -04:00
Victor Stinner	db03c8066a	gh-98393: os module reject bytes-like, only accept bytes (#98394 ) The os module and the PyUnicode_FSDecoder() function no longer accept bytes-like paths, like bytearray and memoryview types: only the exact bytes type is accepted for bytes strings.	2022-10-18 17:52:31 +02:00
Nikita Sobolev	ccab67ba79	gh-97982: Factorize PyUnicode_Count() and unicode_count() code (#98025 ) Add unicode_count_impl() to factorize PyUnicode_Count() and unicode_count() code.	2022-10-12 18:27:53 +02:00
Victor Stinner	df3a6d9beb	gh-97982: Remove asciilib_count() (#98164 ) asciilib_count() is the same than ucs1lib_count(): the code is not specialized for ASCII strings, so it's not worth it to have a separated function. Remove asciilib_count() function.	2022-10-11 17:59:58 +02:00
Kumar Aditya	6dab8c95bd	GH-96458: Statically initialize utf8 representation of static strings (#96481 )	2022-09-02 23:43:08 -07:00
Kumar Aditya	129998bd7b	GH-96075: move interned dict under runtime state (GH-96077)	2022-08-22 12:05:21 -07:00
Petr Viktorin	71c3d649b5	gh-95504: Fix negative numbers in PyUnicode_FromFormat (GH-95848) Co-authored-by: philg314 <110174000+philg314@users.noreply.github.com>	2022-08-10 13:12:40 +02:00
Serhiy Storchaka	62f06508e7	gh-95781: More strict format string checking in PyUnicode_FromFormatV() (GH-95784) An unrecognized format character in PyUnicode_FromFormat() and PyUnicode_FromFormatV() now sets a SystemError. In previous versions it caused all the rest of the format string to be copied as-is to the result string, and any extra arguments discarded.	2022-08-08 19:21:07 +03:00
Dong-hee Na	fb75d015f4	gh-91146: More reduce allocation size of list from str.split/rsplit (gh-95493) Co-authored-by: Inada Naoki <songofacandy@gmail.com>	2022-08-01 22:15:07 +09:00
Dong-hee Na	50b2261bda	gh-91146: Reduce allocation size of list from str.split()/rsplit() (gh-95473)	2022-07-31 12:14:53 +09:00
Pamela Fox	70068b9336	Fix Unicode doc and replace use of macro with PyMem_New function (GH-94088)	2022-07-28 23:32:16 +01:00
Eric Snow	4a1dd73431	gh-94673: Add _PyStaticType_InitBuiltin() (#95152 ) This is the first of several precursors to storing tp_subclasses (and tp_weaklist) on the interpreter state for static builtin types. We do the following: * add `_PyStaticType_InitBuiltin()` * add `_Py_TPFLAGS_STATIC_BUILTIN` * set it on all static builtin types in `_PyStaticType_InitBuiltin()` * shuffle some code around to be able to use _PyStaticType_InitBuiltin() * rename `_PyStructSequence_InitType()` to `_PyStructSequence_InitBuiltinWithFlags()` * add `_PyStructSequence_InitBuiltin()`.	2022-07-25 12:47:31 -06:00
Kumar Aditya	9dff9f4814	GH-90699: Intern statically allocated strings (GH-93597) This is similar to how strings are interned for deepfreeze.	2022-07-08 10:47:37 -07:00
Eric Snow	caa279d6fd	bpo-40514: Drop EXPERIMENTAL_ISOLATED_SUBINTERPRETERS (gh-93185) This was added for bpo-40514 (gh-84694) to test out a per-interpreter GIL. However, it has since proven unnecessary to keep the experiment in the repo. (It can be done as a branch in a fork like normal.) So here we are removing: * the configure option * the macro * the code enabled by the macro	2022-05-27 17:38:01 -06:00
Kumar Aditya	cb04a09d2d	GH-93207: Remove HAVE_STDARG_PROTOTYPES configure check for stdarg.h (#93215 )	2022-05-27 13:30:45 +02:00
Victor Stinner	5f8c3fb997	gh-91924: Optimize unicode_check_encoding_errors() (#93200 ) Avoid _PyCodec_Lookup() and PyCodec_LookupError() for most common built-in encodings and error handlers to avoid creating a temporary Unicode string object, whereas these encodings and error handlers are known to be valid.	2022-05-27 00:39:49 +02:00
Victor Stinner	059b5baf98	gh-85858: Remove PyUnicode_InternImmortal() function (#92579 ) Remove the PyUnicode_InternImmortal() function and the SSTATE_INTERNED_IMMORTAL macro. The PyUnicode_InternImmortal() function is still exported in the stable ABI. The function is removed from the API. PyASCIIObject.state.interned size is now a single bit, rather than 2 bits. Keep SSTATE_NOT_INTERNED and SSTATE_INTERNED_MORTAL macros for backward compatibility, but no longer use them internally since the interned member is now a single bit and so can only have two values (interned or not interned). Update stats of _PyUnicode_ClearInterned().	2022-05-13 13:40:22 +02:00
Victor Stinner	f62ad4f2c4	gh-89653: Use int type for Unicode kind (#92704 ) Use the same type that PyUnicode_FromKindAndData() kind parameter type (public C API): int.	2022-05-13 12:41:05 +02:00
Inada Naoki	f9c9354a7a	gh-92536: PEP 623: Remove wstr and legacy APIs from Unicode (GH-92537)	2022-05-12 14:48:38 +09:00
Victor Stinner	804f2529d8	gh-91320: Use _PyCFunction_CAST() (#92251 ) Replace "(PyCFunction)(void()(void))func" cast with _PyCFunction_CAST(func). Change generated by the command: sed -i -e \ 's!(PyCFunction)(void(\)(void)) $[A-Za-z0-9_]\+$!_PyCFunction_CAST(\1)!g' \ $(find -name ".c")	2022-05-03 21:42:14 +02:00
Serhiy Storchaka	18b07d773e	bpo-36819: Fix crashes in built-in encoders with weird error handlers (GH-28593) If the error handler returns position less or equal than the starting position of non-encodable characters, most of built-in encoders didn't properly re-size the output buffer. This led to out-of-bounds writes, and segfaults.	2022-05-02 12:37:48 +03:00
Serhiy Storchaka	3483299a24	gh-81548: Deprecate octal escape sequences with value larger than 0o377 (GH-91668)	2022-04-30 13:16:27 +03:00
Dennis Sweeney	da6c78584b	gh-90667: Add specializations of Py_DECREF when types are known (GH-30872)	2022-04-19 19:02:19 +01:00
Kumar Aditya	ab0d35d70d	bpo-46712: share more global strings in deepfreeze (gh-32152) (for gh-90868)	2022-04-19 11:41:36 -06:00
Oleg Iarygin	2f0fc521f4	gh-91102: Use Argument Clinic for EncodingMap (#31725 ) Co-authored-by: Jelle Zijlstra <jelle.zijlstra@gmail.com>	2022-04-18 13:43:56 -07:00
Kumar Aditya	8c54c3dacc	gh-91576: Speed up iteration of strings (#91574 )	2022-04-18 07:18:27 -07:00
Tobias Stoeckmann	0859368335	gh-91421: Use constant value check during runtime (GH-91422) The left-hand side expression of the if-check can be converted to a constant by the compiler, but the addition on the right-hand side is performed during runtime. Move the addition from the right-hand side to the left-hand side by turning it into a subtraction there. Since the values are known to be large enough to not turn negative, this is a safe operation. Prevents a very unlikely integer overflow on 32 bit systems. Fixes GH-91421.	2022-04-12 20:01:02 -07:00
John Belmonte	b0b836b20c	bpo-45995: add "z" format specifer to coerce negative 0 to zero (GH-30049) Add "z" format specifier to coerce negative 0 to zero. See https://github.com/python/cpython/issues/90153 (originally https://bugs.python.org/issue45995) for discussion. This covers `str.format()` and f-strings. Old-style string interpolation is not supported. Co-authored-by: Mark Dickinson <dickinsm@gmail.com>	2022-04-11 15:34:18 +01:00
Raymond Hettinger	d6fb104690	Fix bad grammar and import docstring for split/rsplit (GH-32381)	2022-04-08 08:36:20 -05:00
Christian Heimes	44e915028d	bpo-47182: Fix crash by named unicode characters after interpreter reinitialization (GH-32212) Automerge-Triggered-By: GH:tiran	2022-03-31 08:14:50 -07:00
Victor Stinner	c14d7e4b81	bpo-47164: Add _PyASCIIObject_CAST() macro (GH-32191) Add macros to cast objects to PyASCIIObject, PyCompactUnicodeObject and PyUnicodeObject*: _PyASCIIObject_CAST(), _PyCompactUnicodeObject_CAST() and _PyUnicodeObject_CAST(). Using these new macros make the code more readable and check their argument with: assert(PyUnicode_Check(op)). Remove redundant assert(PyUnicode_Check(op)) in macros using directly or indirectly these new CAST macros. Replacing existing casts with these macros.	2022-03-31 09:59:27 +02:00
Pieter Eendebak	850687df47	bpo-47070: Add _PyBytes_Repeat() (GH-31999) Use it where appropriate: the repeat functions of `array.array`, `bytes`, `bytearray`, and `str`.	2022-03-28 04:43:45 -04:00
Jeremy Kloth	88872a29f1	bpo-47084: Clear Unicode cached representations on finalization (GH-32032)	2022-03-22 13:53:51 +01:00
Oleg Iarygin	a52f82baf2	bpo-46920: Remove disabled debug code added decades ago and likely unnecessary (GH-31812)	2022-03-14 17:03:21 +01:00
Jelle Zijlstra	54ab9ad312	bpo-46881: Fix refleak from GH-31616 (GH-31805)	2022-03-11 17:05:08 +08:00
Kumar Aditya	8714b6fa27	bpo-46881: Statically allocate and initialize the latin1 characters. (GH-31616)	2022-03-09 15:02:00 -08:00
Eric Snow	81c72044a1	bpo-46541: Replace core use of _Py_IDENTIFIER() with statically initialized global objects. (gh-30928) We're no longer using _Py_IDENTIFIER() (or _Py_static_string()) in any core CPython code. It is still used in a number of non-builtin stdlib modules. The replacement is: PyUnicodeObject (not pointer) fields under _PyRuntimeState, statically initialized as part of _PyRuntime. A new _Py_GET_GLOBAL_IDENTIFIER() macro facilitates lookup of the fields (along with _Py_GET_GLOBAL_STRING() for non-identifier strings). https://bugs.python.org/issue46541#msg411799 explains the rationale for this change. The core of the change is in: * (new) Include/internal/pycore_global_strings.h - the declarations for the global strings, along with the macros * Include/internal/pycore_runtime_init.h - added the static initializers for the global strings * Include/internal/pycore_global_objects.h - where the struct in pycore_global_strings.h is hooked into _PyRuntimeState * Tools/scripts/generate_global_objects.py - added generation of the global string declarations and static initializers I've also added a --check flag to generate_global_objects.py (along with make check-global-objects) to check for unused global strings. That check is added to the PR CI config. The remainder of this change updates the core code to use _Py_GET_GLOBAL_IDENTIFIER() instead of _Py_IDENTIFIER() and the related _PyId functions (likewise for _Py_GET_GLOBAL_STRING() instead of _Py_static_string()). This includes adding a few functions where there wasn't already an alternative to _PyId(), replacing the _Py_Identifier * parameter with PyObject . The following are not changed (yet): stop using _Py_IDENTIFIER() in the stdlib modules * (maybe) get rid of _Py_IDENTIFIER(), etc. entirely -- this may not be doable as at least one package on PyPI using this (private) API * (maybe) intern the strings during runtime init https://bugs.python.org/issue46541	2022-02-08 13:39:07 -07:00
Victor Stinner	b556f53785	bpo-46670: Test if a macro is defined, not its value (GH-31178) * audioop.c: #ifdef WORDS_BIGENDIAN * ctypes.h: #ifdef USING_MALLOC_CLOSURE_DOT_C * _ctypes/malloc_closure.c: #ifdef HAVE_FFI_CLOSURE_ALLOC and #ifdef USING_APPLE_OS_LIBFFI * pytime.c: #ifdef __APPLE__ * unicodeobject.c: #ifdef HAVE_NON_UNICODE_WCHAR_T_REPRESENTATION	2022-02-07 01:46:51 +01:00
Victor Stinner	1626bf4ac7	bpo-46417: Clear Unicode static types at exit (GH-30806) Add _PyUnicode_FiniTypes() function, called by finalize_interp_types(). It clears these static types: * EncodingMapType * PyFieldNameIter_Type * PyFormatterIter_Type _PyStaticType_Dealloc() now does nothing if tp_subclasses is not NULL.	2022-01-22 22:55:39 +01:00
Victor Stinner	a1bf329bca	bpo-46417: Add missing types of _PyTypes_InitTypes() (GH-30749) Add types removed by mistake by the commit adding _PyTypes_FiniTypes(). Move also PyBool_Type at the end, since it depends on PyLong_Type. PyBytes_Type and PyUnicode_Type no longer depend explicitly on PyBaseObject_Type: it's the default of PyType_Ready().	2022-01-21 17:53:13 +01:00
Victor Stinner	35d6540c90	bpo-46006: Revert "bpo-40521: Per-interpreter interned strings (GH-20085)" (GH-30422) This reverts commit `ea251806b8`. Keep "assert(interned == NULL);" in _PyUnicode_Fini(), but only for the main interpreter. Keep _PyUnicode_ClearInterned() changes avoiding the creation of a temporary Python list object.	2022-01-06 08:53:44 +01:00
Eric Snow	c8749b5783	bpo-46008: Make runtime-global object/type lifecycle functions and state consistent. (gh-29998) This change is strictly renames and moving code around. It helps in the following ways: * ensures type-related init functions focus strictly on one of the three aspects (state, objects, types) * passes in PyInterpreterState * to all those functions, simplifying work on moving types/objects/state to the interpreter * consistent naming conventions help make what's going on more clear * keeping API related to a type in the corresponding header file makes it more obvious where to look for it https://bugs.python.org/issue46008	2021-12-09 12:59:26 -07:00

1 2 3 4 5 ...

1611 Commits