cpython

Commit Graph

Author	SHA1	Message	Date
Serhiy Storchaka	d08c788822	gh-123497: New limit for Python integers on 64-bit platforms (GH-123724) Instead of be limited just by the size of addressable memory (2**63 bytes), Python integers are now also limited by the number of bits, so the number of bit now always fit in a 64-bit integer. Both limits are much larger than what might be available in practice, so it doesn't affect users. _PyLong_NumBits() and _PyLong_Frexp() are now always successful.	2024-09-29 10:40:20 +03:00
Serhiy Storchaka	c0c2aa7644	gh-122213: Add notes for pickle serialization errors (GH-122214) This allows to identify the source of the error.	2024-09-09 21:28:55 +03:00
Serhiy Storchaka	b2a8c38bb2	gh-122311: Improve and unify pickle errors (GH-122771) * Raise PicklingError instead of UnicodeEncodeError, ValueError and AttributeError in both implementations. * Chain the original exception to the pickle-specific one as __context__. * Include the error message of ImportError and some AttributeError in the PicklingError error message. * Unify error messages between Python and C implementations. * Refer to documented __reduce__ and __newobj__ callables instead of internal methods (e.g. save_reduce()) or pickle opcodes (e.g. NEWOBJ). * Include more details in error messages (what expected, what got). * Avoid including a potentially long repr of an arbitrary object in error messages.	2024-09-09 15:04:51 +03:00
Serhiy Storchaka	32c7dbb2bc	gh-121485: Always use 64-bit integers for integers bits count (GH-121486) Use 64-bit integers instead of platform specific size_t or Py_ssize_t to represent the number of bits in Python integer.	2024-08-30 08:13:24 +03:00
Serhiy Storchaka	0c3ea30238	gh-123431: Harmonize extension code checks in pickle (GH-123434) This checks are redundant in normal circumstances and can only work if the extension registry was intentionally broken. * The Python implementation now raises exception for the extension code with false boolean value. * Simplify the C code. RuntimeError is now raised in explicit checks. * Add many tests.	2024-08-29 08:26:16 +03:00
Kirill Podoprigora	94a4bd79a7	gh-122704: Fix reference leak in Modules/_pickle.c (GH-122705)	2024-08-06 08:57:36 +03:00
Serhiy Storchaka	1bb955a2fe	gh-122459: Optimize pickling by name objects without __module__ (GH-122460)	2024-08-05 16:21:32 +03:00
Serhiy Storchaka	68840e91ac	gh-122311: Fix a refleak in pickle (GH-122411)	2024-07-29 21:52:48 +03:00
Serhiy Storchaka	3b034d26eb	gh-122311: Fix some error messages in pickle (GH-122386)	2024-07-29 11:49:13 +03:00
Serhiy Storchaka	dc07f65a53	gh-82951: Fix serializing by name in pickle protocols < 4 (GH-122149) Serializing objects with complex __qualname__ (such as unbound methods and nested classes) by name no longer involves serializing parent objects by value in pickle protocols < 4.	2024-07-25 08:45:19 +00:00
Rodrigo Oliveira	d66b06107b	gh-118830: Bump pickle.DEFAULT_PROTOCOL to 5 (GH-119340)	2024-07-19 16:47:10 +02:00
Justin Applegate	92893fd8dc	gh-121137: Add missing Py_DECREF calls for ADDITEMS opcode of _pickle.c (#121136 ) PyObject_GetAttr returns a new reference, but this reference is never decremented using Py_DECREF, so Py_DECREF calls to this referece are added	2024-06-28 14:43:45 -07:00
Petr Viktorin	6f1d448bc1	gh-113993: Allow interned strings to be mortal, and fix related issues (GH-120520) * Add an InternalDocs file describing how interning should work and how to use it. * Add internal functions to explicitly request what kind of interning is done: - `_PyUnicode_InternMortal` - `_PyUnicode_InternImmortal` - `_PyUnicode_InternStatic` * Switch uses of `PyUnicode_InternInPlace` to those. * Disallow using `_Py_SetImmortal` on strings directly. You should use `_PyUnicode_InternImmortal` instead: - Strings should be interned before immortalization, otherwise you're possibly interning a immortalizing copy. - `_Py_SetImmortal` doesn't handle the `SSTATE_INTERNED_MORTAL` to `SSTATE_INTERNED_IMMORTAL` update, and those flags can't be changed in backports, as they are now part of public API and version-specific ABI. * Add private `_only_immortal` argument for `sys.getunicodeinternedsize`, used in refleak test machinery. * Make sure the statically allocated string singletons are unique. This means these sets are now disjoint: - `_Py_ID` - `_Py_STR` (including the empty string) - one-character latin-1 singletons Now, when you intern a singleton, that exact singleton will be interned. * Add a `_Py_LATIN1_CHR` macro, use it instead of `_Py_ID`/`_Py_STR` for one-character latin-1 singletons everywhere (including Clinic). * Intern `_Py_STR` singletons at startup. * For free-threaded builds, intern `_Py_LATIN1_CHR` singletons at startup. * Beef up the tests. Cover internal details (marked with `@cpython_only`). * Add lots of assertions Co-Authored-By: Eric Snow <ericsnowcurrently@gmail.com>	2024-06-21 17:19:31 +02:00
Brett Simmers	c2627d6eea	gh-116322: Add Py_mod_gil module slot (#116882 ) This PR adds the ability to enable the GIL if it was disabled at interpreter startup, and modifies the multi-phase module initialization path to enable the GIL when loading a module, unless that module's spec includes a slot indicating it can run safely without the GIL. PEP 703 called the constant for the slot `Py_mod_gil_not_used`; I went with `Py_MOD_GIL_NOT_USED` for consistency with gh-104148. A warning will be issued up to once per interpreter for the first GIL-using module that is loaded. If `-v` is given, a shorter message will be printed to stderr every time a GIL-using module is loaded (including the first one that issues a warning).	2024-05-03 11:30:55 -04:00
Donghee Na	94444ea45a	gh-112069: Add _PySet_NextEntryRef to be thread-safe. (gh-117990)	2024-04-19 00:18:22 +09:00
Steve Dower	7861dfd26a	gh-111140: Adds PyLong_AsNativeBytes and PyLong_FromNative[Unsigned]Bytes functions (GH-114886)	2024-02-12 20:13:13 +00:00
Serhiy Storchaka	89cee94b31	gh-89850: Add default C implementations of persistent_id() and persistent_load() (GH-113579) Previously the C implementation of pickle.Pickler and pickle.Unpickler classes did not have such methods and they could only be used if they were overloaded in subclasses or set as instance attributes. Fixed calling super().persistent_id() and super().persistent_load() in subclasses of the C implementation of pickle.Pickler and pickle.Unpickler classes. It no longer causes an infinite recursion.	2024-01-10 15:30:37 +02:00
kale-smoothie	967f2a3052	bpo-41422: Visit the Pickler's and Unpickler's memo in tp_traverse (GH-21664) Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>	2023-11-27 18:09:41 +00:00
Serhiy Storchaka	add16f1a5e	gh-108511: Add C API functions which do not silently ignore errors (GH-109025) Add the following functions: * PyObject_HasAttrWithError() * PyObject_HasAttrStringWithError() * PyMapping_HasKeyWithError() * PyMapping_HasKeyStringWithError()	2023-09-17 14:23:31 +03:00
Victor Stinner	a071ecb4d1	gh-106320: Remove private _PySys functions (#108452 ) Move private functions to the internal C API (pycore_sysmodule.h): * _PySys_GetAttr() * _PySys_GetSizeOf() No longer export most of these functions. Fix also a typo in Include/cpython/optimizer.h: add a missing space.	2023-08-24 20:02:09 +00:00
Victor Stinner	c55e73112c	gh-106320: Remove private PyLong C API functions (#108429 ) Remove private PyLong C API functions: * _PyLong_AsByteArray() * _PyLong_DivmodNear() * _PyLong_Format() * _PyLong_Frexp() * _PyLong_FromByteArray() * _PyLong_FromBytes() * _PyLong_GCD() * _PyLong_Lshift() * _PyLong_Rshift() Move these functions to the internal C API. No longer export _PyLong_FromBytes() function.	2023-08-24 18:53:50 +02:00
Brandt Bucher	05a824f294	GH-84436: Skip refcounting for known immortals (GH-107605)	2023-08-04 16:24:50 -07:00
Victor Stinner	1a3faba9f1	gh-106869: Use new PyMemberDef constant names (#106871 ) * Remove '#include "structmember.h"'. * If needed, add <stddef.h> to get offsetof() function. * Update Parser/asdl_c.py to regenerate Python/Python-ast.c. * Replace: * T_SHORT => Py_T_SHORT * T_INT => Py_T_INT * T_LONG => Py_T_LONG * T_FLOAT => Py_T_FLOAT * T_DOUBLE => Py_T_DOUBLE * T_STRING => Py_T_STRING * T_OBJECT => _Py_T_OBJECT * T_CHAR => Py_T_CHAR * T_BYTE => Py_T_BYTE * T_UBYTE => Py_T_UBYTE * T_USHORT => Py_T_USHORT * T_UINT => Py_T_UINT * T_ULONG => Py_T_ULONG * T_STRING_INPLACE => Py_T_STRING_INPLACE * T_BOOL => Py_T_BOOL * T_OBJECT_EX => Py_T_OBJECT_EX * T_LONGLONG => Py_T_LONGLONG * T_ULONGLONG => Py_T_ULONGLONG * T_PYSSIZET => Py_T_PYSSIZET * T_NONE => _Py_T_NONE * READONLY => Py_READONLY * PY_AUDIT_READ => Py_AUDIT_READ * READ_RESTRICTED => Py_AUDIT_READ * PY_WRITE_RESTRICTED => _Py_WRITE_RESTRICTED * RESTRICTED => (READ_RESTRICTED \| _Py_WRITE_RESTRICTED)	2023-07-25 15:28:30 +02:00
Victor Stinner	5e4af2a3e9	gh-106320: Move private _PySet API to the internal API (#107041 ) * Add pycore_setobject.h header file. * Move the following API to the internal C API: * _PySet_Dummy * _PySet_NextEntry() * _PySet_Update()	2023-07-22 17:04:34 +02:00
Victor Stinner	eda9ce1487	gh-106320: Move _PyNone_Type to the internal C API (#107030 ) Move private types _PyNone_Type and _PyNotImplemented_Type to internal C API.	2023-07-22 14:12:17 +00:00
Serhiy Storchaka	be1b968dc1	gh-106521: Remove _PyObject_LookupAttr() function (GH-106642)	2023-07-12 08:57:10 +03:00
Serhiy Storchaka	4bf43710d1	gh-106307: C API: Add PyMapping_GetOptionalItem() function (GH-106308) Also add PyMapping_GetOptionalItemString() function.	2023-07-11 23:04:12 +03:00
Victor Stinner	ec931fc394	gh-106320: Remove _PyBytesWriter C API (#106399 ) Remove the _PyBytesWriter C API: move it to the internal C API (pycore_bytesobject.h).	2023-07-04 08:27:23 +00:00
Erlend E. Aasland	217589d4f3	gh-105375: Improve error handling in _Unpickler_SetInputStream() (#105667 ) Prevent exceptions from possibly being overwritten in case of multiple failures.	2023-06-13 10:38:01 +02:00
Erlend E. Aasland	ca3cc4b95d	gh-105375: Explicitly initialise all {Pickler,Unpickler}Object fields (#105686 ) All fields must be explicitly initialised to prevent manipulation of uninitialised fields in dealloc. Align initialisation order with the layout of the object structs.	2023-06-12 23:35:07 +02:00
Erlend E. Aasland	89aac6f6b7	gh-105375: Improve _pickle error handling (#105475 ) Error handling was deferred in some cases, which could potentially lead to exceptions being overwritten.	2023-06-09 19:09:53 +02:00
Victor Stinner	ef300937c2	gh-92536: Remove PyUnicode_READY() calls (#105210 ) Since Python 3.12, PyUnicode_READY() does nothing and always returns 0.	2023-06-02 01:33:17 +02:00
Eric Snow	a9c6e0618f	gh-99113: Add Py_MOD_PER_INTERPRETER_GIL_SUPPORTED (gh-104205) Here we are doing no more than adding the value for Py_mod_multiple_interpreters and using it for stdlib modules. We will start checking for it in gh-104206 (once PyInterpreterState.ceval.own_gil is added in gh-104204).	2023-05-05 21:11:27 +00:00
Erlend E. Aasland	c00dcf0e38	gh-103092: Isolate `_pickle` module (#102982 ) Co-authored-by: Mohamed Koubaa <koubaa.m@gmail.com> Co-authored-by: Kumar Aditya <59607654+kumaraditya303@users.noreply.github.com>	2023-04-04 15:38:54 +05:30
Max Bachmann	c6858d1e7f	gh-102255: Improve build support for Windows API partitions (GH-102256) Add `MS_WINDOWS_DESKTOP`, `MS_WINDOWS_APPS`, `MS_WINDOWS_SYSTEM` and `MS_WINDOWS_GAMES` preprocessor definitions to allow switching off functionality missing from particular API partitions ("partitions" are used in Windows to identify overlapping subsets of APIs). CPython only officially supports `MS_WINDOWS_DESKTOP` and `MS_WINDOWS_SYSTEM` (APPS is included by normal desktop builds, but APPS without DESKTOP is not covered). Other configurations are a convenience for people building their own runtimes. `MS_WINDOWS_GAMES` is for the Xbox subset of the Windows API, which is also available on client OS, but is restricted compared to `MS_WINDOWS_DESKTOP`. These restrictions may change over time, as they relate to the build headers rather than the OS support, and so we assume that Xbox builds will use the latest available version of the GDK.	2023-03-09 21:09:12 +00:00
Victor Stinner	85dd6cb6df	gh-99845: Use size_t type in __sizeof__() methods (#99846 ) The implementation of __sizeof__() methods using _PyObject_SIZE() now use an unsigned type (size_t) to compute the size, rather than a signed type (Py_ssize_t). Cast explicitly signed (Py_ssize_t) values to unsigned type (Py_ssize_t).	2022-11-30 17:22:52 +01:00
Victor Stinner	81f7359f67	gh-99537: Use Py_SETREF(var, NULL) in C code (#99687 ) Replace "Py_DECREF(var); var = NULL;" with "Py_SETREF(var, NULL);".	2022-11-23 14:57:50 +01:00
Victor Stinner	7e3f09cad9	gh-99537: Use Py_SETREF() function in C code (#99656 ) Fix potential race condition in code patterns: * Replace "Py_DECREF(var); var = new;" with "Py_SETREF(var, new);" * Replace "Py_XDECREF(var); var = new;" with "Py_XSETREF(var, new);" * Replace "Py_CLEAR(var); var = new;" with "Py_XSETREF(var, new);" Other changes: * Replace "old = var; var = new; Py_DECREF(var)" with "Py_SETREF(var, new);" * Replace "old = var; var = new; Py_XDECREF(var)" with "Py_XSETREF(var, new);" * And remove the "old" variable.	2022-11-22 14:22:22 +01:00
Victor Stinner	7e4dec02ac	gh-99300: Use Py_NewRef() in Modules/ directory (#99467 ) Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in test C files of the Modules/ directory.	2022-11-14 13:08:43 +01:00
Harshil	cfec5b18bf	remove new line in pickle exception message (GH-31782)	2022-11-07 07:43:39 +00:00
Shantanu	d3b82b4463	gh-83004: Clean up refleak in _pickle initialisation (#98841 )	2022-11-06 06:05:13 -08:00
Kumar Aditya	01ef1f95da	GH-89988: Fix memory leak in pickle.Pickler dispatch_table lookup (GH-94298)	2022-06-28 10:01:43 +03:00
Serhiy Storchaka	6fd4c8ec77	gh-93741: Add private C API _PyImport_GetModuleAttrString() (GH-93742) It combines PyImport_ImportModule() and PyObject_GetAttrString() and saves 4-6 lines of code on every use. Add also _PyImport_GetModuleAttr() which takes Python strings as arguments.	2022-06-14 07:15:26 +03:00
Dennis Sweeney	4c496f1f11	gh-92930: _pickle.c: Acquire strong references before calling save() (GH-92931)	2022-06-10 21:50:11 -04:00
Victor Stinner	f62ad4f2c4	gh-89653: Use int type for Unicode kind (#92704 ) Use the same type that PyUnicode_FromKindAndData() kind parameter type (public C API): int.	2022-05-13 12:41:05 +02:00
Victor Stinner	d716a0dfe2	Use static inline function Py_EnterRecursiveCall() (#91988 ) Currently, calling Py_EnterRecursiveCall() and Py_LeaveRecursiveCall() may use a function call or a static inline function call, depending if the internal pycore_ceval.h header file is included or not. Use a different name for the static inline function to ensure that the static inline function is always used in Python internals for best performance. Similar approach than PyThreadState_GET() (function call) and _PyThreadState_GET() (static inline function). * Rename _Py_EnterRecursiveCall() to _Py_EnterRecursiveCallTstate() * Rename _Py_LeaveRecursiveCall() to _Py_LeaveRecursiveCallTstate() * pycore_ceval.h: Rename Py_EnterRecursiveCall() to _Py_EnterRecursiveCall() and Py_LeaveRecursiveCall() and _Py_LeaveRecursiveCall()	2022-05-04 13:30:23 +02:00
Victor Stinner	7cdaf87ec5	gh-91731: Replace Py_BUILD_ASSERT() with static_assert() (#91730 ) Python 3.11 now uses C11 standard which adds static_assert() to <assert.h>. * In pytime.c, replace Py_BUILD_ASSERT() with preprocessor checks on SIZEOF_TIME_T with #error. * On macOS, py_mach_timebase_info() now accepts timebase members with the same size than _PyTime_t. * py_get_monotonic_clock() now saturates GetTickCount64() to _PyTime_MAX: GetTickCount64() is unsigned, whereas _PyTime_t is signed.	2022-04-20 19:26:40 +02:00
Kumar Aditya	ab0d35d70d	bpo-46712: share more global strings in deepfreeze (gh-32152) (for gh-90868)	2022-04-19 11:41:36 -06:00
Victor Stinner	882d8096c2	bpo-46906: Add PyFloat_Pack8() to the C API (GH-31657) Add new functions to pack and unpack C double (serialize and deserialize): * PyFloat_Pack2(), PyFloat_Pack4(), PyFloat_Pack8() * PyFloat_Unpack2(), PyFloat_Unpack4(), PyFloat_Unpack8() Document these functions and add unit tests. Rename private functions and move them from the internal C API to the public C API: * _PyFloat_Pack2() => PyFloat_Pack2() * _PyFloat_Pack4() => PyFloat_Pack4() * _PyFloat_Pack8() => PyFloat_Pack8() * _PyFloat_Unpack2() => PyFloat_Unpack2() * _PyFloat_Unpack4() => PyFloat_Unpack4() * _PyFloat_Unpack8() => PyFloat_Unpack8() Replace the "unsigned char" type with "char" which is more common and easy to use.	2022-03-12 00:10:02 +01:00
Eric Snow	81c72044a1	bpo-46541: Replace core use of _Py_IDENTIFIER() with statically initialized global objects. (gh-30928) We're no longer using _Py_IDENTIFIER() (or _Py_static_string()) in any core CPython code. It is still used in a number of non-builtin stdlib modules. The replacement is: PyUnicodeObject (not pointer) fields under _PyRuntimeState, statically initialized as part of _PyRuntime. A new _Py_GET_GLOBAL_IDENTIFIER() macro facilitates lookup of the fields (along with _Py_GET_GLOBAL_STRING() for non-identifier strings). https://bugs.python.org/issue46541#msg411799 explains the rationale for this change. The core of the change is in: * (new) Include/internal/pycore_global_strings.h - the declarations for the global strings, along with the macros * Include/internal/pycore_runtime_init.h - added the static initializers for the global strings * Include/internal/pycore_global_objects.h - where the struct in pycore_global_strings.h is hooked into _PyRuntimeState * Tools/scripts/generate_global_objects.py - added generation of the global string declarations and static initializers I've also added a --check flag to generate_global_objects.py (along with make check-global-objects) to check for unused global strings. That check is added to the PR CI config. The remainder of this change updates the core code to use _Py_GET_GLOBAL_IDENTIFIER() instead of _Py_IDENTIFIER() and the related _PyId functions (likewise for _Py_GET_GLOBAL_STRING() instead of _Py_static_string()). This includes adding a few functions where there wasn't already an alternative to _PyId(), replacing the _Py_Identifier * parameter with PyObject . The following are not changed (yet): stop using _Py_IDENTIFIER() in the stdlib modules * (maybe) get rid of _Py_IDENTIFIER(), etc. entirely -- this may not be doable as at least one package on PyPI using this (private) API * (maybe) intern the strings during runtime init https://bugs.python.org/issue46541	2022-02-08 13:39:07 -07:00

1 2 3 4 5 ...

372 Commits