cpython

Commit Graph

Author	SHA1	Message	Date
Tim Peters	8383915031	GH-116939: Rewrite binarysort() (#116940 ) Rewrote binarysort() for clarity. Also changed the signature to be more coherent (it was mixing sortslice with raw pointers). No change in method or functionality. However, I left some experiments in, disabled for now via `#if` tricks. Since this code was first written, some kinds of comparisons have gotten enormously faster (like for lists of floats), which changes the tradeoffs. For example, plain insertion sort's simpler innermost loop and highly predictable branches leave it very competitive (even beating, by a bit) binary insertion when comparisons are very cheap, despite that it can do many more compares. And it wins big on runs that are already sorted (moving the next one in takes only 1 compare then). So I left code for a plain insertion sort, to make future experimenting easier. Also made the maximum value of minrun a `#define` (``MAX_MINRUN`) to make experimenting with that easier too. And another bit of `#if``-disabled code rewrites binary insertion's innermost loop to remove its unpredictable branch. Surprisingly, this doesn't really seem to help overall. I'm unclear on why not. It certainly adds more instructions, but they're very simple, and it's hard to be believe they cost as much as a branch miss.	2024-03-21 22:27:25 -05:00
Donghee Na	a3cf0fada0	gh-116621: Specialize list.extend for dict items (gh-116888)	2024-03-19 12:18:07 +09:00
Victor Stinner	f6cdc6b4a1	Revert "gh-96844: Improve error message of list.remove (gh-106455)" (#116956 ) This reverts commit `217f47d6e5`.	2024-03-18 13:54:45 +00:00
Donghee Na	8da83f3386	gh-116621: Specialize list.extend for dict keys/values (gh-116816)	2024-03-15 23:48:34 +09:00
Tim Peters	bf121d6a69	GH-116554: Relax list.sort()'s notion of "descending" runs (#116578 ) * GH-116554: Relax list.sort()'s notion of "descending" run Rewrote `count_run()` so that sub-runs of equal elements no longer end a descending run. Both ascending and descending runs can have arbitrarily many sub-runs of arbitrarily many equal elements now. This is tricky, because we only use ``<`` comparisons, so checking for equality doesn't come "for free". Surprisingly, it turned out there's a very cheap (one comparison) way to determine whether an ascending run consisted of all-equal elements. That sealed the deal. In addition, after a descending run is reversed in-place, we now go on to see whether it can be extended by an ascending run that just happens to be adjacent. This succeeds in finding at least one additional element to append about half the time, and so appears to more than repay its cost (the savings come from getting to skip a binary search, when a short run is artificially forced to length MIINRUN later, for each new element `count_run()` can add to the initial run). While these have been in the back of my mind for years, a question on StackOverflow pushed it to action: https://stackoverflow.com/questions/78108792/ They were wondering why it took about 4x longer to sort a list like: [999_999, 999_999, ..., 2, 2, 1, 1, 0, 0] than "similar" lists. Of course that runs very much faster after this patch. Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com> Co-authored-by: Pieter Eendebak <pieter.eendebak@gmail.com>	2024-03-12 19:59:42 -05:00
Donghee Na	3325699ffa	gh-116621: Set manual critical section for list.extend (gh-116657)	2024-03-13 07:28:23 +09:00
Donghee Na	5b2f21faf3	gh-112087: Make list.sort to be thread-safe for PEP 703. (gh-116553)	2024-03-10 00:45:42 +00:00
Donghee Na	17d31bf384	gh-112087: Store memory allocation information into _PyListArray (gh-116529)	2024-03-09 23:50:28 +00:00
Ken Jin	41457c7fdb	gh-116381: Remove bad specializations, add fail stats (GH-116464) * Remove bad specializations, add fail stats	2024-03-08 00:21:21 +08:00
Ken Jin	7114cf20c0	gh-116381: Specialize CONTAINS_OP (GH-116385) * Specialize CONTAINS_OP * 📜🤖 Added by blurb_it. * Add PyAPI_FUNC for JIT --------- Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2024-03-07 03:30:11 +08:00
Donghee Na	d2f1b0eb49	gh-112087: Update list_get_item_ref to optimistically avoid locking (gh-116353) Co-authored-by: Sam Gross <colesbury@gmail.com>	2024-03-06 08:21:33 +09:00
Donghee Na	6cddc731fb	gh-112087: Make list_{slice, ass_slice, subscript} to be threadsafe (gh-116233)	2024-03-05 04:58:14 +00:00
Donghee Na	fb5e0344e4	gh-112087: Use QSBR technique for list_new/clear for free-thread build (gh-115875)	2024-03-02 08:30:35 +09:00
Donghee Na	259730bbb5	gh-112087: Make list_{concat, repeat, inplace_repeat, ass_item) to be thread-safe (gh-115605)	2024-02-21 01:38:09 +00:00
Sam Gross	520403ed4c	gh-115733: Fix crash involving exhausted list iterator (#115740 ) * gh-115733: Fix crash involving exhausted iterator * Add blurb	2024-02-21 05:18:44 +09:00
Donghee Na	321d13fd2b	gh-111968: Split _Py_dictkeys_freelist out of _Py_dict_freelist (gh-115505)	2024-02-16 01:01:36 +00:00
Donghee Na	a2d4281415	gh-112087: Make __sizeof__ and listiter_{len, next} to be threadsafe (gh-114843)	2024-02-15 02:00:50 +09:00
Donghee Na	f15795c9a0	gh-111968: Rename freelist related struct names to Eric's suggestion (gh-115329)	2024-02-14 00:32:51 +00:00
Donghee Na	d4d5bae147	gh-111968: Refactor _PyXXX_Fini to integrate with _PyObject_ClearFreeLists (gh-114899)	2024-02-10 00:57:04 +00:00
Donghee Na	f7a22a7055	gh-112087: Make list_{count, index, contains} to be thread-safe. (gh-114916)	2024-02-06 16:41:18 +00:00
Sam Gross	d0f1307580	gh-114329: Add `PyList_GetItemRef` function (GH-114504) The new `PyList_GetItemRef` is similar to `PyList_GetItem`, but returns a strong reference instead of a borrowed reference. Additionally, if the passed "list" object is not a list, the function sets a `TypeError` instead of calling `PyErr_BadInternalCall()`.	2024-02-02 14:03:15 +01:00
Donghee Na	13907968d7	gh-111968: Use per-thread freelists for dict in free-threading (gh-114323)	2024-02-01 20:53:53 +00:00
Donghee Na	7b9d406729	gh-112087: Make PyList_{Append,Size,GetSlice} to be thread-safe (gh-114651)	2024-02-01 08:58:08 +09:00
Donghee Na	f9c505698a	gh-112087: Make list_repr and list_length to be thread-safe (gh-114582)	2024-01-27 01:20:21 +09:00
Donghee Na	699779256e	gh-111968: Unify freelist naming schema to Eric's suggestion (gh-114581)	2024-01-27 00:25:16 +09:00
Donghee Na	d5442851a6	gh-112087: Remove duplicated critical_section (gh-114268)	2024-01-18 23:03:28 +00:00
Donghee Na	42b90cf0d6	gh-112087: Update list impl to be thread-safe with manual CS (gh-113863)	2024-01-16 09:11:14 +09:00
Donghee Na	efa738e862	gh-111968: Explicit handling for finalized freelist (gh-113929)	2024-01-12 00:31:28 +00:00
Donghee Na	c65ae26f2b	gh-111968: Unify naming scheme for freelist (gh-113919)	2024-01-11 08:51:51 +09:00
Donghee Na	57bdc6c30d	gh-111968: Introduce _PyFreeListState and _PyFreeListState_GET API (gh-113584)	2024-01-10 08:04:41 +09:00
Donghee Na	a023bc252d	gh-112087: Update list.{pop,clear,reverse,remove} to use CS (gh-113764)	2024-01-09 09:00:55 +09:00
Christopher Chavez	5d36a95e64	gh-111178: Avoid calling functions from incompatible pointer types in listobject.c (GH-112820) Fix undefined behavior warnings (UBSan -fsanitize=function), for example: Objects/object.c:674:11: runtime error: call to function list_repr through pointer to incorrect function type 'struct _object ()(struct _object *)' listobject.c:382: note: list_repr defined here SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior Objects/object.c:674:11 in	2024-01-02 14:41:32 +01:00
Victor Stinner	babb787047	gh-111138: Add PyList_Extend() and PyList_Clear() functions (#111862 ) * Split list_extend() into two sub-functions: list_extend_fast() and list_extend_iter(). * list_inplace_concat() no longer has to call Py_DECREF() on the list_extend() result, since list_extend() now returns an int.	2023-11-13 16:14:56 +00:00
scoder	24ddaee5ca	gh-106168: Revert the "size before item" setting (#111683 ) gh-106168: Update the size only after setting the item, to avoid temporary inconsistencies. Also remove the "what's new" sentence regarding the size setting since tuples cannot grow after allocation.	2023-11-03 11:02:39 +00:00
Victor Stinner	c494fb333b	gh-106320: Remove private _PyEval function (#108433 ) Move private _PyEval functions to the internal C API (pycore_ceval.h): * _PyEval_GetBuiltin() * _PyEval_GetBuiltinId() * _PyEval_GetSwitchInterval() * _PyEval_MakePendingCalls() * _PyEval_SetProfile() * _PyEval_SetSwitchInterval() * _PyEval_SetTrace() No longer export most of these functions.	2023-08-24 20:25:22 +02:00
Victor Stinner	0d6dfd68d2	gh-106320: Remove private _PyObject C API (#107147 ) Move private debug _PyObject functions to the internal C API (pycore_object.h): * _PyDebugAllocatorStats() * _PyObject_CheckConsistency() * _PyObject_DebugTypeStats() * _PyObject_IsFreed() No longer export most of these functions, except of _PyObject_IsFreed(). Move test functions using _PyObject_IsFreed() from _testcapi to _testinternalcapi. check_pyobject_is_freed() test no longer catch _testcapi.error: the tested function cannot raise _testcapi.error.	2023-07-23 20:09:08 +00:00
Dong-hee Na	217f47d6e5	gh-96844: Improve error message of list.remove (gh-106455)	2023-07-06 07:19:49 +09:00
Victor Stinner	35963da40f	gh-106320: Create pycore_modsupport.h header file (#106355 ) Remove the following functions from the C API, move them to the internal C API: add a new pycore_modsupport.h internal header file: * PyModule_CreateInitialized() * _PyArg_NoKwnames() * _Py_VaBuildStack() No longer export these functions.	2023-07-03 09:39:11 +00:00
Victor Stinner	3f8483cad2	gh-106168: PyTuple_SET_ITEM() now checks the index (#106164 ) PyTuple_SET_ITEM() and PyList_SET_ITEM() now check the index argument with an assertion if Python is built in debug mode or is built with assertions. * list_extend() and _PyList_AppendTakeRef() now set the list size before calling PyList_SET_ITEM(). * PyStructSequence_GetItem() and PyStructSequence_SetItem() now check the index argument: must be lesser than REAL_SIZE(op). * PyStructSequence_GET_ITEM() and PyStructSequence_SET_ITEM() are now aliases to PyStructSequence_GetItem() and PyStructSequence_SetItem().	2023-06-28 03:45:57 +02:00
Mark Shannon	7559f5fda9	GH-101291: Rearrange the size bits in PyLongObject (GH-102464) * Eliminate all remaining uses of Py_SIZE and Py_SET_SIZE on PyLongObject, adding asserts. * Change layout of size/sign bits in longobject to support future addition of immortal ints and tagged medium ints. * Add functions to hide some internals of long object, and for setting sign and digit count. * Replace uses of IS_MEDIUM_VALUE macro with _PyLong_IsCompact().	2023-03-22 14:49:51 +00:00
Jelle Zijlstra	d71edbd1b7	gh-101765: Fix refcount issues in list and unicode pickling (#102265 ) Followup from #101769.	2023-02-25 16:01:58 -08:00
Ionite	54dfa14c5a	gh-101765: Fix SystemError / segmentation fault in iter `__reduce__` when internal access of `builtins.__dict__` exhausts the iterator (#101769 )	2023-02-24 15:02:04 -08:00
Mark Shannon	c1b1f51cd1	GH-101291: Refactor the `PyLongObject` struct into object header and PyLongValue struct. (GH-101292)	2023-01-30 10:03:04 +00:00
L. A. F. Pereira	e6d4440782	gh-100146: Steal references from stack when building a list (#100147 ) When executing the BUILD_LIST opcode, steal the references from the stack, in a manner similar to the BUILD_TUPLE opcode. Implement this by offloading the logic to a new private API, _PyList_FromArraySteal(), that works similarly to _PyTuple_FromArraySteal(). This way, instead of performing multiple stack pointer adjustments while the list is being initialized, the stack is adjusted only once and a fast memory copy operation is performed in one fell swoop.	2023-01-03 10:49:49 -08:00
Pieter Eendebak	b3da698952	gh-94603: micro optimize list.pop (gh-94604)	2022-12-27 19:55:54 +09:00
Serhiy Storchaka	a87c46eab3	bpo-15999: Accept arbitrary values for boolean parameters. (#15609 ) builtins and extension module functions and methods that expect boolean values for parameters now accept any Python object rather than just a bool or int type. This is more consistent with how native Python code itself behaves.	2022-12-03 11:52:21 -08:00
Victor Stinner	85dd6cb6df	gh-99845: Use size_t type in __sizeof__() methods (#99846 ) The implementation of __sizeof__() methods using _PyObject_SIZE() now use an unsigned type (size_t) to compute the size, rather than a signed type (Py_ssize_t). Cast explicitly signed (Py_ssize_t) values to unsigned type (Py_ssize_t).	2022-11-30 17:22:52 +01:00
Victor Stinner	dbf8613a2e	gh-99300: Use Py_NewRef() in Objects/listobject.c (#99336 ) Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in Objects/listobject.c.	2022-11-10 22:09:18 +01:00
Victor Stinner	a5f092f3c4	gh-97616: list_resize() checks for integer overflow (#97617 ) Fix multiplying a list by an integer (list = int): detect the integer overflow when the new allocated length is close to the maximum size. Issue reported by Jordan Limor. list_resize() now checks for integer overflow before multiplying the new allocated length by the list item size (sizeof(PyObject)).	2022-09-29 00:07:07 +02:00
Nikita Sobolev	30cc1901ef	gh-96364: Fix text signatures of `__getitem__` for `list` and `dict` (GH-96365)	2022-09-09 17:37:02 +09:00

1 2 3 4 5 ...

505 Commits