cpython

Commit Graph

Author	SHA1	Message	Date
Mark Shannon	5b05d452cd	GH-118095: Add tier 2 support for YIELD_VALUE (GH-118380)	2024-04-30 11:33:13 +01:00
Eric Snow	529a160be6	gh-117953: Share More Machinery Code Between Builtin and Dynamic Extensions (gh-118204) This change will make some later changes simpler. It also brings more consistent behavior and lower maintenance costs.	2024-04-29 12:53:04 -06:00
Sam Gross	7ccacb220d	gh-117783: Immortalize objects that use deferred reference counting (#118112 ) Deferred reference counting is not fully implemented yet. As a temporary measure, we immortalize objects that would use deferred reference counting to avoid multi-threaded scaling bottlenecks. This is only performed in the free-threaded build once the first non-main thread is started. Additionally, some tests, including refleak tests, suppress this behavior.	2024-04-29 14:36:02 -04:00
Eric Snow	44f57a952e	gh-117953: Split Up _PyImport_LoadDynamicModuleWithSpec() (gh-118203) Basically, I've turned most of _PyImport_LoadDynamicModuleWithSpec() into two new functions (_PyImport_GetModInitFunc() and _PyImport_RunModInitFunc()) and moved the rest of it out into _imp_create_dynamic_impl(). There shouldn't be any changes in behavior. This change makes some future changes simpler. This is particularly relevant to potentially calling each module init function in the main interpreter first. Thus the critical part of the PR is the addition of _PyImport_RunModInitFunc(), which is strictly focused on running the init func and validating the result. A later PR will take it a step farther by capturing error information rather than raising exceptions. FWIW, this change also helps readers by clarifying a bit more about what happens when an extension/builtin module is imported.	2024-04-29 09:29:07 -06:00
Mark Shannon	ab6eda0ee5	GH-118095: Allow a variant of RESUME_CHECK in tier 2 (GH-118286)	2024-04-29 07:54:05 +01:00
Eric Snow	1d33925176	gh-110693: Use a Larger Queue for Per-Interpreter Pending Calls (gh-118302) This is an improvement over the status quo, reducing the likelihood of completely filling the pending calls queue. However, the problem won't go away completely unless we move to an unbounded linked list or add a mechanism for waiting until the queue isn't full.	2024-04-26 19:13:44 -06:00
Mark Shannon	3e06c7f719	GH-118095: Add dynamic exit support and FOR_ITER_GEN support to tier 2 (GH-118279)	2024-04-26 18:08:50 +01:00
Eric Snow	09c2947581	gh-110693: Pending Calls Machinery Cleanups (gh-118296) This does some cleanup in preparation for later changes.	2024-04-26 01:05:51 +00:00
Dino Viehland	5da0280648	gh-117657: Fixes a few small TSAN issues in dictobject (#118200 ) Fixup TSAN errors for dict	2024-04-25 08:53:29 -07:00
neonene	2c45148912	gh-117578: Introduce _PyType_GetModuleByDef2 private function (GH-117661) Co-authored-by: Erlend E. Aasland <erlend.aasland@protonmail.com> Co-authored-by: Petr Viktorin <encukou@gmail.com>	2024-04-25 13:51:31 +02:00
Mark Shannon	f180b31e76	GH-118095: Handle `RETURN_GENERATOR` in tier 2 (GH-118180)	2024-04-25 11:32:47 +01:00
Nice Zombies	10bb90ed49	gh-102511: Speed up os.path.splitroot() with native helpers (GH-118089)	2024-04-25 10:07:38 +01:00
Eric Snow	5865fa5f9b	gh-117953: Add Internal struct _Py_ext_module_loader_info (gh-118194) This helps with a later change that splits up _PyImport_LoadDynamicModuleWithSpec().	2024-04-24 17:42:01 +00:00
Eric Snow	03e3e31723	gh-76785: Rename _xxsubinterpreters to _interpreters (gh-117791) See https://discuss.python.org/t/pep-734-multiple-interpreters-in-the-stdlib/41147/26.	2024-04-24 16:18:24 +00:00
Eric Snow	af3c1d817d	gh-117953: Cleanups For fix_up_extension() in import.c (gh-118192) These are cleanups I've pulled out of gh-118116. Mostly, this change moves code around to align with some future changes and to improve clarity a little. There is one very small change in behavior: we now add the module to the per-interpreter caches after updating the global state, rather than before.	2024-04-24 09:55:48 -06:00
Mark Shannon	77cd0428b6	GH-118095: Convert DEOPT_IFs on likely side exits to EXIT_IFs (GH-118106) Covert DEOPT_IFs on likely side exits to EXIT_IFs	2024-04-24 14:37:55 +01:00
Irit Katriel	0aa0fc3d3c	gh-117901: Add option for compiler's codegen to save nested instruction sequences for introspection (#118007 )	2024-04-24 09:46:17 +00:00
Eric Snow	23950beff8	gh-117953: Small Cleanup of Extensions-Related Machinery Code (gh-118167) This is a collection of very basic cleanups I've pulled out of gh-118116. It is mostly renaming variables and moving a couple bits of code in functionally equivalent ways.	2024-04-23 08:25:50 -06:00
Shantanu	8e86579cae	gh-95754: Better error when script shadows a standard library or third party module (#113769 )	2024-04-22 18:24:21 -07:00
Guido van Rossum	4c7bfdff90	Remove more remnants of deepfreeze (#118159 )	2024-04-22 12:17:57 -07:00
Mark Shannon	a6647d16ab	GH-115480: Reduce guard strength for binary ops when type of one operand is known already (GH-118050)	2024-04-22 13:34:06 +01:00
Dino Viehland	8b541c017e	gh-112075: Make instance attributes stored in inline "dict" thread safe (#114742 ) Make instance attributes stored in inline "dict" thread safe on free-threaded builds	2024-04-21 22:57:05 -07:00
Dino Viehland	07525c9a85	gh-116818: Make `sys.settrace`, `sys.setprofile`, and monitoring thread-safe (#116775 ) Makes sys.settrace, sys.setprofile, and monitoring generally thread-safe. Mostly uses a stop-the-world approach and synchronization around the code object's _co_instrumentation_version. There may be a little bit of extra synchronization around the monitoring data that's required to be TSAN clean.	2024-04-19 14:47:42 -07:00
Mark Shannon	7e6fa5fced	GH-116202: Incorporate invalidation check into _START_EXECUTOR. (GH-118044)	2024-04-19 09:26:42 +01:00
Mark Shannon	d3bd6b5f3f	GH-115419: Improve list of escaping functions (GH-118054)	2024-04-19 09:25:07 +01:00
Donghee Na	94444ea45a	gh-112069: Add _PySet_NextEntryRef to be thread-safe. (gh-117990)	2024-04-19 00:18:22 +09:00
Irit Katriel	c179c0e6cb	gh-117680: make _PyInstructionSequence a PyObject and use it in tests (#117629 )	2024-04-17 16:42:04 +01:00
Mark Shannon	147cd0581e	GH-117760: Streamline the trashcan mechanism (GH-117763)	2024-04-17 11:08:05 +01:00
Jeff Glass	acf69e09c6	gh-115178: Add Counts of UOp Pairs to pystats (GH-115181)	2024-04-16 14:27:18 +01:00
Victor Stinner	2cc916e147	gh-117613: Enhance test_clinic @defining_class tests (#117896 )	2024-04-16 09:32:51 +02:00
Eric Snow	eca53620e3	gh-94673: Clarify About Runtime State Related to Static Builtin Types (gh-117761) Guido pointed out to me that some details about the per-interpreter state for the builtin types aren't especially clear. I'm addressing that by: * adding a comment explaining that state * adding some asserts to point out the relationship between each index and the interp/global runtime state	2024-04-12 16:39:27 -06:00
Sam Gross	4ad8f090cc	gh-117376: Partial implementation of deferred reference counting (#117696 ) This marks objects as using deferred refrence counting using the `ob_gc_bits` field in the free-threaded build and collects those objects during GC.	2024-04-12 17:36:20 +00:00
Serhiy Storchaka	39a6b29756	gh-117764: Use Argument Clinic for signal.set_wakeup_fd() (GH-117777)	2024-04-12 11:21:00 +00:00
Erlend E. Aasland	deb921f851	gh-117431: Adapt bytes and bytearray .find() and friends to Argument Clinic (#117502 ) This change gives a significant speedup, as the METH_FASTCALL calling convention is now used. The following bytes and bytearray methods are adapted: - count() - find() - index() - rfind() - rindex() Co-authored-by: Inada Naoki <songofacandy@gmail.com>	2024-04-12 07:40:55 +00:00
Eric Snow	fd259fdabe	gh-76785: Handle Legacy Interpreters Properly (gh-117490) This is similar to the situation with threading._DummyThread. The methods (incl. __del__()) of interpreters.Interpreter objects must be careful with interpreters not created by interpreters.create(). The simplest thing to start with is to disable any method that modifies or runs in the interpreter. As part of this, the runtime keeps track of where an interpreter was created. We also handle interpreter "refcounts" properly.	2024-04-11 23:23:25 +00:00
Brett Simmers	f268e328ed	gh-116738: Make _abc module thread-safe (#117488 ) A collection of small changes aimed at making the `_abc` module safe to use in a free-threaded build.	2024-04-11 18:13:25 -04:00
Eric Snow	993c3cca16	gh-76785: Add More Tests to test_interpreters.test_api (gh-117662) In addition to the increase test coverage, this is a precursor to sorting out how we handle interpreters created directly via the C-API.	2024-04-10 18:37:01 -06:00
Sam Gross	1a6594f661	gh-117439: Make refleak checking thread-safe without the GIL (#117469 ) This keeps track of the per-thread total reference count operations in PyThreadState in the free-threaded builds. The count is merged into the interpreter's total when the thread exits.	2024-04-08 12:11:36 -04:00
mpage	df73179048	gh-111926: Make weakrefs thread-safe in free-threaded builds (#117168 ) Most mutable data is protected by a striped lock that is keyed on the referenced object's address. The weakref's hash is protected using the weakref's per-object lock. Note that this only affects free-threaded builds. Apart from some minor refactoring, the added code is all either gated by `ifdef`s or is a no-op (e.g. `Py_BEGIN_CRITICAL_SECTION`).	2024-04-08 10:58:38 -04:00
Ken Jin	375425abd1	Cases generator: Remove type_prop and passthrough (#117614 )	2024-04-08 06:26:52 +08:00
Michael Droettboom	b5e60918af	gh-117549: Match declaration order for _Py_BackoffCounter initializer (#117551 ) Otherwise it might not compile with C++ (or certain C compilers/flags?).	2024-04-04 14:14:35 -07:00
Dino Viehland	434bc593df	gh-112075: Make _PyDict_LoadGlobal thread safe (#117529 ) Make _PyDict_LoadGlobal threadsafe	2024-04-04 12:26:07 -07:00
Irit Katriel	04697bcfaf	gh-117494: extract the Instruction Sequence data structure into a separate file (#117496 )	2024-04-04 15:47:26 +00:00
Guido van Rossum	060a96f1a9	gh-116968: Reimplement Tier 2 counters (#117144 ) Introduce a unified 16-bit backoff counter type (``_Py_BackoffCounter``), shared between the Tier 1 adaptive specializer and the Tier 2 optimizer. The API used for adaptive specialization counters is changed but the behavior is (supposed to be) identical. The behavior of the Tier 2 counters is changed: - There are no longer dynamic thresholds (we never varied these). - All counters now use the same exponential backoff. - The counter for ``JUMP_BACKWARD`` starts counting down from 16. - The ``temperature`` in side exits starts counting down from 64.	2024-04-04 15:03:27 +00:00
Peter Lazorchak	1c43468886	gh-116168: Remove extra `_CHECK_STACK_SPACE` uops (#117242 ) This merges all `_CHECK_STACK_SPACE` uops in a trace into a single `_CHECK_STACK_SPACE_OPERAND` uop that checks whether there is enough stack space for all calls included in the entire trace.	2024-04-03 17:14:18 +00:00
Erlend E. Aasland	595bb496b0	gh-117431: Adapt bytes and bytearray .startswith() and .endswith() to Argument Clinic (#117495 ) This change gives a significant speedup, as the METH_FASTCALL calling convention is now used.	2024-04-03 13:11:14 +02:00
Eric Snow	f341d6017d	gh-76785: Add PyInterpreterConfig Helpers (gh-117170) These helpers make it easier to customize and inspect the config used to initialize interpreters. This is especially valuable in our tests. I found inspiration from the PyConfig API for the PyInterpreterConfig dict conversion stuff. As part of this PR I've also added a bunch of tests.	2024-04-02 20:35:52 +00:00
Mark Shannon	c32dc47aca	GH-115776: Embed the values array into the object, for "normal" Python objects. (GH-116115)	2024-04-02 11:59:21 +01:00
Irit Katriel	1d5479b236	gh-117411: move PyFutureFeatures to pycore_symtable.h and make it private (#117412 )	2024-04-02 10:34:49 +00:00
Sam Gross	19c1dd60c5	gh-117323: Make `cell` thread-safe in free-threaded builds (#117330 ) Use critical sections to lock around accesses to cell contents. The critical sections are no-ops in the default (with GIL) build.	2024-03-29 13:35:43 -04:00
Erlend E. Aasland	c1712ef066	gh-116664: Make module state Py_SETREF's in _warnings thread-safe (#116959 ) Mark the swap operations as critical sections. Add an internal Py_BEGIN_CRITICAL_SECTION_MUT API that takes a PyMutex pointer instead of a PyObject pointer.	2024-03-28 15:05:08 +00:00
Irit Katriel	262fb911ab	gh-117288: Allocate fewer label IDs in _PyCfg_ToInstructionSequence (#117290 )	2024-03-27 17:38:19 +00:00
Irit Katriel	79be75735c	gh-115775: Compiler adds __static_attributes__ field to classes (#115913 )	2024-03-26 15:18:17 +00:00
Mark Shannon	8bef34f625	GH-117108: Set the "old space bit" to "visited" for all young objects (#117213 ) Change old space bit of young objects from 0 to gcstate->visited_space. This ensures that any object created and collected during cycle GC has the bit set correctly.	2024-03-26 11:11:42 +00:00
Mark Shannon	bf82f77957	GH-116422: Tier2 hot/cold splitting (GH-116813) Splits the "cold" path, deopts and exits, from the "hot" path, reducing the size of most jitted instructions, at the cost of slower exits.	2024-03-26 09:35:11 +00:00
Mark Shannon	e28477f214	GH-117108: Change the size of the GC increment to about 1% of the total heap size. (GH-117120)	2024-03-22 18:43:25 +00:00
Guido van Rossum	570a82d46a	gh-117045: Add code object to function version cache (#117028 ) Changes to the function version cache: - In addition to the function object, also store the code object, and allow the latter to be retrieved even if the function has been evicted. - Stop assigning new function versions after a critical attribute (e.g. `__code__`) has been modified; the version is permanently reset to zero in this case. - Changes to `__annotations__` are no longer considered critical. (This fixes gh-109998.) Changes to the Tier 2 optimization machinery: - If we cannot map a function version to a function, but it is still mapped to a code object, we continue projecting the trace. The operand of the `_PUSH_FRAME` and `_POP_FRAME` opcodes can be either NULL, a function object, or a code object with the lowest bit set. This allows us to trace through code that calls an ephemeral function, i.e., a function that may not be alive when we are constructing the executor, e.g. a generator expression or certain nested functions. We will lose globals removal inside such functions, but we can still do other peephole operations (and even possibly [call inlining](https://github.com/python/cpython/pull/116290), if we decide to do it), which only need the code object. As before, if we cannot retrieve the code object from the cache, we stop projecting.	2024-03-21 12:37:41 -07:00
Sam Gross	1f72fb5447	gh-116522: Refactor `_PyThreadState_DeleteExcept` (#117131 ) Split `_PyThreadState_DeleteExcept` into two functions: - `_PyThreadState_RemoveExcept` removes all thread states other than one passed as an argument. It returns the removed thread states as a linked list. - `_PyThreadState_DeleteList` deletes those dead thread states. It may call destructors, so we want to "start the world" before calling `_PyThreadState_DeleteList` to avoid potential deadlocks.	2024-03-21 11:21:02 -07:00
Michael Droettboom	50369e6c34	gh-116996: Add pystats about _Py_uop_analyse_and_optimize (GH-116997)	2024-03-22 01:27:46 +08:00
Eric Snow	617158e078	gh-76785: Drop PyInterpreterID_Type (gh-117101) I added it quite a while ago as a strategy for managing interpreter lifetimes relative to the PEP 554 (now 734) implementation. Relatively recently I refactored that implementation to no longer rely on InterpreterID objects. Thus now I'm removing it.	2024-03-21 17:15:02 +00:00
Victor Stinner	8bea6c411d	gh-115754: Add Py_GetConstant() function (#116883 ) Add Py_GetConstant() and Py_GetConstantBorrowed() functions. In the limited C API version 3.13, getting Py_None, Py_False, Py_True, Py_Ellipsis and Py_NotImplemented singletons is now implemented as function calls at the stable ABI level to hide implementation details. Getting these constants still return borrowed references. Add _testlimitedcapi/object.c and test_capi/test_object.py to test Py_GetConstant() and Py_GetConstantBorrowed() functions.	2024-03-21 16:07:00 +00:00
Eric Snow	5a76d1be8e	gh-105716: Update interp->threads.main After Fork (gh-117049) I missed this in gh-109921. We also update Py_Exit() to call _PyInterpreterState_SetNotRunningMain(), if necessary.	2024-03-21 10:06:35 -06:00
Eric Snow	bbee57fa8c	gh-76785: Clean Up Interpreter ID Conversions (gh-117048) Mostly we unify the two different implementations of the conversion code (from PyObject * to int64_t. We also drop the PyArg_ParseTuple()-style converter function, as well as rename and move PyInterpreterID_LookUp().	2024-03-21 09:56:12 -06:00
Mark Shannon	15309329b6	GH-108362: Incremental Cycle GC (GH-116206)	2024-03-20 08:54:42 +00:00
Guido van Rossum	7e1f38f2de	gh-116916: Remove separate next_func_version counter (#116918 ) Somehow we ended up with two separate counter variables tracking "the next function version". Most likely this was a historical accident where an old branch was updated incorrectly. This PR merges the two counters into a single one: `interp->func_state.next_version`.	2024-03-18 11:11:10 -07:00
Victor Stinner	5e0a070dfe	gh-116809: Restore removed _PyErr_ChainExceptions1() function (#116900 )	2024-03-16 21:37:11 +01:00
mpage	33da0e844c	gh-114271: Fix race in `Thread.join()` (#114839 ) There is a race between when `Thread._tstate_lock` is released[^1] in `Thread._wait_for_tstate_lock()` and when `Thread._stop()` asserts[^2] that it is unlocked. Consider the following execution involving threads A, B, and C: 1. A starts. 2. B joins A, blocking on its `_tstate_lock`. 3. C joins A, blocking on its `_tstate_lock`. 4. A finishes and releases its `_tstate_lock`. 5. B acquires A's `_tstate_lock` in `_wait_for_tstate_lock()`, releases it, but is swapped out before calling `_stop()`. 6. C is scheduled, acquires A's `_tstate_lock` in `_wait_for_tstate_lock()` but is swapped out before releasing it. 7. B is scheduled, calls `_stop()`, which asserts that A's `_tstate_lock` is not held. However, C holds it, so the assertion fails. The race can be reproduced[^3] by inserting sleeps at the appropriate points in the threading code. To do so, run the `repro_join_race.py` from the linked repo. There are two main parts to this PR: 1. `_tstate_lock` is replaced with an event that is attached to `PyThreadState`. The event is set by the runtime prior to the thread being cleared (in the same place that `_tstate_lock` was released). `Thread.join()` blocks waiting for the event to be set. 2. `_PyInterpreterState_WaitForThreads()` provides the ability to wait for all non-daemon threads to exit. To do so, an `is_daemon` predicate was added to `PyThreadState`. This field is set each time a thread is created. `threading._shutdown()` now calls into `_PyInterpreterState_WaitForThreads()` instead of waiting on `_tstate_lock`s. [^1]: `441affc9e7/Lib/threading.py (L1201)` [^2]: `441affc9e7/Lib/threading.py (L1115)` [^3]: `8194653279` --------- Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com> Co-authored-by: Antoine Pitrou <antoine@python.org>	2024-03-16 13:56:30 +01:00
Mark Shannon	2cf18a4430	GH-116422: Modify a few uops so that they can be supported by tier 2 with hot/cold splitting (GH-116832)	2024-03-15 10:48:00 +00:00
Victor Stinner	7bbb9b57e6	gh-111696, PEP 737: Add %T and %N to PyUnicode_FromFormat() (#116839 )	2024-03-14 22:23:00 +00:00
Victor Stinner	c432df6d56	gh-111696, PEP 737: Add PyType_GetModuleName() function (#116824 ) Co-authored-by: Eric Snow <ericsnowcurrently@gmail.com>	2024-03-14 18:17:43 +00:00
Mark Shannon	61e54bfcee	GH-116422: Factor out eval breaker checks at end of calls into its own micro-op. (GH-116817)	2024-03-14 16:31:47 +00:00
Matthias Diener	3265087c07	Fix code comment regarding DK_ENTRIES (GH-113960) fix code comment regarding dict entries	2024-03-12 15:05:30 +01:00
Victor Stinner	3cc5ae5c2c	gh-85283: Convert grp extension to the limited C API (#116611 ) posixmodule.h: remove check on the limited C API, since these helpers are not part of the public C API.	2024-03-12 00:46:53 +00:00
Victor Stinner	113053a070	gh-110850: Fix _PyTime_FromSecondsDouble() API (#116606 ) Return 0 on success. Set an exception and return -1 on error. Fix os.timerfd_settime(): properly report exceptions on _PyTime_FromSecondsDouble() failure. No longer export _PyTime_FromSecondsDouble().	2024-03-11 16:35:29 +00:00
Brett Simmers	2731913dd5	gh-116167: Allow disabling the GIL with `PYTHON_GIL=0` or `-X gil=0` (#116338 ) In free-threaded builds, running with `PYTHON_GIL=0` will now disable the GIL. Follow-up issues track work to re-enable the GIL when loading an incompatible extension, and to disable the GIL by default. In order to support re-enabling the GIL at runtime, all GIL-related data structures are initialized as usual, and disabling the GIL simply sets a flag that causes `take_gil()` and `drop_gil()` to return early.	2024-03-11 11:02:58 -04:00
Mark Shannon	b6ae6da1bd	GH-116596: Better determination of escaping uops. (GH-116597)	2024-03-11 13:37:48 +00:00
Donghee Na	6c4fc209e1	gh-112536: Define MI_TSAN to 1 for --with-mimalloc and --with-thread-sanitizer (gh-116558)	2024-03-11 22:25:55 +09:00
Mark Shannon	4e5df2013f	GH-116468: Use constants instead of `oparg` in stack effects when `oparg` is known to be a constant. (GH-116469)	2024-03-11 09:30:15 +00:00
Dino Viehland	7db871e4fa	gh-112075: Support freeing object memory via QSBR (#116344 ) Free objects with qsbr if shared	2024-03-08 09:56:36 -08:00
Ken Jin	41457c7fdb	gh-116381: Remove bad specializations, add fail stats (GH-116464) * Remove bad specializations, add fail stats	2024-03-08 00:21:21 +08:00
Ken Jin	7114cf20c0	gh-116381: Specialize CONTAINS_OP (GH-116385) * Specialize CONTAINS_OP * 📜🤖 Added by blurb_it. * Add PyAPI_FUNC for JIT --------- Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2024-03-07 03:30:11 +08:00
Sam Gross	c012c8ab7b	gh-115103: Delay reuse of mimalloc pages that store PyObjects (#115435 ) This implements the delayed reuse of mimalloc pages that contain Python objects in the free-threaded build. Allocations of the same size class are grouped in data structures called pages. These are different from operating system pages. For thread-safety, we want to ensure that memory used to store PyObjects remains valid as long as there may be concurrent lock-free readers; we want to delay using it for other size classes, in other heaps, or returning it to the operating system. When a mimalloc page becomes empty, instead of immediately freeing it, we tag it with a QSBR goal and insert it into a per-thread state linked list of pages to be freed. When mimalloc needs a fresh page, we process the queue and free any still empty pages that are now deemed safe to be freed. Pages waiting to be freed are still available for allocations of the same size class and allocating from a page prevent it from being freed. There is additional logic to handle abandoned pages when threads exit.	2024-03-06 09:42:11 -05:00
Mark Shannon	27858e2a17	GH-113710: Tier 2 optimizer: check the function instead of checking globals. (GH-116410)	2024-03-06 13:12:23 +00:00
Sam Gross	72714c0266	gh-115103: Enable internal mimalloc assertions in debug builds (#116343 ) This sets `MI_DEBUG` to `2` in debug builds to enable `mi_assert_internal()` calls. Expensive internal assertions are not enabled. This also disables an assertion in free-threaded builds that would be triggered by the free-threaded GC because we traverse heaps that are not owned by the current thread.	2024-03-05 13:54:20 -05:00
cui fliter	e7ba6e9dbe	chore: fix typos (#116345 ) Signed-off-by: cui fliter <imcusg@gmail.com>	2024-03-05 09:05:52 -07:00
Mark Shannon	23db9c6227	GH-115685: Split `_TO_BOOL_ALWAYS_TRUE` into micro-ops (GH-116352)	2024-03-05 15:23:08 +00:00
Mark Shannon	0c81ce1360	GH-115819: Eliminate Boolean guards when value is known (GH-116355)	2024-03-05 15:06:00 +00:00
Mark Shannon	cbf3d38cbe	GH-115685: Optimize `TO_BOOL` and variants based on truthiness of input. (GH-116311)	2024-03-05 11:23:46 +00:00
Brett Cannon	90a1e9880f	GH-116226: include `pthread_stubs.h` in `pycore_pythreads.h` (#116227 )	2024-03-01 15:22:31 -08:00
mpage	9e88173d36	gh-114271: Make `_thread.ThreadHandle` thread-safe in free-threaded builds (GH-115190) Make `_thread.ThreadHandle` thread-safe in free-threaded builds We protect the mutable state of `ThreadHandle` using a `_PyOnceFlag`. Concurrent operations (i.e. `join` or `detach`) on `ThreadHandle` block until it is their turn to execute or an earlier operation succeeds. Once an operation has been applied successfully all future operations complete immediately. The `join()` method is now idempotent. It may be called multiple times but the underlying OS thread will only be joined once. After `join()` succeeds, any future calls to `join()` will succeed immediately. The internal thread handle `detach()` method has been removed.	2024-03-01 13:43:12 -08:00
Brett Simmers	339c8e1c13	gh-115999: Disable the specializing adaptive interpreter in free-threaded builds (#116013 ) For now, disable all specialization when the GIL might be disabled.	2024-02-29 21:53:32 -05:00
Ken Jin	d01886c5c9	gh-115685: Type/values propagate for TO_BOOL in tier 2 (GH-115686)	2024-03-01 06:13:38 +08:00
Guido van Rossum	0656509033	gh-116088: Insert bottom checks after all sym_set_...() calls (#116089 ) This changes the `sym_set_...()` functions to return a `bool` which is `false` when the symbol is `bottom` after the operation. All calls to such functions now check this result and go to `hit_bottom`, a special error label that prints a different message and then reports that it wasn't able to optimize the trace. No executor will be produced in this case.	2024-02-29 18:55:29 +00:00
Brandt Bucher	f0df35eeca	GH-115802: JIT "small" code for Windows (GH-115964)	2024-02-29 08:11:28 -08:00
Guido van Rossum	3409bc29c9	gh-115859: Re-enable T2 optimizer pass by default (#116062 ) This undoes the temporary default disabling of the T2 optimizer pass in gh-115860. - Add a new test that reproduces Brandt's example from gh-115859; it indeed crashes before gh-116028 with PYTHONUOPSOPTIMIZE=1 - Re-enable the optimizer pass in T2, stop checking PYTHONUOPSOPTIMIZE - Rename the env var to disable T2 entirely to PYTHON_UOPS_OPTIMIZE (must be explicitly set to 0 to disable) - Fix skipIf conditions on tests in test_opt.py accordingly - Export sym_is_bottom() (for debugging) - Fix various things in the `_BINARY_OP_` specializations in the abstract interpreter: - DECREF(temp) - out-of-space check after sym_new_const() - add sym_matches_type() checks, so even if we somehow reach a binary op with symbolic constants of the wrong type on the stack we won't trigger the type assert	2024-02-28 22:38:01 +00:00
Guido van Rossum	e2a3e4b748	gh-115816: Improve internal symbols API in optimizer (#116028 ) - Any `sym_set_...` call that attempts to set conflicting information cause the symbol to become `bottom` (contradiction). - All `sym_is...` and similar calls return false or NULL for `bottom`. - Everything's tested. - The tests still pass with `PYTHONUOPSOPTIMIZE=1`.	2024-02-28 17:55:56 +00:00
Pablo Galindo Salgado	1752b51012	gh-115773: Add tests to exercise the _Py_DebugOffsets structure (#115774 )	2024-02-28 10:17:34 +00:00
Jelle Zijlstra	d53560deb2	gh-105858: Expose some union-related objects as internal APIs (GH-116025) We now use these in the AST parsing code after gh-105880. A few comparable types (e.g., NoneType) are already exposed as internal APIs.	2024-02-28 09:56:40 +00:00
Jelle Zijlstra	ed4dfd8825	gh-105858: Improve AST node constructors (#105880 ) Demonstration: >>> ast.FunctionDef.__annotations__ {'name': <class 'str'>, 'args': <class 'ast.arguments'>, 'body': list[ast.stmt], 'decorator_list': list[ast.expr], 'returns': ast.expr \| None, 'type_comment': str \| None, 'type_params': list[ast.type_param]} >>> ast.FunctionDef() <stdin>:1: DeprecationWarning: FunctionDef.__init__ missing 1 required positional argument: 'name'. This will become an error in Python 3.15. <stdin>:1: DeprecationWarning: FunctionDef.__init__ missing 1 required positional argument: 'args'. This will become an error in Python 3.15. <ast.FunctionDef object at 0x101959460> >>> node = ast.FunctionDef(name="foo", args=ast.arguments()) >>> node.decorator_list [] >>> ast.FunctionDef(whatever="you want", name="x", args=ast.arguments()) <stdin>:1: DeprecationWarning: FunctionDef.__init__ got an unexpected keyword argument 'whatever'. Support for arbitrary keyword arguments is deprecated and will be removed in Python 3.15. <ast.FunctionDef object at 0x1019581f0>	2024-02-27 18:13:03 -08:00
Mark Shannon	6ecfcfe894	GH-115816: Assorted naming and formatting changes to improve maintainability. (GH-115987) * Rename _Py_UOpsAbstractInterpContext to _Py_UOpsContext and _Py_UOpsSymType to _Py_UopsSymbol. * #define shortened form of _Py_uop_... names for improved readability.	2024-02-27 13:25:02 +00:00
Mark Shannon	10fbcd6c5d	GH-115816: Make tier2 optimizer symbols testable, and add a few tests. (GH-115953)	2024-02-27 10:51:26 +00:00
Dino Viehland	1002fbe12e	gh-112075: Iterating a dict shouldn't require locks (#115108 ) Makes iteration of a dict be lock free for the forward iteration case.	2024-02-22 12:02:39 -08:00
AN Long	87a65a5bd4	gh-115304: Add doc for initializing PyMutex as a global variable (#115305 )	2024-02-21 12:35:53 -05:00
Victor Stinner	e4c34f04a1	gh-110850: Cleanup PyTime API: PyTime_t are nanoseconds (#115753 ) PyTime_t no longer uses an arbitrary unit, it's always a number of nanoseconds (64-bit signed integer). * Rename _PyTime_FromNanosecondsObject() to _PyTime_FromLong(). * Rename _PyTime_AsNanosecondsObject() to _PyTime_AsLong(). * Remove pytime_from_nanoseconds(). * Remove pytime_as_nanoseconds(). * Remove _PyTime_FromNanoseconds().	2024-02-21 11:46:00 +01:00
Victor Stinner	77430b6a32	gh-110850: Replace private _PyTime_MAX with public PyTime_MAX (#115751 ) Remove references to the old names _PyTime_MIN and _PyTime_MAX, now that PyTime_MIN and PyTime_MAX are public. Replace also _PyTime_MIN with PyTime_MIN.	2024-02-21 08:11:40 +00:00
Donghee Na	259730bbb5	gh-112087: Make list_{concat, repeat, inplace_repeat, ass_item) to be thread-safe (gh-115605)	2024-02-21 01:38:09 +00:00
Dino Viehland	54071460d7	gh-112075: Accessing a single element should optimistically avoid locking (#115109 ) Makes accessing a single element thread safe and typically lock free	2024-02-20 17:08:14 -08:00
Dino Viehland	176df09adb	gh-112075: Make PyDictKeysObject thread-safe (#114741 ) Adds locking for shared PyDictKeysObject's for dictionaries	2024-02-20 16:40:37 -08:00
Victor Stinner	145bc2d638	gh-110850: Use public PyTime functions (#115746 ) Replace private _PyTime functions with public PyTime functions. random_seed_time_pid() now reports errors to its caller.	2024-02-20 23:31:30 +00:00
Victor Stinner	52d1477566	gh-110850: Rename internal PyTime C API functions (#115734 ) Rename functions: * _PyTime_GetSystemClock() => _PyTime_TimeUnchecked() * _PyTime_GetPerfCounter() => _PyTime_PerfCounterUnchecked() * _PyTime_GetMonotonicClock() => _PyTime_MonotonicUnchecked() * _PyTime_GetSystemClockWithInfo() => _PyTime_TimeWithInfo() * _PyTime_GetMonotonicClockWithInfo() => _PyTime_MonotonicWithInfo() * _PyTime_GetMonotonicClockWithInfo() => _PyTime_MonotonicWithInfo() Changes: * Remove "typedef PyTime_t PyTime_t;" which was "typedef PyTime_t _PyTime_t;" before a previous rename. * Update comments of "Unchecked" functions. * Remove invalid PyTime_Time() comment.	2024-02-20 22:16:37 +00:00
Sam Gross	e3ad6ca56f	gh-115103: Implement delayed free mechanism for free-threaded builds (#115367 ) This adds `_PyMem_FreeDelayed()` and supporting functions. The `_PyMem_FreeDelayed()` function frees memory with the same allocator as `PyMem_Free()`, but after some delay to ensure that concurrent lock-free readers have finished.	2024-02-20 13:04:37 -05:00
Victor Stinner	d207c7cd5a	gh-110850: Cleanup pycore_time.h includes (#115724 ) <pycore_time.h> include is no longer needed to get the PyTime_t type in internal header files. This type is now provided by <Python.h> include. Add <pycore_time.h> includes to C files instead.	2024-02-20 16:50:43 +00:00
Sam Gross	cc82e33af9	gh-115491: Keep some fields valid across allocations (free-threading) (#115573 ) This avoids filling the memory occupied by ob_tid, ob_ref_local, and ob_ref_shared with debug bytes (e.g., 0xDD) in mimalloc in the free-threaded build.	2024-02-20 10:36:40 -05:00
Victor Stinner	9af80ec83d	gh-110850: Replace _PyTime_t with PyTime_t (#115719 ) Run command: sed -i -e 's!\<_PyTime_t\>!PyTime_t!g' $(find -name ".c" -o -name ".h")	2024-02-20 15:02:27 +00:00
Brett Simmers	0749244d13	gh-112175: Add `eval_breaker` to `PyThreadState` (#115194 ) This change adds an `eval_breaker` field to `PyThreadState`. The primary motivation is for performance in free-threaded builds: with thread-local eval breakers, we can stop a specific thread (e.g., for an async exception) without interrupting other threads. The source of truth for the global instrumentation version is stored in the `instrumentation_version` field in PyInterpreterState. Threads usually read the version from their local `eval_breaker`, where it continues to be colocated with the eval breaker bits.	2024-02-20 09:57:48 -05:00
Ken Jin	dcba21f905	gh-115687: Split up guards from COMPARE_OP (GH-115688)	2024-02-20 11:30:49 +00:00
Mark Shannon	626c414995	GH-115457: Support splitting and replication of micro ops. (GH-115558)	2024-02-20 10:50:59 +00:00
Mark Shannon	7b21403ccd	GH-112354: Initial implementation of warm up on exits and trace-stitching (GH-114142)	2024-02-20 09:39:55 +00:00
Donghee Na	8db8d7118e	gh-111968: Split _Py_async_gen_asend_freelist out of _Py_async_gen_fr… (gh-115546)	2024-02-17 10:03:10 +09:00
Sam Gross	5903190727	gh-115103: Implement delayed memory reclamation (QSBR) (#115180 ) This adds a safe memory reclamation scheme based on FreeBSD's "GUS" and quiescent state based reclamation (QSBR). The API provides a mechanism for callers to detect when it is safe to free memory that may be concurrently accessed by readers.	2024-02-16 15:25:19 -05:00
Sam Gross	b24c9161a6	gh-112529: Make the GC scheduling thread-safe (#114880 ) The GC keeps track of the number of allocations (less deallocations) since the last GC. This buffers the count in thread-local state and uses atomic operations to modify the per-interpreter count. The thread-local buffering avoids contention on shared state. A consequence is that the GC scheduling is not as precise, so "test_sneaky_frame_object" is skipped because it requires that the GC be run exactly after allocating a frame object.	2024-02-16 11:22:27 -05:00
Donghee Na	321d13fd2b	gh-111968: Split _Py_dictkeys_freelist out of _Py_dict_freelist (gh-115505)	2024-02-16 01:01:36 +00:00
Dino Viehland	454d7963e3	gh-113743: Use per-interpreter locks for types (#115541 ) Move type-lock to per-interpreter lock to avoid heavy contention in interpreters test	2024-02-15 16:28:31 -08:00
Dino Viehland	ae460d450a	gh-113743: Make the MRO cache thread-safe in free-threaded builds (#113930 ) Makes _PyType_Lookup thread safe, including: Thread safety of the underlying cache. Make mutation of mro and type members thread safe Also _PyType_GetMRO and _PyType_GetBases are currently returning borrowed references which aren't safe.	2024-02-15 10:54:57 -08:00
monkeyman192	298bcdc185	gh-112433: Add optional _align_ attribute to ctypes.Structure (GH-113790)	2024-02-15 16:40:20 +02:00
Sam Gross	ad4f909e0e	gh-115432: Add critical section variant that handles a NULL object (#115433 ) This adds `Py_XBEGIN_CRITICAL_SECTION` and `Py_XEND_CRITICAL_SECTION`, which accept a possibly NULL object as an argument. If the argument is NULL, then nothing is locked or unlocked. Otherwise, they behave like `Py_BEGIN/END_CRITICAL_SECTION`.	2024-02-15 08:37:54 -05:00
mpage	dc978f6ab6	gh-112050: Make collections.deque thread-safe in free-threaded builds (#113830 ) Use critical sections to make deque methods that operate on mutable state thread-safe when the GIL is disabled. This is mostly accomplished by using the @critical_section Argument Clinic directive, though there are a few places where this was not possible and critical sections had to be manually acquired/released.	2024-02-15 09:22:47 +01:00
Sam Gross	326119d373	gh-112529: Use _PyThread_Id() in mimalloc in free-threaded build (#115488 ) The free-threaded GC uses mimallocs segment thread IDs to restore the overwritten `ob_tid` thread ids in PyObjects. For that reason, it's important that PyObjects and mimalloc use the same identifiers.	2024-02-14 16:41:29 -05:00
mpage	a95b1a56bb	gh-115041: Add wrappers that are atomic only in free-threaded builds (#115046 ) These are intended to be used in places where atomics are required in free-threaded builds but not in the default build. We don't want to introduce the potential performance overhead of an atomic operation in the default build.	2024-02-14 15:15:05 -05:00
Sam Gross	17773fcb86	gh-115441: Fix missing braces warning (#115460 ) Removes `_py_object_state_INIT`. We want to initialize the `object_state` field to zero.	2024-02-14 12:27:39 -05:00
Donghee Na	f15795c9a0	gh-111968: Rename freelist related struct names to Eric's suggestion (gh-115329)	2024-02-14 00:32:51 +00:00
Eric Snow	514b1c91b8	gh-76785: Improved Subinterpreters Compatibility with 3.12 (gh-115424) For the most part, these changes make is substantially easier to backport subinterpreter-related code to 3.12, especially the related modules (e.g. _xxsubinterpreters). The main motivation is to support releasing a PyPI package with the 3.13 capabilities compiled for 3.12. A lot of the changes here involve either hiding details behind macros/functions or splitting up some files.	2024-02-13 14:56:49 -07:00
Mark Shannon	681778c56a	GH-113710: Improve `_SET_IP` and `_CHECK_VALIDITY` (GH-115248)	2024-02-13 16:28:19 +00:00
Mark Shannon	f9f6156c5a	GH-113710: Backedge counter improvements. (GH-115166)	2024-02-13 14:16:37 +00:00
Ken Jin	7cce857622	gh-114058: Foundations of the Tier2 redundancy eliminator (GH-115085) --------- Co-authored-by: Mark Shannon <9448417+markshannon@users.noreply.github.com> Co-authored-by: Jules <57632293+JuliaPoo@users.noreply.github.com> Co-authored-by: Guido van Rossum <gvanrossum@users.noreply.github.com>	2024-02-13 21:24:48 +08:00
Steve Dower	ea25f32d5f	gh-89240: Enable multiprocessing on Windows to use large process pools (GH-107873) We add _winapi.BatchedWaitForMultipleObjects to wait for larger numbers of handles. This is an internal module, hence undocumented, and should be used with caution. Check the docstring for info before using BatchedWaitForMultipleObjects.	2024-02-13 00:28:35 +00:00
mpage	de7d67b19b	gh-114271: Make `PyInterpreterState.threads.count` thread-safe in free-threaded builds (gh-115093) Use atomics to mutate PyInterpreterState.threads.count.	2024-02-12 10:44:00 -07:00
Petr Viktorin	879f4546bf	gh-110850: Add PyTime_t C API (GH-115215) * gh-110850: Add PyTime_t C API Add PyTime_t API: * PyTime_t type. * PyTime_MIN and PyTime_MAX constants. * PyTime_AsSecondsDouble(), PyTime_Monotonic(), PyTime_PerfCounter() and PyTime_GetSystemClock() functions. Co-authored-by: Victor Stinner <vstinner@python.org>	2024-02-12 18:13:10 +01:00
Mark Shannon	8144661017	GH-113710: Fix updating of dict version tag and add watched dict stats (GH-115221)	2024-02-12 16:07:38 +00:00
Donghee Na	d4d5bae147	gh-111968: Refactor _PyXXX_Fini to integrate with _PyObject_ClearFreeLists (gh-114899)	2024-02-10 00:57:04 +00:00
Sam Gross	a3af3cb4f4	gh-110481: Implement inter-thread queue for biased reference counting (#114824 ) Biased reference counting maintains two refcount fields in each object: `ob_ref_local` and `ob_ref_shared`. The true refcount is the sum of these two fields. In some cases, when refcounting operations are split across threads, the ob_ref_shared field can be negative (although the total refcount must be at least zero). In this case, the thread that decremented the refcount requests that the owning thread give up ownership and merge the refcount fields.	2024-02-09 17:08:32 -05:00
Carl Meyer	8f0998e844	gh-114828: parenthesize non-atomic macro definitions in pycore_symtable.h (#115143 )	2024-02-07 13:19:47 -07:00
Mark Shannon	8a3c499ffe	GH-108362: Revert "GH-108362: Incremental GC implementation (GH-108038)" (#115132 ) Revert "GH-108362: Incremental GC implementation (GH-108038)" This reverts commit `36518e69d7`.	2024-02-07 12:38:34 +00:00
Dino Viehland	92abb01240	gh-112075: Add critical sections for most dict APIs (#114508 ) Starts adding thread safety to dict objects. Use @critical_section for APIs which are exposed via argument clinic and don't directly correlate with a public C API which needs to acquire the lock Use a _lock_held suffix for keeping changes to complicated functions simple and just wrapping them with a critical section Acquire and release the lock in an existing function where it won't be overly disruptive to the existing logic	2024-02-06 14:03:43 -08:00
Sam Gross	b6228b521b	gh-115035: Mark ThreadHandles as non-joinable earlier after forking (#115042 ) This marks dead ThreadHandles as non-joinable earlier in `PyOS_AfterFork_Child()` before we execute any Python code. The handles are stored in a global linked list in `_PyRuntimeState` because `fork()` affects the entire process.	2024-02-06 14:45:04 -05:00
Mariusz Felisiak	1a10437a14	gh-91602: Add iterdump() support for filtering database objects (#114501 ) Add optional 'filter' parameter to iterdump() that allows a "LIKE" pattern for filtering database objects to dump. Co-authored-by: Erlend E. Aasland <erlend@python.org>	2024-02-06 12:34:56 +01:00
Dino Viehland	bcccf1fb63	gh-112075: Add gc shared bits (#114931 ) Add GC shared flags for objects to the GC bit states in free-threaded builds	2024-02-05 10:35:59 -08:00
Mark Shannon	36518e69d7	GH-108362: Incremental GC implementation (GH-108038)	2024-02-05 18:28:51 +00:00
Andrew Rogers	b3f0b698da	gh-104530: Enable native Win32 condition variables by default (GH-104531)	2024-02-02 13:50:51 +00:00
Mark Shannon	0e71a295e9	GH-113710: Add a "globals to constants" pass (GH-114592) Converts specializations of `LOAD_GLOBAL` into constants during tier 2 optimization.	2024-02-02 12:14:34 +00:00

1 2 3 4 5 ...

1492 Commits