cpython

Commit Graph

Author	SHA1	Message	Date
Sam Gross	b2c3b70c71	gh-118332: Fix deadlock involving stop the world (#118412 ) Avoid detaching thread state when stopping the world. When re-attaching the thread state, the thread would attempt to resume the top-most critical section, which might now be held by a thread paused for our stop-the-world request.	2024-04-30 15:01:28 -04:00
Dino Viehland	4a1cf66c5c	gh-117657: Fix small issues with instrumentation and TSAN (#118064 ) Small TSAN fixups for instrumentation	2024-04-30 11:38:05 -07:00
Irit Katriel	1f16b4ce56	gh-118272: Clear generator frame's locals when the generator is closed (#118277 ) Co-authored-by: Thomas Grainger <tagrain@gmail.com>	2024-04-30 19:32:25 +01:00
Mark Shannon	5b05d452cd	GH-118095: Add tier 2 support for YIELD_VALUE (GH-118380)	2024-04-30 11:33:13 +01:00
Eric Snow	529a160be6	gh-117953: Share More Machinery Code Between Builtin and Dynamic Extensions (gh-118204) This change will make some later changes simpler. It also brings more consistent behavior and lower maintenance costs.	2024-04-29 12:53:04 -06:00
Sam Gross	7ccacb220d	gh-117783: Immortalize objects that use deferred reference counting (#118112 ) Deferred reference counting is not fully implemented yet. As a temporary measure, we immortalize objects that would use deferred reference counting to avoid multi-threaded scaling bottlenecks. This is only performed in the free-threaded build once the first non-main thread is started. Additionally, some tests, including refleak tests, suppress this behavior.	2024-04-29 14:36:02 -04:00
Eric Snow	44f57a952e	gh-117953: Split Up _PyImport_LoadDynamicModuleWithSpec() (gh-118203) Basically, I've turned most of _PyImport_LoadDynamicModuleWithSpec() into two new functions (_PyImport_GetModInitFunc() and _PyImport_RunModInitFunc()) and moved the rest of it out into _imp_create_dynamic_impl(). There shouldn't be any changes in behavior. This change makes some future changes simpler. This is particularly relevant to potentially calling each module init function in the main interpreter first. Thus the critical part of the PR is the addition of _PyImport_RunModInitFunc(), which is strictly focused on running the init func and validating the result. A later PR will take it a step farther by capturing error information rather than raising exceptions. FWIW, this change also helps readers by clarifying a bit more about what happens when an extension/builtin module is imported.	2024-04-29 09:29:07 -06:00
Mark Shannon	ab6eda0ee5	GH-118095: Allow a variant of RESUME_CHECK in tier 2 (GH-118286)	2024-04-29 07:54:05 +01:00
Eric Snow	1d33925176	gh-110693: Use a Larger Queue for Per-Interpreter Pending Calls (gh-118302) This is an improvement over the status quo, reducing the likelihood of completely filling the pending calls queue. However, the problem won't go away completely unless we move to an unbounded linked list or add a mechanism for waiting until the queue isn't full.	2024-04-26 19:13:44 -06:00
Mark Shannon	3e06c7f719	GH-118095: Add dynamic exit support and FOR_ITER_GEN support to tier 2 (GH-118279)	2024-04-26 18:08:50 +01:00
Eric Snow	09c2947581	gh-110693: Pending Calls Machinery Cleanups (gh-118296) This does some cleanup in preparation for later changes.	2024-04-26 01:05:51 +00:00
Dino Viehland	5da0280648	gh-117657: Fixes a few small TSAN issues in dictobject (#118200 ) Fixup TSAN errors for dict	2024-04-25 08:53:29 -07:00
neonene	2c45148912	gh-117578: Introduce _PyType_GetModuleByDef2 private function (GH-117661) Co-authored-by: Erlend E. Aasland <erlend.aasland@protonmail.com> Co-authored-by: Petr Viktorin <encukou@gmail.com>	2024-04-25 13:51:31 +02:00
Mark Shannon	f180b31e76	GH-118095: Handle `RETURN_GENERATOR` in tier 2 (GH-118180)	2024-04-25 11:32:47 +01:00
Nice Zombies	10bb90ed49	gh-102511: Speed up os.path.splitroot() with native helpers (GH-118089)	2024-04-25 10:07:38 +01:00
Eric Snow	5865fa5f9b	gh-117953: Add Internal struct _Py_ext_module_loader_info (gh-118194) This helps with a later change that splits up _PyImport_LoadDynamicModuleWithSpec().	2024-04-24 17:42:01 +00:00
Eric Snow	03e3e31723	gh-76785: Rename _xxsubinterpreters to _interpreters (gh-117791) See https://discuss.python.org/t/pep-734-multiple-interpreters-in-the-stdlib/41147/26.	2024-04-24 16:18:24 +00:00
Eric Snow	af3c1d817d	gh-117953: Cleanups For fix_up_extension() in import.c (gh-118192) These are cleanups I've pulled out of gh-118116. Mostly, this change moves code around to align with some future changes and to improve clarity a little. There is one very small change in behavior: we now add the module to the per-interpreter caches after updating the global state, rather than before.	2024-04-24 09:55:48 -06:00
Mark Shannon	77cd0428b6	GH-118095: Convert DEOPT_IFs on likely side exits to EXIT_IFs (GH-118106) Covert DEOPT_IFs on likely side exits to EXIT_IFs	2024-04-24 14:37:55 +01:00
Irit Katriel	0aa0fc3d3c	gh-117901: Add option for compiler's codegen to save nested instruction sequences for introspection (#118007 )	2024-04-24 09:46:17 +00:00
Eric Snow	23950beff8	gh-117953: Small Cleanup of Extensions-Related Machinery Code (gh-118167) This is a collection of very basic cleanups I've pulled out of gh-118116. It is mostly renaming variables and moving a couple bits of code in functionally equivalent ways.	2024-04-23 08:25:50 -06:00
Shantanu	8e86579cae	gh-95754: Better error when script shadows a standard library or third party module (#113769 )	2024-04-22 18:24:21 -07:00
Guido van Rossum	4c7bfdff90	Remove more remnants of deepfreeze (#118159 )	2024-04-22 12:17:57 -07:00
Mark Shannon	a6647d16ab	GH-115480: Reduce guard strength for binary ops when type of one operand is known already (GH-118050)	2024-04-22 13:34:06 +01:00
Dino Viehland	8b541c017e	gh-112075: Make instance attributes stored in inline "dict" thread safe (#114742 ) Make instance attributes stored in inline "dict" thread safe on free-threaded builds	2024-04-21 22:57:05 -07:00
Dino Viehland	07525c9a85	gh-116818: Make `sys.settrace`, `sys.setprofile`, and monitoring thread-safe (#116775 ) Makes sys.settrace, sys.setprofile, and monitoring generally thread-safe. Mostly uses a stop-the-world approach and synchronization around the code object's _co_instrumentation_version. There may be a little bit of extra synchronization around the monitoring data that's required to be TSAN clean.	2024-04-19 14:47:42 -07:00
Mark Shannon	7e6fa5fced	GH-116202: Incorporate invalidation check into _START_EXECUTOR. (GH-118044)	2024-04-19 09:26:42 +01:00
Mark Shannon	d3bd6b5f3f	GH-115419: Improve list of escaping functions (GH-118054)	2024-04-19 09:25:07 +01:00
Donghee Na	94444ea45a	gh-112069: Add _PySet_NextEntryRef to be thread-safe. (gh-117990)	2024-04-19 00:18:22 +09:00
Irit Katriel	c179c0e6cb	gh-117680: make _PyInstructionSequence a PyObject and use it in tests (#117629 )	2024-04-17 16:42:04 +01:00
Mark Shannon	147cd0581e	GH-117760: Streamline the trashcan mechanism (GH-117763)	2024-04-17 11:08:05 +01:00
Jeff Glass	acf69e09c6	gh-115178: Add Counts of UOp Pairs to pystats (GH-115181)	2024-04-16 14:27:18 +01:00
Victor Stinner	2cc916e147	gh-117613: Enhance test_clinic @defining_class tests (#117896 )	2024-04-16 09:32:51 +02:00
Eric Snow	eca53620e3	gh-94673: Clarify About Runtime State Related to Static Builtin Types (gh-117761) Guido pointed out to me that some details about the per-interpreter state for the builtin types aren't especially clear. I'm addressing that by: * adding a comment explaining that state * adding some asserts to point out the relationship between each index and the interp/global runtime state	2024-04-12 16:39:27 -06:00
Sam Gross	4ad8f090cc	gh-117376: Partial implementation of deferred reference counting (#117696 ) This marks objects as using deferred refrence counting using the `ob_gc_bits` field in the free-threaded build and collects those objects during GC.	2024-04-12 17:36:20 +00:00
Serhiy Storchaka	39a6b29756	gh-117764: Use Argument Clinic for signal.set_wakeup_fd() (GH-117777)	2024-04-12 11:21:00 +00:00
Erlend E. Aasland	deb921f851	gh-117431: Adapt bytes and bytearray .find() and friends to Argument Clinic (#117502 ) This change gives a significant speedup, as the METH_FASTCALL calling convention is now used. The following bytes and bytearray methods are adapted: - count() - find() - index() - rfind() - rindex() Co-authored-by: Inada Naoki <songofacandy@gmail.com>	2024-04-12 07:40:55 +00:00
Eric Snow	fd259fdabe	gh-76785: Handle Legacy Interpreters Properly (gh-117490) This is similar to the situation with threading._DummyThread. The methods (incl. __del__()) of interpreters.Interpreter objects must be careful with interpreters not created by interpreters.create(). The simplest thing to start with is to disable any method that modifies or runs in the interpreter. As part of this, the runtime keeps track of where an interpreter was created. We also handle interpreter "refcounts" properly.	2024-04-11 23:23:25 +00:00
Brett Simmers	f268e328ed	gh-116738: Make _abc module thread-safe (#117488 ) A collection of small changes aimed at making the `_abc` module safe to use in a free-threaded build.	2024-04-11 18:13:25 -04:00
Eric Snow	993c3cca16	gh-76785: Add More Tests to test_interpreters.test_api (gh-117662) In addition to the increase test coverage, this is a precursor to sorting out how we handle interpreters created directly via the C-API.	2024-04-10 18:37:01 -06:00
Sam Gross	1a6594f661	gh-117439: Make refleak checking thread-safe without the GIL (#117469 ) This keeps track of the per-thread total reference count operations in PyThreadState in the free-threaded builds. The count is merged into the interpreter's total when the thread exits.	2024-04-08 12:11:36 -04:00
mpage	df73179048	gh-111926: Make weakrefs thread-safe in free-threaded builds (#117168 ) Most mutable data is protected by a striped lock that is keyed on the referenced object's address. The weakref's hash is protected using the weakref's per-object lock. Note that this only affects free-threaded builds. Apart from some minor refactoring, the added code is all either gated by `ifdef`s or is a no-op (e.g. `Py_BEGIN_CRITICAL_SECTION`).	2024-04-08 10:58:38 -04:00
Ken Jin	375425abd1	Cases generator: Remove type_prop and passthrough (#117614 )	2024-04-08 06:26:52 +08:00
Michael Droettboom	b5e60918af	gh-117549: Match declaration order for _Py_BackoffCounter initializer (#117551 ) Otherwise it might not compile with C++ (or certain C compilers/flags?).	2024-04-04 14:14:35 -07:00
Dino Viehland	434bc593df	gh-112075: Make _PyDict_LoadGlobal thread safe (#117529 ) Make _PyDict_LoadGlobal threadsafe	2024-04-04 12:26:07 -07:00
Irit Katriel	04697bcfaf	gh-117494: extract the Instruction Sequence data structure into a separate file (#117496 )	2024-04-04 15:47:26 +00:00
Guido van Rossum	060a96f1a9	gh-116968: Reimplement Tier 2 counters (#117144 ) Introduce a unified 16-bit backoff counter type (``_Py_BackoffCounter``), shared between the Tier 1 adaptive specializer and the Tier 2 optimizer. The API used for adaptive specialization counters is changed but the behavior is (supposed to be) identical. The behavior of the Tier 2 counters is changed: - There are no longer dynamic thresholds (we never varied these). - All counters now use the same exponential backoff. - The counter for ``JUMP_BACKWARD`` starts counting down from 16. - The ``temperature`` in side exits starts counting down from 64.	2024-04-04 15:03:27 +00:00
Peter Lazorchak	1c43468886	gh-116168: Remove extra `_CHECK_STACK_SPACE` uops (#117242 ) This merges all `_CHECK_STACK_SPACE` uops in a trace into a single `_CHECK_STACK_SPACE_OPERAND` uop that checks whether there is enough stack space for all calls included in the entire trace.	2024-04-03 17:14:18 +00:00
Erlend E. Aasland	595bb496b0	gh-117431: Adapt bytes and bytearray .startswith() and .endswith() to Argument Clinic (#117495 ) This change gives a significant speedup, as the METH_FASTCALL calling convention is now used.	2024-04-03 13:11:14 +02:00
Eric Snow	f341d6017d	gh-76785: Add PyInterpreterConfig Helpers (gh-117170) These helpers make it easier to customize and inspect the config used to initialize interpreters. This is especially valuable in our tests. I found inspiration from the PyConfig API for the PyInterpreterConfig dict conversion stuff. As part of this PR I've also added a bunch of tests.	2024-04-02 20:35:52 +00:00
Mark Shannon	c32dc47aca	GH-115776: Embed the values array into the object, for "normal" Python objects. (GH-116115)	2024-04-02 11:59:21 +01:00
Irit Katriel	1d5479b236	gh-117411: move PyFutureFeatures to pycore_symtable.h and make it private (#117412 )	2024-04-02 10:34:49 +00:00
Sam Gross	19c1dd60c5	gh-117323: Make `cell` thread-safe in free-threaded builds (#117330 ) Use critical sections to lock around accesses to cell contents. The critical sections are no-ops in the default (with GIL) build.	2024-03-29 13:35:43 -04:00
Erlend E. Aasland	c1712ef066	gh-116664: Make module state Py_SETREF's in _warnings thread-safe (#116959 ) Mark the swap operations as critical sections. Add an internal Py_BEGIN_CRITICAL_SECTION_MUT API that takes a PyMutex pointer instead of a PyObject pointer.	2024-03-28 15:05:08 +00:00
Irit Katriel	262fb911ab	gh-117288: Allocate fewer label IDs in _PyCfg_ToInstructionSequence (#117290 )	2024-03-27 17:38:19 +00:00
Irit Katriel	79be75735c	gh-115775: Compiler adds __static_attributes__ field to classes (#115913 )	2024-03-26 15:18:17 +00:00
Mark Shannon	8bef34f625	GH-117108: Set the "old space bit" to "visited" for all young objects (#117213 ) Change old space bit of young objects from 0 to gcstate->visited_space. This ensures that any object created and collected during cycle GC has the bit set correctly.	2024-03-26 11:11:42 +00:00
Mark Shannon	bf82f77957	GH-116422: Tier2 hot/cold splitting (GH-116813) Splits the "cold" path, deopts and exits, from the "hot" path, reducing the size of most jitted instructions, at the cost of slower exits.	2024-03-26 09:35:11 +00:00
Mark Shannon	e28477f214	GH-117108: Change the size of the GC increment to about 1% of the total heap size. (GH-117120)	2024-03-22 18:43:25 +00:00
Guido van Rossum	570a82d46a	gh-117045: Add code object to function version cache (#117028 ) Changes to the function version cache: - In addition to the function object, also store the code object, and allow the latter to be retrieved even if the function has been evicted. - Stop assigning new function versions after a critical attribute (e.g. `__code__`) has been modified; the version is permanently reset to zero in this case. - Changes to `__annotations__` are no longer considered critical. (This fixes gh-109998.) Changes to the Tier 2 optimization machinery: - If we cannot map a function version to a function, but it is still mapped to a code object, we continue projecting the trace. The operand of the `_PUSH_FRAME` and `_POP_FRAME` opcodes can be either NULL, a function object, or a code object with the lowest bit set. This allows us to trace through code that calls an ephemeral function, i.e., a function that may not be alive when we are constructing the executor, e.g. a generator expression or certain nested functions. We will lose globals removal inside such functions, but we can still do other peephole operations (and even possibly [call inlining](https://github.com/python/cpython/pull/116290), if we decide to do it), which only need the code object. As before, if we cannot retrieve the code object from the cache, we stop projecting.	2024-03-21 12:37:41 -07:00
Sam Gross	1f72fb5447	gh-116522: Refactor `_PyThreadState_DeleteExcept` (#117131 ) Split `_PyThreadState_DeleteExcept` into two functions: - `_PyThreadState_RemoveExcept` removes all thread states other than one passed as an argument. It returns the removed thread states as a linked list. - `_PyThreadState_DeleteList` deletes those dead thread states. It may call destructors, so we want to "start the world" before calling `_PyThreadState_DeleteList` to avoid potential deadlocks.	2024-03-21 11:21:02 -07:00
Michael Droettboom	50369e6c34	gh-116996: Add pystats about _Py_uop_analyse_and_optimize (GH-116997)	2024-03-22 01:27:46 +08:00
Eric Snow	617158e078	gh-76785: Drop PyInterpreterID_Type (gh-117101) I added it quite a while ago as a strategy for managing interpreter lifetimes relative to the PEP 554 (now 734) implementation. Relatively recently I refactored that implementation to no longer rely on InterpreterID objects. Thus now I'm removing it.	2024-03-21 17:15:02 +00:00
Victor Stinner	8bea6c411d	gh-115754: Add Py_GetConstant() function (#116883 ) Add Py_GetConstant() and Py_GetConstantBorrowed() functions. In the limited C API version 3.13, getting Py_None, Py_False, Py_True, Py_Ellipsis and Py_NotImplemented singletons is now implemented as function calls at the stable ABI level to hide implementation details. Getting these constants still return borrowed references. Add _testlimitedcapi/object.c and test_capi/test_object.py to test Py_GetConstant() and Py_GetConstantBorrowed() functions.	2024-03-21 16:07:00 +00:00
Eric Snow	5a76d1be8e	gh-105716: Update interp->threads.main After Fork (gh-117049) I missed this in gh-109921. We also update Py_Exit() to call _PyInterpreterState_SetNotRunningMain(), if necessary.	2024-03-21 10:06:35 -06:00
Eric Snow	bbee57fa8c	gh-76785: Clean Up Interpreter ID Conversions (gh-117048) Mostly we unify the two different implementations of the conversion code (from PyObject * to int64_t. We also drop the PyArg_ParseTuple()-style converter function, as well as rename and move PyInterpreterID_LookUp().	2024-03-21 09:56:12 -06:00
Mark Shannon	15309329b6	GH-108362: Incremental Cycle GC (GH-116206)	2024-03-20 08:54:42 +00:00
Guido van Rossum	7e1f38f2de	gh-116916: Remove separate next_func_version counter (#116918 ) Somehow we ended up with two separate counter variables tracking "the next function version". Most likely this was a historical accident where an old branch was updated incorrectly. This PR merges the two counters into a single one: `interp->func_state.next_version`.	2024-03-18 11:11:10 -07:00
Victor Stinner	5e0a070dfe	gh-116809: Restore removed _PyErr_ChainExceptions1() function (#116900 )	2024-03-16 21:37:11 +01:00
mpage	33da0e844c	gh-114271: Fix race in `Thread.join()` (#114839 ) There is a race between when `Thread._tstate_lock` is released[^1] in `Thread._wait_for_tstate_lock()` and when `Thread._stop()` asserts[^2] that it is unlocked. Consider the following execution involving threads A, B, and C: 1. A starts. 2. B joins A, blocking on its `_tstate_lock`. 3. C joins A, blocking on its `_tstate_lock`. 4. A finishes and releases its `_tstate_lock`. 5. B acquires A's `_tstate_lock` in `_wait_for_tstate_lock()`, releases it, but is swapped out before calling `_stop()`. 6. C is scheduled, acquires A's `_tstate_lock` in `_wait_for_tstate_lock()` but is swapped out before releasing it. 7. B is scheduled, calls `_stop()`, which asserts that A's `_tstate_lock` is not held. However, C holds it, so the assertion fails. The race can be reproduced[^3] by inserting sleeps at the appropriate points in the threading code. To do so, run the `repro_join_race.py` from the linked repo. There are two main parts to this PR: 1. `_tstate_lock` is replaced with an event that is attached to `PyThreadState`. The event is set by the runtime prior to the thread being cleared (in the same place that `_tstate_lock` was released). `Thread.join()` blocks waiting for the event to be set. 2. `_PyInterpreterState_WaitForThreads()` provides the ability to wait for all non-daemon threads to exit. To do so, an `is_daemon` predicate was added to `PyThreadState`. This field is set each time a thread is created. `threading._shutdown()` now calls into `_PyInterpreterState_WaitForThreads()` instead of waiting on `_tstate_lock`s. [^1]: `441affc9e7/Lib/threading.py (L1201)` [^2]: `441affc9e7/Lib/threading.py (L1115)` [^3]: `8194653279` --------- Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com> Co-authored-by: Antoine Pitrou <antoine@python.org>	2024-03-16 13:56:30 +01:00
Mark Shannon	2cf18a4430	GH-116422: Modify a few uops so that they can be supported by tier 2 with hot/cold splitting (GH-116832)	2024-03-15 10:48:00 +00:00
Victor Stinner	7bbb9b57e6	gh-111696, PEP 737: Add %T and %N to PyUnicode_FromFormat() (#116839 )	2024-03-14 22:23:00 +00:00
Victor Stinner	c432df6d56	gh-111696, PEP 737: Add PyType_GetModuleName() function (#116824 ) Co-authored-by: Eric Snow <ericsnowcurrently@gmail.com>	2024-03-14 18:17:43 +00:00
Mark Shannon	61e54bfcee	GH-116422: Factor out eval breaker checks at end of calls into its own micro-op. (GH-116817)	2024-03-14 16:31:47 +00:00
Matthias Diener	3265087c07	Fix code comment regarding DK_ENTRIES (GH-113960) fix code comment regarding dict entries	2024-03-12 15:05:30 +01:00
Victor Stinner	3cc5ae5c2c	gh-85283: Convert grp extension to the limited C API (#116611 ) posixmodule.h: remove check on the limited C API, since these helpers are not part of the public C API.	2024-03-12 00:46:53 +00:00
Victor Stinner	113053a070	gh-110850: Fix _PyTime_FromSecondsDouble() API (#116606 ) Return 0 on success. Set an exception and return -1 on error. Fix os.timerfd_settime(): properly report exceptions on _PyTime_FromSecondsDouble() failure. No longer export _PyTime_FromSecondsDouble().	2024-03-11 16:35:29 +00:00
Brett Simmers	2731913dd5	gh-116167: Allow disabling the GIL with `PYTHON_GIL=0` or `-X gil=0` (#116338 ) In free-threaded builds, running with `PYTHON_GIL=0` will now disable the GIL. Follow-up issues track work to re-enable the GIL when loading an incompatible extension, and to disable the GIL by default. In order to support re-enabling the GIL at runtime, all GIL-related data structures are initialized as usual, and disabling the GIL simply sets a flag that causes `take_gil()` and `drop_gil()` to return early.	2024-03-11 11:02:58 -04:00
Mark Shannon	b6ae6da1bd	GH-116596: Better determination of escaping uops. (GH-116597)	2024-03-11 13:37:48 +00:00
Donghee Na	6c4fc209e1	gh-112536: Define MI_TSAN to 1 for --with-mimalloc and --with-thread-sanitizer (gh-116558)	2024-03-11 22:25:55 +09:00
Mark Shannon	4e5df2013f	GH-116468: Use constants instead of `oparg` in stack effects when `oparg` is known to be a constant. (GH-116469)	2024-03-11 09:30:15 +00:00
Dino Viehland	7db871e4fa	gh-112075: Support freeing object memory via QSBR (#116344 ) Free objects with qsbr if shared	2024-03-08 09:56:36 -08:00
Ken Jin	41457c7fdb	gh-116381: Remove bad specializations, add fail stats (GH-116464) * Remove bad specializations, add fail stats	2024-03-08 00:21:21 +08:00
Ken Jin	7114cf20c0	gh-116381: Specialize CONTAINS_OP (GH-116385) * Specialize CONTAINS_OP * 📜🤖 Added by blurb_it. * Add PyAPI_FUNC for JIT --------- Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2024-03-07 03:30:11 +08:00
Sam Gross	c012c8ab7b	gh-115103: Delay reuse of mimalloc pages that store PyObjects (#115435 ) This implements the delayed reuse of mimalloc pages that contain Python objects in the free-threaded build. Allocations of the same size class are grouped in data structures called pages. These are different from operating system pages. For thread-safety, we want to ensure that memory used to store PyObjects remains valid as long as there may be concurrent lock-free readers; we want to delay using it for other size classes, in other heaps, or returning it to the operating system. When a mimalloc page becomes empty, instead of immediately freeing it, we tag it with a QSBR goal and insert it into a per-thread state linked list of pages to be freed. When mimalloc needs a fresh page, we process the queue and free any still empty pages that are now deemed safe to be freed. Pages waiting to be freed are still available for allocations of the same size class and allocating from a page prevent it from being freed. There is additional logic to handle abandoned pages when threads exit.	2024-03-06 09:42:11 -05:00
Mark Shannon	27858e2a17	GH-113710: Tier 2 optimizer: check the function instead of checking globals. (GH-116410)	2024-03-06 13:12:23 +00:00
Sam Gross	72714c0266	gh-115103: Enable internal mimalloc assertions in debug builds (#116343 ) This sets `MI_DEBUG` to `2` in debug builds to enable `mi_assert_internal()` calls. Expensive internal assertions are not enabled. This also disables an assertion in free-threaded builds that would be triggered by the free-threaded GC because we traverse heaps that are not owned by the current thread.	2024-03-05 13:54:20 -05:00
cui fliter	e7ba6e9dbe	chore: fix typos (#116345 ) Signed-off-by: cui fliter <imcusg@gmail.com>	2024-03-05 09:05:52 -07:00
Mark Shannon	23db9c6227	GH-115685: Split `_TO_BOOL_ALWAYS_TRUE` into micro-ops (GH-116352)	2024-03-05 15:23:08 +00:00
Mark Shannon	0c81ce1360	GH-115819: Eliminate Boolean guards when value is known (GH-116355)	2024-03-05 15:06:00 +00:00
Mark Shannon	cbf3d38cbe	GH-115685: Optimize `TO_BOOL` and variants based on truthiness of input. (GH-116311)	2024-03-05 11:23:46 +00:00
Brett Cannon	90a1e9880f	GH-116226: include `pthread_stubs.h` in `pycore_pythreads.h` (#116227 )	2024-03-01 15:22:31 -08:00
mpage	9e88173d36	gh-114271: Make `_thread.ThreadHandle` thread-safe in free-threaded builds (GH-115190) Make `_thread.ThreadHandle` thread-safe in free-threaded builds We protect the mutable state of `ThreadHandle` using a `_PyOnceFlag`. Concurrent operations (i.e. `join` or `detach`) on `ThreadHandle` block until it is their turn to execute or an earlier operation succeeds. Once an operation has been applied successfully all future operations complete immediately. The `join()` method is now idempotent. It may be called multiple times but the underlying OS thread will only be joined once. After `join()` succeeds, any future calls to `join()` will succeed immediately. The internal thread handle `detach()` method has been removed.	2024-03-01 13:43:12 -08:00
Brett Simmers	339c8e1c13	gh-115999: Disable the specializing adaptive interpreter in free-threaded builds (#116013 ) For now, disable all specialization when the GIL might be disabled.	2024-02-29 21:53:32 -05:00
Ken Jin	d01886c5c9	gh-115685: Type/values propagate for TO_BOOL in tier 2 (GH-115686)	2024-03-01 06:13:38 +08:00
Guido van Rossum	0656509033	gh-116088: Insert bottom checks after all sym_set_...() calls (#116089 ) This changes the `sym_set_...()` functions to return a `bool` which is `false` when the symbol is `bottom` after the operation. All calls to such functions now check this result and go to `hit_bottom`, a special error label that prints a different message and then reports that it wasn't able to optimize the trace. No executor will be produced in this case.	2024-02-29 18:55:29 +00:00
Brandt Bucher	f0df35eeca	GH-115802: JIT "small" code for Windows (GH-115964)	2024-02-29 08:11:28 -08:00
Guido van Rossum	3409bc29c9	gh-115859: Re-enable T2 optimizer pass by default (#116062 ) This undoes the temporary default disabling of the T2 optimizer pass in gh-115860. - Add a new test that reproduces Brandt's example from gh-115859; it indeed crashes before gh-116028 with PYTHONUOPSOPTIMIZE=1 - Re-enable the optimizer pass in T2, stop checking PYTHONUOPSOPTIMIZE - Rename the env var to disable T2 entirely to PYTHON_UOPS_OPTIMIZE (must be explicitly set to 0 to disable) - Fix skipIf conditions on tests in test_opt.py accordingly - Export sym_is_bottom() (for debugging) - Fix various things in the `_BINARY_OP_` specializations in the abstract interpreter: - DECREF(temp) - out-of-space check after sym_new_const() - add sym_matches_type() checks, so even if we somehow reach a binary op with symbolic constants of the wrong type on the stack we won't trigger the type assert	2024-02-28 22:38:01 +00:00
Guido van Rossum	e2a3e4b748	gh-115816: Improve internal symbols API in optimizer (#116028 ) - Any `sym_set_...` call that attempts to set conflicting information cause the symbol to become `bottom` (contradiction). - All `sym_is...` and similar calls return false or NULL for `bottom`. - Everything's tested. - The tests still pass with `PYTHONUOPSOPTIMIZE=1`.	2024-02-28 17:55:56 +00:00
Pablo Galindo Salgado	1752b51012	gh-115773: Add tests to exercise the _Py_DebugOffsets structure (#115774 )	2024-02-28 10:17:34 +00:00
Jelle Zijlstra	d53560deb2	gh-105858: Expose some union-related objects as internal APIs (GH-116025) We now use these in the AST parsing code after gh-105880. A few comparable types (e.g., NoneType) are already exposed as internal APIs.	2024-02-28 09:56:40 +00:00
Jelle Zijlstra	ed4dfd8825	gh-105858: Improve AST node constructors (#105880 ) Demonstration: >>> ast.FunctionDef.__annotations__ {'name': <class 'str'>, 'args': <class 'ast.arguments'>, 'body': list[ast.stmt], 'decorator_list': list[ast.expr], 'returns': ast.expr \| None, 'type_comment': str \| None, 'type_params': list[ast.type_param]} >>> ast.FunctionDef() <stdin>:1: DeprecationWarning: FunctionDef.__init__ missing 1 required positional argument: 'name'. This will become an error in Python 3.15. <stdin>:1: DeprecationWarning: FunctionDef.__init__ missing 1 required positional argument: 'args'. This will become an error in Python 3.15. <ast.FunctionDef object at 0x101959460> >>> node = ast.FunctionDef(name="foo", args=ast.arguments()) >>> node.decorator_list [] >>> ast.FunctionDef(whatever="you want", name="x", args=ast.arguments()) <stdin>:1: DeprecationWarning: FunctionDef.__init__ got an unexpected keyword argument 'whatever'. Support for arbitrary keyword arguments is deprecated and will be removed in Python 3.15. <ast.FunctionDef object at 0x1019581f0>	2024-02-27 18:13:03 -08:00
Mark Shannon	6ecfcfe894	GH-115816: Assorted naming and formatting changes to improve maintainability. (GH-115987) * Rename _Py_UOpsAbstractInterpContext to _Py_UOpsContext and _Py_UOpsSymType to _Py_UopsSymbol. * #define shortened form of _Py_uop_... names for improved readability.	2024-02-27 13:25:02 +00:00
Mark Shannon	10fbcd6c5d	GH-115816: Make tier2 optimizer symbols testable, and add a few tests. (GH-115953)	2024-02-27 10:51:26 +00:00
Dino Viehland	1002fbe12e	gh-112075: Iterating a dict shouldn't require locks (#115108 ) Makes iteration of a dict be lock free for the forward iteration case.	2024-02-22 12:02:39 -08:00
AN Long	87a65a5bd4	gh-115304: Add doc for initializing PyMutex as a global variable (#115305 )	2024-02-21 12:35:53 -05:00
Victor Stinner	e4c34f04a1	gh-110850: Cleanup PyTime API: PyTime_t are nanoseconds (#115753 ) PyTime_t no longer uses an arbitrary unit, it's always a number of nanoseconds (64-bit signed integer). * Rename _PyTime_FromNanosecondsObject() to _PyTime_FromLong(). * Rename _PyTime_AsNanosecondsObject() to _PyTime_AsLong(). * Remove pytime_from_nanoseconds(). * Remove pytime_as_nanoseconds(). * Remove _PyTime_FromNanoseconds().	2024-02-21 11:46:00 +01:00
Victor Stinner	77430b6a32	gh-110850: Replace private _PyTime_MAX with public PyTime_MAX (#115751 ) Remove references to the old names _PyTime_MIN and _PyTime_MAX, now that PyTime_MIN and PyTime_MAX are public. Replace also _PyTime_MIN with PyTime_MIN.	2024-02-21 08:11:40 +00:00
Donghee Na	259730bbb5	gh-112087: Make list_{concat, repeat, inplace_repeat, ass_item) to be thread-safe (gh-115605)	2024-02-21 01:38:09 +00:00
Dino Viehland	54071460d7	gh-112075: Accessing a single element should optimistically avoid locking (#115109 ) Makes accessing a single element thread safe and typically lock free	2024-02-20 17:08:14 -08:00
Dino Viehland	176df09adb	gh-112075: Make PyDictKeysObject thread-safe (#114741 ) Adds locking for shared PyDictKeysObject's for dictionaries	2024-02-20 16:40:37 -08:00
Victor Stinner	145bc2d638	gh-110850: Use public PyTime functions (#115746 ) Replace private _PyTime functions with public PyTime functions. random_seed_time_pid() now reports errors to its caller.	2024-02-20 23:31:30 +00:00
Victor Stinner	52d1477566	gh-110850: Rename internal PyTime C API functions (#115734 ) Rename functions: * _PyTime_GetSystemClock() => _PyTime_TimeUnchecked() * _PyTime_GetPerfCounter() => _PyTime_PerfCounterUnchecked() * _PyTime_GetMonotonicClock() => _PyTime_MonotonicUnchecked() * _PyTime_GetSystemClockWithInfo() => _PyTime_TimeWithInfo() * _PyTime_GetMonotonicClockWithInfo() => _PyTime_MonotonicWithInfo() * _PyTime_GetMonotonicClockWithInfo() => _PyTime_MonotonicWithInfo() Changes: * Remove "typedef PyTime_t PyTime_t;" which was "typedef PyTime_t _PyTime_t;" before a previous rename. * Update comments of "Unchecked" functions. * Remove invalid PyTime_Time() comment.	2024-02-20 22:16:37 +00:00
Sam Gross	e3ad6ca56f	gh-115103: Implement delayed free mechanism for free-threaded builds (#115367 ) This adds `_PyMem_FreeDelayed()` and supporting functions. The `_PyMem_FreeDelayed()` function frees memory with the same allocator as `PyMem_Free()`, but after some delay to ensure that concurrent lock-free readers have finished.	2024-02-20 13:04:37 -05:00
Victor Stinner	d207c7cd5a	gh-110850: Cleanup pycore_time.h includes (#115724 ) <pycore_time.h> include is no longer needed to get the PyTime_t type in internal header files. This type is now provided by <Python.h> include. Add <pycore_time.h> includes to C files instead.	2024-02-20 16:50:43 +00:00
Sam Gross	cc82e33af9	gh-115491: Keep some fields valid across allocations (free-threading) (#115573 ) This avoids filling the memory occupied by ob_tid, ob_ref_local, and ob_ref_shared with debug bytes (e.g., 0xDD) in mimalloc in the free-threaded build.	2024-02-20 10:36:40 -05:00
Victor Stinner	9af80ec83d	gh-110850: Replace _PyTime_t with PyTime_t (#115719 ) Run command: sed -i -e 's!\<_PyTime_t\>!PyTime_t!g' $(find -name ".c" -o -name ".h")	2024-02-20 15:02:27 +00:00
Brett Simmers	0749244d13	gh-112175: Add `eval_breaker` to `PyThreadState` (#115194 ) This change adds an `eval_breaker` field to `PyThreadState`. The primary motivation is for performance in free-threaded builds: with thread-local eval breakers, we can stop a specific thread (e.g., for an async exception) without interrupting other threads. The source of truth for the global instrumentation version is stored in the `instrumentation_version` field in PyInterpreterState. Threads usually read the version from their local `eval_breaker`, where it continues to be colocated with the eval breaker bits.	2024-02-20 09:57:48 -05:00
Ken Jin	dcba21f905	gh-115687: Split up guards from COMPARE_OP (GH-115688)	2024-02-20 11:30:49 +00:00
Mark Shannon	626c414995	GH-115457: Support splitting and replication of micro ops. (GH-115558)	2024-02-20 10:50:59 +00:00
Mark Shannon	7b21403ccd	GH-112354: Initial implementation of warm up on exits and trace-stitching (GH-114142)	2024-02-20 09:39:55 +00:00
Donghee Na	8db8d7118e	gh-111968: Split _Py_async_gen_asend_freelist out of _Py_async_gen_fr… (gh-115546)	2024-02-17 10:03:10 +09:00
Sam Gross	5903190727	gh-115103: Implement delayed memory reclamation (QSBR) (#115180 ) This adds a safe memory reclamation scheme based on FreeBSD's "GUS" and quiescent state based reclamation (QSBR). The API provides a mechanism for callers to detect when it is safe to free memory that may be concurrently accessed by readers.	2024-02-16 15:25:19 -05:00
Sam Gross	b24c9161a6	gh-112529: Make the GC scheduling thread-safe (#114880 ) The GC keeps track of the number of allocations (less deallocations) since the last GC. This buffers the count in thread-local state and uses atomic operations to modify the per-interpreter count. The thread-local buffering avoids contention on shared state. A consequence is that the GC scheduling is not as precise, so "test_sneaky_frame_object" is skipped because it requires that the GC be run exactly after allocating a frame object.	2024-02-16 11:22:27 -05:00
Donghee Na	321d13fd2b	gh-111968: Split _Py_dictkeys_freelist out of _Py_dict_freelist (gh-115505)	2024-02-16 01:01:36 +00:00
Dino Viehland	454d7963e3	gh-113743: Use per-interpreter locks for types (#115541 ) Move type-lock to per-interpreter lock to avoid heavy contention in interpreters test	2024-02-15 16:28:31 -08:00
Dino Viehland	ae460d450a	gh-113743: Make the MRO cache thread-safe in free-threaded builds (#113930 ) Makes _PyType_Lookup thread safe, including: Thread safety of the underlying cache. Make mutation of mro and type members thread safe Also _PyType_GetMRO and _PyType_GetBases are currently returning borrowed references which aren't safe.	2024-02-15 10:54:57 -08:00
monkeyman192	298bcdc185	gh-112433: Add optional _align_ attribute to ctypes.Structure (GH-113790)	2024-02-15 16:40:20 +02:00
Sam Gross	ad4f909e0e	gh-115432: Add critical section variant that handles a NULL object (#115433 ) This adds `Py_XBEGIN_CRITICAL_SECTION` and `Py_XEND_CRITICAL_SECTION`, which accept a possibly NULL object as an argument. If the argument is NULL, then nothing is locked or unlocked. Otherwise, they behave like `Py_BEGIN/END_CRITICAL_SECTION`.	2024-02-15 08:37:54 -05:00
mpage	dc978f6ab6	gh-112050: Make collections.deque thread-safe in free-threaded builds (#113830 ) Use critical sections to make deque methods that operate on mutable state thread-safe when the GIL is disabled. This is mostly accomplished by using the @critical_section Argument Clinic directive, though there are a few places where this was not possible and critical sections had to be manually acquired/released.	2024-02-15 09:22:47 +01:00
Sam Gross	326119d373	gh-112529: Use _PyThread_Id() in mimalloc in free-threaded build (#115488 ) The free-threaded GC uses mimallocs segment thread IDs to restore the overwritten `ob_tid` thread ids in PyObjects. For that reason, it's important that PyObjects and mimalloc use the same identifiers.	2024-02-14 16:41:29 -05:00
mpage	a95b1a56bb	gh-115041: Add wrappers that are atomic only in free-threaded builds (#115046 ) These are intended to be used in places where atomics are required in free-threaded builds but not in the default build. We don't want to introduce the potential performance overhead of an atomic operation in the default build.	2024-02-14 15:15:05 -05:00
Sam Gross	17773fcb86	gh-115441: Fix missing braces warning (#115460 ) Removes `_py_object_state_INIT`. We want to initialize the `object_state` field to zero.	2024-02-14 12:27:39 -05:00
Donghee Na	f15795c9a0	gh-111968: Rename freelist related struct names to Eric's suggestion (gh-115329)	2024-02-14 00:32:51 +00:00
Eric Snow	514b1c91b8	gh-76785: Improved Subinterpreters Compatibility with 3.12 (gh-115424) For the most part, these changes make is substantially easier to backport subinterpreter-related code to 3.12, especially the related modules (e.g. _xxsubinterpreters). The main motivation is to support releasing a PyPI package with the 3.13 capabilities compiled for 3.12. A lot of the changes here involve either hiding details behind macros/functions or splitting up some files.	2024-02-13 14:56:49 -07:00
Mark Shannon	681778c56a	GH-113710: Improve `_SET_IP` and `_CHECK_VALIDITY` (GH-115248)	2024-02-13 16:28:19 +00:00
Mark Shannon	f9f6156c5a	GH-113710: Backedge counter improvements. (GH-115166)	2024-02-13 14:16:37 +00:00
Ken Jin	7cce857622	gh-114058: Foundations of the Tier2 redundancy eliminator (GH-115085) --------- Co-authored-by: Mark Shannon <9448417+markshannon@users.noreply.github.com> Co-authored-by: Jules <57632293+JuliaPoo@users.noreply.github.com> Co-authored-by: Guido van Rossum <gvanrossum@users.noreply.github.com>	2024-02-13 21:24:48 +08:00
Steve Dower	ea25f32d5f	gh-89240: Enable multiprocessing on Windows to use large process pools (GH-107873) We add _winapi.BatchedWaitForMultipleObjects to wait for larger numbers of handles. This is an internal module, hence undocumented, and should be used with caution. Check the docstring for info before using BatchedWaitForMultipleObjects.	2024-02-13 00:28:35 +00:00
mpage	de7d67b19b	gh-114271: Make `PyInterpreterState.threads.count` thread-safe in free-threaded builds (gh-115093) Use atomics to mutate PyInterpreterState.threads.count.	2024-02-12 10:44:00 -07:00
Petr Viktorin	879f4546bf	gh-110850: Add PyTime_t C API (GH-115215) * gh-110850: Add PyTime_t C API Add PyTime_t API: * PyTime_t type. * PyTime_MIN and PyTime_MAX constants. * PyTime_AsSecondsDouble(), PyTime_Monotonic(), PyTime_PerfCounter() and PyTime_GetSystemClock() functions. Co-authored-by: Victor Stinner <vstinner@python.org>	2024-02-12 18:13:10 +01:00
Mark Shannon	8144661017	GH-113710: Fix updating of dict version tag and add watched dict stats (GH-115221)	2024-02-12 16:07:38 +00:00
Donghee Na	d4d5bae147	gh-111968: Refactor _PyXXX_Fini to integrate with _PyObject_ClearFreeLists (gh-114899)	2024-02-10 00:57:04 +00:00
Sam Gross	a3af3cb4f4	gh-110481: Implement inter-thread queue for biased reference counting (#114824 ) Biased reference counting maintains two refcount fields in each object: `ob_ref_local` and `ob_ref_shared`. The true refcount is the sum of these two fields. In some cases, when refcounting operations are split across threads, the ob_ref_shared field can be negative (although the total refcount must be at least zero). In this case, the thread that decremented the refcount requests that the owning thread give up ownership and merge the refcount fields.	2024-02-09 17:08:32 -05:00
Carl Meyer	8f0998e844	gh-114828: parenthesize non-atomic macro definitions in pycore_symtable.h (#115143 )	2024-02-07 13:19:47 -07:00
Mark Shannon	8a3c499ffe	GH-108362: Revert "GH-108362: Incremental GC implementation (GH-108038)" (#115132 ) Revert "GH-108362: Incremental GC implementation (GH-108038)" This reverts commit `36518e69d7`.	2024-02-07 12:38:34 +00:00
Dino Viehland	92abb01240	gh-112075: Add critical sections for most dict APIs (#114508 ) Starts adding thread safety to dict objects. Use @critical_section for APIs which are exposed via argument clinic and don't directly correlate with a public C API which needs to acquire the lock Use a _lock_held suffix for keeping changes to complicated functions simple and just wrapping them with a critical section Acquire and release the lock in an existing function where it won't be overly disruptive to the existing logic	2024-02-06 14:03:43 -08:00
Sam Gross	b6228b521b	gh-115035: Mark ThreadHandles as non-joinable earlier after forking (#115042 ) This marks dead ThreadHandles as non-joinable earlier in `PyOS_AfterFork_Child()` before we execute any Python code. The handles are stored in a global linked list in `_PyRuntimeState` because `fork()` affects the entire process.	2024-02-06 14:45:04 -05:00
Mariusz Felisiak	1a10437a14	gh-91602: Add iterdump() support for filtering database objects (#114501 ) Add optional 'filter' parameter to iterdump() that allows a "LIKE" pattern for filtering database objects to dump. Co-authored-by: Erlend E. Aasland <erlend@python.org>	2024-02-06 12:34:56 +01:00
Dino Viehland	bcccf1fb63	gh-112075: Add gc shared bits (#114931 ) Add GC shared flags for objects to the GC bit states in free-threaded builds	2024-02-05 10:35:59 -08:00
Mark Shannon	36518e69d7	GH-108362: Incremental GC implementation (GH-108038)	2024-02-05 18:28:51 +00:00
Andrew Rogers	b3f0b698da	gh-104530: Enable native Win32 condition variables by default (GH-104531)	2024-02-02 13:50:51 +00:00
Mark Shannon	0e71a295e9	GH-113710: Add a "globals to constants" pass (GH-114592) Converts specializations of `LOAD_GLOBAL` into constants during tier 2 optimization.	2024-02-02 12:14:34 +00:00
Donghee Na	13907968d7	gh-111968: Use per-thread freelists for dict in free-threading (gh-114323)	2024-02-01 20:53:53 +00:00
Sam Gross	587d480203	gh-112529: Remove PyGC_Head from object pre-header in free-threaded build (#114564 ) * gh-112529: Remove PyGC_Head from object pre-header in free-threaded build This avoids allocating space for PyGC_Head in the free-threaded build. The GC implementation for free-threaded CPython does not use the PyGC_Head structure. * The trashcan mechanism uses the `ob_tid` field instead of `_gc_prev` in the free-threaded build. * The GDB libpython.py file now determines the offset of the managed dict field based on whether the running process is a free-threaded build. Those are identified by the `ob_ref_local` field in PyObject. * Fixes `_PySys_GetSizeOf()` which incorrectly incorrectly included the size of `PyGC_Head` in the size of static `PyTypeObject`.	2024-02-01 12:29:19 -08:00
He Weidong	97cc58f977	Fix comment in pycore_runtime.h (GH-110540)	2024-02-01 19:27:53 +00:00
Donghee Na	7b9d406729	gh-112087: Make PyList_{Append,Size,GetSlice} to be thread-safe (gh-114651)	2024-02-01 08:58:08 +09:00
Eugene Toder	1f515e8a10	gh-112919: Speed-up datetime, date and time.replace() (GH-112921) Use argument clinic and call new_* functions directly. This speeds up these functions 6x to 7.5x when calling with keyword arguments.	2024-01-30 15:19:46 +00:00
Dino Viehland	0cd9bacb8a	gh-112075: Dictionary global version counter should use atomic increments (#114568 ) Dictionary global version counter should use atomic increments	2024-01-29 09:47:54 -08:00
mpage	c87233fd3f	gh-112050: Adapt collections.deque to Argument Clinic (#113963 )	2024-01-29 15:08:23 +00:00
Brandt Bucher	f6d9e5926b	GH-113464: Add a JIT backend for tier 2 (GH-113465) Add an option (--enable-experimental-jit for configure-based builds or --experimental-jit for PCbuild-based ones) to build an experimental just-in-time compiler, based on copy-and-patch (https://fredrikbk.com/publications/copy-and-patch.pdf). See Tools/jit/README.md for more information on how to install the required build-time tooling.	2024-01-28 18:48:48 -08:00
Neil Schemenauer	7a7bce5a0a	gh-113055: Use pointer for interp->obmalloc state (gh-113412) For interpreters that share state with the main interpreter, this points to the same static memory structure. For interpreters with their own obmalloc state, it is heap allocated. Add free_obmalloc_arenas() which will free the obmalloc arenas and radix tree structures for interpreters with their own obmalloc state. Co-authored-by: Eric Snow <ericsnowcurrently@gmail.com>	2024-01-26 19:38:14 -08:00
Donghee Na	699779256e	gh-111968: Unify freelist naming schema to Eric's suggestion (gh-114581)	2024-01-27 00:25:16 +09:00
Sam Gross	b52fc70d1a	gh-112529: Implement GC for free-threaded builds (#114262 ) * gh-112529: Implement GC for free-threaded builds This implements a mark and sweep GC for the free-threaded builds of CPython. The implementation relies on mimalloc to find GC tracked objects (i.e., "containers").	2024-01-25 10:27:36 -08:00
Dino Viehland	4850410b60	gh-112075: Add try-incref functions from nogil branch for use in dict thread safety (#114512 ) * Bring in a subset of biased reference counting: https://github.com/colesbury/nogil/commit/b6b12a9a94e The NoGIL branch has functions for attempting to do an incref on an object which may or may not be in flight. This just brings those functions over so that they will be usable from in the dict implementation to get items w/o holding a lock. There's a handful of small simple modifications: Adding inline to the force inline functions to avoid a warning, and switching from _Py_ALWAYS_INLINE to Py_ALWAYS_INLINE as that's available Remove _Py_REF_LOCAL_SHIFT as it doesn't exist yet (and is currently 0 in the 3.12 nogil branch anyway) ob_ref_shared is currently Py_ssize_t and not uint32_t, so use that _PY_LIKELY doesn't exist, so drop it _Py_ThreadLocal becomes _Py_IsOwnedByCurrentThread Add '_PyInterpreterState_GET()' to _Py_IncRefTotal calls. Co-Authored-By: Sam Gross <colesbury@gmail.com>	2024-01-25 09:34:03 -08:00
Michael Droettboom	ea3cd0498c	gh-114312: Collect stats for unlikely events (GH-114493)	2024-01-25 11:10:51 +00:00
Mark Shannon	981d172f7f	GH-112354: `END_FOR` instruction to only pop one value. (GH-114247) * Compiler emits END_FOR; POP_TOP instead of END_FOR. To support tier 2 side exits in loops.	2024-01-24 15:10:17 +00:00
Mark Shannon	384429d1c0	GH-113710: Add a tier 2 peephole optimization pass. (GH-114487) * Convert _LOAD_CONST to inline versions * Remove PEP 523 checks	2024-01-24 12:08:31 +00:00
Sam Gross	441affc9e7	gh-111964: Implement stop-the-world pauses (gh-112471) The `--disable-gil` builds occasionally need to pause all but one thread. Some examples include: * Cyclic garbage collection, where this is often called a "stop the world event" * Before calling `fork()`, to ensure a consistent state for internal data structures * During interpreter shutdown, to ensure that daemon threads aren't accessing Python objects This adds the following functions to implement global and per-interpreter pauses: * `_PyEval_StopTheWorldAll()` and `_PyEval_StartTheWorldAll()` (for the global runtime) * `_PyEval_StopTheWorld()` and `_PyEval_StartTheWorld()` (per-interpreter) (The function names may change.) These functions are no-ops outside of the `--disable-gil` build.	2024-01-23 11:08:23 -07:00
Sam Gross	412920a41e	gh-112532: Improve mimalloc page visiting (#114133 ) This adds support for visiting abandoned pages in mimalloc and improves the performance of the page visiting code. Abandoned pages contain memory blocks from threads that have exited. At some point, they may be later reclaimed by other threads. We still need to visit those pages in the free-threaded GC because they contain live objects. This also reduces the overhead of visiting mimalloc pages: * Special cases for full, empty, and pages containing only a single block. * Fix free_map to use one bit instead of one byte per block. * Use fast integer division by a constant algorithm when computing block offset from block size and index.	2024-01-22 13:10:21 -08:00
Sam Gross	1d6d5e854c	gh-112529: Use GC heaps for GC allocations in free-threaded builds (gh-114157) * gh-112529: Use GC heaps for GC allocations in free-threaded builds The free-threaded build's garbage collector implementation will need to find GC objects by traversing mimalloc heaps. This hooks up the allocation calls with the correct heaps by using a thread-local "current_obj_heap" variable. * Refactor out setting heap based on type	2024-01-21 01:14:45 +09:00
Donghee Na	7fa511ba57	gh-111968: Use per-thread freelists for generator in free-threading (gh-114189)	2024-01-18 18:15:00 +00:00
Sam Gross	b331381485	gh-112529: Track if debug allocator is used as underlying allocator (#113747 ) * gh-112529: Track if debug allocator is used as underlying allocator The GC implementation for free-threaded builds will need to accurately detect if the debug allocator is used because it affects the offset of the Python object from the beginning of the memory allocation. The current implementation of `_PyMem_DebugEnabled` only considers if the debug allocator is the outer-most allocator; it doesn't handle the case of "hooks" like tracemalloc being used on top of the debug allocator. This change enables more accurate detection of the debug allocator by tracking when debug hooks are enabled. * Simplify _PyMem_DebugEnabled	2024-01-16 13:42:15 -08:00
Donghee Na	867f59f234	gh-111968: Use per-thread freelists for PyContext in free-threading (gh-114122)	2024-01-16 16:14:56 +00:00
Serhiy Storchaka	d2d8332f71	gh-113626: Add allow_code parameter in marshal functions (GH-113648) Passing allow_code=False prevents serialization and de-serialization of code objects which is incompatible between Python versions.	2024-01-16 18:05:15 +02:00
Donghee Na	3eae76554b	gh-111968: Use per-thread slice_cache in free-threading (gh-113972)	2024-01-16 00:38:57 +09:00
Mark Shannon	ac10947ba7	GH-112354: `_GUARD_IS_TRUE_POP` side-exits to target the next instruction, not themselves. (GH-114078)	2024-01-15 11:41:06 +00:00
Joseph Pearson	9a71750a29	Fix a grammatical error in `pycore_pymem.h` (#112993 )	2024-01-12 22:25:52 +00:00
Ken Jin	ac92527c08	gh-113710: Add types to the interpreter DSL (#113711 ) Co-authored-by: Jules <57632293+JuliaPoo@users.noreply.github.com> Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2024-01-13 01:30:27 +08:00
Brandt Bucher	30e6cbdba2	GH-113860: Get rid of `_PyUOpExecutorObject` (GH-113954)	2024-01-12 11:58:23 +00:00
Donghee Na	2e7577b622	gh-111968: Use per-thread freelists for tuple in free-threading (gh-113921)	2024-01-12 03:46:28 +09:00
Nikita Sobolev	2ac4cf4743	gh-112640: Add `kwdefaults` parameter to `types.FunctionType.__new__` (#112641 )	2024-01-11 00:42:30 -08:00
Donghee Na	c65ae26f2b	gh-111968: Unify naming scheme for freelist (gh-113919)	2024-01-11 08:51:51 +09:00
Sam Gross	73ae2023a7	gh-113753: Clear finalized bit when putting PyAsyncGenASend back into free list (#113754 )	2024-01-10 10:18:38 -08:00
Donghee Na	f728f7242c	gh-111968: Use per-thread freelists for float in free-threading (gh-113886)	2024-01-10 15:47:13 +00:00
Mark Shannon	a0c9cf9456	GH-113860: All executors are now defined in terms of micro ops. Convert counter executor to use uops. (GH-113864)	2024-01-10 15:44:34 +00:00
Donghee Na	57bdc6c30d	gh-111968: Introduce _PyFreeListState and _PyFreeListState_GET API (gh-113584)	2024-01-10 08:04:41 +09:00
Pablo Galindo Salgado	a03ec20bcd	gh-110721: Remove unused code from suggestions.c after moving PyErr_Display to use the traceback module (#113712 )	2024-01-08 15:10:45 +00:00
Sam Gross	99854ce170	gh-113688: Split up gcmodule.c (gh-113715) This splits part of Modules/gcmodule.c of into Python/gc.c, which now contains the core garbage collection implementation. The Python module remain in the Modules/gcmodule.c file.	2024-01-05 12:17:16 -08:00
Sam Gross	0b7476080b	gh-112532: Tag mimalloc heaps and pages (#113742 ) * gh-112532: Tag mimalloc heaps and pages Mimalloc pages are data structures that contain contiguous allocations of the same block size. Note that they are distinct from operating system pages. Mimalloc pages are contained in segments. When a thread exits, it abandons any segments and contained pages that have live allocations. These segments and pages may be later reclaimed by another thread. To support GC and certain thread-safety guarantees in free-threaded builds, we want pages to only be reclaimed by the corresponding heap in the claimant thread. For example, we want pages containing GC objects to only be claimed by GC heaps. This allows heaps and pages to be tagged with an integer tag that is used to ensure that abandoned pages are only claimed by heaps with the same tag. Heaps can be initialized with a tag (0-15); any page allocated by that heap copies the corresponding tag. * Fix conversion warning	2024-01-05 12:08:50 -08:00
Sam Gross	fcb3c2a444	gh-112532: Isolate abandoned segments by interpreter (#113717 ) * gh-112532: Isolate abandoned segments by interpreter Mimalloc segments are data structures that contain memory allocations along with metadata. Each segment is "owned" by a thread. When a thread exits, it abandons its segments to a global pool to be later reclaimed by other threads. This changes the pool to be per-interpreter instead of process-wide. This will be important for when we use mimalloc to find GC objects in the `--disable-gil` builds. We want heaps to only store Python objects from a single interpreter. Absent this change, the abandoning and reclaiming process could break this isolation. * Add missing '&_mi_abandoned_default' to 'tld_empty'	2024-01-04 22:21:40 +00:00
Donghee Na	0c3455a969	gh-111926: Set up basic sementics of weakref API for freethreading (gh-113621) --------- Co-authored-by: Sam Gross <colesbury@gmail.com>	2024-01-03 13:25:27 +00:00
Sam Gross	acf3bcc886	gh-112532: Use separate mimalloc heaps for GC objects (gh-113263) * gh-112532: Use separate mimalloc heaps for GC objects In `--disable-gil` builds, we now use four separate heaps in anticipation of using mimalloc to find GC objects when the GIL is disabled. To support this, we also make a few changes to mimalloc: * `mi_heap_t` and `mi_tld_t` initialization is split from allocation. This allows us to have a `mi_tld_t` per-`PyThreadState`, which is important to keep interpreter isolation, since the same OS thread may run in multiple interpreters (using different PyThreadStates.) * Heap abandoning (mi_heap_collect_ex) can now be called from a different thread than the one that created the heap. This is necessary because we may clear and delete the containing PyThreadStates from a different thread during finalization and after fork(). * Use enum instead of defines and guard mimalloc includes. * The enum typedef will be convenient for future PRs that use the type. * Guarding the mimalloc includes allows us to unconditionally include pycore_mimalloc.h from other header files that rely on things like `struct _mimalloc_thread_state`. * Only define _mimalloc_thread_state in Py_GIL_DISABLED builds	2023-12-27 01:53:20 +09:00
Yilei Yang	48c49739f5	gh-106905: Use separate structs to track recursion depth in each PyAST_mod2obj call. (GH-113035) Co-authored-by: Gregory P. Smith [Google LLC] <greg@krypto.org>	2023-12-25 19:36:59 +02:00
Mark Shannon	723f4d6698	GH-111485: Delete the old generator code. (GH-113321)	2023-12-21 12:46:28 +00:00
Mark Shannon	e96f26083b	GH-111485: Generate instruction and uop metadata (GH-113287)	2023-12-20 14:27:25 +00:00
Christopher Chavez	a545a86ec6	gh-111178: Make slot functions in typeobject.c have compatible types (GH-112752)	2023-12-20 15:13:44 +01:00
Sam Gross	5ae75e1be2	gh-111964: Add _PyRWMutex a "readers-writer" lock (gh-112859) This adds `_PyRWMutex`, a "readers-writer" lock, which wil be used to serialize global stop-the-world pauses with per-interpreter pauses.	2023-12-15 18:56:55 -07:00
Mark Shannon	e24eccbc1c	GH-111485: Sort metadata tables for easier checking of future diffs (GH-113101)	2023-12-14 16:41:52 +00:00
Mark Shannon	6873555955	GH-112354: Treat _EXIT_TRACE like an unconditional side exit (GH-113104)	2023-12-14 14:26:44 +00:00
Eric Snow	c6e614fd81	gh-76785: Avoid Pickled TracebackException for Propagated Subinterpreter Exceptions (gh-113036) We need the TracebackException of uncaught exceptions for a single purpose: the error display. Thus we only need to pass the formatted error display between interpreters. Passing a pickled TracebackException is overkill.	2023-12-13 00:31:30 +00:00
Sam Gross	a3c031884d	gh-112723: Call `PyThreadState_Clear()` from the correct interpreter (#112776 ) The `PyThreadState_Clear()` function must only be called with the GIL held and must be called from the same interpreter as the passed in thread state. Otherwise, any Python objects on the thread state may be destroyed using the wrong interpreter, leading to memory corruption. This is also important for `Py_GIL_DISABLED` builds because free lists will be associated with PyThreadStates and cleared in `PyThreadState_Clear()`. This fixes two places that called `PyThreadState_Clear()` from the wrong interpreter and adds an assertion to `PyThreadState_Clear()`.	2023-12-12 17:20:21 -07:00
Eric Snow	8a4c1f3ff1	gh-76785: Show the Traceback for Uncaught Subinterpreter Exceptions (gh-113034) When an exception is uncaught in Interpreter.exec_sync(), it helps to show that exception's error display if uncaught in the calling interpreter. We do so here by generating a TracebackException in the subinterpreter and passing it between interpreters using pickle.	2023-12-13 00:00:54 +00:00
Mark Shannon	956023826a	GH-108866: Guarantee forward progress in executors. (GH-113006)	2023-12-12 19:02:24 +00:00
Eric Snow	86a77f4e1a	gh-76785: Fixes for test.support.interpreters (gh-112982) This involves a number of changes for PEP 734.	2023-12-12 08:24:31 -07:00
Mark Shannon	0c55f27060	GH-111485: Factor out tier 2 code generation from the rest of the interpreter code generator (GH-112968)	2023-12-12 12:12:17 +00:00
Sam Gross	fdee7b7b3e	gh-112532: Require mimalloc in `--disable-gil` builds (gh-112883)	2023-12-12 09:04:48 +09:00
Mark Shannon	c27e9d5d17	GH-111485: Factor out generation of uop IDs from cases generator. (GH-112877)	2023-12-11 14:14:36 +00:00
Neil Schemenauer	890ce430d9	gh-112867: fix for WITH_PYMALLOC_RADIX_TREE=0 (GH-112885) The _obmalloc_usage structure is only defined if the obmalloc radix tree is enabled.	2023-12-09 13:50:48 -08:00
Sam Gross	cf6110ba13	gh-111924: Use PyMutex for Runtime-global Locks. (gh-112207) This replaces some usages of PyThread_type_lock with PyMutex, which does not require memory allocation to initialize. This simplifies some of the runtime initialization and is also one step towards avoiding changing the default raw memory allocator during initialize/finalization, which can be non-thread-safe in some circumstances.	2023-12-07 12:33:40 -07:00
Sam Gross	db460735af	gh-112538: Add internal-only _PyThreadStateImpl "wrapper" for PyThreadState (gh-112560) Every PyThreadState instance is now actually a _PyThreadStateImpl. It is safe to cast from `PyThreadState` to `_PyThreadStateImpl` and back. The _PyThreadStateImpl will contain fields that we do not want to expose in the public C API.	2023-12-07 12:11:45 -07:00
Sam Gross	2d76be251d	gh-111962: Make dtoa thread-safe in `--disable-gil` builds. (#112049 ) This updates `dtoa.c` to avoid using the Bigint free-list in --disable-gil builds and to pre-computes the needed powers of 5 during interpreter initialization. * gh-111962: Make dtoa thread-safe in `--disable-gil` builds. This avoids using the Bigint free-list in `--disable-gil` builds and pre-computes the needed powers of 5 during interpreter initialization. * Fix size of cached powers of 5 array. We need the powers of 5 up to 5*512 because we only jump straight to underflow when the exponent is less than -512 (or larger than 308). Rename Py_NOGIL to Py_GIL_DISABLED * Changes from review * Fix assertion placement	2023-12-07 13:47:55 +00:00
andrewluotechnologies	9c3458e058	gh-112125: Fix None.__ne__(None) returning NotImplemented instead of False (#112504 )	2023-12-07 13:56:01 +01:00
Mark Shannon	b449415b2f	GH-111485: Separate out parsing, analysis and code-gen phases of tier 1 code generator (GH-112299)	2023-12-07 12:49:40 +00:00
Victor Stinner	828451dfde	gh-111545: Add Py_HashPointer() function (#112096 ) * Implement _Py_HashPointerRaw() as a static inline function. * Add Py_HashPointer() tests to test_capi.test_hash. * Keep _Py_HashPointer() function as an alias to Py_HashPointer().	2023-12-06 15:09:22 +01:00
Victor Stinner	a74902a14c	gh-106550: Fix sign conversion in pycore_code.h (#112613 ) Fix sign conversion in pycore_code.h: use unsigned integers and cast explicitly when needed.	2023-12-04 11:42:58 +01:00
Victor Stinner	5c5022b862	gh-112567: Add _PyTimeFraction C API (#112568 ) Use a fraction internally in the _PyTime API to reduce the risk of integer overflow: simplify the fraction using Greatest Common Divisor (GCD). The fraction API is used by time functions: perf_counter(), monotonic() and process_time(). For example, QueryPerformanceFrequency() usually returns 10 MHz on Windows 10 and newer. The fraction SEC_TO_NS / frequency = 1_000_000_000 / 10_000_000 can be simplified to 100 / 1. * Add _PyTimeFraction type. * Add functions: * _PyTimeFraction_Set() * _PyTimeFraction_Mul() * _PyTimeFraction_Resolution() * No longer check "numer * denom <= _PyTime_MAX" in _PyTimeFraction_Set(). _PyTimeFraction_Mul() uses _PyTime_Mul() which handles integer overflow.	2023-12-01 19:50:10 +01:00
Victor Stinner	05a370abd6	gh-112567: Add _Py_GetTicksPerSecond() function (#112587 ) * Move _PyRuntimeState.time to _posixstate.ticks_per_second and time_module_state.ticks_per_second. * Add time_module_state.clocks_per_second. * Rename _PyTime_GetClockWithInfo() to py_clock(). * Rename _PyTime_GetProcessTimeWithInfo() to py_process_time(). * Add process_time_times() helper function, called by py_process_time(). * os.times() is now always built: no longer rely on HAVE_TIMES.	2023-12-01 17:05:56 +01:00
Pablo Galindo Salgado	a73aa48e6b	gh-112367: Only free perf trampoline arenas at shutdown (#112368 ) Signed-off-by: Pablo Galindo <pablogsal@gmail.com>	2023-12-01 13:20:51 +00:00
Irit Katriel	07ebd46f9e	gh-112519: Make it possible to specify instruction flags for pseudo instructions in bytecodes.c (#112520 )	2023-11-30 11:03:30 +00:00
Kirill Podoprigora	0785c68559	gh-111972: Make Unicode name C APIcapsule initialization thread-safe (#112249 )	2023-11-30 11:12:49 +01:00
Guido van Rossum	e723700190	Rename ...Uop... to ...UOp... (uppercase O) for consistency (#112327 ) * Rename _PyUopExecute to _PyUOpExecute (uppercase O) for consistency * Also rename _PyUopName and _PyUOp_Replacements, and some output strings	2023-11-28 17:10:11 -08:00
Grant Ramsay	e954ac7205	gh-63284: Add support for TLS-PSK (pre-shared key) to the ssl module (#103181 ) Add support for TLS-PSK (pre-shared key) to the ssl module. --------- Co-authored-by: Oleg Iarygin <oleg@arhadthedev.net> Co-authored-by: Gregory P. Smith <greg@krypto.org>	2023-11-27 04:01:44 +00:00
Eric Snow	9e56eedd01	gh-76785: Return an "excinfo" Object From Interpreter.run() (gh-111573)	2023-11-23 00:55:00 +00:00
Eric Snow	790db85c77	gh-76785: Add _PyType_GetModuleName() to the Internal C-API (gh-112323) The new function corresponds to the existing (public) PyType_GetName() and PyType_GetQualName().	2023-11-22 15:03:33 -07:00
Guido van Rossum	8deb8bc2e5	gh-112287: Speed up Tier 2 (uop) interpreter a little (#112286 ) This makes the Tier 2 interpreter a little faster. I calculated by about 3%, though I hesitate to claim an exact number. This starts by doubling the trace size limit (to 512), making it more likely that loops fit in a trace. The rest of the approach is to only load `oparg` and `operand` in cases that use them. The code generator know when these are used. For `oparg`, it will conditionally emit ``` oparg = CURRENT_OPARG(); ``` at the top of the case block. (The `oparg` variable may be referenced multiple times by the instructions code block, so it must be in a variable.) For `operand`, it will use `CURRENT_OPERAND()` directly instead of referencing the `operand` variable, which no longer exists. (There is only one place where this will be used.)	2023-11-20 11:25:32 -08:00
Guido van Rossum	1995955173	gh-106529: Make FOR_ITER a viable uop (#112134 ) This uses the new mechanism whereby certain uops are replaced by others during translation, using the `_PyUop_Replacements` table. We further special-case the `_FOR_ITER_TIER_TWO` uop to update the deoptimization target to point just past the corresponding `END_FOR` opcode. Two tiny code cleanups are also part of this PR.	2023-11-20 10:08:53 -08:00
Hugo van Kemenade	3b3ec0d77f	gh-111863: Rename `Py_NOGIL` to `Py_GIL_DISABLED` (#111864 ) Rename Py_NOGIL to Py_GIL_DISABLED	2023-11-20 15:52:00 +02:00
Donghee Na	7c9f2677fb	gh-111926: Update _PyWeakref_IS_DEAD to be thread-safe (gh-112267)	2023-11-20 07:36:45 +09:00
Guido van Rossum	be0bd54c6b	gh-106529: Cleanups split off gh-112134 (#112214 ) - Double max trace size to 256 - Add a dependency on executor_cases.c.h for ceval.o - Mark `_SPECIALIZE_UNPACK_SEQUENCE` as `TIER_ONE_ONLY` - Add debug output back showing the optimized trace - Bunch of cleanups to Tools/cases_generator/	2023-11-17 11:49:42 -08:00
Sam Gross	446f18a911	gh-111956: Add thread-safe one-time initialization. (gh-111960)	2023-11-16 12:19:54 -07:00
Victor Stinner	58469244be	gh-112026: Restore removed private C API (#112115 ) Restore removed private C API functions, macros and structures which have no simple replacement for now: * _PyDict_GetItem_KnownHash() * _PyDict_NewPresized() * _PyHASH_BITS * _PyHASH_IMAG * _PyHASH_INF * _PyHASH_MODULUS * _PyHASH_MULTIPLIER * _PyLong_Copy() * _PyLong_FromDigits() * _PyLong_New() * _PyLong_Sign() * _PyObject_CallMethodId() * _PyObject_CallMethodNoArgs() * _PyObject_CallMethodOneArg() * _PyObject_CallOneArg() * _PyObject_EXTRA_INIT * _PyObject_FastCallDict() * _PyObject_GetAttrId() * _PyObject_Vectorcall() * _PyObject_VectorcallMethod() * _PyStack_AsDict() * _PyThread_CurrentFrames() * _PyUnicodeWriter structure * _PyUnicodeWriter_Dealloc() * _PyUnicodeWriter_Finish() * _PyUnicodeWriter_Init() * _PyUnicodeWriter_Prepare() * _PyUnicodeWriter_PrepareKind() * _PyUnicodeWriter_WriteASCIIString() * _PyUnicodeWriter_WriteChar() * _PyUnicodeWriter_WriteLatin1String() * _PyUnicodeWriter_WriteStr() * _PyUnicodeWriter_WriteSubstring() * _PyUnicode_AsString() * _PyUnicode_FromId() * _PyVectorcall_Function() * _Py_HashDouble() * _Py_HashPointer() * _Py_IDENTIFIER() * _Py_c_abs() * _Py_c_diff() * _Py_c_neg() * _Py_c_pow() * _Py_c_prod() * _Py_c_quot() * _Py_c_sum() * _Py_static_string() * _Py_static_string_init()	2023-11-15 16:38:31 +00:00
Mark Shannon	4bbb367ba6	GH-111848: Set the IP when de-optimizing (GH-112065) * Replace jumps with deopts in tier 2 * Fewer special cases of uop names * Add target field to uop IR * Remove more redundant SET_IP and _CHECK_VALIDITY micro-ops * Extend whitelist of non-escaping API functions.	2023-11-15 15:48:58 +00:00
Mark Shannon	a519b87958	GH-111848: Convert remaining jumps to deopts into tier 2 code. (GH-112045)	2023-11-14 15:30:33 +00:00
Victor Stinner	4f04172c92	gh-111262: Add PyDict_Pop() function (#112028 ) _PyDict_Pop_KnownHash(): remove the default value and the return type becomes an int. Co-authored-by: Stefan Behnel <stefan_ml@behnel.de> Co-authored-by: Antoine Pitrou <pitrou@free.fr>	2023-11-14 12:51:00 +00:00
Irit Katriel	36aab34fab	gh-107149: make new opcode util functions private rather than public and unstable (#112042 )	2023-11-14 00:31:02 +00:00
Serhiy Storchaka	771bd3c94a	Add private _PyUnicode_AsUTF8NoNUL() function (GH-111957) Like PyUnicode_AsUTF8(), but check for embedded null characters.	2023-11-10 21:31:36 +02:00
Mark Shannon	25c4956488	GH-109369: Exit tier 2 if executor is invalid (GH-111657)	2023-11-09 11:19:51 +00:00
Irit Katriel	30ec968bef	gh-111354: remove comparisons with enum values, variable reuse, unused imports in genobject.c (#111708 )	2023-11-09 10:27:20 +00:00
Sam Gross	31c90d5838	gh-111569: Implement Python critical section API (gh-111571) Critical sections are helpers to replace the global interpreter lock with finer grained locking. They provide similar guarantees to the GIL and avoid the deadlock risk that plain locking involves. Critical sections are implicitly ended whenever the GIL would be released. They are resumed when the GIL would be acquired. Nested critical sections behave as if the sections were interleaved.	2023-11-08 15:39:29 -07:00
Mark Shannon	06efb60264	GH-111848: Tidy up tier 2 handling of FOR_ITER specialization by using DEOPT_IF instead of jumps. (GH-111849)	2023-11-08 13:31:55 +00:00
Mark Shannon	931f4438c9	GH-111485: Allow arbitrary annotations on instructions and micro-ops. (GH-111697)	2023-11-07 09:42:39 +00:00
Brandt Bucher	3e99c9cbf6	GH-111485: Make BEFORE_WITH a uop (GH-111812)	2023-11-06 16:42:49 -08:00
Eric Snow	d4426e8d00	gh-76785: Move _Py_excinfo Functions Out of the Internal C-API (gh-111715) I added _Py_excinfo to the internal API (and added its functions in Python/errors.c) in gh-111530 (`9322ce9`). Since then I've had a nagging sense that I should have added the type and functions in its own PR. While I do plan on using _Py_excinfo outside crossinterp.c very soon (see gh-111572/gh-111573), I'd still feel more comfortable if the _Py_excinfo stuff went in as its own PR. Hence, here we are. (FWIW, I may combine that with gh-111572, which I may, in turn, combine with gh-111573. We'll see.)	2023-11-06 11:09:22 -07:00
Antoine Pitrou	0e9c364f4a	GH-110829: Ensure Thread.join() joins the OS thread (#110848 ) Joining a thread now ensures the underlying OS thread has exited. This is required for safer fork() in multi-threaded processes. --------- Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2023-11-04 13:59:24 +00:00
Tian Gao	e0afed7e27	gh-103615: Use local events for opcode tracing (GH-109472) * Use local monitoring for opcode trace * Remove f_opcode_trace_set * Add test for setting f_trace_opcodes after settrace	2023-11-03 16:39:50 +00:00
Michael Droettboom	2bc01cc0c7	gh-111652: Fix --enable-pystats build (GH-111653)	2023-11-03 15:21:16 +00:00
scoder	24ddaee5ca	gh-106168: Revert the "size before item" setting (#111683 ) gh-106168: Update the size only after setting the item, to avoid temporary inconsistencies. Also remove the "what's new" sentence regarding the size setting since tuples cannot grow after allocation.	2023-11-03 11:02:39 +00:00
Irit Katriel	d49aba5a7a	gh-111354: Simplify _PyGen_yf by moving some of its work to the compiler and frame state (#111648 )	2023-11-03 10:01:36 +00:00
Serhiy Storchaka	26c0e5e03a	gh-108082: Remove _PyErr_WriteUnraisableMsg() (GH-111643) Replace the remaining calls with PyErr_FormatUnraisable().	2023-11-03 09:45:53 +02:00
Irit Katriel	52cc4af6ae	gh-111354: simplify detection of RESUME after YIELD_VALUE at except-depth 1 (#111459 )	2023-11-02 10:18:43 +00:00
Eric Snow	9322ce90ac	gh-76785: Crossinterp utils additions (gh-111530) This moves several general internal APIs out of _xxsubinterpretersmodule.c and into the new Python/crossinterp.c (and the corresponding internal headers). Specifically: * _Py_excinfo, etc.: the initial implementation for non-object exception snapshots (in pycore_pyerrors.h and Python/errors.c) * _PyXI_exception_info, etc.: helpers for passing an exception beween interpreters (wraps _Py_excinfo) * _PyXI_namespace, etc.: helpers for copying a dict of attrs between interpreters * _PyXI_Enter(), _PyXI_Exit(): functions that abstract out the transitions between one interpreter and a second that will do some work temporarily Again, these were all abstracted out of _xxsubinterpretersmodule.c as generalizations. I plan on proposing these as public API at some point.	2023-11-01 17:36:40 -06:00
Mark Shannon	b14e882428	GH-111485: Use micro-ops to split specialization code from base action (GH-111561)	2023-11-01 10:53:27 +00:00
Mark Shannon	2904d99839	GH-111485: Remove some special cases from the code generator and bytecodes.c (GH-111540)	2023-10-31 13:21:07 +00:00
Mark Shannon	d27acd4461	GH-111485: Increment `next_instr` consistently at the start of the instruction. (GH-111486)	2023-10-31 10:09:54 +00:00
Michael Droettboom	84b4533e84	gh-109329: Count tier2 opcode misses (#110561 ) This keeps a separate 'miss' counter for each micro-opcode, incremented whenever a guard uop takes a deoptimization side exit.	2023-10-30 17:02:45 -07:00
Eric Snow	c6fe0869ab	gh-76785: Move the Cross-Interpreter Code to Its Own File (gh-111502) This is partly to clear this stuff out of pystate.c, but also in preparation for moving some code out of _xxsubinterpretersmodule.c. This change also moves this stuff to the internal API (new: Include/internal/pycore_crossinterp.h). @vstinner did this previously and I undid it. Now I'm re-doing it. :/	2023-10-30 16:53:10 -06:00
Victor Stinner	801741ff81	gh-90815: Fix mimalloc atomic.h on Windows arm64 (#111527 ) mi_atomic_load_explicit() casts 'p' argument to drop the 'const' qualifier on Windows arm64 platform. Fix the compiler warning: 'function': different 'const' qualifiers (compiling source file ..\Objects\mimalloc\options.c)	2023-10-30 22:33:49 +00:00
Sam Gross	6dfb8fe023	gh-110481: Implement biased reference counting (gh-110764)	2023-10-30 16:06:09 +00:00
Dino Viehland	05f2f0ac92	gh-90815: Add mimalloc memory allocator (#109914 ) * Add mimalloc v2.12 Modified src/alloc.c to remove include of alloc-override.c and not compile new handler. Did not include the following files: - include/mimalloc-new-delete.h - include/mimalloc-override.h - src/alloc-override-osx.c - src/alloc-override.c - src/static.c - src/region.c mimalloc is thread safe and shares a single heap across all runtimes, therefore finalization and getting global allocated blocks across all runtimes is different. * mimalloc: minimal changes for use in Python: - remove debug spam for freeing large allocations - use same bytes (0xDD) for freed allocations in CPython and mimalloc This is important for the test_capi debug memory tests * Don't export mimalloc symbol in libpython. * Enable mimalloc as Python allocator option. * Add mimalloc MIT license. * Log mimalloc in Lib/test/pythoninfo.py. * Document new mimalloc support. * Use macro defs for exports as done in: https://github.com/python/cpython/pull/31164/ Co-authored-by: Sam Gross <colesbury@gmail.com> Co-authored-by: Christian Heimes <christian@python.org> Co-authored-by: Victor Stinner <vstinner@python.org>	2023-10-30 15:43:11 +00:00
Savannah Ostrowski	4a929d432b	GH-111339: Fix initialization and finalization of static optimizer types (GH-111430)	2023-10-29 13:53:25 -07:00
gsallam	21f068d80c	gh-109587: Allow "precompiled" perf-trampolines to largely mitigate the cost of enabling perf-trampolines (#109666 )	2023-10-27 03:57:29 +00:00
Irit Katriel	a0c414c35d	gh-111354: define names for RESUME oparg values (#111365 )	2023-10-26 16:30:18 +01:00
Irit Katriel	67a91f78e4	gh-109094: replace frame->prev_instr by frame->instr_ptr (#109095 )	2023-10-26 13:43:10 +00:00
Pablo Galindo Salgado	90a1b2859f	gh-67224: Show source lines in tracebacks when using the -c option when running Python (#111200 )	2023-10-26 15:17:28 +09:00
scoder	a8a89fcd1f	gh-106320: Re-add some PyLong/PyDict C-API functions (GH-#111162) * gh-106320: Re-add _PyLong_FromByteArray(), _PyLong_AsByteArray() and _PyLong_GCD() to the public header files since they are used by third-party packages and there is no efficient replacement. See https://github.com/python/cpython/issues/111140 See https://github.com/python/cpython/issues/111139 * gh-111262: Re-add _PyDict_Pop() to have a C-API until a new public one is designed.	2023-10-25 11:33:48 +02:00
Brandt Bucher	e5168ff3f8	GH-109214: _SET_IP before _PUSH_FRAME (but not _POP_FRAME) (GH-111001)	2023-10-24 13:27:42 -07:00
Radislav Chugunov	47d3e2ed93	gh-109894: Fix initialization of static `MemoryError` in subinterpreter (gh-110911) Fixes #109894 * set `interp.static_objects.last_resort_memory_error.args` to empty tuple to avoid crash on `PyErr_Display()` call * allow `_PyExc_InitGlobalObjects()` to be called on subinterpreter init --------- Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2023-10-23 17:06:59 -06:00
Mark Shannon	52e902ccf0	GH-109369: Add machinery for deoptimizing tier2 executors, both individually and globally. (GH-110384)	2023-10-23 14:49:09 +01:00
Eric Snow	c58c63fdf6	gh-84570: Add Timeouts to SendChannel.send() and RecvChannel.recv() (gh-110567)	2023-10-17 23:05:49 +00:00
Eric Snow	a53d7cb672	gh-84570: Send-Wait Fixes for _xxinterpchannels (gh-111006) There were a few things I did in gh-110565 that need to be fixed. I also forgot to add tests in that PR. (Note that this PR exposes a refleak introduced by gh-110246. I'll take care of that separately.)	2023-10-17 16:32:00 -06:00
Donghee Na	2dcc57008b	gh-109693: Remove pycore_atomic.h (gh-110992)	2023-10-18 00:33:50 +09:00
Victor Stinner	6db6b30ac2	gh-85283: Build winsound extension with limited C API (#110978 ) Replace type->tp_name with PyType_GetQualName().	2023-10-17 15:57:10 +02:00
Victor Stinner	be5e8a0103	gh-110964: Remove private _PyArg functions (#110966 ) Move the following private functions and structures to pycore_modsupport.h internal C API: * _PyArg_BadArgument() * _PyArg_CheckPositional() * _PyArg_NoKeywords() * _PyArg_NoPositional() * _PyArg_ParseStack() * _PyArg_ParseStackAndKeywords() * _PyArg_Parser structure * _PyArg_UnpackKeywords() * _PyArg_UnpackKeywordsWithVararg() * _PyArg_UnpackStack() * _Py_ANY_VARARGS() Changes: * Python/getargs.h now includes pycore_modsupport.h to export functions. * clinic.py now adds pycore_modsupport.h when one of these functions is used. * Add pycore_modsupport.h includes when a C extension uses one of these functions. * Define Py_BUILD_CORE_MODULE in C extensions which now include directly or indirectly (via code generated by Argument Clinic) pycore_modsupport.h: * _csv * _curses_panel * _dbm * _gdbm * _multiprocessing.posixshmem * _sqlite.row * _statistics * grp * resource * syslog * _testcapi: bad_get() no longer uses METH_FASTCALL calling convention but METH_VARARGS. Replace _PyArg_UnpackStack() with PyArg_ParseTuple(). * _testcapi: add PYTESTCAPI_NEED_INTERNAL_API macro which is defined by _testcapi sub-modules which need the internal C API (pycore_modsupport.h): exceptions.c, float.c, vectorcall.c, watchers.c. * Remove Include/cpython/modsupport.h header file. Include/modsupport.h no longer includes the removed header file. * Fix mypy clinic.py	2023-10-17 14:30:31 +02:00
Donghee Na	86559ddfec	gh-109693: Update _gil_runtime_state.locked to use pyatomic.h (gh-110836)	2023-10-17 07:32:50 +09:00
Donghee Na	b2ab210aae	gh-109693: Update pyruntimestate._finalizing to use pyatomic.h (gh-110837)	2023-10-13 16:40:15 +00:00
Pablo Galindo Salgado	e1d8c65e1d	gh-110805: Allow the repl to show source code and complete tracebacks (#110775 )	2023-10-13 09:25:37 +00:00
Donghee Na	2566434e59	gh-109693: Update _gil_runtime_state.last_holder to use pyatomic.h (#110605 )	2023-10-13 10:07:27 +09:00
Pablo Galindo Salgado	e7331365b4	gh-110721: Use the traceback module for PyErr_Display() and fallback to the C implementation (#110702 )	2023-10-12 14:52:14 +00:00
Irit Katriel	7dd3c2b800	gh-109094: remove redundant arg to _PyFrame_PushTrampolineUnchecked (GH-110759)	2023-10-12 11:02:42 +01:00
Mark Shannon	19b7ead5eb	GH-109214: Convert _SAVE_CURRENT_IP to _SET_IP in tier 2 trace creation. (GH-110755)	2023-10-12 10:34:32 +01:00
Donghee Na	5bc1b7f08d	gh-109693: Update pycore_interp.h to use pyatomic.h (#110604 )	2023-10-10 23:17:08 +09:00
Donghee Na	67e8d416cc	gh-109693: Use pyatomic.h for signal module (gh-110480)	2023-10-10 08:26:29 +09:00
Eric Snow	7bd560ce8d	gh-76785: Add SendChannel.send_buffer() (#110246 ) (This is still a test module.)	2023-10-09 07:39:51 -06:00
Masaru Tsuchiyama	de2a4036cb	gh-108277: Add os.timerfd_create() function (#108382 ) Add wrapper for timerfd_create, timerfd_settime, and timerfd_gettime to os module. Co-authored-by: Serhiy Storchaka <storchaka@gmail.com> Co-authored-by: Adam Turner <9087854+AA-Turner@users.noreply.github.com> Co-authored-by: Erlend E. Aasland <erlend.aasland@protonmail.com> Co-authored-by: Victor Stinner <vstinner@python.org>	2023-10-07 19:33:22 +02:00
Sam Gross	6e97a9647a	gh-109549: Add new states to PyThreadState to support PEP 703 (gh-109915) This adds a new field 'state' to PyThreadState that can take on one of three values: _Py_THREAD_ATTACHED, _Py_THREAD_DETACHED, or _Py_THREAD_GC. The "attached" and "detached" states correspond closely to acquiring and releasing the GIL. The "gc" state is current unused, but will be used to implement stop-the-world GC for --disable-gil builds in the near future.	2023-10-05 09:46:33 -06:00
Sam Gross	cf6f23b0e3	gh-88402: Add new sysconfig variables on Windows (GH-110049) Co-authored-by: Filipe Laíns <filipe.lains@gmail.com>	2023-10-04 22:50:29 +00:00
Eric Snow	80dc39e1dc	gh-110310: Add a Per-Interpreter XID Registry for Heap Types (gh-110311) We do the following: * add a per-interpreter XID registry (PyInterpreterState.xidregistry) * put heap types there (keep static types in _PyRuntimeState.xidregistry) * clear the registries during interpreter/runtime finalization * avoid duplicate entries in the registry (when _PyCrossInterpreterData_RegisterClass() is called more than once for a type) * use Py_TYPE() instead of PyObject_Type() in _PyCrossInterpreterData_Lookup() The per-interpreter registry helps preserve isolation between interpreters. This is important when heap types are registered, which is something we haven't been doing yet but I will likely do soon.	2023-10-04 16:35:27 -06:00
Michael Droettboom	e561e98058	GH-109329: Add tier 2 stats (GH-109913)	2023-10-04 14:52:28 -07:00
Mark Shannon	bf4bc36069	GH-109369: Merge all eval-breaker flags and monitoring version into one word. (GH-109846)	2023-10-04 16:09:48 +01:00
Guido van Rossum	7c149a76b2	gh-104909: Split more LOAD_ATTR specializations (GH-110317) * Split LOAD_ATTR_MODULE * Split LOAD_ATTR_WITH_HINT * Split _GUARD_TYPE_VERSION out of the latter * Split LOAD_ATTR_CLASS * Split LOAD_ATTR_NONDESCRIPTOR_WITH_VALUES * Fix indent of DEOPT_IF in macros * Split LOAD_ATTR_METHOD_LAZY_DICT * Split LOAD_ATTR_NONDESCRIPTOR_NO_DICT * Fix omission of _CHECK_ATTR_METHOD_LAZY_DICT	2023-10-04 16:08:02 +01:00
Guido van Rossum	625ecbe92e	gh-109979: Unify _GUARD_TYPE_VERSION{,_STORE} (#110301 ) Now the target for `DEOPT_IF()` is auto-filled, we don't need a separate `_GUARD_TYPE_VERSION_STORE` uop.	2023-10-03 22:37:21 +00:00
Victor Stinner	d73501602f	gh-108867: Add PyThreadState_GetUnchecked() function (#108870 ) Add PyThreadState_GetUnchecked() function: similar to PyThreadState_Get(), but don't issue a fatal error if it is NULL. The caller is responsible to check if the result is NULL. Previously, this function was private and known as _PyThreadState_UncheckedGet().	2023-10-03 16:53:51 +00:00
Eric Snow	f5198b09e1	gh-109860: Use a New Thread State When Switching Interpreters, When Necessary (gh-110245) In a few places we switch to another interpreter without knowing if it has a thread state associated with the current thread. For the main interpreter there wasn't much of a problem, but for subinterpreters we were mostly okay re-using the tstate created with the interpreter (located via PyInterpreterState_ThreadHead()). There was a good chance that tstate wasn't actually in use by another thread. However, there are no guarantees of that. Furthermore, re-using an already used tstate is currently fragile. To address this, now we create a new thread state in each of those places and use it. One consequence of this change is that PyInterpreterState_ThreadHead() may not return NULL (though that won't happen for the main interpreter).	2023-10-03 09:20:48 -06:00
Eric Snow	1dd9dee45d	gh-105716: Support Background Threads in Subinterpreters Consistently (gh-109921) The existence of background threads running on a subinterpreter was preventing interpreters from getting properly destroyed, as well as impacting the ability to run the interpreter again. It also affected how we wait for non-daemon threads to finish. We add PyInterpreterState.threads.main, with some internal C-API functions.	2023-10-02 20:12:12 +00:00
Victor Stinner	7513994c92	gh-110014: Include explicitly <unistd.h> header (#110155 ) * Remove unused <locale.h> includes. * Remove unused <fcntl.h> include in traceback.h. * Remove redundant <assert.h> and <stddef.h> includes. They are already included by "Python.h". * Remove <object.h> include in faulthandler.c. Python.h already includes it. * Add missing <stdbool.h> in pycore_pythread.h if HAVE_PTHREAD_STUBS is defined. * Fix also warnings in pthread_stubs.h: don't redefine macros if they are already defined, like the __NEED_pthread_t macro.	2023-09-30 20:06:45 +00:00
Victor Stinner	74e425ec18	gh-110014: Fix _POSIX_THREADS and _POSIX_SEMAPHORES usage (#110139 ) * pycore_pythread.h is now the central place to make sure that _POSIX_THREADS and _POSIX_SEMAPHORES macros are defined if available. * Make sure that pycore_pythread.h is included when _POSIX_THREADS and _POSIX_SEMAPHORES macros are tested. * PY_TIMEOUT_MAX is now defined as a constant, since its value depends on _POSIX_THREADS, instead of being defined as a macro. * Prevent integer overflow in the preprocessor when computing PY_TIMEOUT_MAX_VALUE on Windows: replace "0xFFFFFFFELL * 1000 < LLONG_MAX" with "0xFFFFFFFELL < LLONG_MAX / 1000". * Document the change and give hints how to fix affected code. * Add an exception for PY_TIMEOUT_MAX name to smelly.py * Add PY_TIMEOUT_MAX to the stable ABI	2023-09-30 19:25:54 +02:00
Guido van Rossum	5bb6f0fcba	gh-104909: Split some more insts into ops (#109943 ) These are the most popular specializations of `LOAD_ATTR` and `STORE_ATTR` that weren't already viable uops: * Split LOAD_ATTR_METHOD_WITH_VALUES * Split LOAD_ATTR_METHOD_NO_DICT * Split LOAD_ATTR_SLOT * Split STORE_ATTR_SLOT * Split STORE_ATTR_INSTANCE_VALUE Also: * Add `-v` flag to code generator which prints a list of non-viable uops (easter-egg: it can print execution counts -- see source) * Double _Py_UOP_MAX_TRACE_LENGTH to 128 I had dropped one of the DEOPT_IF() calls! :-(	2023-09-27 15:27:44 -07:00
Eric Snow	32466c97c0	gh-109793: Allow Switching Interpreters During Finalization (gh-109794) Essentially, we should check the thread ID rather than the thread state pointer.	2023-09-27 13:41:06 -06:00
Serhiy Storchaka	b8d1744e7b	gh-109611: Add convenient C API function _PyFile_Flush() (GH-109612)	2023-09-23 09:35:30 +03:00

... 4 5 6 7 8 ...

1645 Commits