cpython

Commit Graph

Author	SHA1	Message	Date
Sam Gross	f4997bb3ac	gh-123923: Defer refcounting for `f_funcobj` in `_PyInterpreterFrame` (#124026 ) Use a `_PyStackRef` and defer the reference to `f_funcobj` when possible. This avoids some reference count contention in the common case of executing the same code object from multiple threads concurrently in the free-threaded build.	2024-09-24 20:08:18 +00:00
Mark Shannon	c13e7d98fb	GH-118093: Specialize `CALL_KW` (GH-123006)	2024-08-16 17:11:24 +01:00
Mark Shannon	7a65439b93	GH-122390: Replace `_Py_GetbaseOpcode` with `_Py_GetBaseCodeUnit` (GH-122942)	2024-08-13 14:22:57 +01:00
Brandt Bucher	9621a7d017	GH-118093: Handle some polymorphism before requiring progress in tier two (GH-122843)	2024-08-12 12:39:31 -07:00
Mark Shannon	df13a1821a	GH-118095: Add tier two support for BINARY_SUBSCR_GETITEM (GH-120793)	2024-08-01 16:19:05 -07:00
Victor Stinner	fda6bd842a	Replace PyObject_Del with PyObject_Free (#122453 ) PyObject_Del() is just a alias to PyObject_Free() kept for backward compatibility. Use directly PyObject_Free() instead.	2024-08-01 14:12:33 +02:00
Brandt Bucher	7797182b78	GH-118093: Improve handling of short and mid-loop traces (GH-122252)	2024-07-29 14:49:17 -07:00
Brandt Bucher	64857d849f	GH-122294: Burn in the addresses of side exits (GH-122295)	2024-07-26 09:40:15 -07:00
Brandt Bucher	d9efa45d74	GH-118093: Add tier two support for BINARY_OP_INPLACE_ADD_UNICODE (GH-122253)	2024-07-25 14:45:07 -07:00
Brandt Bucher	5f6001130f	GH-118093: Add tier two support for LOAD_ATTR_PROPERTY (GH-122283)	2024-07-25 10:45:28 -07:00
Brandt Bucher	794546fd53	GH-118093: Remove invalidated executors from side exits (GH-121885)	2024-07-24 09:16:30 -07:00
Brandt Bucher	7b36b67b1e	GH-118093: Add tier two support to several instructions (GH-121884)	2024-07-18 14:24:58 -07:00
Brandt Bucher	33903c53db	GH-116017: Get rid of _COLD_EXITs (GH-120960)	2024-07-01 13:17:40 -07:00
Ken Jin	22b0de2755	gh-117139: Convert the evaluation stack to stack refs (#118450 ) This PR sets up tagged pointers for CPython. The general idea is to create a separate struct _PyStackRef for everything on the evaluation stack to store the bits. This forces the C compiler to warn us if we try to cast things or pull things out of the struct directly. Only for free threading: We tag the low bit if something is deferred - that means we skip incref and decref operations on it. This behavior may change in the future if Mark's plans to defer all objects in the interpreter loop pans out. This implies a strict stack reference discipline is required. ALL incref and decref operations on stackrefs must use the stackref variants. It is unsafe to untag something then do normal incref/decref ops on it. The new incref and decref variants are called dup and close. They mimic a "handle" API operating on these stackrefs. Please read Include/internal/pycore_stackref.h for more information! --------- Co-authored-by: Mark Shannon <9448417+markshannon@users.noreply.github.com>	2024-06-27 03:10:43 +08:00
Victor Stinner	9e4a81f00f	gh-120642: Move private PyCode APIs to the internal C API (#120643 ) * Move _Py_CODEUNIT and related functions to pycore_code.h. * Move _Py_BackoffCounter to pycore_backoff.h. * Move Include/cpython/optimizer.h content to pycore_optimizer.h. * Remove Include/cpython/optimizer.h. * Remove PyUnstable_Replace_Executor(). Rename functions: * PyUnstable_GetExecutor() => _Py_GetExecutor() * PyUnstable_GetOptimizer() => _Py_GetOptimizer() * PyUnstable_SetOptimizer() => _Py_SetTier2Optimizer() * PyUnstable_Optimizer_NewCounter() => _PyOptimizer_NewCounter() * PyUnstable_Optimizer_NewUOpOptimizer() => _PyOptimizer_NewUOpOptimizer()	2024-06-26 13:54:03 +02:00
Brandt Bucher	a47abdb45d	GH-117062: Make _JUMP_TO_TOP a general absolute jump (GH-120854)	2024-06-24 08:35:10 -07:00
Mark Shannon	274f844830	GH-120619: Clean up `RETURN_VALUE` instruction (GH-120624) * Rename _POP_FRAME to _RETURN_VALUE as it returns a value as well as popping a frame. * Remove remaining _POP_FRAMEs	2024-06-17 14:40:11 +01:00
Xie Yanbo	9e052619a6	Fix typos in documentation and comments (#119763 )	2024-06-04 10:22:22 +00:00
Victor Stinner	f6da790122	gh-111389: Add PyHASH_MULTIPLIER constant (#119214 )	2024-05-21 19:51:51 +02:00
Petr Viktorin	941eea0a27	gh-118771: Ensure names defined in optimizer.h start with Py/_Py (GH-118825)	2024-05-10 18:20:12 +02:00
Mark Shannon	1ab6356ebe	GH-118095: Use broader specializations of CALL in tier 1, for better tier 2 support of calls. (GH-118322) * Add CALL_PY_GENERAL, CALL_BOUND_METHOD_GENERAL and call CALL_NON_PY_GENERAL specializations. * Remove CALL_PY_WITH_DEFAULTS specialization * Use CALL_NON_PY_GENERAL in more cases when otherwise failing to specialize	2024-05-04 12:11:11 +01:00
Mark Shannon	da2cfc4cb6	GH-113464: Remove the extra jump via `_SIDE_EXIT` in `_EXIT_TRACE` (GH-118545)	2024-05-04 08:50:24 +01:00
Mark Shannon	72867c962c	GH-118095: Unify the behavior of tier 2 FOR_ITER branch micro-ops (GH-118420) * Target _FOR_ITER_TIER_TWO at POP_TOP following the matching END_FOR * Modify _GUARD_NOT_EXHAUSTED_RANGE, _GUARD_NOT_EXHAUSTED_LIST and _GUARD_NOT_EXHAUSTED_TUPLE so that they also target the POP_TOP following the matching END_FOR	2024-05-02 16:17:59 +01:00
Mark Shannon	67bba9dd0f	GH-117442: Check eval-breaker at start (rather than end) of tier 2 loops (GH-118482)	2024-05-02 13:10:31 +01:00
Brandt Bucher	49baa656cb	GH-115802: Use the GHC calling convention in JIT code (GH-118287)	2024-05-01 08:05:53 -07:00
Anthony Shaw	beb653cc24	gh-117958: Expose JIT code via method in UOpExecutor (#117959 )	2024-05-01 07:11:14 -07:00
Mark Shannon	f6fab21721	GH-118095: Make invalidating and clearing executors memory safe (GH-118459)	2024-05-01 11:34:50 +01:00
Guido van Rossum	7d83f7bcc4	gh-118335: Configure Tier 2 interpreter at build time (#118339 ) The code for Tier 2 is now only compiled when configured with `--enable-experimental-jit[=yes\|interpreter]`. We drop support for `PYTHON_UOPS` and -`Xuops`, but you can disable the interpreter or JIT at runtime by setting `PYTHON_JIT=0`. You can also build it without enabling it by default using `--enable-experimental-jit=yes-off`; enable with `PYTHON_JIT=1`. On Windows, the `build.bat` script supports `--experimental-jit`, `--experimental-jit-off`, `--experimental-interpreter`. In the C code, `_Py_JIT` is defined as before when the JIT is enabled; the new variable `_Py_TIER2` is defined when the JIT or the interpreter is enabled. It is actually a bitmask: 1: JIT; 2: default-off; 4: interpreter.	2024-04-30 18:26:34 -07:00
Mark Shannon	5b05d452cd	GH-118095: Add tier 2 support for YIELD_VALUE (GH-118380)	2024-04-30 11:33:13 +01:00
Mark Shannon	ab6eda0ee5	GH-118095: Allow a variant of RESUME_CHECK in tier 2 (GH-118286)	2024-04-29 07:54:05 +01:00
Mark Shannon	3e06c7f719	GH-118095: Add dynamic exit support and FOR_ITER_GEN support to tier 2 (GH-118279)	2024-04-26 18:08:50 +01:00
Mark Shannon	f180b31e76	GH-118095: Handle `RETURN_GENERATOR` in tier 2 (GH-118180)	2024-04-25 11:32:47 +01:00
Mark Shannon	83235f7791	GH-115419: Move setting the instruction pointer to error exit stubs (GH-118088)	2024-04-24 14:41:30 +01:00
Mark Shannon	77cd0428b6	GH-118095: Convert DEOPT_IFs on likely side exits to EXIT_IFs (GH-118106) Covert DEOPT_IFs on likely side exits to EXIT_IFs	2024-04-24 14:37:55 +01:00
Guido van Rossum	7e87d30f1f	gh-118074: Immortal executors are not GC-able (#118182 ) Better version of gh-118117. Just check for immortality instead of an address range check.	2024-04-23 13:38:23 -07:00
Guido van Rossum	1b85b3424c	GH-118074: Executors in the COLD_EXITS array are not GC'able (#118117 )	2024-04-22 16:20:39 -07:00
Mark Shannon	7e6fa5fced	GH-116202: Incorporate invalidation check into _START_EXECUTOR. (GH-118044)	2024-04-19 09:26:42 +01:00
Guido van Rossum	060a96f1a9	gh-116968: Reimplement Tier 2 counters (#117144 ) Introduce a unified 16-bit backoff counter type (``_Py_BackoffCounter``), shared between the Tier 1 adaptive specializer and the Tier 2 optimizer. The API used for adaptive specialization counters is changed but the behavior is (supposed to be) identical. The behavior of the Tier 2 counters is changed: - There are no longer dynamic thresholds (we never varied these). - All counters now use the same exponential backoff. - The counter for ``JUMP_BACKWARD`` starts counting down from 16. - The ``temperature`` in side exits starts counting down from 64.	2024-04-04 15:03:27 +00:00
Mark Shannon	bf82f77957	GH-116422: Tier2 hot/cold splitting (GH-116813) Splits the "cold" path, deopts and exits, from the "hot" path, reducing the size of most jitted instructions, at the cost of slower exits.	2024-03-26 09:35:11 +00:00
Ken Jin	6c83352bfe	gh-117180: Complete call sequence when trace stack overflow (GH-117184) --------- Co-authored-by: Peter Lazorchak <lazorchakp@gmail.com> Co-authored-by: Guido van Rossum <gvanrossum@users.noreply.github.com> Co-authored-by: Guido van Rossum <gvanrossum@gmail.com>	2024-03-24 06:19:17 +08:00
Guido van Rossum	570a82d46a	gh-117045: Add code object to function version cache (#117028 ) Changes to the function version cache: - In addition to the function object, also store the code object, and allow the latter to be retrieved even if the function has been evicted. - Stop assigning new function versions after a critical attribute (e.g. `__code__`) has been modified; the version is permanently reset to zero in this case. - Changes to `__annotations__` are no longer considered critical. (This fixes gh-109998.) Changes to the Tier 2 optimization machinery: - If we cannot map a function version to a function, but it is still mapped to a code object, we continue projecting the trace. The operand of the `_PUSH_FRAME` and `_POP_FRAME` opcodes can be either NULL, a function object, or a code object with the lowest bit set. This allows us to trace through code that calls an ephemeral function, i.e., a function that may not be alive when we are constructing the executor, e.g. a generator expression or certain nested functions. We will lose globals removal inside such functions, but we can still do other peephole operations (and even possibly [call inlining](https://github.com/python/cpython/pull/116290), if we decide to do it), which only need the code object. As before, if we cannot retrieve the code object from the cache, we stop projecting.	2024-03-21 12:37:41 -07:00
Mark Shannon	15309329b6	GH-108362: Incremental Cycle GC (GH-116206)	2024-03-20 08:54:42 +00:00
Michael Droettboom	0f278012e8	gh-116808: Fix optimized trace length histogram (GH-116827)	2024-03-19 11:06:43 +00:00
Guido van Rossum	76d0868907	Cleanup tier2 debug output (#116920 ) Various tweaks, including a slight refactor of the special cases for `_PUSH_FRAME`/`_POP_FRAME` to show the actual operand emitted.	2024-03-18 11:08:43 -07:00
Michael Droettboom	cef0ec1a3c	gh-116760: Fix pystats for trace attempts (GH-116761) There are now at least two bytecodes that may attempt to optimize, JUMP_BACK, and more recently, COLD_EXIT. Only the JUMP_BACK was counting the attempt in the stats. This moves that counter to uop_optimize itself so it should always happen no matter where it is called from.	2024-03-13 22:13:33 +00:00
Mark Shannon	b6ae6da1bd	GH-116596: Better determination of escaping uops. (GH-116597)	2024-03-11 13:37:48 +00:00
Guido van Rossum	d444dec09a	Fix debug output for optimized executor (#116337 ) This adjusts `length` rather than using `length+1` all over the place.	2024-03-05 10:05:29 -08:00
Guido van Rossum	3409bc29c9	gh-115859: Re-enable T2 optimizer pass by default (#116062 ) This undoes the temporary default disabling of the T2 optimizer pass in gh-115860. - Add a new test that reproduces Brandt's example from gh-115859; it indeed crashes before gh-116028 with PYTHONUOPSOPTIMIZE=1 - Re-enable the optimizer pass in T2, stop checking PYTHONUOPSOPTIMIZE - Rename the env var to disable T2 entirely to PYTHON_UOPS_OPTIMIZE (must be explicitly set to 0 to disable) - Fix skipIf conditions on tests in test_opt.py accordingly - Export sym_is_bottom() (for debugging) - Fix various things in the `_BINARY_OP_` specializations in the abstract interpreter: - DECREF(temp) - out-of-space check after sym_new_const() - add sym_matches_type() checks, so even if we somehow reach a binary op with symbolic constants of the wrong type on the stack we won't trigger the type assert	2024-02-28 22:38:01 +00:00
Michael Droettboom	b05afdd5ec	gh-115168: Add pystats counter for invalidated executors (GH-115169)	2024-02-26 17:51:47 +00:00
Guido van Rossum	4ee6bdfbaa	gh-115727: Reduce confidence even on 100% predicted jumps (#115748 ) The theory is that even if we saw a jump go in the same direction the last 16 times we got there, we shouldn't be overly confident that it's still going to go the same way in the future. This PR makes it so that in the extreme cases, the confidence is multiplied by 0.9 instead of remaining unchanged. For unpredictable jumps, there is no difference (still 0.5). For somewhat predictable jumps, we interpolate.	2024-02-22 12:23:48 -08:00

1 2 3

129 Commits