cpython

Commit Graph

Author	SHA1	Message	Date
Serhiy Storchaka	d08c788822	gh-123497: New limit for Python integers on 64-bit platforms (GH-123724) Instead of be limited just by the size of addressable memory (2**63 bytes), Python integers are now also limited by the number of bits, so the number of bit now always fit in a 64-bit integer. Both limits are much larger than what might be available in practice, so it doesn't affect users. _PyLong_NumBits() and _PyLong_Frexp() are now always successful.	2024-09-29 10:40:20 +03:00
Irit Katriel	e07154fd1e	gh-123958: apply docstring removal optimization in ast_opt instead of codegen (#123959 )	2024-09-11 17:02:28 +00:00
Serhiy Storchaka	32c7dbb2bc	gh-121485: Always use 64-bit integers for integers bits count (GH-121486) Use 64-bit integers instead of platform specific size_t or Py_ssize_t to represent the number of bits in Python integer.	2024-08-30 08:13:24 +03:00
Bogdan Romanyuk	be083cee34	gh-123344: Add missing ast optimizations for PEP 696 (#123377 ) Co-authored-by: Kirill Podoprigora <kirill.bast9@mail.ru> Co-authored-by: Jelle Zijlstra <jelle.zijlstra@gmail.com>	2024-08-28 06:38:56 -07:00
Serhiy Storchaka	efcd65cd84	gh-122313: Clean up deep recursion guarding code in the compiler (GH-122640) Add ENTER_RECURSIVE and LEAVE_RECURSIVE macros in ast.c, ast_opt.c and symtable.c. Remove VISIT_QUIT macro in symtable.c. The current recursion depth counter only needs to be updated during normal execution -- all functions should just return an error code if an error occurs.	2024-08-03 12:45:45 +03:00
Steve Dower	e731554337	Fixes loop variables to be the same types as their limit (GH-120958)	2024-06-24 17:11:47 +01:00
Petr Viktorin	6f1d448bc1	gh-113993: Allow interned strings to be mortal, and fix related issues (GH-120520) * Add an InternalDocs file describing how interning should work and how to use it. * Add internal functions to explicitly request what kind of interning is done: - `_PyUnicode_InternMortal` - `_PyUnicode_InternImmortal` - `_PyUnicode_InternStatic` * Switch uses of `PyUnicode_InternInPlace` to those. * Disallow using `_Py_SetImmortal` on strings directly. You should use `_PyUnicode_InternImmortal` instead: - Strings should be interned before immortalization, otherwise you're possibly interning a immortalizing copy. - `_Py_SetImmortal` doesn't handle the `SSTATE_INTERNED_MORTAL` to `SSTATE_INTERNED_IMMORTAL` update, and those flags can't be changed in backports, as they are now part of public API and version-specific ABI. * Add private `_only_immortal` argument for `sys.getunicodeinternedsize`, used in refleak test machinery. * Make sure the statically allocated string singletons are unique. This means these sets are now disjoint: - `_Py_ID` - `_Py_STR` (including the empty string) - one-character latin-1 singletons Now, when you intern a singleton, that exact singleton will be interned. * Add a `_Py_LATIN1_CHR` macro, use it instead of `_Py_ID`/`_Py_STR` for one-character latin-1 singletons everywhere (including Clinic). * Intern `_Py_STR` singletons at startup. * For free-threaded builds, intern `_Py_LATIN1_CHR` singletons at startup. * Beef up the tests. Cover internal details (marked with `@cpython_only`). * Add lots of assertions Co-Authored-By: Eric Snow <ericsnowcurrently@gmail.com>	2024-06-21 17:19:31 +02:00
Mark Shannon	17b73ab99e	GH-113655: Lower the C recursion limit on various platforms (GH-113944)	2024-01-16 09:32:01 +00:00
Kirill Podoprigora	7162c3a1e8	gh-111133: Remove unnecessary PyFrozenSet_Check() calls in const folding (GH-111137) frozenset does not support multiplication.	2023-10-20 23:37:38 +03:00
Victor Stinner	b0edf3b98e	GH-91079: Rename C_RECURSION_LIMIT to Py_C_RECURSION_LIMIT (#108507 ) Symbols of the C API should be prefixed by "Py_" to avoid conflict with existing names in 3rd party C extensions on "#include <Python.h>". test.pythoninfo now logs Py_C_RECURSION_LIMIT constant and other _testcapi and _testinternalcapi constants.	2023-09-08 09:48:28 +00:00
Mark Shannon	fa45958450	GH-107263: Increase C stack limit for most functions, except `_PyEval_EvalFrameDefault()` (GH-107535) * Set C recursion limit to 1500, set cost of eval loop to 2 frames, and compiler mutliply to 2.	2023-08-04 10:10:29 +01:00
Victor Stinner	5e4af2a3e9	gh-106320: Move private _PySet API to the internal API (#107041 ) * Add pycore_setobject.h header file. * Move the following API to the internal C API: * _PySet_Dummy * _PySet_NextEntry() * _PySet_Update()	2023-07-22 17:04:34 +02:00
Irit Katriel	f990bb8b2d	gh-105148: make _PyASTOptimizeState internal to ast_opt.c (#105149 )	2023-05-31 20:21:46 +01:00
Jelle Zijlstra	a5f244d627	gh-104656: Rename typeparams AST node to type_params (#104657 )	2023-05-21 21:25:09 -07:00
Jelle Zijlstra	24d8b88420	gh-103763: Implement PEP 695 (#103764 ) This implements PEP 695, Type Parameter Syntax. It adds support for: - Generic functions (def func[T](): ...) - Generic classes (class X[T](): ...) - Type aliases (type X = ...) - New scoping when the new syntax is used within a class body - Compiler and interpreter changes to support the new syntax and scoping rules Co-authored-by: Marc Mueller <30130371+cdce8p@users.noreply.github.com> Co-authored-by: Eric Traut <eric@traut.com> Co-authored-by: Larry Hastings <larry@hastings.org> Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2023-05-15 20:36:23 -07:00
John Belmonte	69621d1b09	gh-104018: remove unused format "z" handling in string formatfloat() (#104107 ) This is a cleanup overlooked in PR #104033.	2023-05-07 10:11:42 +05:30
Mark Shannon	7559f5fda9	GH-101291: Rearrange the size bits in PyLongObject (GH-102464) * Eliminate all remaining uses of Py_SIZE and Py_SET_SIZE on PyLongObject, adding asserts. * Change layout of size/sign bits in longobject to support future addition of immortal ints and tagged medium ints. * Add functions to hide some internals of long object, and for setting sign and digit count. * Replace uses of IS_MEDIUM_VALUE macro with _PyLong_IsCompact().	2023-03-22 14:49:51 +00:00
Victor Stinner	d8f239d86e	gh-99300: Use Py_NewRef() in Python/ directory (#99302 ) Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in C files of the Python/ directory.	2022-11-10 09:03:39 +01:00
Mark Shannon	76449350b3	GH-91079: Decouple C stack overflow checks from Python recursion checks. (GH-96510)	2022-10-05 01:34:03 +01:00
John Belmonte	b0b836b20c	bpo-45995: add "z" format specifer to coerce negative 0 to zero (GH-30049) Add "z" format specifier to coerce negative 0 to zero. See https://github.com/python/cpython/issues/90153 (originally https://bugs.python.org/issue45995) for discussion. This covers `str.format()` and f-strings. Old-style string interpolation is not supported. Co-authored-by: Mark Dickinson <dickinsm@gmail.com>	2022-04-11 15:34:18 +01:00
Eric Snow	12360aa159	bpo-46541: Discover the global strings. (gh-31346) Instead of manually enumerating the global strings in generate_global_objects.py, we extrapolate the list from usage of _Py_ID() and _Py_STR() in the source files. This is partly inspired by gh-31261. https://bugs.python.org/issue46541	2022-02-14 17:36:51 -07:00
Eric Snow	81c72044a1	bpo-46541: Replace core use of _Py_IDENTIFIER() with statically initialized global objects. (gh-30928) We're no longer using _Py_IDENTIFIER() (or _Py_static_string()) in any core CPython code. It is still used in a number of non-builtin stdlib modules. The replacement is: PyUnicodeObject (not pointer) fields under _PyRuntimeState, statically initialized as part of _PyRuntime. A new _Py_GET_GLOBAL_IDENTIFIER() macro facilitates lookup of the fields (along with _Py_GET_GLOBAL_STRING() for non-identifier strings). https://bugs.python.org/issue46541#msg411799 explains the rationale for this change. The core of the change is in: * (new) Include/internal/pycore_global_strings.h - the declarations for the global strings, along with the macros * Include/internal/pycore_runtime_init.h - added the static initializers for the global strings * Include/internal/pycore_global_objects.h - where the struct in pycore_global_strings.h is hooked into _PyRuntimeState * Tools/scripts/generate_global_objects.py - added generation of the global string declarations and static initializers I've also added a --check flag to generate_global_objects.py (along with make check-global-objects) to check for unused global strings. That check is added to the PR CI config. The remainder of this change updates the core code to use _Py_GET_GLOBAL_IDENTIFIER() instead of _Py_IDENTIFIER() and the related _PyId functions (likewise for _Py_GET_GLOBAL_STRING() instead of _Py_static_string()). This includes adding a few functions where there wasn't already an alternative to _PyId(), replacing the _Py_Identifier * parameter with PyObject . The following are not changed (yet): stop using _Py_IDENTIFIER() in the stdlib modules * (maybe) get rid of _Py_IDENTIFIER(), etc. entirely -- this may not be doable as at least one package on PyPI using this (private) API * (maybe) intern the strings during runtime init https://bugs.python.org/issue46541	2022-02-08 13:39:07 -07:00
Victor Stinner	7d8b69e1d1	bpo-46670: Remove unused macros in the Python directory (GH-31192)	2022-02-07 16:21:52 +01:00
Irit Katriel	d60457a667	bpo-45292: [PEP-654] add except* (GH-29581)	2021-12-14 16:48:15 +00:00
Serhiy Storchaka	87d5180b7a	Remove misleading comment in the AST optimizer (#29825 )	2021-11-29 11:19:59 +03:00
Mark Shannon	b931077375	bpo-45753: Make recursion checks more efficient. (GH-29524) * Uses recursion remaining, instead of recursion depth to speed up check against recursion limit.	2021-11-16 11:01:57 +00:00
Serhiy Storchaka	8b01067318	bpo-28307: Tests and fixes for optimization of C-style formatting (GH-26318) Fix errors: * "%10.s" should be equal to "%10.0s", not "%10s". * Tuples with starred expressions caused a SyntaxError.	2021-05-23 19:06:48 +03:00
Serhiy Storchaka	a0bd9e9c11	bpo-28307: Convert simple C-style formatting with literal format into f-string. (GH-5012) C-style formatting with literal format containing only format codes %s, %r and %a (with optional width, precision and alignment) will be converted to an equivalent f-string expression. It can speed up formatting more than 2 times by eliminating runtime parsing of the format string and creating temporary tuple.	2021-05-08 22:33:10 +03:00
Brandt Bucher	dbe60ee09d	bpo-43892: Validate the first term of complex literal value patterns (GH-25735)	2021-04-29 17:19:28 -07:00
Nick Coghlan	1e7b858575	bpo-43892: Make match patterns explicit in the AST (GH-25585) Co-authored-by: Brandt Bucher <brandtbucher@gmail.com>	2021-04-28 22:58:44 -07:00
Serhiy Storchaka	face87c94e	bpo-42609: Check recursion depth in the AST validator and optimizer (GH-23744)	2021-04-25 13:38:00 +03:00
Pablo Galindo	b0544ba77c	bpo-38605: Revert making 'from __future__ import annotations' the default (GH-25490) This reverts commits `044a1048ca` and `1be456ae9d`, adapting the code to changes that happened after it.	2021-04-21 12:41:19 +01:00
Victor Stinner	d27f8d2e07	bpo-43244: Rename pycore_ast.h functions to _PyAST_xxx() (GH-25252) Rename AST functions of pycore_ast.h to use the "_PyAST_" prefix. Remove macros creating aliases without prefix. For example, Module() becomes _PyAST_Module(). Update Grammar/python.gram to use _PyAST_xxx() functions.	2021-04-07 21:34:22 +02:00
Victor Stinner	d36d6a9c18	bpo-43244: Remove Yield macro from pycore_ast.h (GH-25243) * pycore_ast.h no longer defines the Yield macro. * Fix a compiler warning on Windows: "warning C4005: 'Yield': macro redefinition". * Python-ast.c now defines directly functions with their real _Py_xxx() name, rather than xxx(). * Remove "#undef Yield" in C files including pycore_ast.h.	2021-04-07 13:01:09 +02:00
Victor Stinner	8370e07e1e	bpo-43244: Remove the pyarena.h header (GH-25007) Remove the pyarena.h header file with functions: * PyArena_New() * PyArena_Free() * PyArena_Malloc() * PyArena_AddPyObject() These functions were undocumented, excluded from the limited C API, and were only used internally by the compiler. Add pycore_pyarena.h header. Rename functions: * PyArena_New() => _PyArena_New() * PyArena_Free() => _PyArena_Free() * PyArena_Malloc() => _PyArena_Malloc() * PyArena_AddPyObject() => _PyArena_AddPyObject()	2021-03-24 02:23:01 +01:00
Victor Stinner	a81fca6ec8	bpo-43244: Add pycore_compile.h header file (GH-25000) Remove the compiler functions using "struct _mod" type, because the public AST C API was removed: * PyAST_Compile() * PyAST_CompileEx() * PyAST_CompileObject() * PyFuture_FromAST() * PyFuture_FromASTObject() These functions were undocumented and excluded from the limited C API. Rename functions: * PyAST_CompileObject() => _PyAST_Compile() * PyFuture_FromASTObject() => _PyFuture_FromAST() Moreover, _PyFuture_FromAST() is no longer exported (replace PyAPI_FUNC() with extern). _PyAST_Compile() remains exported for test_peg_generator. Remove also compatibility functions: * PyAST_Compile() * PyAST_CompileEx() * PyFuture_FromAST()	2021-03-24 00:51:50 +01:00
Victor Stinner	526fdeb227	bpo-43244: Add pycore_ast.h header file (GH-24908) Move _PyAST_GetDocString() and _PyAST_ExprAsUnicode() functions the internal C API: from Include/ast.h to a new Include/internal/pycore_ast.h header file. Don't export these functions anymore: replace PyAPI_FUNC() with extern. Remove also unused includes.	2021-03-17 23:50:50 +01:00
Brandt Bucher	145bf269df	bpo-42128: Structural Pattern Matching (PEP 634) (GH-22917) Co-authored-by: Guido van Rossum <guido@python.org> Co-authored-by: Talin <viridia@gmail.com> Co-authored-by: Pablo Galindo <pablogsal@gmail.com>	2021-02-26 14:51:55 -08:00
Nick Coghlan	8805a4dad2	bpo-42282: Fold constants inside named expressions (GH-23190) * The AST optimiser wasn't descending into named expressions, so any constant subexpressions weren't being folded at compile time * Remove "default:" clauses inside the AST optimiser code to reduce the risk of similar bugs passing unnoticed in future compiler changes	2020-11-07 22:35:17 +10:00
Batuhan Taskaya	044a1048ca	bpo-38605: Make 'from __future__ import annotations' the default (GH-20434) The hard part was making all the tests pass; there are some subtle issues here, because apparently the future import wasn't tested very thoroughly in previous Python versions. For example, `inspect.signature()` returned type objects normally (except for forward references), but strings with the future import. We changed it to try and return type objects by calling `typing.get_type_hints()`, but fall back on returning strings if that function fails (which it may do if there are future references in the annotations that require passing in a specific namespace to resolve).	2020-10-06 13:03:02 -07:00
Pablo Galindo	a5634c4067	bpo-41746: Add type information to asdl_seq objects (GH-22223) * Add new capability to the PEG parser to type variable assignments. For instance: ``` \| a[asdl_stmt_seq]=';'.small_stmt+ [';'] NEWLINE { a } ``` Add new sequence types from the asdl definition (automatically generated) * Make `asdl_seq` type a generic aliasing pointer type. * Create a new `asdl_generic_seq` for the generic case using `void`. The old `asdl_seq_GET`/`ast_seq_SET` macros now are typed. * New `asdl_seq_GET_UNTYPED`/`ast_seq_SET_UNTYPED` macros for dealing with generic sequences. * Changes all possible `asdl_seq` types to use specific versions everywhere.	2020-09-16 19:42:00 +01:00
Pablo Galindo	c5fc156852	bpo-40334: PEP 617 implementation: New PEG parser for CPython (GH-19503) Co-authored-by: Guido van Rossum <guido@python.org> Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-04-22 23:29:27 +01:00
Pablo Galindo	33986465bd	bpo-39522: Always initialise kind attribute in constant ast nodes (GH-19525)	2020-04-14 21:40:41 +01:00
Pablo Galindo	d112c600ab	bpo-39220: Do not optimise annotation if 'from __future__ import annotations' is used (GH-17866) Do not apply AST-based optimizations if 'from __future__ import annotations' is used in order to prevent information lost in the final version of the annotations.	2020-03-18 23:02:09 +00:00
Serhiy Storchaka	13d52c2686	bpo-34822: Simplify AST for subscription. (GH-9605) * Remove the slice type. * Make Slice a kind of the expr type instead of the slice type. * Replace ExtSlice(slices) with Tuple(slices, Load()). * Replace Index(value) with a value itself. All non-terminal nodes in AST for expressions are now of the expr type.	2020-03-10 18:52:34 +02:00
Batuhan Taşkaya	d82e469048	bpo-39639: Remove the AST "Suite" node and associated code (GH-18513) The AST "Suite" node is no longer used and it can be removed from the ASDL definition and related structures (compiler, visitors, ...). Co-Authored-By: Victor Stinner <vstinner@python.org> Co-authored-by: Brett Cannon <54418+brettcannon@users.noreply.github.com> Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-03-04 16:16:46 +00:00
Anthony Sottile	b121a4a45f	Fix constant folding optimization for positional only arguments (GH-17837)	2020-01-05 17:03:56 +00:00
Ivan Levkivskyi	9932a22897	bpo-33416: Add end positions to Python AST (GH-11605) The majority of this PR is tediously passing `end_lineno` and `end_col_offset` everywhere. Here are non-trivial points: * It is not possible to reconstruct end positions in AST "on the fly", some information is lost after an AST node is constructed, so we need two more attributes for every AST node `end_lineno` and `end_col_offset`. * I add end position information to both CST and AST. Although it may be technically possible to avoid adding end positions to CST, the code becomes more cumbersome and less efficient. * Since the end position is not known for non-leaf CST nodes while the next token is added, this requires a bit of extra care (see `_PyNode_FinalizeEndPos`). Unless I made some mistake, the algorithm should be linear. * For statements, I "trim" the end position of suites to not include the terminal newlines and dedent (this seems to be what people would expect), for example in ```python class C: pass pass ``` the end line and end column for the class definition is (2, 8). * For `end_col_offset` I use the common Python convention for indexing, for example for `pass` the `end_col_offset` is 4 (not 3), so that `[0:4]` gives one the source code that corresponds to the node. * I added a helper function `ast.get_source_segment()`, to get source text segment corresponding to a given AST node. It is also useful for testing. An (inevitable) downside of this PR is that AST now takes almost 25% more memory. I think however it is probably justified by the benefits.	2019-01-22 11:18:22 +00:00
Victor Stinner	5f2df88b63	bpo-35177: Add dependencies between header files (GH-10361) * ast.h now includes Python-ast.h and node.h * parsetok.h now includes node.h and grammar.h * symtable.h now includes Python-ast.h * Modify asdl_c.py to enhance Python-ast.h: * Add #ifndef/#define Py_PYTHON_AST_H to be able to include the header twice * Add "extern { ... }" for C++ * Undefine "Yield" macro conflicting with winbase.h * Remove "#undef Yield" from C files, it's now done in Python-ast.h * Remove now useless includes in C files	2018-11-12 00:56:19 +01:00
Serhiy Storchaka	3f22811fef	bpo-32892: Use ast.Constant instead of specific constant AST types. (GH-9445)	2018-09-27 17:42:37 +03:00

1 2

58 Commits