cpython

Commit Graph

Author	SHA1	Message	Date
Victor Stinner	578ebc5d5f	gh-108767: Replace ctype.h functions with pyctype.h functions (#108772 ) Replace <ctype.h> locale dependent functions with Python "pyctype.h" locale independent functions: * Replace isalpha() with Py_ISALPHA(). * Replace isdigit() with Py_ISDIGIT(). * Replace isxdigit() with Py_ISXDIGIT(). * Replace tolower() with Py_TOLOWER(). Leave Modules/_sre/sre.c unchanged, it uses locale dependent functions on purpose. Include explicitly <ctype.h> in _decimal.c to get isascii().	2023-09-01 18:36:53 +02:00
Victor Stinner	e59a95238b	gh-108444: Remove _PyLong_AsInt() function (#108461 ) * Update Parser/asdl_c.py to regenerate Python/Python-ast.c. * Remove _PyLong_AsInt() alias to PyLong_AsInt().	2023-08-25 11:13:59 +02:00
Dennis Sweeney	86617518c4	gh-108179: Add error message for parser stack overflows (#108256 )	2023-08-22 08:41:50 +01:00
Victor Stinner	0dd3fc2a64	gh-108216: Cleanup #include in internal header files (#108228 ) * Add missing includes. * Remove unused includes. * Update old include/symbol names to newer names. * Mention at least one included symbol. * Sort includes. * Update Tools/cases_generator/generate_cases.py used to generated pycore_opcode_metadata.h. * Update Parser/asdl_c.py used to generate pycore_ast.h. * Cleanup also includes in _testcapimodule.c and _testinternalcapi.c.	2023-08-21 18:05:59 +00:00
Irit Katriel	10a91d7e98	gh-108113: Make it possible to create an optimized AST (#108154 )	2023-08-21 16:31:30 +00:00
Lysandros Nikolaou	d66bc9e8a7	gh-107967: Fix infinite recursion on invalid escape sequence warning (#107968 )	2023-08-15 11:26:42 +00:00
Mark Shannon	fa45958450	GH-107263: Increase C stack limit for most functions, except `_PyEval_EvalFrameDefault()` (GH-107535) * Set C recursion limit to 1500, set cost of eval loop to 2 frames, and compiler mutliply to 2.	2023-08-04 10:10:29 +01:00
Pablo Galindo Salgado	da8f87b7ea	gh-107015: Remove async_hacks from the tokenizer (#107018 )	2023-07-26 16:34:15 +01:00
Victor Stinner	1a3faba9f1	gh-106869: Use new PyMemberDef constant names (#106871 ) * Remove '#include "structmember.h"'. * If needed, add <stddef.h> to get offsetof() function. * Update Parser/asdl_c.py to regenerate Python/Python-ast.c. * Replace: * T_SHORT => Py_T_SHORT * T_INT => Py_T_INT * T_LONG => Py_T_LONG * T_FLOAT => Py_T_FLOAT * T_DOUBLE => Py_T_DOUBLE * T_STRING => Py_T_STRING * T_OBJECT => _Py_T_OBJECT * T_CHAR => Py_T_CHAR * T_BYTE => Py_T_BYTE * T_UBYTE => Py_T_UBYTE * T_USHORT => Py_T_USHORT * T_UINT => Py_T_UINT * T_ULONG => Py_T_ULONG * T_STRING_INPLACE => Py_T_STRING_INPLACE * T_BOOL => Py_T_BOOL * T_OBJECT_EX => Py_T_OBJECT_EX * T_LONGLONG => Py_T_LONGLONG * T_ULONGLONG => Py_T_ULONGLONG * T_PYSSIZET => Py_T_PYSSIZET * T_NONE => _Py_T_NONE * READONLY => Py_READONLY * PY_AUDIT_READ => Py_AUDIT_READ * READ_RESTRICTED => Py_AUDIT_READ * PY_WRITE_RESTRICTED => _Py_WRITE_RESTRICTED * RESTRICTED => (READ_RESTRICTED \| _Py_WRITE_RESTRICTED)	2023-07-25 15:28:30 +02:00
Victor Stinner	7d41ead919	gh-106320: Remove _PyBytes_Join() C API (#107144 ) Move private _PyBytes functions to the internal C API (pycore_bytesobject.h): * _PyBytes_DecodeEscape() * _PyBytes_FormatEx() * _PyBytes_FromHex() * _PyBytes_Join() No longer export these functions.	2023-07-23 20:10:12 +00:00
Victor Stinner	d228825e08	gh-106320: Remove _PyOS_ReadlineTState API (#107034 ) Remove _PyOS_ReadlineTState variable from the public C API. The symbol is still exported for the readline shared extension.	2023-07-22 14:45:56 +00:00
Menelaos Kotoglou	76e20c361c	gh-106989: Remove tok report warnings (#106993 )	2023-07-22 14:23:23 +02:00
Serhiy Storchaka	be1b968dc1	gh-106521: Remove _PyObject_LookupAttr() function (GH-106642)	2023-07-12 08:57:10 +03:00
Lysandros Nikolaou	dfe4de2038	gh-106396: Special-case empty format spec to gen empty JoinedStr node (#106401 )	2023-07-04 14:19:08 +02:00
Victor Stinner	d8c5d76da2	gh-106320: Remove private _PyUnicode codecs C API functions (#106385 ) Remove private _PyUnicode codecs C API functions: move them to the internal C API (pycore_unicodeobject.h). No longer export most of these functions.	2023-07-04 07:29:52 +00:00
Victor Stinner	c5afc97fc2	gh-106320: Remove private _PyErr C API functions (#106356 ) Remove private _PyErr C API functions: move them to the internal C API (pycore_pyerrors.h).	2023-07-03 10:48:50 +00:00
Inada Naoki	d5bd32fb48	gh-104922: remove PY_SSIZE_T_CLEAN (#106315 )	2023-07-02 15:07:46 +09:00
Nikita Sobolev	46c1097868	gh-106145: Make `end_{lineno,col_offset}` required on `type_param` nodes (#106224 )	2023-06-30 23:45:08 +00:00
Victor Stinner	8c5f74fc89	gh-106023: Update code using _PyObject_FastCall() (#106257 ) Replace _PyObject_FastCall() calls with PyObject_Vectorcall().	2023-06-30 01:05:01 +00:00
Pablo Galindo Salgado	13237a2da8	gh-98931: Add custom error messages to invalid import/from with multiple targets (#105985 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2023-06-22 15:56:40 +00:00
Lysandros Nikolaou	6586cee27f	gh-105938: Emit a SyntaxWarning for escaped braces in an f-string (#105939 )	2023-06-20 12:38:46 +00:00
Brandt Bucher	a4056c8f9c	GH-105588: Add missing error checks to some obj2ast_* converters (GH-105589)	2023-06-15 15:45:13 -07:00
Lysandros Nikolaou	d382ad4915	gh-105820: Fix tok_mode expression buffer in file & readline tokenizer (#105828 )	2023-06-15 16:21:24 +00:00
Pablo Galindo Salgado	12b6d844d8	gh-105800: Issue SyntaxWarning in f-strings for invalid escape sequences (#105801 )	2023-06-15 01:08:12 +01:00
Lysandros Nikolaou	abfbab6415	gh-105718: Fix buffer allocation in tokenizer with readline (#105728 )	2023-06-13 16:18:11 +01:00
Pablo Galindo Salgado	b047fa5e56	gh-105549: Tokenize separately NUMBER and NAME tokens and allow 0-prefixed literals (#105555 )	2023-06-09 21:39:01 +01:00
Pablo Galindo Salgado	c0a6ed3934	gh-105259: Ensure we don't show newline characters for trailing NEWLINE tokens (#105364 )	2023-06-06 12:52:16 +01:00
Pablo Galindo Salgado	41de54378d	gh-105194: Fix format specifier escaped characters in f-strings (#105231 )	2023-06-02 13:33:26 +02:00
Jelle Zijlstra	77d2579586	gh-104799: Default missing lists in AST to the empty list (#104834 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2023-06-01 18:39:39 -07:00
Victor Stinner	ef300937c2	gh-92536: Remove PyUnicode_READY() calls (#105210 ) Since Python 3.12, PyUnicode_READY() does nothing and always returns 0.	2023-06-02 01:33:17 +02:00
Lysandros Nikolaou	70f315c2d6	gh-105042: Disable unmatched parens syntax error in python tokenize (#105061 )	2023-05-30 22:52:52 +01:00
Pablo Galindo Salgado	9216e69a87	gh-105069: Add a readline-like callable to the tokenizer to consume input iteratively (#105070 )	2023-05-30 22:43:34 +01:00
Marta Gómez Macías	96fff35325	gh-105017: Include CRLF lines in strings and column numbers (#105030 ) Co-authored-by: Pablo Galindo <pablogsal@gmail.com>	2023-05-28 15:15:53 +01:00
Marta Gómez Macías	86d8f48935	gh-105017: Fix including additional NL token when using CRLF (#105022 ) Co-authored-by: Pablo Galindo Salgado <Pablogsal@gmail.com>	2023-05-27 16:50:43 +00:00
Petr Vaněk	6e62eb2e70	Fix indentation in Parser/tokenizer.c (#105012 )	2023-05-27 12:41:50 +01:00
Jelle Zijlstra	ba73473f4c	gh-104799: Move location of type_params AST fields (#104828 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2023-05-26 05:54:37 -07:00
Stepfen Shawn	705e387dd8	Fix typo in the tokenizer (#104950 )	2023-05-26 02:50:33 +00:00
Lysandros Nikolaou	c90a862cdc	gh-104866: Tokenize should emit NEWLINE after exiting block with comment (#104870 )	2023-05-24 17:18:17 +01:00
Brandt Bucher	357bed0bcd	GH-104668: Don't call PyOS_* hooks in subinterpreters (GH-104674)	2023-05-22 19:34:34 +00:00
Cristián Maureira-Fredes	0a7796052a	gh-102856: Allow comments inside multi-line f-string expresions (#104006 )	2023-05-22 10:30:07 +00:00
Jelle Zijlstra	a5f244d627	gh-104656: Rename typeparams AST node to type_params (#104657 )	2023-05-21 21:25:09 -07:00
Serhiy Storchaka	f3466bc040	gh-98836: Extend PyUnicode_FromFormat() (GH-98838) * Support for conversion specifiers o (octal) and X (uppercase hexadecimal). * Support for length modifiers j (intmax_t) and t (ptrdiff_t). * Length modifiers are now applied to all integer conversions. * Support for wchar_t C strings (%ls and %lV). * Support for variable width and precision (). Support for flag - (left alignment).	2023-05-22 00:32:39 +03:00
Marta Gómez Macías	6715f91edc	gh-102856: Python tokenizer implementation for PEP 701 (#104323 ) This commit replaces the Python implementation of the tokenize module with an implementation that reuses the real C tokenizer via a private extension module. The tokenize module now implements a compatibility layer that transforms tokens from the C tokenizer into Python tokenize tokens for backward compatibility. As the C tokenizer does not emit some tokens that the Python tokenizer provides (such as comments and non-semantic newlines), a new special mode has been added to the C tokenizer mode that currently is only used via the extension module that exposes it to the Python layer. This new mode forces the C tokenizer to emit these new extra tokens and add the appropriate metadata that is needed to match the old Python implementation. Co-authored-by: Pablo Galindo <pablogsal@gmail.com>	2023-05-21 01:03:02 +01:00
Pablo Galindo Salgado	ff7f731632	gh-104658: Fix location of unclosed quote error for multiline f-strings (#104660 )	2023-05-20 14:07:05 +01:00
Jelle Zijlstra	24d8b88420	gh-103763: Implement PEP 695 (#103764 ) This implements PEP 695, Type Parameter Syntax. It adds support for: - Generic functions (def func[T](): ...) - Generic classes (class X[T](): ...) - Type aliases (type X = ...) - New scoping when the new syntax is used within a class body - Compiler and interpreter changes to support the new syntax and scoping rules Co-authored-by: Marc Mueller <30130371+cdce8p@users.noreply.github.com> Co-authored-by: Eric Traut <eric@traut.com> Co-authored-by: Larry Hastings <larry@hastings.org> Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2023-05-15 20:36:23 -07:00
Hugo van Kemenade	d513ddee94	Trim trailing whitespace and test on CI (#104275 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2023-05-08 17:03:52 +03:00
Eric Snow	a9c6e0618f	gh-99113: Add Py_MOD_PER_INTERPRETER_GIL_SUPPORTED (gh-104205) Here we are doing no more than adding the value for Py_mod_multiple_interpreters and using it for stdlib modules. We will start checking for it in gh-104206 (once PyInterpreterState.ceval.own_gil is added in gh-104204).	2023-05-05 21:11:27 +00:00
Pablo Galindo Salgado	eba64d2afb	gh-104169: Ensure the tokenizer doesn't overwrite previous errors (#104170 )	2023-05-04 15:15:26 +01:00
Lysandros Nikolaou	ef0df5284f	gh-97556: Raise null bytes syntax error upon null in multiline string (GH-104136)	2023-05-04 14:26:23 +02:00
jx124	5078eedc5b	gh-104016: Fixed off by 1 error in f string tokenizer (#104047 ) Co-authored-by: sunmy2019 <59365878+sunmy2019@users.noreply.github.com> Co-authored-by: Ken Jin <kenjin@python.org> Co-authored-by: Pablo Galindo <pablogsal@gmail.com>	2023-05-01 19:15:47 +00:00
chgnrdv	d5a97074d2	gh-103824: fix use-after-free error in Parser/tokenizer.c (#103993 )	2023-05-01 15:26:43 +00:00
Lysandros Nikolaou	9169a56fad	gh-103656: Transfer f-string buffers to parser to avoid use-after-free (GH-103896) Co-authored-by: Pablo Galindo <pablogsal@gmail.com>	2023-04-27 01:33:31 +00:00
Lysandros Nikolaou	57f8f9a66d	gh-103718: Correctly set f-string buffers in all cases (GH-103815) Turns out we always need to remember/restore fstring buffers in all of the stack of tokenizer modes, cause they might change to `TOK_REGULAR_MODE` and have newlines inside the braces (which is when we need to reallocate the buffer and restore the fstring ones).	2023-04-25 01:31:21 +00:00
Lysandros Nikolaou	cb157a1a35	GH-103727: Avoid advancing tokenizer too far in f-string mode (GH-103775)	2023-04-24 12:30:21 -06:00
Lysandros Nikolaou	05b3ce7339	GH-103718: Correctly cache and restore f-string buffers when needed (GH-103719)	2023-04-23 13:06:10 -06:00
Nikita Sobolev	0fd3891758	gh-102310: Change error range for invalid bytes literals (#103663 )	2023-04-22 18:08:27 -06:00
Pablo Galindo Salgado	d4aa8578b1	gh-102856: Clean some of the PEP 701 tokenizer implementation (#103634 )	2023-04-19 14:51:31 -06:00
Pablo Galindo Salgado	1ef61cf71a	gh-102856: Initial implementation of PEP 701 (#102855 ) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com> Co-authored-by: Batuhan Taskaya <isidentical@gmail.com> Co-authored-by: Marta Gómez Macías <mgmacias@google.com> Co-authored-by: sunmy2019 <59365878+sunmy2019@users.noreply.github.com>	2023-04-19 11:18:16 -05:00
Chenxi Mao	7703def37e	GH-102711: Fix warnings found by clang (#102712 ) There are some warnings if build python via clang: Parser/pegen.c:812:31: warning: a function declaration without a prototype is deprecated in all versions of C [-Wstrict-prototypes] _PyPegen_clear_memo_statistics() ^ void Parser/pegen.c:820:29: warning: a function declaration without a prototype is deprecated in all versions of C [-Wstrict-prototypes] _PyPegen_get_memo_statistics() ^ void Fix it to make clang happy. Signed-off-by: Chenxi Mao <chenxi.mao@suse.com>	2023-03-28 10:52:22 +02:00
Max Bachmann	c6858d1e7f	gh-102255: Improve build support for Windows API partitions (GH-102256) Add `MS_WINDOWS_DESKTOP`, `MS_WINDOWS_APPS`, `MS_WINDOWS_SYSTEM` and `MS_WINDOWS_GAMES` preprocessor definitions to allow switching off functionality missing from particular API partitions ("partitions" are used in Windows to identify overlapping subsets of APIs). CPython only officially supports `MS_WINDOWS_DESKTOP` and `MS_WINDOWS_SYSTEM` (APPS is included by normal desktop builds, but APPS without DESKTOP is not covered). Other configurations are a convenience for people building their own runtimes. `MS_WINDOWS_GAMES` is for the Xbox subset of the Windows API, which is also available on client OS, but is restricted compared to `MS_WINDOWS_DESKTOP`. These restrictions may change over time, as they relate to the build headers rather than the OS support, and so we assume that Xbox builds will use the latest available version of the GDK.	2023-03-09 21:09:12 +00:00
Pablo Galindo Salgado	f533f216e6	gh-102416: Do not memoize incorrectly loop rules in the parser (#102467 )	2023-03-06 14:41:53 +01:00
Eric Snow	880437d4ec	gh-100227: Move _str_replace_inf to PyInterpreterState (gh-102333) https://github.com/python/cpython/issues/100227	2023-02-28 14:16:39 -07:00
abel1502	448c7d154e	Fix some typos in asdl_c.py (GH-101757)	2023-02-09 21:10:46 -06:00
Mark Shannon	feec49c407	GH-101578: Normalize the current exception (GH-101607) * Make sure that the current exception is always normalized. * Remove redundant type and traceback fields for the current exception. * Add new API functions: PyErr_GetRaisedException, PyErr_SetRaisedException * Add new API functions: PyException_GetArgs, PyException_SetArgs	2023-02-08 09:31:12 +00:00
Stepfen Shawn	a1e051a237	gh-100940: Change "char str" to "const char str" in KeywordToken: It is an immutable string. (#100936 )	2023-01-18 21:02:48 +00:00
Pablo Galindo Salgado	1de4395f62	gh-101046: Fix a potential memory leak in the parser when raising MemoryError (#101051 )	2023-01-16 18:45:37 +00:00
Eric Snow	0415cf895f	gh-81057: Move the Cached Parser Dummy Name to _PyRuntimeState (#100277 )	2022-12-16 13:48:03 +00:00
Eric Snow	91a8e002c2	gh-81057: Move More Globals to _PyRuntimeState (gh-100092) https://github.com/python/cpython/issues/81057	2022-12-07 15:56:31 -07:00
Eric Snow	d47ffeb9e3	gh-90110: Clean Up the C-analyzer Globals Lists (gh-100091) https://github.com/python/cpython/issues/90110	2022-12-07 15:02:47 -07:00
Pablo Galindo Salgado	97e7004cfe	gh-100050: Fix an assertion error when raising unclosed parenthesis errors in the tokenizer (GH-100065) Automerge-Triggered-By: GH:pablogsal	2022-12-06 15:09:56 -08:00
Pablo Galindo Salgado	417206a05c	gh-99891: Fix infinite recursion in the tokenizer when showing warnings (GH-99893) Automerge-Triggered-By: GH:pablogsal	2022-11-30 03:36:06 -08:00
Lysandros Nikolaou	6d8da238cc	gh-90994: Improve error messages upon call arguments syntax errors (GH-96893)	2022-11-21 00:15:05 +01:00
Pablo Galindo Salgado	e13d1d9dda	gh-99581: Fix a buffer overflow in the tokenizer when copying lines that fill the available buffer (#99605 )	2022-11-20 20:20:03 +00:00
Lysandros Nikolaou	9c4232ae89	gh-99211: Point to except/except* on syntax errors when mixing them (GH-99215) Automerge-Triggered-By: GH:lysnikolaou	2022-11-20 09:11:02 -08:00
Eric Snow	3c57971a2d	gh-81057: Move Globals in Core Code to _PyRuntimeState (gh-99496) This is the first of several changes to consolidate non-object globals in core code. https://github.com/python/cpython/issues/81057	2022-11-15 09:45:11 -07:00
Victor Stinner	f13f466474	gh-99300: Use Py_NewRef() in Python/Python-ast.c (#99499 ) Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in Python/Python-ast.c. Update Parser/asdl_c.py to regenerate code.	2022-11-15 10:29:56 +01:00
Eric Snow	a088290f9d	gh-81057: Move Global Variables Holding Objects to _PyRuntimeState. (gh-99487) This moves nearly all remaining object-holding globals in core code (other than static types). https://github.com/python/cpython/issues/81057	2022-11-14 13:50:56 -07:00
Victor Stinner	4ce2a202c7	gh-99300: Use Py_NewRef() in Parser/ directory (#99330 ) Replace Py_INCREF() with Py_NewRef() in C files of the Parser/ directory and in the PEG generator.	2022-11-10 15:30:05 +01:00
Victor Stinner	231d83b724	gh-99300: Use Py_NewRef() in Python/ directory (#99317 ) Replace Py_INCREF() and Py_XINCREF() with Py_NewRef() and Py_XNewRef() in C files of the Python/ directory. Update Parser/asdl_c.py to regenerate Python/Python-ast.c.	2022-11-10 11:23:36 +01:00
Irit Katriel	61b6c40b64	gh-99153: set location on SyntaxError for try with both except and except* (GH-99160)	2022-11-06 15:36:19 +00:00
Victor Stinner	a60ddd31be	gh-98401: Invalid escape sequences emits SyntaxWarning (#99011 ) A backslash-character pair that is not a valid escape sequence now generates a SyntaxWarning, instead of DeprecationWarning. For example, re.compile("\d+\.\d+") now emits a SyntaxWarning ("\d" is an invalid escape sequence), use raw strings for regular expression: re.compile(r"\d+\.\d+"). In a future Python version, SyntaxError will eventually be raised, instead of SyntaxWarning. Octal escapes with value larger than 0o377 (ex: "\477"), deprecated in Python 3.11, now produce a SyntaxWarning, instead of DeprecationWarning. In a future Python version they will be eventually a SyntaxError. codecs.escape_decode() and codecs.unicode_escape_decode() are left unchanged: they still emit DeprecationWarning. * The parser only emits SyntaxWarning for Python 3.12 (feature version), and still emits DeprecationWarning on older Python versions. * Fix SyntaxWarning by using raw strings in Tools/c-analyzer/ and wasm_build.py.	2022-11-03 17:53:25 +01:00
Pablo Galindo Salgado	395d4285bf	gh-98931: Improve error message when the user types 'import x from y' instead of 'from y import x' (#98932 )	2022-11-01 13:01:20 +00:00
Victor Stinner	1863302d61	gh-97669: Create Tools/build/ directory (#97963 ) Create Tools/build/ directory. Move the following scripts from Tools/scripts/ to Tools/build/: * check_extension_modules.py * deepfreeze.py * freeze_modules.py * generate_global_objects.py * generate_levenshtein_examples.py * generate_opcode_h.py * generate_re_casefix.py * generate_sre_constants.py * generate_stdlib_module_names.py * generate_token.py * parse_html5_entities.py * smelly.py * stable_abi.py * umarshal.py * update_file.py * verify_ensurepip_wheels.py Update references to these scripts.	2022-10-17 12:01:00 +02:00
Lysandros Nikolaou	3de08ce8c1	gh-97997: Add col_offset field to tokenizer and use that for AST nodes (#98000 )	2022-10-07 14:38:35 -07:00
Lysandros Nikolaou	cbf0afd8a1	gh-97973: Return all necessary information from the tokenizer (GH-97984) Right now, the tokenizer only returns type and two pointers to the start and end of the token. This PR modifies the tokenizer to return the type and set all of the necessary information, so that the parser does not have to this.	2022-10-06 16:07:17 -07:00
Mark Shannon	76449350b3	GH-91079: Decouple C stack overflow checks from Python recursion checks. (GH-96510)	2022-10-05 01:34:03 +01:00
Pablo Galindo Salgado	aab01e3524	gh-96670: Raise SyntaxError when parsing NULL bytes (#97594 )	2022-09-27 23:23:42 +01:00
Lysandros Nikolaou	7e36abbb78	gh-91210: Improve error message when non-default param follows default (GH-95933) - Improve error message when parameter without a default follows one with a default - Show same error message when positional-only params precede the default/non-default sequence	2022-09-17 10:09:28 -07:00
Matthias Görgens	81e36f350b	gh-96678: Fix UB of null pointer arithmetic (GH-96782) Automerge-Triggered-By: GH:pablogsal	2022-09-13 06:14:35 -07:00
Michael Droettboom	8bc356a7dd	gh-96268: Fix loading invalid UTF-8 (#96270 ) This makes tokenizer.c:valid_utf8 match stringlib/codecs.h:decode_utf8. It also fixes an off-by-one error introduced in 3.10 for the line number when the tokenizer reports bad UTF8.	2022-09-07 14:23:54 -07:00
Michael Droettboom	05692c67c5	gh-96611: Fix error message for invalid UTF-8 in mid-multiline string (#96623 )	2022-09-07 00:12:16 +01:00
Nikita Sobolev	2c7d2e8d46	gh-96587: Raise `SyntaxError` for PEP654 on older `feature_version` (#96588 )	2022-09-05 17:54:09 +01:00
Gregory P. Smith	511ca94520	gh-95778: CVE-2020-10735: Prevent DoS by very large int() (#96499 ) Integer to and from text conversions via CPython's bignum `int` type is not safe against denial of service attacks due to malicious input. Very large input strings with hundred thousands of digits can consume several CPU seconds. This PR comes fresh from a pile of work done in our private PSRT security response team repo. Signed-off-by: Christian Heimes [Red Hat] <christian@python.org> Tons-of-polishing-up-by: Gregory P. Smith [Google] <greg@krypto.org> Reviews via the private PSRT repo via many others (see the NEWS entry in the PR). <!-- gh-issue-number: gh-95778 --> * Issue: gh-95778 <!-- /gh-issue-number --> I wrote up [a one pager for the release managers](https://docs.google.com/document/d/1KjuF_aXlzPUxTK4BMgezGJ2Pn7uevfX7g0_mvgHlL7Y/edit#). Much of that text wound up in the Issue. Backports PRs already exist. See the issue for links.	2022-09-02 09:35:08 -07:00
Shantanu	a965db37f2	gh-94996: Disallow lambda pos only params with feature_version < (3, 8) (GH-95934)	2022-08-12 20:41:02 +02:00
Shantanu	b5e3ea2862	gh-94996: Disallow parsing pos only params with feature_version < (3, 8) (GH-94997)	2022-08-12 19:27:50 +02:00
Christian Heimes	b4c857d0fd	gh-95876: Fix format string in pegen error location code (#95877 )	2022-08-11 09:55:57 +01:00
Honglin Zhu	b946f529ef	gh-95355: Check tokens[0] after allocating memory (GH-95356) #95355 Automerge-Triggered-By: GH:pablogsal	2022-07-28 03:00:34 -07:00
Pablo Galindo Salgado	0047447294	gh-95185: Check recursion depth in the AST constructor (#95186 ) Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>	2022-07-24 15:58:52 +01:00
Shantanu	0daba82221	gh-94949: Disallow parsing parenthesised ctx mgr with old feature_version (#94950 ) * gh-94949: Disallow parsing parenthesised ctx manager with old feature_version * 📜🤖 Added by blurb_it. * Allow it with feature_version=(3, 9) as well Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2022-07-18 22:10:49 +01:00
Shantanu	ae0be5a53b	gh-94947: Disallow parsing walrus with feature_version < (3, 8) (#94948 ) * gh-94947: Disallow parsing walrus with feature_version < (3, 8) * oops, commit the parser * 📜🤖 Added by blurb_it. Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>	2022-07-18 10:20:12 +01:00

1 2 3 4 5 ...

1232 Commits