cpython

Commit Graph

Author	SHA1	Message	Date
Brandt Bucher	dbe60ee09d	bpo-43892: Validate the first term of complex literal value patterns (GH-25735)	2021-04-29 17:19:28 -07:00
Nick Coghlan	1e7b858575	bpo-43892: Make match patterns explicit in the AST (GH-25585) Co-authored-by: Brandt Bucher <brandtbucher@gmail.com>	2021-04-28 22:58:44 -07:00
Pablo Galindo	a77aac4fca	bpo-43914: Highlight invalid ranges in SyntaxErrors (#25525 ) To improve the user experience understanding what part of the error messages associated with SyntaxErrors is wrong, we can highlight the whole error range and not only place the caret at the first character. In this way: >>> foo(x, z for z in range(10), t, w) File "<stdin>", line 1 foo(x, z for z in range(10), t, w) ^ SyntaxError: Generator expression must be parenthesized becomes >>> foo(x, z for z in range(10), t, w) File "<stdin>", line 1 foo(x, z for z in range(10), t, w) ^^^^^^^^^^^^^^^^^^^^ SyntaxError: Generator expression must be parenthesized	2021-04-23 14:27:05 +01:00
Pablo Galindo	56c95dfe27	bpo-43859: Improve the error message for IndentationError exceptions (GH-25431)	2021-04-21 15:28:21 +01:00
Pablo Galindo	b5b98bd8f8	bpo-43823: Fix location of one of the errors for invalid dictionary literals (GH-25427)	2021-04-16 00:45:42 +01:00
Pablo Galindo	b280248be8	bpo-43822: Improve syntax errors for missing commas (GH-25377)	2021-04-15 21:38:45 +01:00
Pablo Galindo	da74350174	bpo-43823: Improve syntax errors for invalid dictionary literals (GH-25378)	2021-04-15 14:06:39 +01:00
Pablo Galindo	30ed93bfec	bpo-43797: Handle correctly invalid assignments inside function calls and generators (GH-25390)	2021-04-13 17:51:21 +01:00
Pablo Galindo	d9151cb453	Ensure that early = are not matched by the parser as invalid comparisons (GH-25375)	2021-04-13 02:32:33 +01:00
Pablo Galindo	b86ed8e3bb	bpo-43797: Improve syntax error for invalid comparisons (#25317 ) * bpo-43797: Improve syntax error for invalid comparisons * Update Lib/test/test_fstring.py Co-authored-by: Guido van Rossum <gvanrossum@gmail.com> * Apply review comments * can't -> cannot Co-authored-by: Guido van Rossum <gvanrossum@gmail.com>	2021-04-12 16:59:30 +01:00
Matthew Suozzo	75a06f067b	bpo-43798: Add source location attributes to alias (GH-25324) * Add source location attributes to alias. * Move alias star construction to pegen helper. Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com> Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2021-04-10 22:56:28 +02:00
Victor Stinner	d27f8d2e07	bpo-43244: Rename pycore_ast.h functions to _PyAST_xxx() (GH-25252) Rename AST functions of pycore_ast.h to use the "_PyAST_" prefix. Remove macros creating aliases without prefix. For example, Module() becomes _PyAST_Module(). Update Grammar/python.gram to use _PyAST_xxx() functions.	2021-04-07 21:34:22 +02:00
Pablo Galindo	8efad61963	bpo-41064: Improve syntax error for invalid usage of '**' in f-strings (GH-25006)	2021-03-24 19:34:17 +00:00
Victor Stinner	6af528b4ab	bpo-43244: Fix test_peg_generators on Windows (GH-24913) Don't redefine Py_DebugFlag, it's already defined in pydebug.h which is included by Python.h	2021-03-18 09:54:13 +01:00
Pablo Galindo	08fb8ac99a	bpo-42128: Add 'missing :' syntax error message to match statements (GH-24733)	2021-03-18 01:03:11 +00:00
Jozef Grajciar	c994ffe695	bpo-11717: fix ssize_t redefinition error when targeting 32bit Windows app (GH-24479)	2021-03-01 11:18:33 +00:00
Brandt Bucher	145bf269df	bpo-42128: Structural Pattern Matching (PEP 634) (GH-22917) Co-authored-by: Guido van Rossum <guido@python.org> Co-authored-by: Talin <viridia@gmail.com> Co-authored-by: Pablo Galindo <pablogsal@gmail.com>	2021-02-26 14:51:55 -08:00
Pablo Galindo	206cbdab16	bpo-43149: Improve error message for exception group without parentheses (GH-24467)	2021-02-07 18:42:21 +00:00
Pablo Galindo	d4e6ed7e5f	bpo-43121: Fix incorrect SyntaxError message for missing comma (GH-24436)	2021-02-03 23:29:26 +00:00
Pablo Galindo	58fb156edd	bpo-42997: Improve error message for missing : before suites (GH-24292) * Add to the peg generator a new directive ('&&') that allows to expect a token and hard fail the parsing if the token is not found. This allows to quickly emmit syntax errors for missing tokens. * Use the new grammar element to hard-fail if the ':' is missing before suites.	2021-02-02 19:54:22 +00:00
Pablo Galindo	835f14ff8e	bpo-43017: Improve error message for unparenthesised tuples in comprehensions (GH24314)	2021-01-31 22:52:56 +00:00
Lysandros Nikolaou	07dcd86cee	bpo-42860: Remove type error from grammar (GH-24156) This is only there so that alternative implementations written in statically-typed languages can use this grammar without having type errors in the way. Automerge-Triggered-By: GH:lysnikolaou	2021-01-07 14:31:25 -08:00
Lysandros Nikolaou	2ea320dddd	bpo-40631: Disallow single parenthesized star target (GH-24027)	2021-01-03 01:14:21 +02:00
Pablo Galindo	43c4fb6c90	bpo-30858: Improve error location for expressions with assignments (GH-23753) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-12-13 16:46:48 +00:00
Pablo Galindo	9bdc40ee3e	Refactor the grammar to match the language specification docs (GH-23574)	2020-11-30 19:42:38 +00:00
Pablo Galindo	b0aba1fcdc	bpo-42381: Allow walrus in set literals and set comprehensions (GH-23332) Currently walruses are not allowerd in set literals and set comprehensions: >>> {y := 4, 42, 33} File "<stdin>", line 1 {y := 4, 42, 33} ^ SyntaxError: invalid syntax but they should be allowed as well per PEP 572	2020-11-17 01:17:12 +00:00
Lysandros Nikolaou	cae60187cf	bpo-42316: Allow unparenthesized walrus operator in indexes (GH-23317)	2020-11-17 01:09:35 +02:00
Lysandros Nikolaou	cb3e5ed071	bpo-42374: Allow unparenthesized walrus in genexps (GH-23319) This fixes a regression that was introduced by the new parser. Automerge-Triggered-By: GH:lysnikolaou	2020-11-16 15:08:35 -08:00
Lysandros Nikolaou	02cdfc93f8	bpo-42218: Correctly handle errors in left-recursive rules (GH-23065) Left-recursive rules need to check for errors explicitly, since even if the rule returns NULL, the parsing might continue and lead to long-distance failures. Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-10-31 20:31:41 +02:00
Pablo Galindo	06f8c3328d	bpo-42214: Fix check for NOTEQUAL token in the PEG parser for the barry_as_flufl rule (GH-23048)	2020-10-30 23:48:42 +00:00
Lysandros Nikolaou	15acc4eaba	bpo-41659: Disallow curly brace directly after primary (GH-22996)	2020-10-27 20:54:20 +02:00
Lysandros Nikolaou	bca7014032	bpo-42123: Run the parser two times and only enable invalid rules on the second run (GH-22111) * Implement running the parser a second time for the errors messages The first parser run is only responsible for detecting whether there is a `SyntaxError` or not. If there isn't the AST gets returned. Otherwise, the parser is run a second time with all the `invalid_*` rules enabled so that all the customized error messages get produced.	2020-10-27 00:42:04 +02:00
Lysandros Nikolaou	2e5ca9e3f6	bpo-41746: Cast to typed seqs in CHECK macros to avoid type erasure (GH-22864)	2020-10-21 22:53:14 +03:00
Batuhan Taskaya	48f305fd12	bpo-41979: Accept star-unpacking on with-item targets (GH-22611) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-10-09 10:56:48 +01:00
Pablo Galindo	a5634c4067	bpo-41746: Add type information to asdl_seq objects (GH-22223) * Add new capability to the PEG parser to type variable assignments. For instance: ``` \| a[asdl_stmt_seq]=';'.small_stmt+ [';'] NEWLINE { a } ``` Add new sequence types from the asdl definition (automatically generated) * Make `asdl_seq` type a generic aliasing pointer type. * Create a new `asdl_generic_seq` for the generic case using `void`. The old `asdl_seq_GET`/`ast_seq_SET` macros now are typed. * New `asdl_seq_GET_UNTYPED`/`ast_seq_SET_UNTYPED` macros for dealing with generic sequences. * Changes all possible `asdl_seq` types to use specific versions everywhere.	2020-09-16 19:42:00 +01:00
Pablo Galindo	315a61f7a9	bpo-41697: Correctly handle KeywordOrStarred when parsing arguments in the parser (GH-22077)	2020-09-03 15:29:32 +01:00
Pablo Galindo	4a97b1517a	bpo-41690: Use a loop to collect args in the parser instead of recursion (GH-22053) This program can segfault the parser by stack overflow: ``` import ast code = "f(" + ",".join(['a' for _ in range(100000)]) + ")" print("Ready!") ast.parse(code) ``` the reason is that the rule for arguments has a simple recursion when collecting args: args[expr_ty]: [...] \| a=named_expression b=[',' c=args { c }] { [...] }	2020-09-02 17:44:19 +01:00
Pablo Galindo	1ac0cbca36	bpo-41215: Don't use NULL by default in the PEG parser keyword list (GH-21355) Automerge-Triggered-By: @lysnikolaou	2020-07-06 12:31:16 -07:00
Batuhan Taskaya	c8f29ad986	bpo-40769: Allow extra surrounding parentheses for invalid annotated assignment rule (GH-20387)	2020-06-27 19:33:08 +01:00
Lysandros Nikolaou	4b85e60601	bpo-41119: Output correct error message for list/tuple followed by colon (GH-21160)	2020-06-26 00:22:36 +01:00
Lysandros Nikolaou	6c4e0bd974	bpo-41060: Avoid SEGFAULT when calling GET_INVALID_TARGET in the grammar (GH-21020) `GET_INVALID_TARGET` might unexpectedly return `NULL`, which if not caught will cause a SEGFAULT. Therefore, this commit introduces a new inline function `RAISE_SYNTAX_ERROR_INVALID_TARGET` that always checks for `GET_INVALID_TARGET` returning NULL and can be used in the grammar, replacing the long C ternary operation used till now.	2020-06-21 03:18:01 +01:00
Lysandros Nikolaou	01ece63d42	bpo-40334: Produce better error messages on invalid targets (GH-20106) The following error messages get produced: - `cannot delete ...` for invalid `del` targets - `... is an illegal 'for' target` for invalid targets in for statements - `... is an illegal 'with' target` for invalid targets in with statements Additionally, a few `cut`s were added in various places before the invocation of the `invalid_*` rule, in order to speed things up. Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-06-19 00:10:43 +01:00
Pablo Galindo	1ed83adb0e	bpo-40939: Remove the old parser (GH-20768) This commit removes the old parser, the deprecated parser module, the old parser compatibility flags and environment variables and all associated support code and documentation.	2020-06-11 17:30:46 +01:00
Victor Stinner	9e5d30cc99	bpo-39882: Py_FatalError() logs the function name (GH-18819) The Py_FatalError() function is replaced with a macro which logs automatically the name of the current function, unless the Py_LIMITED_API macro is defined. Changes: * Add _Py_FatalErrorFunc() function. * Remove the function name from the message of Py_FatalError() calls which included the function name. * Update tests.	2020-03-07 00:54:20 +01:00
Inada Naoki	09415ff0eb	fix warnings by adding more const (GH-12924)	2019-04-23 20:39:37 +09:00
Pablo Galindo	f2cf1e3e28	bpo-36623: Clean parser headers and include files (GH-12253) After the removal of pgen, multiple header and function prototypes that lack implementation or are unused are still lying around.	2019-04-13 17:05:14 +01:00
Guido van Rossum	dcfcd146f8	bpo-35766: Merge typed_ast back into CPython (GH-11645)	2019-01-31 12:40:27 +01:00
Ivan Levkivskyi	9932a22897	bpo-33416: Add end positions to Python AST (GH-11605) The majority of this PR is tediously passing `end_lineno` and `end_col_offset` everywhere. Here are non-trivial points: * It is not possible to reconstruct end positions in AST "on the fly", some information is lost after an AST node is constructed, so we need two more attributes for every AST node `end_lineno` and `end_col_offset`. * I add end position information to both CST and AST. Although it may be technically possible to avoid adding end positions to CST, the code becomes more cumbersome and less efficient. * Since the end position is not known for non-leaf CST nodes while the next token is added, this requires a bit of extra care (see `_PyNode_FinalizeEndPos`). Unless I made some mistake, the algorithm should be linear. * For statements, I "trim" the end position of suites to not include the terminal newlines and dedent (this seems to be what people would expect), for example in ```python class C: pass pass ``` the end line and end column for the class definition is (2, 8). * For `end_col_offset` I use the common Python convention for indexing, for example for `pass` the `end_col_offset` is 4 (not 3), so that `[0:4]` gives one the source code that corresponds to the node. * I added a helper function `ast.get_source_segment()`, to get source text segment corresponding to a given AST node. It is also useful for testing. An (inevitable) downside of this PR is that AST now takes almost 25% more memory. I think however it is probably justified by the benefits.	2019-01-22 11:18:22 +00:00
Berker Peksag	2a65ecb780	Issue #26130 : Remove redundant variable 's' from Parser/parser.c Patch by Oren Milman.	2016-03-28 00:45:28 +03:00
Serhiy Storchaka	c679227e31	Issue #1772673 : The type of `char` arguments now changed to `const char`.	2013-10-19 21:03:34 +03:00

1 2

86 Commits