cpython

Commit Graph

Author	SHA1	Message	Date
Pablo Galindo	06f8c3328d	bpo-42214: Fix check for NOTEQUAL token in the PEG parser for the barry_as_flufl rule (GH-23048)	2020-10-30 23:48:42 +00:00
Lysandros Nikolaou	15acc4eaba	bpo-41659: Disallow curly brace directly after primary (GH-22996)	2020-10-27 20:54:20 +02:00
Lysandros Nikolaou	bca7014032	bpo-42123: Run the parser two times and only enable invalid rules on the second run (GH-22111) * Implement running the parser a second time for the errors messages The first parser run is only responsible for detecting whether there is a `SyntaxError` or not. If there isn't the AST gets returned. Otherwise, the parser is run a second time with all the `invalid_*` rules enabled so that all the customized error messages get produced.	2020-10-27 00:42:04 +02:00
Lysandros Nikolaou	2e5ca9e3f6	bpo-41746: Cast to typed seqs in CHECK macros to avoid type erasure (GH-22864)	2020-10-21 22:53:14 +03:00
Batuhan Taskaya	48f305fd12	bpo-41979: Accept star-unpacking on with-item targets (GH-22611) Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-10-09 10:56:48 +01:00
Pablo Galindo	a5634c4067	bpo-41746: Add type information to asdl_seq objects (GH-22223) * Add new capability to the PEG parser to type variable assignments. For instance: ``` \| a[asdl_stmt_seq]=';'.small_stmt+ [';'] NEWLINE { a } ``` Add new sequence types from the asdl definition (automatically generated) * Make `asdl_seq` type a generic aliasing pointer type. * Create a new `asdl_generic_seq` for the generic case using `void`. The old `asdl_seq_GET`/`ast_seq_SET` macros now are typed. * New `asdl_seq_GET_UNTYPED`/`ast_seq_SET_UNTYPED` macros for dealing with generic sequences. * Changes all possible `asdl_seq` types to use specific versions everywhere.	2020-09-16 19:42:00 +01:00
Pablo Galindo	315a61f7a9	bpo-41697: Correctly handle KeywordOrStarred when parsing arguments in the parser (GH-22077)	2020-09-03 15:29:32 +01:00
Pablo Galindo	4a97b1517a	bpo-41690: Use a loop to collect args in the parser instead of recursion (GH-22053) This program can segfault the parser by stack overflow: ``` import ast code = "f(" + ",".join(['a' for _ in range(100000)]) + ")" print("Ready!") ast.parse(code) ``` the reason is that the rule for arguments has a simple recursion when collecting args: args[expr_ty]: [...] \| a=named_expression b=[',' c=args { c }] { [...] }	2020-09-02 17:44:19 +01:00
Pablo Galindo	72cabb2aa6	bpo-40939: Use the new grammar for the grammar specification documentation (GH-19969) (We censor the heck out of actions and some other stuff using a custom "highlighter".) Co-authored-by: Guido van Rossum <guido@python.org>	2020-07-27 11:20:36 -07:00
Guido van Rossum	508ed2d912	Delete remaining references to Grammar/Grammar from docs (#21624 ) (Ironically, the file itself remains, see https://github.com/we-like-parsers/cpython/issues/135.)	2020-07-26 08:27:52 -07:00
Batuhan Taskaya	c8f29ad986	bpo-40769: Allow extra surrounding parentheses for invalid annotated assignment rule (GH-20387)	2020-06-27 19:33:08 +01:00
Lysandros Nikolaou	4b85e60601	bpo-41119: Output correct error message for list/tuple followed by colon (GH-21160)	2020-06-26 00:22:36 +01:00
Lysandros Nikolaou	6c4e0bd974	bpo-41060: Avoid SEGFAULT when calling GET_INVALID_TARGET in the grammar (GH-21020) `GET_INVALID_TARGET` might unexpectedly return `NULL`, which if not caught will cause a SEGFAULT. Therefore, this commit introduces a new inline function `RAISE_SYNTAX_ERROR_INVALID_TARGET` that always checks for `GET_INVALID_TARGET` returning NULL and can be used in the grammar, replacing the long C ternary operation used till now.	2020-06-21 03:18:01 +01:00
Lysandros Nikolaou	01ece63d42	bpo-40334: Produce better error messages on invalid targets (GH-20106) The following error messages get produced: - `cannot delete ...` for invalid `del` targets - `... is an illegal 'for' target` for invalid targets in for statements - `... is an illegal 'with' target` for invalid targets in with statements Additionally, a few `cut`s were added in various places before the invocation of the `invalid_*` rule, in order to speed things up. Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-06-19 00:10:43 +01:00
Pablo Galindo	b4282dd150	Remove unnecessary grammar decorations and change header (GH-20819)	2020-06-12 00:51:44 +01:00
Lysandros Nikolaou	bcd7deed91	bpo-40939: Remove PEG parser easter egg (__new_parser__) (#20802 ) It no longer serves a purpose (there's only one parser) and having "new" in any name will eventually look odd. Also, it impinges on a potential sub-namespace, `__new_...__`.	2020-06-11 09:09:21 -07:00
Pablo Galindo	c6483c9896	Raise specialised syntax error for invalid lambda parameters (GH-20776)	2020-06-10 14:07:06 +01:00
Pablo Galindo	9f495908c5	bpo-40903: Handle multiple '=' in invalid assignment rules in the PEG parser (GH-20697) Automerge-Triggered-By: @pablogsal	2020-06-07 18:57:00 -07:00
Lysandros Nikolaou	ae14583302	bpo-40334: Produce better error messages for non-parenthesized genexps (GH-20153) The error message, generated for a non-parenthesized generator expression in function calls, was still the generic `invalid syntax`, when the generator expression wasn't appearing as the first argument in the call. With this patch, even on input like `f(a, b, c for c in d, e)`, the correct error message gets produced.	2020-05-22 01:56:52 +01:00
Batuhan Taskaya	b8a65ec1d3	bpo-40715: Reject dict unpacking on dict comprehensions (GH-20292) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com> Co-authored-by: Pablo Galindo <pablogsal@gmail.com>	2020-05-21 23:39:56 +01:00
Batuhan Taskaya	72e0aa2fd2	bpo-40176: Improve error messages for trailing comma on from import (GH-20294)	2020-05-21 21:41:58 +01:00
Lysandros Nikolaou	75b863aa97	bpo-40334: Reproduce error message for type comments on bare '*' in the new parser (GH-20151)	2020-05-18 20:14:47 +01:00
Pablo Galindo	16ab07063c	bpo-40334: Correctly identify invalid target in assignment errors (GH-20076) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-05-15 02:04:52 +01:00
Lysandros Nikolaou	ce21cfca7b	bpo-40618: Disallow invalid targets in augassign and except clauses (GH-20083) This commit fixes the new parser to disallow invalid targets in the following scenarios: - Augmented assignments must only accept a single target (Name, Attribute or Subscript), but no tuples or lists. - `except` clauses should only accept a single `Name` as a target. Co-authored-by: Pablo Galindo <Pablogsal@gmail.com>	2020-05-14 21:13:50 +01:00
Lysandros Nikolaou	a15c9b3a05	bpo-40334: Always show the caret on SyntaxErrors (GH-20050) This commit fixes SyntaxError locations when the caret is not displayed, by doing the following: - `col_number` always gets set to the location of the offending node/expr. When no caret is to be displayed, this gets achieved by setting the object holding the error line to None. - Introduce a new function `_PyPegen_raise_error_known_location`, which can be called, when an arbitrary `lineno`/`col_offset` needs to be passed. This function then gets used in the grammar (through some new macros and inline functions) so that SyntaxError locations of the new parser match that of the old.	2020-05-13 20:36:27 +01:00
Shantanu	27c0d9b54a	bpo-40334: produce specialized errors for invalid del targets (GH-19911)	2020-05-11 14:53:58 -07:00
Lysandros Nikolaou	4638c64295	bpo-40334: Error message for invalid default args in function call (GH-19973) When parsing something like `f(g()=2)`, where the name of a default arg is not a NAME, but an arbitrary expression, a specialised error message is emitted.	2020-05-07 11:44:06 +01:00
Pablo Galindo	99db2a1db7	bpo-40334: Allow trailing comma in parenthesised context managers (GH-19964)	2020-05-06 22:54:34 +01:00
Lysandros Nikolaou	999ec9ab6a	bpo-40334: Add type to the assignment rule in the grammar file (GH-19963)	2020-05-06 19:11:04 +01:00
Lysandros Nikolaou	e10e7c771b	bpo-40334: Spacialized error message for invalid args after bare '' (GH-19865) When parsing things like `def f(): pass` the old parser used to output `SyntaxError: named arguments must follow bare *`, which the new parser wasn't able to do.	2020-05-04 11:58:31 +01:00
Shantanu	603d354626	bpo-40493: fix function type comment parsing (GH-19894) The grammar for func_type_input rejected things like `(*t1) ->t2`. This fixes that. Automerge-Triggered-By: @gvanrossum	2020-05-03 22:08:14 -07:00
Guido van Rossum	3941d9700b	bpo-40334: Refactor lambda_parameters similar to parameters (GH-19830)	2020-05-01 17:42:03 +01:00
Pablo Galindo	d955241469	bpo-40334: Correct return value of func_type_comment (GH-19833)	2020-05-01 08:32:09 -07:00
Batuhan Taskaya	76c1b4d5c5	bpo-40334: Improve column offsets for thrown syntax errors by Pegen (GH-19782)	2020-05-01 14:13:43 +01:00
Lysandros Nikolaou	3e0a6f37df	bpo-40334: Add support for feature_version in new PEG parser (GH-19827) `ast.parse` and `compile` support a `feature_version` parameter that tells the parser to parse the input string, as if it were written in an older Python version. The `feature_version` is propagated to the tokenizer, which uses it to handle the three different stages of support for `async` and `await`. Additionally, it disallows the following at parser level: - The '@' operator in < 3.5 - Async functions in < 3.5 - Async comprehensions in < 3.6 - Underscores in numeric literals in < 3.6 - Await expression in < 3.5 - Variable annotations in < 3.6 - Async for-loops in < 3.5 - Async with-statements in < 3.5 - F-strings in < 3.6 Closes we-like-parsers/cpython#124.	2020-04-30 20:27:52 -07:00
Guido van Rossum	c001c09e90	bpo-40334: Support type comments (GH-19780) This implements full support for # type: <type> comments, # type: ignore <stuff> comments, and the func_type parsing mode for ast.parse() and compile(). Closes https://github.com/we-like-parsers/cpython/issues/95. (For now, you need to use the master branch of mypy, since another issue unique to 3.9 had to be fixed there, and there's no mypy release yet.) The only thing missing is `feature_version=N`, which is being tracked in https://github.com/we-like-parsers/cpython/issues/124.	2020-04-30 12:12:19 -07:00
Pablo Galindo	2b74c835a7	bpo-40334: Support CO_FUTURE_BARRY_AS_BDFL in the new parser (GH-19721) This commit also allows to pass flags to the new parser in all interfaces and fixes a bug in the parser generator that was causing to inline rules with actions, making them disappear.	2020-04-27 18:02:07 +01:00
Pablo Galindo	c5fc156852	bpo-40334: PEP 617 implementation: New PEG parser for CPython (GH-19503) Co-authored-by: Guido van Rossum <guido@python.org> Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>	2020-04-22 23:29:27 +01:00
Brandt Bucher	be501ca241	bpo-39702: Relax grammar restrictions on decorators (PEP 614) (GH-18570)	2020-03-03 14:25:44 -08:00
Pablo Galindo	8565f6b6db	bpo-35814: Allow unpacking in r.h.s of annotated assignment expressions (GH-13760)	2019-06-03 08:34:20 +01:00
Pablo Galindo	8c77b8cb91	bpo-36540: PEP 570 -- Implementation (GH-12701) This commit contains the implementation of PEP570: Python positional-only parameters. * Update Grammar/Grammar with new typedarglist and varargslist * Regenerate grammar files * Update and regenerate AST related files * Update code object * Update marshal.c * Update compiler and symtable * Regenerate importlib files * Update callable objects * Implement positional-only args logic in ceval.c * Regenerate frozen data * Update standard library to account for positional-only args * Add test file for positional-only args * Update other test files to account for positional-only args * Add News entry * Update inspect module and related tests	2019-04-29 13:36:57 +01:00
Guido van Rossum	495da29225	bpo-35975: Support parsing earlier minor versions of Python 3 (GH-12086) This adds a `feature_version` flag to `ast.parse()` (documented) and `compile()` (hidden) that allow tweaking the parser to support older versions of the grammar. In particular if `feature_version` is 5 or 6, the hacks for the `async` and `await` keyword from PEP 492 are reinstated. (For 7 or higher, these are unconditionally treated as keywords, but they are still special tokens rather than `NAME` tokens that the parser driver recognizes.) https://bugs.python.org/issue35975	2019-03-07 12:38:08 -08:00
Xtreak	d4fceaafb8	bpo-35877: Make parenthesis optional for named expression in while statement (GH-11724) * Add parenthesis optional in named expressions for while statement * Add NEWS entry	2019-02-01 14:40:16 -07:00
Guido van Rossum	dcfcd146f8	bpo-35766: Merge typed_ast back into CPython (GH-11645)	2019-01-31 12:40:27 +01:00
Ivan Levkivskyi	62c35a8a8f	bpo-35814: Allow same r.h.s. in annotated assignments as in normal ones (GH-11667)	2019-01-25 01:39:19 +00:00
Emily Morehouse	8f59ee01be	bpo-35224: PEP 572 Implementation (#10497 ) * Add tokenization of := - Add token to Include/token.h. Add token to documentation in Doc/library/token.rst. - Run `./python Lib/token.py` to regenerate Lib/token.py. - Update Parser/tokenizer.c: add case to handle `:=`. * Add initial usage of := in grammar. * Update Python.asdl to match the grammar updates. Regenerated Include/Python-ast.h and Python/Python-ast.c * Update AST and compiler files in Python/ast.c and Python/compile.c. Basic functionality, this isn't scoped properly * Regenerate Lib/symbol.py using `./python Lib/symbol.py` * Tests - Fix failing tests in test_parser.py due to changes in token numbers for internal representation * Tests - Add simple test for := token * Tests - Add simple tests for named expressions using expr and suite * Tests - Update number of levels for nested expressions to prevent stack overflow * Update symbol table to handle NamedExpr * Update Grammar to allow assignment expressions in if statements. Regenerate Python/graminit.c accordingly using `make regen-grammar` * Tests - Add additional tests for named expressions in RoundtripLegalSyntaxTestCase, based on examples and information directly from PEP 572 Note: failing tests are currently commented out (4 out of 24 tests currently fail) * Tests - Add temporary syntax test failure tests in test_parser.py Note: There is an outstanding TODO for this -- syntax tests need to be moved to a different file (presumably test_syntax.py), but this is covering what needs to be tested at the moment, and it's more convenient to run a single test for the time being * Add support for allowing assignment expressions as function argument annotations. Uncomment tests for these cases because they all pass now! * Tests - Move existing syntax tests out of test_parser.py and into test_named_expressions.py. Refactor syntax tests to use unittest * Add TargetScopeError exception to extend SyntaxError Note: This simply creates the TargetScopeError exception, it is not yet used anywhere * Tests - Update tests per PEP 572 Continue refactoring test suite: The named expression test suite now checks for any invalid cases that throw exceptions (no longer limited to SyntaxErrors), assignment tests to ensure that variables are properly assigned, and scope tests to ensure that variable availability and values are correct Note: - There are still tests that are marked to skip, as they are not yet implemented - There are approximately 300 lines of the PEP that have not yet been addressed, though these may be deferred * Documentation - Small updates to XXX/todo comments - Remove XXX from child description in ast.c - Add comment with number of previously supported nested expressions for 3.7.X in test_parser.py * Fix assert in seq_for_testlist() * Cleanup - Denote "Not implemented -- No keyword args" on failing test case. Fix PEP8 error for blank lines at beginning of test classes in test_parser.py * Tests - Wrap all file opens in `with...as` to ensure files are closed * WIP: handle f(a := 1) * Tests and Cleanup - No longer skips keyword arg test. Keyword arg test now uses a simpler test case and does not rely on an external file. Remove print statements from ast.c * Tests - Refactor last remaining test case that relied on on external file to use a simpler test case without the dependency * Tests - Add better description of remaning skipped tests. Add test checking scope when using assignment expression in a function argument * Tests - Add test for nested comprehension, testing value and scope. Fix variable name in skipped comprehension scope test * Handle restriction of LHS for named expressions - can only assign to LHS of type NAME. Specifically, restrict assignment to tuples This adds an alternative set_context specifically for named expressions, set_namedexpr_context. Thus, context is now set differently for standard assignment versus assignment for named expressions in order to handle restrictions. * Tests - Update negative test case for assigning to lambda to match new error message. Add negative test case for assigning to tuple * Tests - Reorder test cases to group invalid syntax cases and named assignment target errors * Tests - Update test case for named expression in function argument - check that result and variable are set correctly * Todo - Add todo for TargetScopeError based on Guido's comment (`2b3acd37bd (r30472562)`) * Tests - Add named expression tests for assignment operator in function arguments Note: One of two tests are skipped, as function arguments are currently treating an assignment expression inside of parenthesis as one child, which does not properly catch the named expression, nor does it count arguments properly * Add NamedStore to expr_context. Regenerate related code with `make regen-ast` * Add usage of NamedStore to ast_for_named_expr in ast.c. Update occurances of checking for Store to also handle NamedStore where appropriate * Add ste_comprehension to _symtable_entry to track if the namespace is a comprehension. Initialize ste_comprehension to 0. Set set_comprehension to 1 in symtable_handle_comprehension * s/symtable_add_def/symtable_add_def_helper. Add symtable_add_def to handle grabbing st->st_cur and passing it to symtable_add_def_helper. This now allows us to call the original code from symtable_add_def by instead calling symtable_add_def_helper with a different ste. * Refactor symtable_record_directive to take lineno and col_offset as arguments instead of stmt_ty. This allows symtable_record_directive to be used for stmt_ty and expr_ty * Handle elevating scope for named expressions in comprehensions. * Handle error for usage of named expression inside a class block * Tests - No longer skip scope tests. Add additional scope tests * Cleanup - Update error message for named expression within a comprehension within a class. Update comments. Add assert for symtable_extend_namedexpr_scope to validate that we always find at least a ModuleScope if we don't find a Class or FunctionScope * Cleanup - Add missing case for NamedStore in expr_context_name. Remove unused var in set_namedexpr_content * Refactor - Consolidate set_context and set_namedexpr_context to reduce duplicated code. Special cases for named expressions are handled by checking if ctx is NamedStore * Cleanup - Add additional use cases for ast_for_namedexpr in usage comment. Fix multiple blank lines in test_named_expressions * Tests - Remove unnecessary test case. Renumber test case function names * Remove TargetScopeError for now. Will add back if needed * Cleanup - Small comment nit for consistency * Handle positional argument check with named expression * Add TargetScopeError exception definition. Add documentation for TargetScopeError in c-api docs. Throw TargetScopeError instead of SyntaxError when using a named expression in a comprehension within a class scope * Increase stack size for parser by 200. This is a minimal change (approx. 5kb) and should not have an impact on any systems. Update parser test to allow 99 nested levels again * Add TargetScopeError to exception_hierarchy.txt for test_baseexception.py_ * Tests - Major update for named expression tests, both in test_named_expressions and test_parser - Add test for TargetScopeError - Add tests for named expressions in comprehension scope and edge cases - Add tests for named expressions in function arguments (declarations and call sites) - Reorganize tests to group them more logically * Cleanup - Remove unnecessary comment * Cleanup - Comment nitpicks * Explicitly disallow assignment expressions to a name inside parentheses, e.g.: ((x) := 0) - Add check for LHS types to detect a parenthesis then a name (see note) - Add test for this scenario - Update tests for changed error message for named assignment to a tuple (also, see note) Note: This caused issues with the previous error handling for named assignment to a LHS that contained an expression, such as a tuple. Thus, the check for the LHS of a named expression must be changed to be more specific if we wish to maintain the previous error messages * Cleanup - Wrap lines more strictly in test file * Revert "Explicitly disallow assignment expressions to a name inside parentheses, e.g.: ((x) := 0)" This reverts commit f1531400ca7d7a2d148830c8ac703f041740896d. * Add NEWS.d entry * Tests - Fix error in test_pickle.test_exceptions by adding TargetScopeError to list of exceptions * Tests - Update error message tests to reflect improved messaging convention (s/can't/cannot) * Remove cases that cannot be reached in compile.c. Small linting update. * Update Grammar/Tokens to add COLONEQUAL. Regenerate all files * Update TargetScopeError PRE_INIT and POST_INIT, as this was purposefully left out when fixing rebase conflicts * Add NamedStore back and regenerate files * Pass along line number and end col info for named expression * Simplify News entry * Fix compiler warning and explicity mark fallthrough	2019-01-24 16:49:56 -07:00
Serhiy Storchaka	8ac658114d	bpo-30455: Generate all token related code and docs from Grammar/Tokens. (GH-10370) "Include/token.h", "Lib/token.py" (containing now some data moved from "Lib/tokenize.py") and new files "Parser/token.c" (containing the code moved from "Parser/tokenizer.c") and "Doc/library/token-list.inc" (included in "Doc/library/token.rst") are now generated from "Grammar/Tokens" by "Tools/scripts/generate_token.py". The script overwrites files only if needed and can be used on the read-only sources tree. "Lib/symbol.py" is now generated by "Tools/scripts/generate_symbol_py.py" instead of been executable itself. Added new make targets "regen-token" and "regen-symbol" which are now dependencies of "regen-all". The documentation contains now strings for operators and punctuation tokens.	2018-12-22 11:18:40 +02:00
David Cuthbert	fd97d1f1af	bpo-32117: Allow tuple unpacking in return and yield statements (gh-4509) Iterable unpacking is now allowed without parentheses in yield and return statements, e.g. ``yield 1, 2, 3, *rest``. Thanks to David Cuthbert for the change and jChapman for added tests.	2018-09-21 18:31:15 -07:00
Jelle Zijlstra	ac317700ce	bpo-30406: Make async and await proper keywords (#1669 ) Per PEP 492, 'async' and 'await' should become proper keywords in 3.7.	2017-10-05 23:24:46 -04:00
Lisa Hewus Fresh	384899dfae	bpo-30737: Update DevGuide links to new URL (GH-3228) Update old devguide links from https://docs.python.org/devguide to https://devguide.python.org	2017-08-30 09:37:43 -07:00

1 2 3 4

175 Commits