cpython

Commit Graph

Author	SHA1	Message	Date
Pablo Galindo Salgado	1ef61cf71a	gh-102856: Initial implementation of PEP 701 (#102855 ) Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com> Co-authored-by: Batuhan Taskaya <isidentical@gmail.com> Co-authored-by: Marta Gómez Macías <mgmacias@google.com> Co-authored-by: sunmy2019 <59365878+sunmy2019@users.noreply.github.com>	2023-04-19 11:18:16 -05:00
Pablo Galindo Salgado	e13d1d9dda	gh-99581: Fix a buffer overflow in the tokenizer when copying lines that fill the available buffer (#99605 )	2022-11-20 20:20:03 +00:00
Michael Droettboom	23e83a8465	gh-94808: Coverage: Test that maximum indentation level is handled (#95926 ) * gh-94808: Coverage: Test that maximum indentation level is handled * Use "compile" rather than "exec"	2022-10-06 10:39:17 -07:00
Serhiy Storchaka	6927632492	Remove trailing spaces (GH-31695)	2022-03-05 17:47:00 +02:00
Pablo Galindo Salgado	a0efc0c196	bpo-46091: Correctly calculate indentation levels for whitespace lines with continuation characters (GH-30130)	2022-01-25 22:12:14 +00:00
Serhiy Storchaka	a5a56154f1	Remove trailing spaces. (GH-28706)	2021-10-03 16:58:14 +03:00
Pablo Galindo Salgado	a24676bedc	Add tests for the C tokenizer and expose it as a private module (GH-27924)	2021-08-24 17:50:05 +01:00
Pablo Galindo Salgado	b6bde9fc42	bpo-44667: Treat correctly lines ending with comments and no newlines in the Python tokenizer (GH-27499)	2021-07-31 02:17:09 +01:00
Hai Shi	4660597b51	bpo-40275: Use new test.support helper submodules in tests (GH-21448)	2020-08-03 18:49:18 +02:00
Serhiy Storchaka	9355868458	bpo-41043: Escape literal part of the path for glob(). (GH-20994)	2020-06-20 11:10:31 +03:00
Emily Morehouse	8f59ee01be	bpo-35224: PEP 572 Implementation (#10497 ) * Add tokenization of := - Add token to Include/token.h. Add token to documentation in Doc/library/token.rst. - Run `./python Lib/token.py` to regenerate Lib/token.py. - Update Parser/tokenizer.c: add case to handle `:=`. * Add initial usage of := in grammar. * Update Python.asdl to match the grammar updates. Regenerated Include/Python-ast.h and Python/Python-ast.c * Update AST and compiler files in Python/ast.c and Python/compile.c. Basic functionality, this isn't scoped properly * Regenerate Lib/symbol.py using `./python Lib/symbol.py` * Tests - Fix failing tests in test_parser.py due to changes in token numbers for internal representation * Tests - Add simple test for := token * Tests - Add simple tests for named expressions using expr and suite * Tests - Update number of levels for nested expressions to prevent stack overflow * Update symbol table to handle NamedExpr * Update Grammar to allow assignment expressions in if statements. Regenerate Python/graminit.c accordingly using `make regen-grammar` * Tests - Add additional tests for named expressions in RoundtripLegalSyntaxTestCase, based on examples and information directly from PEP 572 Note: failing tests are currently commented out (4 out of 24 tests currently fail) * Tests - Add temporary syntax test failure tests in test_parser.py Note: There is an outstanding TODO for this -- syntax tests need to be moved to a different file (presumably test_syntax.py), but this is covering what needs to be tested at the moment, and it's more convenient to run a single test for the time being * Add support for allowing assignment expressions as function argument annotations. Uncomment tests for these cases because they all pass now! * Tests - Move existing syntax tests out of test_parser.py and into test_named_expressions.py. Refactor syntax tests to use unittest * Add TargetScopeError exception to extend SyntaxError Note: This simply creates the TargetScopeError exception, it is not yet used anywhere * Tests - Update tests per PEP 572 Continue refactoring test suite: The named expression test suite now checks for any invalid cases that throw exceptions (no longer limited to SyntaxErrors), assignment tests to ensure that variables are properly assigned, and scope tests to ensure that variable availability and values are correct Note: - There are still tests that are marked to skip, as they are not yet implemented - There are approximately 300 lines of the PEP that have not yet been addressed, though these may be deferred * Documentation - Small updates to XXX/todo comments - Remove XXX from child description in ast.c - Add comment with number of previously supported nested expressions for 3.7.X in test_parser.py * Fix assert in seq_for_testlist() * Cleanup - Denote "Not implemented -- No keyword args" on failing test case. Fix PEP8 error for blank lines at beginning of test classes in test_parser.py * Tests - Wrap all file opens in `with...as` to ensure files are closed * WIP: handle f(a := 1) * Tests and Cleanup - No longer skips keyword arg test. Keyword arg test now uses a simpler test case and does not rely on an external file. Remove print statements from ast.c * Tests - Refactor last remaining test case that relied on on external file to use a simpler test case without the dependency * Tests - Add better description of remaning skipped tests. Add test checking scope when using assignment expression in a function argument * Tests - Add test for nested comprehension, testing value and scope. Fix variable name in skipped comprehension scope test * Handle restriction of LHS for named expressions - can only assign to LHS of type NAME. Specifically, restrict assignment to tuples This adds an alternative set_context specifically for named expressions, set_namedexpr_context. Thus, context is now set differently for standard assignment versus assignment for named expressions in order to handle restrictions. * Tests - Update negative test case for assigning to lambda to match new error message. Add negative test case for assigning to tuple * Tests - Reorder test cases to group invalid syntax cases and named assignment target errors * Tests - Update test case for named expression in function argument - check that result and variable are set correctly * Todo - Add todo for TargetScopeError based on Guido's comment (`2b3acd37bd (r30472562)`) * Tests - Add named expression tests for assignment operator in function arguments Note: One of two tests are skipped, as function arguments are currently treating an assignment expression inside of parenthesis as one child, which does not properly catch the named expression, nor does it count arguments properly * Add NamedStore to expr_context. Regenerate related code with `make regen-ast` * Add usage of NamedStore to ast_for_named_expr in ast.c. Update occurances of checking for Store to also handle NamedStore where appropriate * Add ste_comprehension to _symtable_entry to track if the namespace is a comprehension. Initialize ste_comprehension to 0. Set set_comprehension to 1 in symtable_handle_comprehension * s/symtable_add_def/symtable_add_def_helper. Add symtable_add_def to handle grabbing st->st_cur and passing it to symtable_add_def_helper. This now allows us to call the original code from symtable_add_def by instead calling symtable_add_def_helper with a different ste. * Refactor symtable_record_directive to take lineno and col_offset as arguments instead of stmt_ty. This allows symtable_record_directive to be used for stmt_ty and expr_ty * Handle elevating scope for named expressions in comprehensions. * Handle error for usage of named expression inside a class block * Tests - No longer skip scope tests. Add additional scope tests * Cleanup - Update error message for named expression within a comprehension within a class. Update comments. Add assert for symtable_extend_namedexpr_scope to validate that we always find at least a ModuleScope if we don't find a Class or FunctionScope * Cleanup - Add missing case for NamedStore in expr_context_name. Remove unused var in set_namedexpr_content * Refactor - Consolidate set_context and set_namedexpr_context to reduce duplicated code. Special cases for named expressions are handled by checking if ctx is NamedStore * Cleanup - Add additional use cases for ast_for_namedexpr in usage comment. Fix multiple blank lines in test_named_expressions * Tests - Remove unnecessary test case. Renumber test case function names * Remove TargetScopeError for now. Will add back if needed * Cleanup - Small comment nit for consistency * Handle positional argument check with named expression * Add TargetScopeError exception definition. Add documentation for TargetScopeError in c-api docs. Throw TargetScopeError instead of SyntaxError when using a named expression in a comprehension within a class scope * Increase stack size for parser by 200. This is a minimal change (approx. 5kb) and should not have an impact on any systems. Update parser test to allow 99 nested levels again * Add TargetScopeError to exception_hierarchy.txt for test_baseexception.py_ * Tests - Major update for named expression tests, both in test_named_expressions and test_parser - Add test for TargetScopeError - Add tests for named expressions in comprehension scope and edge cases - Add tests for named expressions in function arguments (declarations and call sites) - Reorganize tests to group them more logically * Cleanup - Remove unnecessary comment * Cleanup - Comment nitpicks * Explicitly disallow assignment expressions to a name inside parentheses, e.g.: ((x) := 0) - Add check for LHS types to detect a parenthesis then a name (see note) - Add test for this scenario - Update tests for changed error message for named assignment to a tuple (also, see note) Note: This caused issues with the previous error handling for named assignment to a LHS that contained an expression, such as a tuple. Thus, the check for the LHS of a named expression must be changed to be more specific if we wish to maintain the previous error messages * Cleanup - Wrap lines more strictly in test file * Revert "Explicitly disallow assignment expressions to a name inside parentheses, e.g.: ((x) := 0)" This reverts commit f1531400ca7d7a2d148830c8ac703f041740896d. * Add NEWS.d entry * Tests - Fix error in test_pickle.test_exceptions by adding TargetScopeError to list of exceptions * Tests - Update error message tests to reflect improved messaging convention (s/can't/cannot) * Remove cases that cannot be reached in compile.c. Small linting update. * Update Grammar/Tokens to add COLONEQUAL. Regenerate all files * Update TargetScopeError PRE_INIT and POST_INIT, as this was purposefully left out when fixing rebase conflicts * Add NamedStore back and regenerate files * Pass along line number and end col info for named expression * Simplify News entry * Fix compiler warning and explicity mark fallthrough	2019-01-24 16:49:56 -07:00
Serhiy Storchaka	8ac658114d	bpo-30455: Generate all token related code and docs from Grammar/Tokens. (GH-10370) "Include/token.h", "Lib/token.py" (containing now some data moved from "Lib/tokenize.py") and new files "Parser/token.c" (containing the code moved from "Parser/tokenizer.c") and "Doc/library/token-list.inc" (included in "Doc/library/token.rst") are now generated from "Grammar/Tokens" by "Tools/scripts/generate_token.py". The script overwrites files only if needed and can be used on the read-only sources tree. "Lib/symbol.py" is now generated by "Tools/scripts/generate_symbol_py.py" instead of been executable itself. Added new make targets "regen-token" and "regen-symbol" which are now dependencies of "regen-all". The documentation contains now strings for operators and punctuation tokens.	2018-12-22 11:18:40 +02:00
Sergey Fedoseev	b796e7dcdc	Fixed several assertTrue() that were intended to be assertEqual(). (GH-8191) Fixed also testing the "always" warning filter.	2018-07-09 18:25:55 +03:00
Ammar Askar	c4ef4896ea	bpo-33899: Make tokenize module mirror end-of-file is end-of-line behavior (GH-7891) Most of the change involves fixing up the test suite, which previously made the assumption that there wouldn't be a new line if the input didn't end in one. Contributed by Ammar Askar.	2018-07-06 10:19:08 +03:00
Thomas Kluyver	c56b17bd8c	bpo-12486: Document tokenize.generate_tokens() as public API (#6957 ) * Document tokenize.generate_tokens() * Add news file * Add test for generate_tokens * Document behaviour around ENCODING token * Add generate_tokens to __all__	2018-06-05 10:26:39 -07:00
Jelle Zijlstra	ac317700ce	bpo-30406: Make async and await proper keywords (#1669 ) Per PEP 492, 'async' and 'await' should become proper keywords in 3.7.	2017-10-05 23:24:46 -04:00
Stéphane Wirtel	90addd6d1c	bpo-31029: test_tokenize Add missing import unittest (#2865 )	2017-07-25 16:33:53 +03:00
Albert-Jan Nijburg	fc354f0785	bpo-25324: copy tok_name before changing it (#1608 ) * add test to check if were modifying token * copy list so import tokenize doesnt have side effects on token * shorten line * add tokenize tokens to token.h to get them to show up in token * move ERRORTOKEN back to its previous location, and fix nitpick * copy comments from token.h automatically * fix whitespace and make more pythonic * change to fix comments from @haypo * update token.rst and Misc/NEWS * change wording * some more wording changes	2017-05-31 16:00:21 +02:00
Albert-Jan Nijburg	c471ca448c	bpo-30377: Simplify handling of COMMENT and NL in tokenize.py (#1607 )	2017-05-24 14:31:57 +03:00
Jim Fasarakis-Hilliard	d4914e9041	Add ELLIPSIS and RARROW. Add tests (#666 )	2017-03-14 21:16:15 +01:00
Brett Cannon	a721abac29	Issue #26331 : Implement the parsing part of PEP 515. Thanks to Georg Brandl for the patch.	2016-09-09 14:57:09 -07:00
Zachary Ware	724f6a67f2	Rename test_pep####.py files	2016-09-09 12:55:37 -07:00
Zachary Ware	a0154c0f0e	Fix running test_tokenize directly	2016-09-09 12:55:14 -07:00
Eric V. Smith	1c8222c80a	Issue 25311: Add support for f-strings to tokenize.py. Also added some comments to explain what's happening, since it's not so obvious.	2015-10-26 04:37:55 -04:00
Eric V. Smith	6731774216	Issue 25422: Add tests for multi-line string tokenization. Also remove truncated tokens.	2015-10-16 20:45:53 -04:00
Serhiy Storchaka	6f5175de15	Issue #25317 : Converted doctests in test_tokenize to unittests. Made test_tokenize discoverable.	2015-10-06 18:23:12 +03:00
Serhiy Storchaka	5f6fa82617	Issue #25317 : Converted doctests in test_tokenize to unittests. Made test_tokenize discoverable.	2015-10-06 18:16:28 +03:00
Yury Selivanov	96ec934e75	Issue #24619 : Simplify async/await tokenization. This commit simplifies async/await tokenization in tokenizer.c, tokenize.py & lib2to3/tokenize.py. Previous solution was to keep a stack of async-def & def blocks, whereas the new approach is just to remember position of the outermost async-def block. This change won't bring any parsing performance improvements, but it makes the code much easier to read and validate.	2015-07-23 15:01:58 +03:00
Yury Selivanov	8fb307cd65	Issue #24619 : New approach for tokenizing async/await. This commit fixes how one-line async-defs and defs are tracked by tokenizer. It allows to correctly parse invalid code such as: >>> async def f(): ... def g(): pass ... async = 10 and valid code such as: >>> async def f(): ... async def g(): pass ... await z As a consequence, is is now possible to have one-line 'async def foo(): await ..' functions: >>> async def foo(): return await bar()	2015-07-22 13:33:45 +03:00
Jason R. Coombs	a95a476b3a	Issue #20387 : Merge test and patch from 3.4.4	2015-06-28 11:13:30 -04:00
Jason R. Coombs	b6d1cdda8e	Issue #20387 : Correct test to properly capture expectation.	2015-06-25 22:42:24 -04:00
Jason R. Coombs	5713b3c5bf	Issue #20387 : Add test capturing failure to roundtrip indented code in tokenize module.	2015-06-20 19:52:22 -04:00
Jason R. Coombs	7cf36387e4	Remove unused import and remove doctest-only import into doctests.	2015-06-20 19:13:50 -04:00
Victor Stinner	24d262af0b	(Merge 3.5) Issue #23840 : tokenize.open() now closes the temporary binary file on error to fix a resource warning.	2015-05-26 00:46:44 +02:00
Victor Stinner	387729e183	Issue #23840 : tokenize.open() now closes the temporary binary file on error to fix a resource warning.	2015-05-26 00:43:58 +02:00
Yury Selivanov	8085b80c18	Issue 24226: Fix parsing of many sequential one-line 'def' statements.	2015-05-18 12:50:52 -04:00
Yury Selivanov	7544508f02	PEP 0492 -- Coroutines with async and await syntax. Issue #24017 .	2015-05-11 22:57:16 -04:00
Serhiy Storchaka	ee4c0b9dcf	Issue #23681 : Fixed Python 2 to 3 poring bugs. Indexing bytes retiurns an integer, not bytes.	2015-03-20 16:48:02 +02:00
Serhiy Storchaka	74a49ac3f5	Issue #23681 : Fixed Python 2 to 3 poring bugs. Indexing bytes retiurns an integer, not bytes.	2015-03-20 16:46:19 +02:00
Benjamin Peterson	d51374ed78	PEP 465: a dedicated infix operator for matrix multiplication (closes #21176 )	2014-04-09 23:55:56 -04:00
Terry Jan Reedy	9dc3a36c84	Issue #9974 : When untokenizing, use row info to insert backslash+newline. Original patches by A. Kuchling and G. Rees (#12691).	2014-02-23 23:33:08 -05:00
Terry Jan Reedy	938ba685dc	Issue #20750 , Enable roundtrip tests for new 5-tuple untokenize. The constructed examples and all but 7 of the test/test_*.py files (run with -ucpu) pass. Remove those that fail the new test from the selection list. Patch partly based on patches by G. Brandl (#8478) and G. Rees (#12691).	2014-02-23 18:00:31 -05:00
Terry Jan Reedy	5b8d2c3af7	Issue #8478 : Untokenizer.compat now processes first token from iterator input. Patch based on lines from Georg Brandl, Eric Snow, and Gareth Rees.	2014-02-17 23:12:16 -05:00
Terry Jan Reedy	58edfd9ff1	whitespace	2014-02-17 16:49:06 -05:00
Terry Jan Reedy	5e6db31368	Untokenize: An logically incorrect assert tested user input validity. Replace it with correct logic that raises ValueError for bad input. Issues #8478 and #12691 reported the incorrect logic. Add an Untokenize test case and an initial test method.	2014-02-17 16:45:48 -05:00
Serhiy Storchaka	768c16ce02	Issue #18960 : Fix bugs with Python source code encoding in the second line. * The first line of Python script could be executed twice when the source encoding (not equal to 'utf-8') was specified on the second line. * Now the source encoding declaration on the second line isn't effective if the first line contains anything except a comment. * As a consequence, 'python -x' works now again with files with the source encoding declarations specified on the second file, and can be used again to make Python batch files on Windows. * The tokenize module now ignore the source encoding declaration on the second line if the first line contains anything except a comment. * IDLE now ignores the source encoding declaration on the second line if the first line contains anything except a comment. * 2to3 and the findnocoding.py script now ignore the source encoding declaration on the second line if the first line contains anything except a comment.	2014-01-09 18:36:09 +02:00
Serhiy Storchaka	dafea85190	Issue #18873 : The tokenize module, IDLE, 2to3, and the findnocoding.py script now detect Python source code encoding only in comment lines.	2013-09-16 23:51:56 +03:00
Ezio Melotti	fafa8b7797	#16152 : merge with 3.2.	2012-11-03 17:46:51 +02:00
Ezio Melotti	2cc3b4ba9f	#16152 : fix tokenize to ignore whitespace at the end of the code when no newline is found. Patch by Ned Batchelder.	2012-11-03 17:38:43 +02:00
Florent Xicluna	fed2c51eea	Merge branch	2012-07-07 12:26:56 +02:00

1 2 3

107 Commits