cpython

Commit Graph

Author	SHA1	Message	Date
Serhiy Storchaka	882cb79afa	gh-56166: Deprecate passing confusing positional arguments in re functions (#107778 ) Deprecate passing optional arguments maxsplit, count and flags in module-level functions re.split(), re.sub() and re.subn() as positional. They should only be passed by keyword.	2023-08-16 13:35:35 -07:00
wulmer	0af247da09	gh-102111: Add link to string escape sequences in re module (#106995 ) Co-authored-by: Alex Waygood <Alex.Waygood@Gmail.com>	2023-07-23 02:50:38 -06:00
wulmer	149748ea4f	Fix Sphinx warnings in `re` module docs (#107044 )	2023-07-22 16:44:44 +01:00
Skip Montanaro	bcadcde712	gh-102259: Fix re doc issue regarding right square brackets (#102264 ) Co-authored-by: Terry Jan Reedy <tjreedy@udel.edu>	2023-02-25 21:22:16 -05:00
Ilya Kulakov	dbc1e696eb	gh-99308: Clarify re docs for byte pattern group names (#99311 )	2022-12-25 12:25:27 +05:30
Stanley	36a0b1d0dd	gh-69929: re docs: Add more specific definition of \w (#92015 ) Co-authored-by: Jelle Zijlstra <jelle.zijlstra@gmail.com>	2022-12-19 19:07:31 -08:00
Stanley	286e3c76a9	gh-99087: Add missing newline for prompts in docs (GH-98993) Add newline for prompts so copying to REPL does not cause errors.	2022-12-08 19:31:19 -08:00
ram vikram singh	e0f91deb59	GH-98906 ```re``` module: ```search() vs. match()``` section should mention ```fullmatch()``` (GH-98916) Mention fullmatch along with search and match.	2022-11-30 17:52:21 -05:00
Victor Stinner	a60ddd31be	gh-98401: Invalid escape sequences emits SyntaxWarning (#99011 ) A backslash-character pair that is not a valid escape sequence now generates a SyntaxWarning, instead of DeprecationWarning. For example, re.compile("\d+\.\d+") now emits a SyntaxWarning ("\d" is an invalid escape sequence), use raw strings for regular expression: re.compile(r"\d+\.\d+"). In a future Python version, SyntaxError will eventually be raised, instead of SyntaxWarning. Octal escapes with value larger than 0o377 (ex: "\477"), deprecated in Python 3.11, now produce a SyntaxWarning, instead of DeprecationWarning. In a future Python version they will be eventually a SyntaxError. codecs.escape_decode() and codecs.unicode_escape_decode() are left unchanged: they still emit DeprecationWarning. * The parser only emits SyntaxWarning for Python 3.12 (feature version), and still emits DeprecationWarning on older Python versions. * Fix SyntaxWarning by using raw strings in Tools/c-analyzer/ and wasm_build.py.	2022-11-03 17:53:25 +01:00
Serhiy Storchaka	e9ac890c02	gh-98740: Fix validation of conditional expressions in RE (GH-98764) In very rare circumstances the JUMP opcode could be confused with the argument of the opcode in the "then" part which doesn't end with the JUMP opcode. This led to incorrect detection of the final JUMP opcode and incorrect calculation of the size of the subexpression. NOTE: Changed return value of functions _validate_inner() and _validate_charset() in Modules/_sre/sre.c. Now they return 0 on success, -1 on failure, and 1 if the last op is JUMP (which usually is a failure). Previously they returned 1 on success and 0 on failure.	2022-11-03 09:23:46 +02:00
Athos Ribeiro	0ceafa7fa4	Add re.VERBOSE flag documentation example (#97678 ) The current re.VERBOSE documentation example leaves space for ambiguous interpretation. One may read that spaces within the `(?:` token are spaces inside the non-capturing group (such as `(?: )`). This patch removes the ambiguity by including examples after the statement.	2022-10-04 17:39:42 -07:00
Baptiste Mispelon	642d1fa81f	gh-92727: Add example of named group in doc for re.Match.__getitem__ (#92730 )	2022-05-28 13:11:08 -05:00
Stanley	b7a6610bc8	gh-73137: Added sub-subsection headers for flags in re (#93000 ) Fixes #73137	2022-05-22 18:52:17 -07:00
谭九鼎	bd30461298	re docs: fix source code link (#92819 )	2022-05-16 17:04:17 -07:00
Serhiy Storchaka	a84a56d80f	gh-91760: More strict rules for numerical group references and group names in RE (GH-91792) Only sequence of ASCII digits is now accepted as a numerical reference. The group name in bytes patterns and replacement strings can now only contain ASCII letters and digits and underscore.	2022-05-08 19:19:29 +03:00
Serhiy Storchaka	19dca04121	gh-91760: Deprecate group names and numbers which will be invalid in future (GH-91794) Only sequence of ASCII digits will be accepted as a numerical reference. The group name in bytes patterns and replacement strings could only contain ASCII letters and digits and underscore.	2022-04-30 13:13:46 +03:00
谭九鼎	faa12088c1	chore/docs: fix rst style and typo (GH-32331) Current: ![图片](https://user-images.githubusercontent.com/24759802/161704413-30fc91e8-ccd1-4617-8483-bc54ec970f30.png) After this change: ![图片](https://user-images.githubusercontent.com/24759802/161704636-a5458192-a93a-40af-8bde-90ba80fdb53f.png) Trivial so I don't think it needs news or issue Automerge-Triggered-By: GH:JulienPalard	2022-04-05 02:08:00 -07:00
Serhiy Storchaka	c6cd3cc93c	bpo-47081: Replace "qualifiers" with "quantifiers" in the re module documentation (GH-32028) It is a more commonly used term.	2022-03-22 11:44:47 +02:00
Serhiy Storchaka	345b390ed6	bpo-433030: Add support of atomic grouping in regular expressions (GH-31982) * Atomic grouping: (?>...). * Possessive quantifiers: x++, x+, x?+, x{m,n}+. Equivalent to (?>x+), (?>x), (?>x?), (?>x{m,n}). Co-authored-by: Jeffrey C. Jacobs <timehorse@users.sourceforge.net>	2022-03-21 18:28:22 +02:00
Serhiy Storchaka	92a6abf72e	bpo-47066: Convert a warning about flags not at the start of the regular expression into error (GH-31994)	2022-03-19 16:10:44 +02:00
andrei kulakov	fea7290a0e	bpo-31369: include ``RegexFlag`` in ``re.__all__`` (GH-30279) * added RegexFlag to re.__all__; added RegexFlag.NOFLAG Co-authored-by: Jelle Zijlstra <jelle.zijlstra@gmail.com>	2022-02-04 19:54:28 -08:00
Rim Chatti	dbd62e74da	Fix the "Finding all Adverbs" example (GH-21420) Co-authored-by: Serhiy Storchaka <storchaka@gmail.com>	2021-10-09 21:46:56 +03:00
Serhiy Storchaka	64f9e7b19d	bpo-44940: Clarify the documentation of re.findall() (GH-27849) Co-authored-by: Ken Jin <28750310+Fidget-Spinner@users.noreply.github.com> Co-authored-by: Vedran Čačić <vedgar+github@gmail.com>	2021-08-22 10:24:20 +03:00
Noah Kantrowitz	be42c06bb0	Update URLs in comments and metadata to use HTTPS (GH-27458)	2021-07-30 15:54:46 +02:00
Raymond Hettinger	bf1a81258c	Minor modernization and readability improvement to the tokenizer example (GH-19558)	2020-04-16 19:54:13 -07:00
Ricardo Bánffy	15ae75d660	bpo-38294: Add list of no-longer-escaped chars to re.escape documentation. (GH-16442) Prior to 3.7, re.escape escaped many characters that don't have special meaning in Python, but that use to require escaping in other tools and languages. This commit aims to make it clear which characters were, but are no longer escaped.	2019-10-07 23:54:35 +03:00
Julien Palard	1fae844451	Doc: Fix missing negation. (GH-14640) Reported by Hug Capella on docs@. Automerge-Triggered-By: @matrixise	2019-09-11 08:55:22 -07:00
Robert DiPietro	fb6c1f8d3b	Fix typo in re.escape documentation (GH-14722)	2019-07-13 16:35:04 +08:00
mollison	5ebfa840a1	bpo-36645: Fix ambiguous formatting in re.sub() documentation (GH-12879)	2019-04-22 01:14:45 +03:00
Serhiy Storchaka	a180b007d9	bpo-28450: Fix and improve the documentation for unknown escapes in RE. (GH-11920)	2019-02-25 17:58:30 +02:00
animalize	4a7f44a2ed	bpo-34294: re module, fix wrong capturing groups in rare cases. (GH-11546) Need to reset capturing groups between two SRE(match) callings in loops, this fixes wrong capturing groups in rare cases. Also add a missing index in re.rst.	2019-02-18 15:26:37 +02:00
Pablo Galindo	e8239b8e81	Add information about DeprecationWarning for invalid escaped characters in the re module (GH-5255)	2019-01-20 18:57:56 +00:00
Raymond Hettinger	b83942c755	Cleanup and improve the regex tokenizer example. (GH-10426) 1) Convert weird field name "typ" to the more standard "type". 2) For the NUMBER type, convert the value to an int() or float(). 3) Simplify ``group(kind)`` to the shorter and faster ``group()`` call. 4) Simplify logic go a single if-elif chain to make this easier to extend. 5) Reorder the tests to match the order the tokens are specified. This isn't necessary for correctness but does make the example easier to follow. 6) Move the "column" calculation before the if-elif chain so that users have the option of using this value in error messages.	2018-11-09 01:19:33 -08:00
Serhiy Storchaka	913876d824	bpo-35054: Add yet more index entries for symbols. (GH-10121)	2018-10-28 13:41:26 +02:00
Serhiy Storchaka	ddb961d2ab	bpo-35054: Add more index entries for symbols. (GH-10064)	2018-10-26 09:00:49 +03:00
Stéphane Wirtel	859c068e52	bpo-34962: make doctest in Doc/ now passes, and is enforced in CI (GH-9806)	2018-10-12 09:51:05 +02:00
Andrés Delfino	7dfbd49671	Correct grammar mistake in re.rst. (GH-9745)	2018-10-06 22:48:30 +03:00
Andrés Delfino	5092439c2c	bpo-33892: Doc: Use gender neutral words (GH-7770)	2018-06-18 13:34:30 +09:00
Stéphane Wirtel	19177fbd5d	bpo-33503: Fix the broken pypi link in the source and the documentation (GH-6814)	2018-05-15 14:58:35 -04:00
Berker Peksag	a0a42d22d8	Fix a reference to the MRE book in re docs (GH-1113) Reported by Maksym Nikulyak on docs.p.o.	2018-03-23 16:46:52 +03:00
Serhiy Storchaka	a445feb729	bpo-30688: Support \N{name} escapes in re patterns. (GH-5588) Co-authored-by: Jonathan Eunice <jonathan.eunice@gmail.com>	2018-02-10 00:08:17 +02:00
Cheryl Sabella	66771422d0	bpo-32614: Modify re examples to use a raw string to prevent warning (GH-5265) Modify RE examples in documentation to use raw strings to prevent DeprecationWarning. Add text to REGEX HOWTO to highlight the deprecation. Approved by Serhiy Storchaka.	2018-02-02 16:16:27 -05:00
Serhiy Storchaka	fbb490fd2f	bpo-32308: Replace empty matches adjacent to a previous non-empty match in re.sub(). (#4846 )	2018-01-04 11:06:13 +02:00
Serhiy Storchaka	70d56fb525	bpo-25054, bpo-1647489: Added support of splitting on zerowidth patterns. (#4471 ) Also fixed searching patterns that could match an empty string.	2017-12-04 14:29:05 +02:00
Serhiy Storchaka	c615be5166	Use raw strings in the re module examples. (#4616 )	2017-11-28 22:51:38 +02:00
Serhiy Storchaka	05cb728d68	bpo-30349: Raise FutureWarning for nested sets and set operations (#1553 ) in regular expressions.	2017-11-16 12:38:26 +02:00
Serhiy Storchaka	b0b44b4b33	bpo-15606: Improve the re.VERBOSE documentation. (#4366 )	2017-11-14 17:21:26 +02:00
Serhiy Storchaka	3557b05c5a	bpo-31690: Allow the inline flags "a", "L", and "u" to be used as group flags for RE. (#3885 )	2017-10-24 23:31:42 +03:00
Serhiy Storchaka	cd195e2a7a	bpo-31714: Improved regular expression documentation. (#3907 )	2017-10-14 11:14:26 +03:00
Serhiy Storchaka	0b5e61ddca	bpo-30397: Add re.Pattern and re.Match. (#1646 )	2017-10-04 20:09:49 +03:00

1 2 3 4 5

209 Commits