cpython

Commit Graph

Author	SHA1	Message	Date
Fredrik Lundh	29c4ba9ada	SRE 0.9.8: passes the entire test suite -- reverted REPEAT operator to use "repeat context" strategy (from 0.8.X), but done right this time. -- got rid of backtracking stack; use nested SRE_MATCH calls instead (should probably put it back again in 0.9.9 ;-) -- properly reset state in scanner mode -- don't use aggressive inlining by default	2000-08-01 18:20:07 +00:00
Fredrik Lundh	8a3ebf8ca8	-- SRE 0.9.6 sync. this includes: + added "regs" attribute + fixed "pos" and "endpos" attributes + reset "lastindex" and "lastgroup" in scanner methods + removed (?P#id) syntax; the "lastindex" and "lastgroup" attributes are now always set + removed string module dependencies in sre_parse + better debugging support in sre_parse + various tweaks to build under 1.5.2	2000-07-23 21:46:17 +00:00
Fredrik Lundh	72b82ba16d	- fixed grouping error bug - changed "group" operator to "groupref"	2000-07-03 21:31:48 +00:00
Fredrik Lundh	6f01398236	- added lookbehind support (?<=pattern), (?<!pattern). the pattern must have a fixed width. - got rid of array-module dependencies; the match pro- gram is now stored inside the pattern object, rather than in an extra string buffer. - cleaned up a various of potential leaks, api abuses, and other minors in the engine module. - use mal's new isalnum macro, rather than my own work- around. - untabified test_sre.py. seems like I removed a couple of trailing spaces in the process...	2000-07-03 18:44:21 +00:00
Fredrik Lundh	c2301730b8	- experimental: added two new attributes to the match object: "lastgroup" is the name of the last matched capturing group, "lastindex" is the index of the same group. if no group was matched, both attributes are set to None. the (?P#) feature will be removed in the next relase.	2000-07-02 22:25:39 +00:00
Fredrik Lundh	7cafe4d7e4	- actually enabled charset anchors in the engine (still not used by the code generator) - changed max repeat value in engine (to match earlier array fix) - added experimental "which part matched?" mechanism to sre; see http://hem.passagen.se/eff/2000_07_01_bot-archive.htm#416954 or python-dev for details.	2000-07-02 17:33:27 +00:00
Fredrik Lundh	3562f11764	-- use charset bitmaps where appropriate. this gives a 5-10% speedup for some tests, including the python tokenizer. -- added support for an optional charset anchor to the engine (currently unused by the code generator). -- removed workaround for array module bug.	2000-07-02 12:00:07 +00:00
Fredrik Lundh	c13222cdff	- fixed "{ in any other context" bug - minor comment touchups in the C module	2000-07-01 23:49:14 +00:00
Fredrik Lundh	22d2546520	today's SRE update: -- changed 1.6 to 2.0 in the file headers -- fixed ISALNUM macro for the unicode locale. this solution isn't perfect, but the best I can do with Python's current unicode database.	2000-07-01 17:50:59 +00:00
Fredrik Lundh	55a4f4a528	- fixed code generation error in multiline mode - fixed parser flag propagation (of all stupid bugs...)	2000-06-30 22:37:31 +00:00
Fredrik Lundh	4ccea94152	- reverted to "\x is binary byte" - removed evil tabs from sre_parse and sre_compile	2000-06-30 18:39:20 +00:00
Fredrik Lundh	0640e1161f	the mad patcher strikes again: -- added pickling support (only works if sre is imported) -- fixed wordsize problems in engine (instead of casting literals down to the character size, cast characters up to the literal size (same as the code word size). this prevents false hits when you're matching a unicode pattern against an 8-bit string. (unfortunately, this broke another test, but I think the test should be changed in this case; more on that on python-dev) -- added sre.purge function (unofficial, clears the cache)	2000-06-30 13:55:15 +00:00
Fredrik Lundh	43b3b49b5a	- fixed lookahead assertions (#10 , #11 , #12 ) - untabified sre_constants.py	2000-06-30 10:41:31 +00:00
Fredrik Lundh	b71624e698	- added support for (?P=name) (closes #3 and #7 from the status report)	2000-06-30 09:13:06 +00:00
Fredrik Lundh	90a0791322	- pedantic: make sure "python -t" doesn't complain...	2000-06-30 07:50:59 +00:00
Fredrik Lundh	01016fe972	- fixed split behaviour on empty matches - fixed compiler problems when using locale/unicode flags - fixed group/octal code parsing in sub/subn templates	2000-06-30 00:27:46 +00:00
Fredrik Lundh	8094611eb8	- fixed another split problem (those semantics are weird...) - got rid of $Id$'s (for the moment, at least). in other words, there should be no more "empty" checkins. - internal: some minor cleanups.	2000-06-29 18:03:25 +00:00
Fredrik Lundh	4781b07201	- make sure group names are valid identifiers (closes the "SRE: symbolic reference" bug)	2000-06-29 12:38:45 +00:00
Fredrik Lundh	75f2d675ed	- last patch broke parse_template; fixed by changing some tests in sre_patch back to previous version - fixed return value from findall - renamed a bunch of functions inside _sre (way too many leading underscores...) </F>	2000-06-29 11:34:28 +00:00
Fredrik Lundh	6c68dc7b1a	- removed "alpha only" licensing restriction - removed some hacks that worked around 1.6 alpha bugs - removed bogus test code from sre_parse	2000-06-29 10:34:56 +00:00
Fredrik Lundh	436c3d58a2	towards 1.6b1	2000-06-29 08:58:44 +00:00
Andrew M. Kuchling	815d5b934b	Patch from /F: this patch brings the CVS version of SRE in sync with the latest public snapshot.""	2000-06-09 14:08:07 +00:00
Guido van Rossum	b81e70ebdb	Fredrik Lundh: new snapshot. Mostly reindented. This one should work with unicode expressions, and compile a bit more silently.	2000-04-10 17:10:48 +00:00
Andrew M. Kuchling	e3ba931aa4	This patch looks large, but it just deletes the ^M characters and untabifies the files. No actual code changes were made.	2000-04-02 05:22:30 +00:00
Guido van Rossum	7627c0de69	Added Fredrik Lundh's sre module and its supporting cast. NOTE: THIS IS VERY ROUGH ALPHA CODE!	2000-03-31 14:58:54 +00:00

1 2 3

125 Commits