cpython

Commit Graph

Author	SHA1	Message	Date
Brett Cannon	8ffe7bbb72	Remove unused variables and a variable initialization. Found using Clang's static analyzer.	2010-05-03 23:51:28 +00:00
Benjamin Peterson	e266d3e804	ready _sre types	2010-04-06 03:34:09 +00:00
Antoine Pitrou	efdddd3370	Issue #3299 : Fix possible crash in the _sre module when given bad argument values in debug mode. Patch by Victor Stinner.	2010-01-14 17:25:24 +00:00
Mark Dickinson	fe67bd9168	Issue #6561 : '\d' regular expression should not match characters of category [No]; only those of category [Nd]. (Backport of r74237 from py3k.)	2009-07-28 20:35:03 +00:00
Guido van Rossum	e3c4fd9cc0	- Issue #3629 : Fix sre "bytecode" validator for an end case. Reviewed by Amaury.	2008-09-10 14:27:00 +00:00
Guido van Rossum	8b762f05c7	Tracker issue 3487: sre "bytecode" verifier. This is a verifier for the binary code used by the _sre module (this is often called bytecode, though to distinguish it from Python bytecode I put it in quotes). I wrote this for Google App Engine, and am making the patch available as open source under the Apache 2 license. Below are the copyright statement and license, for completeness. # Copyright 2008 Google Inc. # # Licensed under the Apache License, Version 2.0 (the "License"); # you may not use this file except in compliance with the License. # You may obtain a copy of the License at # # http://www.apache.org/licenses/LICENSE-2.0 # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. It's not necessary to include these copyrights and bytecode in the source file. Google has signed a contributor's agreement with the PSF already.	2008-08-05 03:39:21 +00:00
Gregory P. Smith	dd96db63f6	This reverts r63675 based on the discussion in this thread: http://mail.python.org/pipermail/python-dev/2008-June/079988.html Python 2.6 should stick with PyString_* in its codebase. The PyBytes_* names in the spirit of 3.0 are available via a #define only. See the email thread.	2008-06-09 04:58:54 +00:00
Christian Heimes	593daf545b	Renamed PyString to PyBytes	2008-05-26 12:51:38 +00:00
Christian Heimes	4956d2b889	Silence Coverity false alerts with CIDs #172 , #183 , #184	2008-01-18 19:12:56 +00:00
Facundo Batista	4473d225a8	Issue 846388. Adds a call to PyErr_CheckSignals to SRE_MATCH so that signal handlers can be invoked during long regular expression matches. It also adds a new error return value indicating that an exception occurred in a signal handler during the match, allowing exceptions in the signal handler to propagate up to the main loop. Thanks Josh Hoyt and Ralf Schmitt.	2008-01-08 21:10:12 +00:00
Christian Heimes	e93237dfcc	#1629 : Renamed Py_Size, Py_Type and Py_Refcnt to Py_SIZE, Py_TYPE and Py_REFCNT. Macros for b/w compatibility are available.	2007-12-19 02:37:44 +00:00
Guido van Rossum	1ff91d95a2	Patch # 1140 (my code, approved by Effbot). Make sure the type of the return value of re.sub(x, y, z) is the type of y+x (i.e. unicode if either is unicode, str if they are both str) even if there are no substitutions or if x==z (which triggered various special cases in join_list()). Could be backported to 2.5; no need to port to 3.0.	2007-09-10 22:02:25 +00:00
Martin v. Löwis	6819210b9e	PEP 3123: Provide forward compatibility with Python 3.0, while keeping backwards compatibility. Add Py_Refcnt, Py_Type, Py_Size, and PyVarObject_HEAD_INIT.	2007-07-21 06:55:02 +00:00
Andrew M. Kuchling	36126c424a	Cause a PyObject_Malloc() failure to trigger a MemoryError, and then add 'if (PyErr_Occurred())' checks to various places so that NULL is returned properly. 2.4 backport candidate.	2006-10-04 13:42:43 +00:00
Neal Norwitz	ef0de023db	Try to handle a malloc failure. I'm not entirely sure this is correct. There might be something else we need to do to handle the exception. Klocwork # 212-213	2006-08-12 01:53:28 +00:00
Neal Norwitz	a6d80faf6c	Impl ssize_t	2006-06-12 03:05:40 +00:00
Georg Brandl	96a8c3954c	Make use of METH_O and METH_NOARGS where possible. Use Py_UnpackTuple instead of PyArg_ParseTuple where possible.	2006-05-29 21:04:52 +00:00
Georg Brandl	964f5978dc	METH_NOARGS functions do get called with two args.	2006-05-28 22:38:57 +00:00
Georg Brandl	fbef5888e7	Fix C function calling conventions in _sre module.	2006-05-28 22:14:04 +00:00
Jack Diederich	2d40077b4f	needforspeed: use PyObject_MALLOC instead of system malloc for small allocations. Use PyMem_MALLOC for larger (1k+) chunks. 1%-2% speedup.	2006-05-27 15:44:34 +00:00
Skip Montanaro	816a162265	C++ compiler cleanup: proper casts	2006-04-18 11:53:09 +00:00
Anthony Baxter	aefd8ca701	Move constructors, add some casts to make C++ compiler happy. Still a problem with the getstring() results in pattern_subx. Will come back to that.	2006-04-12 04:26:11 +00:00
Neal Norwitz	94a9c09e10	Rename sre.py -> re.py	2006-03-16 06:30:02 +00:00
Neal Norwitz	60da31660c	Thanks to Coverity, these were all reported by their Prevent tool. All of these (except _lsprof.c) should be backported. Particularly the hotshot change which validates sys.path. Can someone backport?	2006-03-07 04:48:24 +00:00
Martin v. Löwis	15e62742fa	Revert backwards-incompatible const changes.	2006-02-27 16:46:16 +00:00
Tim Peters	3d56350910	_compile(): raise an exception if downcasting to SRE_CODE loses information: OverflowError: regular expression code size limit exceeded Otherwise the compiled code is gibberish, possibly leading at least to wrong results or (as reported on c.l.py) internal sre errors at match time. I'm not sure how to test this. SRE_CODE is a 2-byte type on my box, and it's easy to create a regexp that causes the new exception to trigger here. But it may be a 4-byte type on other boxes, and creating a regexp large enough to trigger problems there would be pretty crazy. Bugfix candidate.	2006-01-21 02:47:53 +00:00
Neal Norwitz	1ac754fa10	Check return result from Py_InitModule*(). This API can fail. Probably should be backported.	2006-01-19 06:09:39 +00:00
Jeremy Hylton	af68c874a6	Add const to several API functions that take char . In C++, it's an error to pass a string literal to a char function without a const_cast(). Rather than require every C++ extension module to put a cast around string literals, fix the API to state the const-ness. I focused on parts of the API where people usually pass literals: PyArg_ParseTuple() and friends, Py_BuildValue(), PyMethodDef, the type slots, etc. Predictably, there were a large set of functions that needed to be fixed as a result of these changes. The most pervasive change was to make the keyword args list passed to PyArg_ParseTupleAndKewords() to be a const char kwlist[]. One cast was required as a result of the changes: A type object mallocs the memory for its tp_doc slot and later frees it. PyTypeObject says that tp_doc is const char ; but if the type was created by type_new(), we know it is safe to cast to char *.	2005-12-10 18:50:16 +00:00
Gustavo Niemeyer	166878f544	Fixing bug #1072259 in SRE.	2004-12-02 16:15:39 +00:00
Raymond Hettinger	9447874131	Add docstrings for regular expression objects and methods.	2004-09-24 04:31:19 +00:00
Gustavo Niemeyer	0506c64086	Fixing bug #817234 , which made SRE get into an infinite loop on empty final matches with finditer(). New test cases included for this bug and for #581080.	2004-09-03 18:11:59 +00:00
Nicholas Bastin	9ba301e589	Moved SunPro warning suppression into pyport.h and out of individual modules and objects.	2004-07-15 15:54:05 +00:00
Nicholas Bastin	1ce9e4cfc1	Fixed end-of-loop code not reached warning when using SunPro C	2004-06-17 18:27:18 +00:00
Raymond Hettinger	027bb633b6	Add weakref support to sockets and re pattern objects.	2004-05-31 03:09:25 +00:00
Gustavo Niemeyer	601b963be0	- Fixing annoying warnings.	2004-02-14 00:31:13 +00:00
Gustavo Niemeyer	2cbdc2a461	Cleaning up recursive pieces left in the reorganization.	2003-12-13 20:32:08 +00:00
Gustavo Niemeyer	0f0c06a5c2	Removing dead code.	2003-10-18 20:54:44 +00:00
Gustavo Niemeyer	ad3fc44ccb	Implemented non-recursive SRE matching.	2003-10-17 22:13:16 +00:00
Raymond Hettinger	8ae4689657	Simplify and speedup uses of Py_BuildValue(): * Py_BuildValue("(OOO)",a,b,c) --> PyTuple_Pack(3,a,b,c) * Py_BuildValue("()",a) --> PyTuple_New(0) * Py_BuildValue("O", a) --> Py_INCREF(a)	2003-10-12 19:09:37 +00:00
Gustavo Niemeyer	28b5bb33ea	Fixing bug described in patch #756032 , where SRE reads invalid data due to a corrupted end pointer.	2003-06-26 14:41:08 +00:00
Andrew MacIntyre	1a44448b24	Changes to sre.c after the application of patch #726869 have increased stack usage on FreeBSD, requiring the recursion limit to be lowered further. Building with gcc 2.95 (the standard compiler on FreeBSD 4.x) is now also affected. The underlying issue is that FreeBSD's pthreads implementation has a hard-coded 1MB stack size for the initial (or "primary") thread, which can not be changed without rebuilding libc_r. Exhausting this stack results in a bus error. Building without pthreads (configure --without-threads), or linking with the port of the Linux pthreads library (aka Linuxthreads) instead of libc_r, avoids this limitation. On OS/2, only gcc 3.2 is affected and the stack size is controllable, so the special handling has been removed.	2003-06-09 08:22:11 +00:00
Andrew M. Kuchling	c24fe36c57	Allow _sre.c to compile with Python 2.2	2003-04-30 13:09:08 +00:00
Gustavo Niemeyer	caf1c9dfe7	- Included detailed documentation in _sre.c explaining how, when, and why to use LASTMARK_SAVE()/LASTMARK_RESTORE(), based on the discussion in patch #712900. - Cleaned up LASTMARK_SAVE()/LASTMARK_RESTORE() usage, based on the established rules. - Moved the upper part of the just commited patch (relative to bug #725106) to outside the for() loop of BRANCH OP. There's no need to mark_save() in every loop iteration.	2003-04-27 14:42:54 +00:00
Gustavo Niemeyer	3646ab98af	Fix for part of the problem mentioned in #725149 by Greg Chapman. This problem is related to a wrong behavior from mark_save/restore(), which don't restore the mark_stack_base before restoring the marks. Greg's suggestion was to change the asserts, which happen to be the only recursive ops that can continue the loop, but the problem would happen to any operation with the same behavior. So, rather than hardcoding this into asserts, I have changed mark_save/restore() to always restore the stackbase before restoring the marks. Both solutions should fix these two cases, presented by Greg: >>> re.match('(a)(?:(?=(b))c)', 'abb').groups() ('b', None) >>> re.match('(a)((?!(b)))', 'abb').groups() ('b', None, None) The rest of the bug and patch in #725149 must be discussed further.	2003-04-27 13:25:21 +00:00
Gustavo Niemeyer	c34f2555bd	Applied patch #725106 , by Greg Chapman, fixing capturing groups within repeats of alternatives. The only change to the original patch was to convert the tests to the new test_re.py file. This patch fixes cases like: >>> re.match('((a)\|b)', 'abc').groups() ('b', '') Which is wrong (it's impossible to match the empty string), and incompatible with other regex systems, like the following examples show: % perl -e '"abc" =~ /^((a)\|b)/; print "$1 $2\n";' b a % echo "abc" \| sed -r -e "s/^((a)\|b)*/\1 \2\|/" b a\|c	2003-04-27 12:34:14 +00:00
Gustavo Niemeyer	c23fb77477	Applying patch #726869 by Andrew I MacIntyre, reducing in _sre.c the recursion limit for certain setups of FreeBSD and OS/2.	2003-04-27 06:58:54 +00:00
Gustavo Niemeyer	3c9068bbec	Made MAX_UNTIL/MIN_UNTIL code more coherent about mark protection, accordingly to further discussions with Greg Chapman in patch #712900.	2003-04-22 15:39:09 +00:00
Gustavo Niemeyer	be733ee7fb	More work on bug #672491 and patch #712900 . I've applied a modified version of Greg Chapman's patch. I've included the fixes without introducing the reorganization mentioned, for the sake of stability. Also, the second fix mentioned in the patch don't fix the mentioned problem anymore, because of the change introduced by patch #720991 (by Greg as well). The new fix wasn't complicated though, and is included as well. As a note. It seems that there are other places that require the "protection" of LASTMARK_SAVE()/LASTMARK_RESTORE(), and are just waiting for someone to find how to break them. Particularly, I belive that every recursion of SRE_MATCH() should be protected by these macros. I won't do that right now since I'm not completely sure about this, and we don't have much time for testing until the next release.	2003-04-20 07:35:44 +00:00
Gustavo Niemeyer	1aca359e89	- Fixed bug #672491 . This change restores the behavior of lastindex/lastgroup to be compliant with previous python versions, by backing out the changes made in revision 2.84 which affected this. The bugfix for backtracking is still maintained.	2003-04-20 00:45:13 +00:00
Martin v. Löwis	78e2f06cc6	Fully support 32-bit codes. Enable BIGCHARSET in UCS-4 builds.	2003-04-19 12:56:08 +00:00

1 2 3

137 Commits