cpython

Commit Graph

Author	SHA1	Message	Date
Fred Drake	bd3090d4d6	Added test suite for the new HTMLParser module, originally from the TAL/PageTemplate package for Zope. This only needed a little boilerplate change; the tests themselves are unchanged.	2001-05-18 15:32:59 +00:00
Guido van Rossum	8846d7178b	A much improved HTML parser -- a replacement for sgmllib. The API is derived from but not quite compatible with that of sgmllib, so it's a new file. I suppose it needs documentation, and htmllib needs to be changed to use this instead of sgmllib, and sgmllib needs to be declared obsolete. But that can all be done later. This code was first published as part of TAL (part of Zope Page Templates), but that was strongly based on sgmllib anyway. Authors are Fred drake and Guido van Rossum.	2001-05-18 14:50:52 +00:00
Tim Peters	1928314ef4	Speed dictresize by collapsing its two passes into one; the reason given in the comments for using two passes was bogus, as the only object that can get decref'ed due to the copy is the dummy key, and decref'ing dummy can't have side effects (for one thing, dummy is immortal! for another, it's a string object, not a potentially dangerous user-defined object).	2001-05-17 22:25:34 +00:00
Jack Jansen	b686791b77	Added pymactoolboxglue.c and changed the exported symbols having to do with this.	2001-05-17 22:14:36 +00:00
Jack Jansen	e8f53bb779	Dynamically loaded toolbox modules don't need to link against each other anymore, due to the new glue code that ties them together.	2001-05-17 22:12:55 +00:00
Jack Jansen	f57a4a2e97	Glue code to connect obj_New and obj_Convert routines (the PyArg_Parse and Py_BuildTuple helpers) from one dynamically imported module to another.	2001-05-17 22:11:44 +00:00
Jack Jansen	0e04eecdbf	First step in porting MacPython modules to OSX/unix: break all references between modules except for the obj_New() and obj_Convert() routines, the PyArg_Parse and Py_BuildValue helpers. And these can now be vectored through glue routines (by defining USE_TOOLBOX_OBJECT_GLUE) which will do the necessary imports, whereupon the module's init routine will tell the glue routine about the real conversion routine address and everything is fine again.	2001-05-17 21:58:34 +00:00
Guido van Rossum	99f9baa331	Fixed botched indent in _init_mac() code. (It may never be executed, but it still can't have any syntax errors. Went a little too fast there, Jack? :-)	2001-05-17 15:03:14 +00:00
Jack Jansen	dd13a20490	Made distutils understand the MacPython Carbon runtime model. Distutils will build for the runtime model you are currently using for the interpreter.	2001-05-17 12:52:01 +00:00
Jack Jansen	6f84ed5afc	Fixed macroman<->latin1 conversion. Some chars don't exist in latin1, but at least the roundtrip results in the same macroman characters.	2001-05-17 12:45:13 +00:00
Jack Jansen	dfebe90c4c	Fixed macroman<->latin1 conversion. Some characters don't exist in latin1, but at least the roundtrip gives the correct macroman characters again.	2001-05-17 12:35:13 +00:00
Marc-André Lemburg	716cf91839	Moved the encoding map building logic from the individual mapping codec files to codecs.py and added logic so that multi mappings in the decoding maps now result in mappings to None (undefined mapping) in the encoding maps.	2001-05-16 09:41:45 +00:00
Jack Jansen	5a4718e1ed	Bah, somehow the macroman<->iso-latin-1 translation got lost during the merge. Checking in one fixed file to make sure MacCVS Pro isn't the problem. If it isn't a flurry of checkins will follow tomorrow. If it is... well...	2001-05-15 20:22:08 +00:00
Tim Peters	d7ed3bf552	Speed tuple comparisons in two ways: 1. Omit the early-out EQ/NE "lengths different?" test. Was unable to find any real code where it triggered, but it always costs. The same is not true of list richcmps, where different-size lists appeared to get compared about half the time. 2. Because tuples are immutable, there's no need to refetch the lengths of both tuples from memory again on each loop trip. BUG ALERT: The tuple (and list) richcmp algorithm is arguably wrong, because it won't believe there's any difference unless Py_EQ returns false for some corresponding elements: >>> class C: ... def __lt__(x, y): return 1 ... __eq__ = __lt__ ... >>> C() < C() 1 >>> (C(),) < (C(),) 0 >>> That doesn't make sense -- provided you believe the defn. of C makes sense.	2001-05-15 20:12:59 +00:00
Marc-André Lemburg	fab96cc2ff	Add NEWS item for new string methods.	2001-05-15 18:38:45 +00:00
Tim Peters	30324a7363	Just changed "x,y" to "x, y" everywhere (i.e., inserted horizontal space after commas that didn't have any).	2001-05-15 17:19:16 +00:00
Guido van Rossum	acfdf156aa	Add quoted-printable codec	2001-05-15 15:34:07 +00:00
Fred Drake	c0dac1a58c	Beef up the unicode() description a bit, based on material from AMK's "What's New in Python ..." documents.	2001-05-15 15:27:53 +00:00
Fred Drake	da05e977f3	abspath(): Fix inconsistent indentation.	2001-05-15 15:23:01 +00:00
Marc-André Lemburg	2d9204199f	This patch changes the way the string .encode() method works slightly and introduces a new method .decode(). The major change is that strg.encode() will no longer try to convert Unicode returns from the codec into a string, but instead pass along the Unicode object as-is. The same is now true for all other codec return types. The underlying C APIs were changed accordingly. Note that even though this does have the potential of breaking existing code, the chances are low since conversion from Unicode previously took place using the default encoding which is normally set to ASCII rendering this auto-conversion mechanism useless for most Unicode encodings. The good news is that you can now use .encode() and .decode() with much greater ease and that the door was opened for better accessibility of the builtin codecs. As demonstration of the new feature, the patch includes a few new codecs which allow string to string encoding and decoding (rot13, hex, zip, uu, base64). Written by Marc-Andre Lemburg. Copyright assigned to the PSF.	2001-05-15 12:00:02 +00:00
Guido van Rossum	2e0a654f6e	Add warnings to the strop module, for to those functions that really are obsolete; three variables and the maketrans() function are not (yet) obsolete. Add a compensating warnings.filterwarnings() call to test_strop.py. Add this to the NEWS.	2001-05-15 02:14:44 +00:00
Guido van Rossum	9cba64318e	Ignore 'build' and 'Makefile.pre'.	2001-05-15 01:53:40 +00:00
Tim Peters	d401eddf91	Fix new compiler warnings. Also boost "start" from (C) int to long and return a (C) long: PyArg_ParseTuple and Py_BuildValue may not let us get at the size_t we really want, but C int is clearly too small for a 64-bit box, and both the start parameter and the return value should work for large mapped files even on 32-bit boxes. The code really needs to be rethought from scratch (not by me, though ...).	2001-05-14 23:19:12 +00:00
Tim Peters	58e0a8c130	SF patch #418147 Fixes to allow compiling w/ Borland, from Stephen Hansen.	2001-05-14 22:32:33 +00:00
Fred Drake	460f0691df	fcntl.ioctl(): Update error message; necessity noted by Michael Hudson.	2001-05-14 21:02:36 +00:00
Fred Drake	992d387540	Convert a couple of comments to docstrings -- PyUnit can use these when the regression test is run in verbose mode.	2001-05-14 19:15:23 +00:00
Tim Peters	95b3f78622	pprint's workhorse _safe_repr() function took time quadratic in the # of elements when crunching a list, dict or tuple. Now takes linear time instead -- huge speedup for even moderately large containers, and the code is notably simpler too. Added some basic "is the output correct?" tests to test_pprint.	2001-05-14 18:39:41 +00:00
Fred Drake	43913dd27c	Convert the pprint test to use PyUnit.	2001-05-14 17:41:20 +00:00
Fred Drake	b3384d3555	Make sure we include all of Python's numeric types in the data model description, so that the introduction of complex is not a surprise. This closes SF bug #423429.	2001-05-14 16:04:22 +00:00
Jack Jansen	6db483e679	Added a WITHOUT_FRAMEWORKS define to all the config files, so that on MacOS<=9 compiles use Universal Headers, not Carbon/Carbon.h.	2001-05-14 15:00:38 +00:00
Guido van Rossum	1bd797a257	Fix a typo, consistently spell ASCII in all caps, and insert blank lines between paragraphs in Mark Hammond's news item about the default encoding in posixmodule. Resist the temptation to reflow paragraphs.	2001-05-14 13:53:38 +00:00
Mark Hammond	26cffde4c2	Fix the Py_FileSystemDefaultEncoding checkin - declare the variable in a fileobject.h, and initialize it in bltinmodule.	2001-05-14 12:17:34 +00:00
Greg Stein	834f4dd7c9	Fix the .find() method for memory maps. 1) it didn't obey the "start" parameter (and when it does, we must validate the value) 2) the return value needs to be an absolute index, rather than relative to some arbitrary point in the file (checking CVS, it appears this method never worked; these changes bring it into line with typical .find() behavior)	2001-05-14 09:32:26 +00:00
Tim Peters	a814db579d	SF bug[ #423781 : pprint.isrecursive() broken.	2001-05-14 07:05:58 +00:00
Mark Hammond	2a0af79269	Add mention of the default file system encoding for Windows.	2001-05-14 03:09:36 +00:00
Tim Peters	a0599575aa	A disgusting "fix" for the test___all__ failure under Windows.	2001-05-13 09:01:06 +00:00
Mark Hammond	ef8b654bbe	Add support for Windows using "mbcs" as the default Unicode encoding when dealing with the file system. As discussed on python-dev and in patch 410465.	2001-05-13 08:04:26 +00:00
Tim Peters	342c65e19a	Aggressive reordering of dict comparisons. In case of collision, it stands to reason that me_key is much more likely to match the key we're looking for than to match dummy, and if the key is absent me_key is much more likely to be NULL than dummy: most dicts don't even have a dummy entry. Running instrumented dict code over the test suite and some apps confirmed that matching dummy was 200-300x less frequent than matching key in practice. So this reorders the tests to try the common case first. It can lose if a large dict with many collisions is mostly deleted, not resized, and then frequently searched, but that's hardly a case we should be favoring.	2001-05-13 06:43:53 +00:00
Tim Peters	2f228e75e4	Get rid of the superstitious "~" in dict hashing's "i = (~hash) & mask". The comment following used to say: /* We use ~hash instead of hash, as degenerate hash functions, such as for ints <sigh>, can have lots of leading zeros. It's not really a performance risk, but better safe than sorry. 12-Dec-00 tim: so ~hash produces lots of leading ones instead -- what's the gain? / That is, there was never a good reason for doing it. And to the contrary, as explained on Python-Dev last December, it tended to make the sum* (i + incr) & mask (which is the first table index examined in case of collison) the same "too often" across distinct hashes. Changing to the simpler "i = hash & mask" reduced the number of string-dict collisions (== # number of times we go around the lookup for-loop) from about 6 million to 5 million during a full run of the test suite (these are approximate because the test suite does some random stuff from run to run). The number of collisions in non-string dicts also decreased, but not as dramatically. Note that this may, for a given dict, change the order (wrt previous releases) of entries exposed by .keys(), .values() and .items(). A number of std tests suffered bogus failures as a result. For dicts keyed by small ints, or (less so) by characters, the order is much more likely to be in increasing order of key now; e.g., >>> d = {} >>> for i in range(10): ... d[i] = i ... >>> d {0: 0, 1: 1, 2: 2, 3: 3, 4: 4, 5: 5, 6: 6, 7: 7, 8: 8, 9: 9} >>> Unfortunately. people may latch on to that in small examples and draw a bogus conclusion. test_support.py Moved test_extcall's sortdict() into test_support, made it stronger, and imported sortdict into other std tests that needed it. test_unicode.py Excluced cp875 from the "roundtrip over range(128)" test, because cp875 doesn't have a well-defined inverse for unicode("?", "cp875"). See Python-Dev for excruciating details. Cookie.py Chaged various output functions to sort dicts before building strings from them. test_extcall Fiddled the expected-result file. This remains sensitive to native dict ordering, because, e.g., if there are multiple errors in a keyword-arg dict (and test_extcall sets up many cases like that), the specific error Python complains about first depends on native dict ordering.	2001-05-13 00:19:31 +00:00
Jack Jansen	0194ad5c7d	Got the first MacPython module working under MacOSX/MachO (gestalt). Main changes are including Carbon/Carbon.h in stead of the old headers (unless WITHOUT_FRAMEWORKS is defined, as it will be for classic MacPython) and selectively disabling all the stuff that is unneeded in a unix-Python (event handling, etc).	2001-05-12 22:46:35 +00:00
Jack Jansen	6e68a7e74f	Be more sensible about when to use TARGET_API_MAC_OS8 in stead of !TARGET_API_MAC_CARBON. This should greatly facilitate porting stuff to OSX in its MachO/BSD incarnation.	2001-05-12 21:31:34 +00:00
Jack Jansen	1bd0a712ad	Added iterobject.c to the project. And trying my first checkin at the same time.	2001-05-12 21:09:45 +00:00
Tim Peters	16cabc0a3d	Repair "module has no attribute xxx" error msg; bug introduced when switching from tp_getattr to tp_getattro.	2001-05-12 20:24:22 +00:00
Guido van Rossum	e9d7f0779d	Refactored, with some future plans in mind. This now uses the new gotofileline() method defined in FileList.py.	2001-05-12 12:30:04 +00:00
Guido van Rossum	6cb7a21934	Move the action of loading the configuration to the IdleConf module rather than the idle.py script. This has advantages and disadvantages; the biggest advantage being that we can more easily have an alternative main program.	2001-05-12 12:18:10 +00:00
Guido van Rossum	8451ebb435	Delete goodname() method, which is unused. Add gotofileline(), a convenience method which I intend to use in a variant. Rename test() to _test().	2001-05-12 12:11:36 +00:00
Tim Peters	d85e102337	Variant of patch #423262 : Change module attribute get & set Allow module getattr and setattr to exploit string interning, via the previously null module object tp_getattro and tp_setattro slots. Yields a very nice speedup for things like random.random and os.path etc.	2001-05-11 21:51:48 +00:00
Fred Drake	564a6cc8ca	Fix a minor style consistency issue. When getting a string buffer for a string we just created, use PyString_AS_STRING() instead of PyString_AsString() to avoid the call overhead and extra type check.	2001-05-11 20:12:26 +00:00
Fred Drake	7e473800c3	Fix one bare except: clause.	2001-05-11 19:52:57 +00:00
Fred Drake	ef4cdad090	[].index() raises ValueError if the value is not in the list, so only catch that instead of using a bare except clause.	2001-05-11 19:52:03 +00:00

1 2 3 4 5 ...

17760 Commits All Branches Search

17760 Commits

All Branches