cpython

Commit Graph

Author	SHA1	Message	Date
Andrew M. Kuchling	dc54f2be3f	Make 'bin' argument trigger DeprecationWarning	2004-08-07 16:27:24 +00:00
Walter Dörwald	70a6b49821	Replace backticks with repr() or "%r" From SF patch #852334.	2004-02-12 17:35:32 +00:00
Jeremy Hylton	e3a565eed0	Fix grammar in comment.	2003-06-29 16:59:59 +00:00
Guido van Rossum	443ada47c3	Remove unused _better_reduce (which will disappear soon) and _reconstructor (whose import here is a mystery to me).	2003-02-18 22:49:10 +00:00
Guido van Rossum	c53f009f94	Introducing __reduce_ex__, which is called with a protocol number argument if it exists in preference over __reduce__. Now Tim can go implement this in cPickle.c.	2003-02-18 22:05:12 +00:00
Tim Peters	080c88b912	cPickle.c, load_build(): Taught cPickle how to pick apart the optional proto 2 slot state. pickle.py, load_build(): CAUTION: Noted that cPickle's load_build and pickle's load_build really don't do the same things with the state, and didn't before this patch either. cPickle never tries to do .update(), and has no backoff if instance.__dict__ can't be retrieved. There are no tests that can tell the difference, and part of what cPickle's load_build() did looked accidental to me, so I don't know what the true intent is here. pickletester.py, test_pickle.py: Got rid of the hack for exempting cPickle from running some of the proto 2 tests. dictobject.c, PyDict_Next(): documented intended use.	2003-02-15 03:01:11 +00:00
Tim Peters	8587b3c073	Added a HIGHEST_PROTOCOL module attribute to pickle and cPickle.	2003-02-13 15:44:41 +00:00
Tim Peters	42f08ac1e3	Implemented batching for dicts in cPickle. This is after two failed attempts to merge the C list-batch and dict-batch code -- they worked, but it was a godawful mess to read.	2003-02-11 22:43:24 +00:00
Tim Peters	1092d64002	Implemented list batching in cPickle.	2003-02-11 21:06:20 +00:00
Guido van Rossum	cf117b0b40	Rename 'proto' keyword arg to 'protocol' . Greg Ward's suggestion.	2003-02-09 17:19:41 +00:00
Guido van Rossum	5aac4e6312	Move _better_reduce from copy.py to copy_reg.py, and also use it in pickle.py, where it makes save_newobj() unnecessary. Tests pass.	2003-02-06 22:57:00 +00:00
Guido van Rossum	98b922c0ec	Remove a debug print statement.	2003-02-06 19:30:38 +00:00
Jeremy Hylton	cc1fccbc1c	No need for a continuation line.	2003-02-06 16:23:01 +00:00
Jeremy Hylton	656d4519b5	Replace hasattr() + getattr() with single getattr() and default value.	2003-02-06 16:22:01 +00:00
Guido van Rossum	d4b920c8bd	Rename the extension registry variables to have leading underscores -- this clarifies that they are part of an internal API (albeit shared between pickle.py, copy_reg.py and cPickle.c). I'd like to do the same for copy_reg.dispatch_table, but worry that it might be used by existing code. This risk doesn't exist for the extension registry.	2003-02-04 01:54:49 +00:00
Guido van Rossum	93fe564217	_slotnames(): this is a fairly expensive calculation. Cache the outcome as __slotnames__ on the class. (Like __slots__, it's not safe to ask for this as an attribute -- you must look for it in the specific class's __dict__. But it must be set using attribute notation, because __dict__ is a read-only proxy.)	2003-02-03 19:46:54 +00:00
Guido van Rossum	868ecc22ab	_slotnames(): exclude __dict__ and __weakref__; these aren't real slots even though they can be listed in __slots__.	2003-02-03 18:10:09 +00:00
Guido van Rossum	795ea89cb5	Support keyword argument 'bin', with a pending deprecation warning.	2003-02-03 16:59:48 +00:00
Tim Peters	1d63c9f151	cPickle support for TUPLE[123]. Incidentally plugged several undetected overflow holes in Pdata_grow().	2003-02-02 20:29:39 +00:00
Tim Peters	bf2674be0e	long(string, base) now takes time linear in len(string) when base is a power of 2. Enabled the tail end of test_long() in pickletester.py because it no longer takes forever when run from test_pickle.py.	2003-02-02 07:51:32 +00:00
Tim Peters	ee1a53cbb1	cPickle.c: Full support for the new LONG1 and LONG4. Added comments. Assorted code cleanups; e.g., sizeof(char) is 1 by definition, so there's no need to do things like multiply by sizeof(char) in hairy malloc arguments. Fixed an undetected-overflow bug in readline_file(). longobject.c: Fixed a really stupid bug in the new _PyLong_NumBits. pickle.py: Fixed stupid bug in save_long(): When proto is 2, it wrote LONG1 or LONG4, but forgot to return then -- it went on to append the proto 1 LONG opcode too. Fixed equally stupid cancelling bugs in load_long1() and load_long4(): they returned the unpickled long instead of pushing it on the stack. The return values were ignored. Tests passed before only because save_long() pickled the long twice. Fixed bugs in encode_long(). Noted that decode_long() is quadratic-time despite our hopes, because long(string, 16) is still quadratic-time in len(string). It's hex() that's linear-time. I don't know a way to make decode_long() linear-time in Python, short of maybe transforming the 256's-complement bytes into marshal's funky internal format, and letting marshal decode that. It would be more valuable to make long(string, 16) linear time. pickletester.py: Added a global "protocols" vector so tests can try all the protocols in a sane way. Changed test_ints() and test_unicode() to do so. Added a new test_long(), but the tail end of it is disabled because it "takes forever" under pickle.py (but runs very quickly under cPickle: cPickle proto 2 for longs is linear-time).	2003-02-02 02:57:53 +00:00
Tim Peters	5bd2a79b22	The C pickle now knows how to deal with a proto= argument. Assorted code cleanups, and purged more references to text-vs-binary modes.	2003-02-01 16:45:06 +00:00
Guido van Rossum	7eff63abce	Change the default protocol back to 0. Add a feature suggested by Tim: a negative protocol value means to use the largest protocol value supported.	2003-01-31 19:42:31 +00:00
Guido van Rossum	25cb7dfb0f	Another extension to reduce(). It can return a 4- or 5-tuple now. The 4th item can be None or an iterator yielding list items, which are used to append() or extend() the object. The 5th item can be None or an iterator yielding a dict's (key, value) pairs, which are stuffed into the object using __setitem__. Also (as a separate, though related, feature) add "batching" for list and dict items. If you pickled a dict or list with a million items in the past, it would push a million items onto the stack. It now pushes only 1000 items at a time on the stack, using repeated APPENDS or SETITEMS opcodes. (For lists, I hope that using many short extend() calls doesn't exhibit quadratic behavior.)	2003-01-31 18:53:21 +00:00
Jeremy Hylton	4f0dcc9a9a	Provide __module__ attributes for functions defined in C and Python. __module__ is the string name of the module the function was defined in, just like __module__ of classes. In some cases, particularly for C functions, the __module__ may be None. Change PyCFunction_New() from a function to a macro, but keep an unused copy of the function around so that we don't change the binary API. Change pickle's save_global() to use whichmodule() if __module__ is None, but add the __module__ logic to whichmodule() since it might be used outside of pickle.	2003-01-31 18:33:18 +00:00
Guido van Rossum	f7f4517fae	Pass the object to save_reduce(), so the memoize() call can go into save_reduce(), before the state is pickled. This makes it possible for an object to be referenced from its own (mutable) state.	2003-01-31 17:17:49 +00:00
Guido van Rossum	d053b4b416	Add a magical feature to save_reduce so that __reduce__ can cause NEWOBJ to be generated.	2003-01-31 16:51:45 +00:00
Tim Peters	4b23f2b44b	It's Official: for LONG1/LONG4, a "byte count" of 0 is taken as a shortcut meaning 0L. This allows LONG1 to encode 0L in two bytes total.	2003-01-31 16:43:39 +00:00
Neal Norwitz	d17406830c	Fix typo	2003-01-31 04:04:23 +00:00
Tim Peters	91149821d3	Linear-time implementations of {encode,decode}_long.	2003-01-31 03:43:58 +00:00
Tim Peters	d01c1e91c4	load_inst(), load_obj(): Put the bulk of these into a common new _instantiate() method.	2003-01-30 15:41:46 +00:00
Guido van Rossum	9b40e804c7	There was a subtle big in save_newobj(): it used self.save_global(t) on the type instead of self.save(t). This defeated the purpose of NEWOBJ, because it didn't generate a BINGET opcode when t was already memoized; but moreover, it would generate multiple BINPUT opcodes for the same type! pickletools.dis() doesn't like this. How I found this? I was playing with picklesize.py in the datetime sandbox, and noticed that protocol 2 pickles for multiple objects were in fact larger than protocol 1 pickles! That was suspicious, so I decided to disassemble one of the pickles. This really needs a unit test, but I'm exhausted. I'll be late for work as it is. :-(	2003-01-30 06:37:41 +00:00
Guido van Rossum	4fba220f4a	Slight code rearrangement to avoid testing getstate twice.	2003-01-30 05:41:19 +00:00
Guido van Rossum	45486176ea	In save_newobj(), if an object's __getnewargs__ and __getstate__ are the same function, don't save the state or write a BUILD opcode. This is so that a type (e.g. datetime :-) can support protocol 2 using __getnewargs__ while also supporting protocol 0 and 1 using __getstate__. (Without this, the state would be pickled twice with protocol 2, unless __getstate__ is defined to return None, which breaks protocol 0 and 1.)	2003-01-30 05:39:04 +00:00
Guido van Rossum	ba884f3d22	Use %c rather than chr() to turn some ints into chars.	2003-01-29 20:14:23 +00:00
Guido van Rossum	5d9113d8be	Implement appropriate __getnewargs__ for all immutable subclassable builtin types. The special handling for these can now be removed from save_newobj(). Add some testing for this. Also add support for setting the 'fast' flag on the Python Pickler class, which suppresses use of the memo.	2003-01-29 17:58:45 +00:00
Guido van Rossum	586c9e813c	Declare Protocol 2 as implemented.	2003-01-29 06:16:12 +00:00
Guido van Rossum	255f3ee0a5	Support for extension codes. (By accident I checked in the tests first.)	2003-01-29 06:14:11 +00:00
Tim Peters	c0c12b5707	pickle: Comment repair. pickletools: Import decode_long from pickle instead of duplicating it.	2003-01-29 00:56:17 +00:00
Guido van Rossum	4e2491dbb1	Add a comment about how some built-in types should grow a __getnewargs__ method.	2003-01-28 22:31:25 +00:00
Guido van Rossum	b26a97aa50	Get rid of __safe_for_unpickling__ and safe_constructors. Also tidied up a few lines, got rid of apply(), added a comment.	2003-01-28 22:29:13 +00:00
Guido van Rossum	ac5b5d2e8b	Instead of bad hacks trying to worm around the inherited object.__reduce__, do a getattr() on the class so we can explicitly test for it. The reduce()-calling code becomes a bit more regular as a result. Also add support slots: if an object has slots, the default state is (dict, slots) where dict is the __dict__ or None, and slots is a dict mapping slot names to slot values. We do a best-effort approach to find slot names, assuming the __slots__ fields of classes aren't modified after class definition time to misrepresent the actual list of slots defined by a class.	2003-01-28 22:01:16 +00:00
Guido van Rossum	3d8c01b31c	The default __reduce__ on the base object type obscured any possibility of calling save_reduce(). Add a special hack for this. The tests for this are much simpler now (no __getstate__ or __getnewargs__ needed).	2003-01-28 19:48:18 +00:00
Guido van Rossum	54fb192508	Move the NEWOBJ-generating code to a separate function, and invoke it after checking for __reduce__.	2003-01-28 18:22:35 +00:00
Guido van Rossum	533dbcf250	Some experimental support for generating NEWOBJ with proto=2, and fixed a bug in load_newobj().	2003-01-28 17:55:05 +00:00
Tim Peters	a6ae9a2128	save_empty_tuple(): Comment on why we can't get rid of this.	2003-01-28 16:58:41 +00:00
Tim Peters	82ca59e002	save_dict(): Added a comment about the control flow NealN missed.	2003-01-28 16:47:59 +00:00
Tim Peters	13a25fb8e6	_is_string_secure(): This method is no longer used; removed it. (It was used before string-escape codecs were added to the core.)	2003-01-28 16:42:22 +00:00
Guido van Rossum	bc64e22ed6	Made save() fit on a page, while adding comments. (I moved some type checks to save_reduce(), which can also be called from a subclass.) Also tweaked some more comments.	2003-01-28 16:34:19 +00:00
Tim Peters	ad5a771fae	Got rid of the _quotes global. Used only once, and is trivial.	2003-01-28 16:23:33 +00:00
Guido van Rossum	1be3175992	Add a few comments. Change the way the protocol is checked (it must be one of 0, 1 or 2). I should note that the previous checkin also added NEWOBJ support to the unpickler -- but there's nothing yet that generates this.	2003-01-28 15:19:53 +00:00
Guido van Rossum	3a41c61dd4	Rename all variables 'object' to 'obj' to avoid conflicts with the type 'object'. Also minor docstring tweakage, and rearranged a few lines in save().	2003-01-28 15:10:22 +00:00
Guido van Rossum	cbe2dbddda	Don't memoize the empty tuple in protocol 0.	2003-01-28 14:40:16 +00:00
Tim Peters	d97da80dd5	save_tuple(): So long as the charter is rewriting for clarity, the snaky control flow had to be simplified.	2003-01-28 05:48:29 +00:00
Tim Peters	ff57bff16e	save_tuple(): I believe the new code for TUPLE{1,2,3} in proto 2 was incorrect for recursive tuples. Tried to repair; seems to work OK, but there are no checked-in tests for this yet.	2003-01-28 05:34:53 +00:00
Guido van Rossum	7d97d31a1b	OK, this is really the last one tonight! NEWFALSE and NEWTRUE.	2003-01-28 04:25:27 +00:00
Guido van Rossum	44f0ea5f73	More protocol 2: TUPLE1, TUPLE2, TUPLE3. Also moved the special case for empty tuples from save() to save_tuple().	2003-01-28 04:14:51 +00:00
Tim Peters	3b769835ca	save_inst(): Rewrote to have only one branch on self.bin. Also got rid of my recent XXX comment, taking a (what appears to be vanishingly small) chance and calling self.memoize() instead.	2003-01-28 03:51:36 +00:00
Guido van Rossum	d6c9e63af9	First baby steps towards implementing protocol 2: PROTO, LONG1 and LONG4.	2003-01-28 03:49:52 +00:00
Tim Peters	d95c2df3a9	Fixed odd whitespace after "if", which I believe I introduced long ago.	2003-01-28 03:41:54 +00:00
Tim Peters	8fda7bc48d	save_int(): Fixed two new off-by-1 glitches.	2003-01-28 03:40:52 +00:00
Guido van Rossum	e0b904232f	Add a comment explaining that struct.pack() beats marshal.dumps(), but marshal.loads() beats struct.unpack()! Possibly because the latter creates a one-tuple. :-(	2003-01-28 03:17:21 +00:00
Guido van Rossum	5c938d00a1	Got rid of mdumps; I timed it, and struct.pack("<i", x) is more than 40% faster than marshal.dumps(x)[1:]! (That's not counting the module attribute lookups, which can be avoided in either case.)	2003-01-28 03:03:08 +00:00
Tim Peters	f558da0f90	save_tuple(): Minor rewriting, and added a comment about the subtlety created by recursive tuples.	2003-01-28 02:09:55 +00:00
Tim Peters	209ad95b00	load_appends(): replaced .append() loop with an .extend().	2003-01-28 01:44:45 +00:00
Tim Peters	c23d18a955	Comments.	2003-01-28 01:41:51 +00:00
Tim Peters	064567e41a	save_dict(): Untangled most of the bin-vs-not-bin logic. Also used iteritems() instead of materializing a (possibly giant) list of the items.	2003-01-28 01:34:43 +00:00
Tim Peters	21c18f0bf5	save_list(): Rewrote, to untangle the proto 0 from the proto 1 cases. The code is much easier to follow now, and I bet it's faster too.	2003-01-28 01:15:46 +00:00
Tim Peters	22dc6f4f7a	save_list(): removed unused local "d".	2003-01-28 01:07:48 +00:00
Tim Peters	58d7846e68	save_list(): removed unused local "memo".	2003-01-28 01:07:10 +00:00
Tim Peters	bd1cdb9227	save_pers(): Switched the order of cases, to get rid of a "not", and to make the bin-vs-not-bin order consistent with what other routines try to do (they almost all handle the bin case first).	2003-01-28 01:03:10 +00:00
Tim Peters	518df0dae4	Several routines appeared to inline the guts of memoize(), possibly for some notion of low-level efficiency. Undid that, but left one routine alone: save_inst() claims it has a reason for not using memoize(). I don't understand that comment, so added an XXX comment there.	2003-01-28 01:00:38 +00:00
Tim Peters	b32a8317d7	save(): Fiddled the control flow to put the normal case where it belongs. This is a much smaller change than it may appear: the bulk of the function merely got unindented by one level.	2003-01-28 00:48:09 +00:00
Tim Peters	c9d7c4a656	Added XXX about save()'s special-casing of tuples -- I don't get it.	2003-01-28 00:43:26 +00:00
Tim Peters	22987e3cbd	save_bool(): simplified.	2003-01-28 00:26:14 +00:00
Tim Peters	cbd0a32d09	Repaired grammar in new comment.	2003-01-28 00:24:43 +00:00
Tim Peters	b377f8ac3e	clear_memo(): Repaired grammar in docstring.	2003-01-28 00:23:36 +00:00
Tim Peters	e105478a0c	Added one-line comments to the proto 2 opcodes.	2003-01-28 00:22:12 +00:00
Tim Peters	fdb8cfab08	Removed the new LONG2 opcode: it's extravagant. If LONG1 isn't enough, then the embedded argument consumes at least 256 bytes. The difference between a 3-byte prefix (LONG2 + 2 bytes) and a 5-byte prefix (LONG4 + 4 bytes) is at worst less than 1%. Note that binary strings and binary Unicode strings also have only "size is 1 byte, or size is 4 bytes?" flavors, and I expect for the same reason. The only place a 2-byte thingie was used was in BININT2, where the 2 bytes make up the entire embedded argument (and now EXT2 also does this); that's a large savings over 4 bytes, because the total opcode+argument size is so small in the BININT2/EXT2 case. Removed the TAKEN_FROM_ARGUMENT "number of bytes" code, and bifurcated it into TAKEN_FROM_ARGUMENT1 and TAKEN_FROM_ARGUMENT4. Now there's enough info in ArgumentDescriptor objects to deduce the # of bytes consumed by each opcode. Rearranged the order in which proto2 opcodes are listed in pickle.py.	2003-01-28 00:13:19 +00:00
Guido van Rossum	f29d3d6011	Begin the change from 'binary vs. text mode' to 'protocol 0, 1, 2'. The protocol now defaults to 1. Protocol 2 is still unimplemented.	2003-01-27 22:47:53 +00:00
Guido van Rossum	5a2d8f5e9a	Begin documenting protocol 2.	2003-01-27 21:44:25 +00:00
Tim Peters	bb38e306ae	Added some comments.	2003-01-27 21:25:41 +00:00
Tim Peters	e46b73f5b3	memoize(): Reworded the docs to try to disentangle the Pickler's memo dict from the Unpickler's memo (which is a different beast!).	2003-01-27 21:22:10 +00:00
Tim Peters	bbf63cd05c	Using marshal functions to pack & unpack 1-byte ints is an obscure & expensive way to spell chr() and ord().	2003-01-27 21:15:36 +00:00
Tim Peters	22a449aacc	Added a brief comment to each pickle opcode declaration.	2003-01-27 20:16:36 +00:00
Jeremy Hylton	3422c99de1	Raise PicklingError when __reduce__() fails, and add memoize() helper function to update the memo. The first element of the tuple returned by __reduce__() must be a callable. If it isn't the Unpickler will raise an error. Catch this error in the pickler and raise the error there. The memoize() helper also has a comment explaining how the memo works. So methods can't use memoize() because the write funny codes.	2003-01-24 19:29:52 +00:00
Jeremy Hylton	5e0f4e73a9	Remove inst_persistent_id() WANNI (we ain't never needed it). Add some simple tests of the persistence hooks.	2002-11-13 22:01:27 +00:00
Jeremy Hylton	f0cfdf7314	Fiddle comments and variable names in whichmodule().	2002-09-19 23:00:12 +00:00
Jeremy Hylton	065a5ab8fb	whichmodule() should skip dummy package entries in sys.modules. This fixes the charming, but unhelpful error message for >>> pickle.dumps(type.__new__) Can't pickle <built-in method __new__ of type object at 0x812a440>: it's not the same object as datetime.math.__new__ Bugfix candidate.	2002-09-19 22:57:26 +00:00
Martin v. Löwis	8a8da798a5	Patch #505705 : Remove eval in pickle and cPickle.	2002-08-14 07:46:28 +00:00
Jeremy Hylton	39c6116483	Given the persistent id code a shot at a class before calling save_global(). Some persistent picklers (well, probably, the only persistent pickler) would like to pickle some classes in a special way.	2002-07-16 19:47:43 +00:00
Raymond Hettinger	46ac8eb3c8	Code modernization. Replace v=s[i]; del s[i] with single lookup v=s.pop(i)	2002-06-30 03:39:14 +00:00
Raymond Hettinger	54f0222547	SF 563203. Replaced 'has_key()' with 'in'.	2002-06-01 14:18:47 +00:00
Neal Norwitz	efbb67b1a7	Remove comment about inheritance, look one line up	2002-05-30 12:12:04 +00:00
Raymond Hettinger	aef22fb9cd	Patch 560023 adding docstrings. 2.2 Candidate (after verifying modules were not updated after 2.2).	2002-05-29 16:18:42 +00:00
Tim Peters	8ac1495a6a	Whitespace normalization.	2002-05-23 15:15:30 +00:00
Raymond Hettinger	97394bc795	Patch 533291. Deprecate None return form of __reduce__.	2002-05-21 17:22:02 +00:00
Fred Drake	7f781c9aab	Add Pickler.clear_memo() so the pickle and cPickle modules are more similar.	2002-05-01 20:33:53 +00:00
Guido van Rossum	e276339cea	Implement an idea by Paul Rubin: Change pickling format for bools to use a backwards compatible encoding. This means you can pickle True or False on Python 2.3 and Python 2.2 or before will read it back as 1 or 0. The code used for pickling bools before would create pickles that could not be read in previous Python versions.	2002-04-05 19:30:08 +00:00
Guido van Rossum	77f6a65eb0	Add the 'bool' type and its values 'False' and 'True', as described in PEP 285. Everything described in the PEP is here, and there is even some documentation. I had to fix 12 unit tests; all but one of these were printing Boolean outcomes that changed from 0/1 to False/True. (The exception is test_unicode.py, which did a type(x) == type(y) style comparison. I could've fixed that with a single line using issubtype(x, type(y)), but instead chose to be explicit about those places where a bool is expected. Still to do: perhaps more documentation; change standard library modules to return False/True from predicates.	2002-04-03 22:41:51 +00:00

1 2 3 4 5

208 Commits