cpython

Commit Graph

Author	SHA1	Message	Date
Collin Winter	ce36ad8a46	Raise statement normalization in Lib/.	2007-08-30 01:19:48 +00:00
Walter Dörwald	3abcb013b8	Apply SF patch #1698994 : Add getstate() and setstate() methods to incrementalcodecs. Also forward port r54786 (fix the incremental utf_8_sig decoder).	2007-04-16 22:10:50 +00:00
Thomas Wouters	a977329b6f	Merge part of the trunk changes into the p3yk branch. This merges from 43030 (branch-creation time) up to 43067. 43068 and 43069 contain a little swapping action between re.py and sre.py, and this mightily confuses svn merge, so later changes are going in separately. This merge should break no additional tests. The last-merged revision is going in a 'last_merge' property on '.' (the branch directory.) Arbitrarily chosen, really; if there's a BCP for this, I couldn't find it, but we can easily change it afterwards ;)	2006-04-21 09:43:23 +00:00
Walter Dörwald	729c31f5c3	Reset internal buffers when seek() is called. This fixes SF bug #1156259 .	2005-03-14 19:06:30 +00:00
Walter Dörwald	69652035bc	SF patch #998993 : The UTF-8 and the UTF-16 stateful decoders now support decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful.	2004-09-07 20:24:22 +00:00
Tim Peters	469cdad822	Whitespace normalization.	2002-08-08 20:19:19 +00:00
Marc-André Lemburg	3ccb09cba3	Fix for bug #222395 : UTF-16 et al. don't handle .readline(). They now raise an NotImplementedError to hint to the truth ;-)	2002-04-05 12:12:00 +00:00
Marc-André Lemburg	92b550cdd8	This patch by Martin v. Loewis changes the UTF-16 codec to only write a BOM at the start of the stream and also to only read it as BOM at the start of a stream. Subsequent reading/writing of BOMs will read/write the BOM as ZWNBSP character. This is in sync with the Unicode specifications. Note that UTF-16 files will now have to start with a BOM mark in order to be readable by the codec.	2001-06-19 20:07:51 +00:00
Guido van Rossum	0229bf6001	Marc-Andre Lemburg: Unicode encodings.	2000-03-10 23:17:24 +00:00

9 Commits