cpython

Commit Graph

Author	SHA1	Message	Date
Thomas Wouters	49fd7fa443	Merge p3yk branch with the trunk up to revision 45595. This breaks a fair number of tests, all because of the codecs/_multibytecodecs issue described here (it's not a Py3K issue, just something Py3K discovers): http://mail.python.org/pipermail/python-dev/2006-April/064051.html Hye-Shik Chang promised to look for a fix, so no need to fix it here. The tests that are expected to break are: test_codecencodings_cn test_codecencodings_hk test_codecencodings_jp test_codecencodings_kr test_codecencodings_tw test_codecs test_multibytecodec This merge fixes an actual test failure (test_weakref) in this branch, though, so I believe merging is the right thing to do anyway.	2006-04-21 10:40:58 +00:00
Thomas Wouters	a977329b6f	Merge part of the trunk changes into the p3yk branch. This merges from 43030 (branch-creation time) up to 43067. 43068 and 43069 contain a little swapping action between re.py and sre.py, and this mightily confuses svn merge, so later changes are going in separately. This merge should break no additional tests. The last-merged revision is going in a 'last_merge' property on '.' (the branch directory.) Arbitrarily chosen, really; if there's a BCP for this, I couldn't find it, but we can easily change it afterwards ;)	2006-04-21 09:43:23 +00:00
Georg Brandl	8f4b4db676	typo	2006-03-09 10:16:42 +00:00
Walter Dörwald	b075fcea3a	Fix table header.	2006-02-21 18:51:32 +00:00
Georg Brandl	131e4f71ba	Add markup to new section in codecs docs	2006-01-23 21:33:48 +00:00
Walter Dörwald	b754fe4e7f	Fix typos.	2006-01-09 12:45:01 +00:00
Martin v. Löwis	412ed3b8a7	Patch #1177307 : UTF-8-Sig codec.	2006-01-08 10:45:39 +00:00
Walter Dörwald	d4bfe2c878	SF patch #1364946 : Add a reference link from the dcoumentation of the encode and decode methods to the documentation of the default error handlers.	2005-11-25 17:17:12 +00:00
Fred Drake	9984e706ff	add missing word	2005-10-20 17:52:05 +00:00
Walter Dörwald	007f8dfde2	Bug #1245379 : Add "unicode-1-1-utf-7" as an alias for "utf-7" as specified by RFC 1642.	2005-10-09 19:42:27 +00:00
Martin v. Löwis	56066d2e55	Return complete lines from codec stream readers even if there is an exception in later lines, resulting in correct line numbers for decoding errors in source code. Fixes #1178484. Will backport to 2.4.	2005-08-24 07:38:12 +00:00
Raymond Hettinger	68804315e0	SF Patch #1093896 : miscellaneous doc typos	2005-01-01 00:28:46 +00:00
Fred Drake	a2544ee7f0	fix typo in markup	2004-09-10 01:16:49 +00:00
Walter Dörwald	69652035bc	SF patch #998993 : The UTF-8 and the UTF-16 stateful decoders now support decoding incomplete input (when the input stream is temporarily exhausted). codecs.StreamReader now implements buffering, which enables proper readline support for the UTF-16 decoders. codecs.StreamReader.read() has a new argument chars which specifies the number of characters to return. codecs.StreamReader.readline() and codecs.StreamReader.readlines() have a new argument keepends. Trailing "\n"s will be stripped from the lines if keepends is false. Added C APIs PyUnicode_DecodeUTF8Stateful and PyUnicode_DecodeUTF16Stateful.	2004-09-07 20:24:22 +00:00
Hye-Shik Chang	2bb146f2f4	Bring CJKCodecs 1.1 into trunk. This completely reorganizes source and installed layouts to make maintenance simple and easy. And it also adds four new codecs; big5hkscs, euc-jis-2004, shift-jis-2004 and iso2022-jp-2004.	2004-07-18 03:06:29 +00:00
Hye-Shik Chang	910d8f1e89	Change CJK encoding aliases to their most popular variation of hyphen and underscores in consistency of non-CJK aliases. (Spotted by Mike Brown at SF #969415)	2004-07-17 14:44:43 +00:00
Skip Montanaro	78bace7442	add cp866 row	2004-07-02 02:14:34 +00:00
Skip Montanaro	ecf7a52bb8	link to the codecs page from the "".encode() description.	2004-07-01 19:26:04 +00:00
Hye-Shik Chang	5c5316f111	Add a new unicode codec: ptcp154 (Kazakh)	2004-03-19 08:06:07 +00:00
Hye-Shik Chang	3e2a306920	Add CJK codecs support as discussed on python-dev. (SF #873597 ) Several style fixes are suggested by Martin v. Loewis and Marc-Andre Lemburg. Thanks!	2004-01-17 14:29:29 +00:00
Raymond Hettinger	9a80c5dbc4	Added codec for bz2 compression.	2003-09-23 20:21:01 +00:00
Raymond Hettinger	7e43110f34	SF 810242. Fix doubled word errors.	2003-09-22 15:00:55 +00:00
Raymond Hettinger	aa1178b811	Minor typo	2003-09-01 23:13:04 +00:00
Raymond Hettinger	f17d65da3a	SF patch#786531 'the the' typo. Contributed by George Yoshida	2003-08-12 00:01:16 +00:00
Fred Drake	d24c767d5b	A variety of markup-level adjustments.	2003-07-16 05:17:23 +00:00
Raymond Hettinger	b5155e30ce	Fix typo.	2003-06-18 01:58:31 +00:00
Fred Drake	d4be747e1e	- comment out \moduleauthor that broke formatting until the formatting tools can be fixed; added XXX comment - general markup fixes	2003-04-30 15:02:07 +00:00
Martin v. Löwis	faf71ea5b3	Fix spelling of cedillas.	2003-04-18 21:48:56 +00:00
Martin v. Löwis	2548c730c1	Implement IDNA (Internationalized Domain Names in Applications).	2003-04-18 10:39:54 +00:00
Walter Dörwald	2e0b18af30	Change the treatment of positions returned by PEP293 error handers in the Unicode codecs: Negative positions are treated as being relative to the end of the input and out of bounds positions result in an IndexError. Also update the PEP and include an explanation of this in the documentation for codecs.register_error. Fixes a small bug in iconv_codecs: if the position from the callback is negative add it to the size instead of substracting it. From SF patch #677429.	2003-01-31 17:19:08 +00:00
Martin v. Löwis	5c37a7717d	Document standard encodings.	2002-12-31 12:39:07 +00:00
Walter Dörwald	72f861657a	Document additional error handling names available through PEP 293.	2002-11-19 21:51:35 +00:00
Walter Dörwald	430b1563dd	Add documentation for the PEP 293 functionality: The errors attribute can be changed after the reader/writer is created. For encoding there are two additional errors values: "xmlcharrefreplace" and "backslashreplace". These values can be extended via register_error().	2002-11-07 22:33:17 +00:00
Walter Dörwald	1a7a894d90	Move introductory sentence to where it belongs.	2002-11-02 13:32:07 +00:00
Raymond Hettinger	8a64d40949	Fix typo. Close SF Bug 606354.	2002-09-08 22:26:13 +00:00
Walter Dörwald	3aeb632c31	PEP 293 implemention (from SF patch http://www.python.org/sf/432401 )	2002-09-02 13:14:32 +00:00
Walter Dörwald	474458da48	Add constants BOM_UTF8, BOM_UTF16, BOM_UTF16_LE, BOM_UTF16_BE, BOM_UTF32, BOM_UTF32_LE and BOM_UTF32_BE that represent the Byte Order Mark in UTF-8, UTF-16 and UTF-32 encodings for little and big endian systems. The old names BOM32_* and BOM64_* were off by a factor of 2. This closes SF bug http://www.python.org/sf/555360	2002-06-04 15:16:29 +00:00
Skip Montanaro	b02ea65f92	typo	2002-04-17 19:33:06 +00:00
Skip Montanaro	6c7bc31089	added small clarification to the descriptions of encode() and decode()	2002-04-16 15:12:10 +00:00
Fred Drake	0aa811c527	Use the \note and \warning macros where appropriate.	2001-10-20 04:24:09 +00:00
Marc-André Lemburg	494f2aea8e	Docs and News item for the codecs.py additions.	2001-09-19 11:33:31 +00:00
Fred Drake	dc40ac0fe0	Added link to the "Python Codecs" project at SourceForge. Changed markup of the list of values for the list of meaningful "errors" values.	2001-01-22 20:17:54 +00:00
Fred Drake	602aa77d2f	Marc-Andre Lemburg <mal@lemburg.com>: Documentation for the codec base classes. Lots of markup adjustments by FLD. This closes SourceForge bug #115308, patch #101877.	2000-10-12 20:50:55 +00:00
Fred Drake	e1b304db37	Fix small typos and markup consistency nits.	2000-07-24 19:35:52 +00:00
Fred Drake	69ca950d1f	Make sure the \declaremodule uses the right name for the module! Clean up several markup problems & inconsistencies.	2000-04-06 16:09:59 +00:00
Fred Drake	b7979c756c	Marc-Andre Lemburg <mal@lemburg.com>: codecs module documentation, with some preliminary markup adjustments from FLD.	2000-04-06 14:21:58 +00:00

46 Commits