cpython

Commit Graph

Author	SHA1	Message	Date
Victor Stinner	d267ac20c3	bpo-36778: cp65001 encoding becomes an alias to utf_8 (GH-13230)	2019-05-10 03:19:54 +02:00
Inada Naoki	6a16b18224	bpo-36297: remove "unicode_internal" codec (GH-12342)	2019-03-18 15:44:11 +09:00
Anthony Sottile	ed2e9ab804	Remove obsolete comment about latin-1 in `normalize_encoding` (GH-8739) This docstring has drifted since python2: `ca079a3ea3/Lib/encodings/__init__.py (L68)`	2018-09-10 17:54:37 -07:00
Xiang Zhang	e4ce9fa89c	bpo-32943: Fix confusing error message for rot13 codec (GH-5869)	2018-03-25 12:09:21 +08:00
Victor Stinner	91106cd9ff	bpo-29240: PEP 540: Add a new UTF-8 Mode (#855 ) * Add -X utf8 command line option, PYTHONUTF8 environment variable and a new sys.flags.utf8_mode flag. * If the LC_CTYPE locale is "C" at startup: enable automatically the UTF-8 mode. * Add _winapi.GetACP(). encodings._alias_mbcs() now calls _winapi.GetACP() to get the ANSI code page * locale.getpreferredencoding() now returns 'UTF-8' in the UTF-8 mode. As a side effect, open() now uses the UTF-8 encoding by default in this mode. * Py_DecodeLocale() and Py_EncodeLocale() now use the UTF-8 encoding in the UTF-8 Mode. * Update subprocess._args_from_interpreter_flags() to handle -X utf8 * Skip some tests relying on the current locale if the UTF-8 mode is enabled. * Add test_utf8mode.py. * _Py_DecodeUTF8_surrogateescape() gets a new optional parameter to return also the length (number of wide characters). * pymain_get_global_config() and pymain_set_global_config() now always copy flag values, rather than only copying if the new value is greater than the old value.	2017-12-13 12:29:09 +01:00
Steve Dower	18591e4189	Revert #27959 : ImportError within an encoding module should also skip the encoding	2016-09-09 08:56:37 -07:00
Steve Dower	ef37dfcd84	Issue #28005 : Allow ImportErrors in encoding implementation to propagate.	2016-09-07 17:27:33 -07:00
Steve Dower	fe8f4c9e87	Issue #27959 : Prevent ImportError from escaping codec search function	2016-09-07 09:31:52 -07:00
Steve Dower	f5aba58480	Issue #27959 : Adds oem encoding, alias ansi to mbcs, move aliasmbcs to codec lookup	2016-09-06 19:42:27 -07:00
Victor Stinner	1a05d6c04d	PEP 7 style for if/else in C Add also a newline for readability in normalize_encoding().	2016-09-02 12:12:23 +02:00
Martin Panter	46f50726a0	Issue #27076 : Doc, comment and tests spelling fixes Most fixes to Doc/ and Lib/ directories by Ville Skyttä.	2016-05-26 05:35:26 +00:00
Brett Cannon	07b954d148	Add some "used with permission" mentions where external resources are referenced. Permission was validated prior to adding these markings.	2016-01-15 09:53:51 -08:00
Martin Panter	9ab96946ee	Issue #16473 : Merge codecs doc and test from 3.4 into 3.5	2015-09-12 01:22:17 +00:00
Martin Panter	06171bd52a	Issue #16473 : Fix byte transform codec documentation; test quotetabs=True This changes the equivalent functions listed for the Base-64, hex and Quoted- Printable codecs to reflect the functions actually used. Also mention and test the "quotetabs" setting for Quoted-Printable encoding.	2015-09-12 00:34:28 +00:00
Serhiy Storchaka	cd4a5cc339	Added forgotten new files for issues #22681 and #22682 .	2015-05-13 00:34:53 +03:00
Serhiy Storchaka	ad8a1c3fb2	Issue #22682 : Added support for the kz1048 encoding.	2015-05-12 23:16:55 +03:00
Serhiy Storchaka	85e7066278	Issue #22406 : Fixed the uu_codec codec incorrectly ported to 3.x. Based on patch by Martin Panter.	2014-11-07 14:06:19 +02:00
Serhiy Storchaka	519114df42	Issue #22406 : Fixed the uu_codec codec incorrectly ported to 3.x. Based on patch by Martin Panter.	2014-11-07 14:04:37 +02:00
Serhiy Storchaka	9c5553e122	Issue #21171 : Fixed undocumented filter API of the rot13 codec. Patch by Berker Peksag.	2014-04-13 17:08:51 +03:00
Serhiy Storchaka	a39938ff44	Issue #21171 : Fixed undocumented filter API of the rot13 codec. Patch by Berker Peksag.	2014-04-13 17:07:04 +03:00
Victor Stinner	7d00cc1a64	Issue #20574 : Implement incremental decoder for cp65001 code (Windows code page 65001, Microsoft UTF-8).	2014-03-17 23:08:06 +01:00
R David Murray	fb2c2db0fb	Merge #7475 : Remove references to '.transform' from transform codec docstrings.	2014-03-13 20:55:09 -04:00
R David Murray	e5cb836d4c	#7475 : Remove references to '.transform' from transform codec docstrings.	2014-03-13 20:54:30 -04:00
R David Murray	47d083cf1a	whatsnew: cp273 codec (#10907797 ) Also updated the docs and added the aliases mentioned by the references.	2014-03-07 21:00:34 -05:00
Serhiy Storchaka	94ee389308	Issue #19619 : Blacklist non-text codecs in method API str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings. Backported changeset d68df99d7a57.	2014-02-24 14:43:03 +02:00
Serhiy Storchaka	e7f87e1262	Fixed incorrectly applying a patch for issue19668.	2013-11-23 19:50:47 +02:00
Serhiy Storchaka	be0c3250b1	Issue #19668 : Added support for the cp1125 encoding.	2013-11-23 18:52:23 +02:00
Nick Coghlan	9c1aed8f94	Close #7475 : Restore binary & text transform codecs The codecs themselves were restored in Python 3.2, this completes the restoration by adding back the convenience aliases. These aliases were originally left out due to confusing errors when attempting to use them with the text encoding specific convenience methods. Python 3.4 includes several improvements to those errors, thus permitting the aliases to be restored as well.	2013-11-23 11:13:36 +10:00
Nick Coghlan	c72e4e6dcc	Issue #19619 : Blacklist non-text codecs in method API str.encode, bytes.decode and bytearray.decode now use an internal API to throw LookupError for known non-text encodings, rather than attempting the encoding or decoding operation and then throwing a TypeError for an unexpected output type. The latter mechanism remains in place for third party non-text encodings.	2013-11-22 22:39:36 +10:00
Andrew Kuchling	ad8156e9b2	#1097797 : Add CP273 codec, and exercise it in the test suite	2013-11-10 13:44:30 -05:00
Brett Cannon	cd171c8e92	Issue #18200 : Back out usage of ModuleNotFoundError (8d28d44f3a9a)	2013-07-04 17:43:24 -04:00
Brett Cannon	0a140668fa	Issue #18200 : Update the stdlib (except tests) to use ModuleNotFoundError.	2013-06-13 20:57:26 -04:00
Victor Stinner	03c3e35d42	Add fast-path in PyUnicode_DecodeCharmap() for pure 8 bit encodings: cp037, cp500 and iso8859_1 codecs	2013-04-09 21:53:09 +02:00
Antoine Pitrou	7e19337ebc	Normalize whitespace	2012-06-16 22:50:54 +02:00
Antoine Pitrou	aaefac76dd	Issue #14874 : Restore charmap decoding speed to pre-PEP 393 levels. Patch by Serhiy Storchaka.	2012-06-16 22:48:21 +02:00
Antoine Pitrou	9768676f6f	Speed up IDNA for the common case	2011-11-10 22:49:20 +01:00
Florent Xicluna	aabbda5354	Merge 3.2	2011-10-28 14:52:29 +02:00
Florent Xicluna	5d1155c08e	Closes #13258 : Use callable() built-in in the standard library.	2011-10-28 14:45:05 +02:00
Victor Stinner	2f3ca9f20e	Close #13247 : Add cp65001 codec, the Windows UTF-8 (CP_UTF8)	2011-10-27 01:38:56 +02:00
Victor Stinner	b6f424043d	Issue #10807 : Remove base64, bz2, hex, quopri, rot13, uu and zlib codecs from the codec aliases. They are still accessible via codecs.lookup().	2011-01-02 19:50:36 +00:00
Georg Brandl	7c23ea2e88	Don't use deprecated aliases.	2010-12-06 22:25:25 +00:00
Georg Brandl	02524629f3	#7475 : add (un)transform method to bytes/bytearray and str, add back codecs that can be used with them from Python 2.	2010-12-02 18:06:51 +00:00
Florent Xicluna	e01de8f2f3	remove pointless coding cookies	2010-08-30 14:05:50 +00:00
Marc-André Lemburg	ff562506d4	Fix a typo in the alias target name for 'macintosh'.	2010-08-21 10:58:31 +00:00
Benjamin Peterson	23110e7361	alias macintosh to mac_roman #843590	2010-08-21 02:54:44 +00:00
Benjamin Peterson	5a6214afe2	Merged revisions 81499,81506 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81499 \| georg.brandl \| 2010-05-24 16:29:07 -0500 (Mon, 24 May 2010) \| 1 line #8016: add the CP858 codec (approved by Benjamin). (Also add CP720 to the tests, it was missing there.) ........ r81506 \| benjamin.peterson \| 2010-05-24 17:04:53 -0500 (Mon, 24 May 2010) \| 1 line set svn:eol-style ........	2010-06-27 22:41:29 +00:00
Victor Stinner	a92ad7ee2c	Merged revisions 81471-81472 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r81471 \| victor.stinner \| 2010-05-22 15:37:56 +0200 (sam., 22 mai 2010) \| 7 lines Issue #6268: More bugfixes about BOM, UTF-16 and UTF-32 * Fix seek() method of codecs.open(), don't write the BOM twice after seek(0) * Fix reset() method of codecs, UTF-16, UTF-32 and StreamWriter classes * test_codecs: use "w+" mode instead of "wt+". "t" mode is not supported by Solaris or Windows, but does it really exist? I found it the in the issue. ........ r81472 \| victor.stinner \| 2010-05-22 15:44:25 +0200 (sam., 22 mai 2010) \| 4 lines Fix my last commit (r81471) about codecs Rememder: don't touch the code just before a commit ........	2010-05-22 16:59:09 +00:00
Benjamin Peterson	75ad1fc089	Merged revisions 78806 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r78806 \| benjamin.peterson \| 2010-03-08 16:15:11 -0600 (Mon, 08 Mar 2010) \| 1 line set svn:eol-style on various files ........	2010-03-08 22:17:58 +00:00
Brett Cannon	5f4ec0451c	Fix a minor grammatical error.	2009-12-13 21:25:28 +00:00
Philip Jenvey	1309adb06a	Merged revisions 76337 via svnmerge from svn+ssh://pythondev@svn.python.org/python/trunk ........ r76337 \| philip.jenvey \| 2009-11-16 18:42:26 -0800 (Mon, 16 Nov 2009) \| 2 lines #1757126: fix typo with the cyrillic_asian alias ........	2009-11-17 03:43:14 +00:00

1 2 3 4

177 Commits