Serhiy Storchaka
de3ee5b94f
Issue #23071 : Added missing names to codecs.__all__. Patch by Martin Panter.
2014-12-20 17:42:38 +02:00
Serhiy Storchaka
166ebc4e5d
Issue #19676 : Added the "namereplace" error handler.
2014-11-25 13:57:17 +02:00
Serhiy Storchaka
85e7066278
Issue #22406 : Fixed the uu_codec codec incorrectly ported to 3.x.
...
Based on patch by Martin Panter.
2014-11-07 14:06:19 +02:00
Serhiy Storchaka
519114df42
Issue #22406 : Fixed the uu_codec codec incorrectly ported to 3.x.
...
Based on patch by Martin Panter.
2014-11-07 14:04:37 +02:00
Nick Coghlan
a0f33759fa
Merge fix for issue #22166 from 3.4
2014-09-15 23:55:16 +12:00
Nick Coghlan
8fad1676a2
Issue #22166 : clear codec caches in test_codecs
2014-09-15 23:50:44 +12:00
Victor Stinner
0d4e01ca07
Issue #13916 : Fix surrogatepass error handler on Windows
2014-05-16 14:46:20 +02:00
Serhiy Storchaka
88d8fb6af6
Issue #13916 : Disallowed the surrogatepass error handler for non UTF-*
...
encodings.
2014-05-15 14:37:42 +03:00
Victor Stinner
a57dfd033c
Issue #21488 : Add support of keyword arguments for codecs.encode and codecs.decode
2014-05-14 17:13:14 +02:00
Victor Stinner
07beb375b7
Issue #20574 : Remove duplicated test failing on Windows XP
2014-03-18 01:40:22 +01:00
Victor Stinner
f8cbf78bbd
Issue #20574 : Add more tests for cp65001
2014-03-17 23:16:02 +01:00
Victor Stinner
7d00cc1a64
Issue #20574 : Implement incremental decoder for cp65001 code
...
(Windows code page 65001, Microsoft UTF-8).
2014-03-17 23:08:06 +01:00
Victor Stinner
3633ce3301
Issue #20571 : skip test_readline() of test_codecs for Windows code page 65001.
...
The decoder does not support partial decoding yet for this code page.
2014-02-09 13:11:53 +01:00
Serhiy Storchaka
6cbf151032
Issue #20538 : UTF-7 incremental decoder produced inconsistant string when
...
input was truncated in BASE64 section.
2014-02-08 14:06:33 +02:00
Serhiy Storchaka
016a3f33a5
Issue #20538 : UTF-7 incremental decoder produced inconsistant string when
...
input was truncated in BASE64 section.
2014-02-08 14:01:29 +02:00
Nick Coghlan
96252cd724
Issue 20542: Temporarily skip failing test
2014-02-07 23:34:41 +10:00
Serhiy Storchaka
f28ba369dd
Issue #20532 : Tests which use _testcapi now are marked as CPython only.
2014-02-07 10:10:55 +02:00
Serhiy Storchaka
5cfc79deae
Issue #20532 : Tests which use _testcapi now are marked as CPython only.
2014-02-07 10:06:39 +02:00
Serhiy Storchaka
3dcb0cf9b1
Issue #20520 : Fixed readline test in test_codecs.
2014-02-06 09:27:28 +02:00
Serhiy Storchaka
5b4fab1ad7
Issue #20520 : Fixed readline test in test_codecs.
2014-02-06 09:26:56 +02:00
Serhiy Storchaka
dbe0982bc5
Issue #8260 : The read(), readline() and readlines() methods of
...
codecs.StreamReader returned incomplete data when were called after
readline() or read(size). Based on patch by Amaury Forgeot d'Arc.
2014-01-26 19:27:56 +02:00
Serhiy Storchaka
8003850e22
Issue #8260 : The read(), readline() and readlines() methods of
...
codecs.StreamReader returned incomplete data when were called after
readline() or read(size). Based on patch by Amaury Forgeot d'Arc.
2014-01-26 19:21:00 +02:00
Nick Coghlan
77b286b2cc
Close #20105 : set __traceback__ when chaining exceptions in C
2014-01-27 00:53:38 +10:00
Zachary Ware
efa2e04033
Issue19619: skip zlib error test when zlib not available
2013-12-30 14:54:11 -06:00
Serhiy Storchaka
2480c2ed59
Issue #15204 : Silence and check the 'U' mode deprecation warnings in tests.
...
Changed deprecation message in the fileinput module.
2013-11-24 23:13:26 +02:00
Serhiy Storchaka
be0c3250b1
Issue #19668 : Added support for the cp1125 encoding.
2013-11-23 18:52:23 +02:00
Nick Coghlan
9c1aed8f94
Close #7475 : Restore binary & text transform codecs
...
The codecs themselves were restored in Python 3.2, this
completes the restoration by adding back the convenience
aliases.
These aliases were originally left out due to confusing
errors when attempting to use them with the text encoding
specific convenience methods. Python 3.4 includes several
improvements to those errors, thus permitting the aliases
to be restored as well.
2013-11-23 11:13:36 +10:00
Nick Coghlan
c72e4e6dcc
Issue #19619 : Blacklist non-text codecs in method API
...
str.encode, bytes.decode and bytearray.decode now use an
internal API to throw LookupError for known non-text encodings,
rather than attempting the encoding or decoding operation and
then throwing a TypeError for an unexpected output type.
The latter mechanism remains in place for third party non-text
encodings.
2013-11-22 22:39:36 +10:00
Nick Coghlan
f1de55fb33
Also chain codec exceptions that allow weakrefs
...
The zlib and hex codecs throw custom exception types with
weakref support if the input type is valid, but the data
fails validation. Make sure the exception chaining in the
codec infrastructure can wrap those as well.
2013-11-19 22:33:10 +10:00
Serhiy Storchaka
58cf607d13
Issue #12892 : The utf-16* and utf-32* codecs now reject (lone) surrogates.
...
The utf-16* and utf-32* encoders no longer allow surrogate code points
(U+D800-U+DFFF) to be encoded.
The utf-32* decoders no longer decode byte sequences that correspond to
surrogate code points.
The surrogatepass error handler now works with the utf-16* and utf-32* codecs.
Based on patches by Victor Stinner and Kang-Hao (Kenny) Lu.
2013-11-19 11:32:41 +02:00
Nick Coghlan
4e553e2e52
Avoid triggering the refleak detector
2013-11-16 00:35:34 +10:00
Nick Coghlan
c4c2580d43
Close 19609: narrow scope of codec exc chaining
2013-11-15 21:47:37 +10:00
Nick Coghlan
8b097b4ed7
Close #17828 : better handling of codec errors
...
- output type errors now redirect users to the type-neutral
convenience functions in the codecs module
- stateless errors that occur during encoding and decoding
will now be automatically wrapped in exceptions that give
the name of the codec involved
2013-11-13 23:49:21 +10:00
Serhiy Storchaka
0e071c967c
Fixed tests for issue #19279 .
2013-10-19 21:14:57 +03:00
Serhiy Storchaka
55e092f545
Issue #19279 : UTF-7 decoder no more produces illegal strings.
2013-10-19 20:39:28 +03:00
Serhiy Storchaka
35804e4c63
Issue #19279 : UTF-7 decoder no more produces illegal strings.
2013-10-19 20:38:19 +03:00
Nick Coghlan
fdf239a855
Close #17839 : support bytes-like objects in base64 module
...
This mostly affected the encodebytes and decodebytes function
(which are used by base64_codec)
Also added a test to ensure all bytes-bytes codecs can handle
memoryview input and tests for handling of multidimensional
and non-bytes format input in the modern base64 API.
2013-10-03 00:43:22 +10:00
Serhiy Storchaka
7b07873b93
Add tests for raw-unicode-escape codec.
2013-01-29 11:41:34 +02:00
Serhiy Storchaka
799fd9c877
Add tests for raw-unicode-escape codec.
2013-01-29 11:41:01 +02:00
Serhiy Storchaka
c9c4338e2b
Add tests for raw-unicode-escape codec.
2013-01-29 11:40:00 +02:00
Serhiy Storchaka
d8f07cd374
Clean up escape-decode decoder tests.
2013-01-29 11:08:06 +02:00
Serhiy Storchaka
db6add7d71
Clean up escape-decode decoder tests.
2013-01-29 11:07:27 +02:00
Serhiy Storchaka
077cb347a9
Clean up escape-decode decoder tests.
2013-01-29 11:06:53 +02:00
Serhiy Storchaka
8fe5a9f9c3
Issue #16979 : Fix error handling bugs in the unicode-escape-decode decoder.
2013-01-29 10:37:39 +02:00
Serhiy Storchaka
24193debd4
Issue #16979 : Fix error handling bugs in the unicode-escape-decode decoder.
2013-01-29 10:28:07 +02:00
Serhiy Storchaka
d679377be7
Issue #16979 : Fix error handling bugs in the unicode-escape-decode decoder.
2013-01-29 10:20:44 +02:00
Serhiy Storchaka
f584aba3a5
Issue #16975 : Fix error handling bug in the escape-decode bytes decoder.
2013-01-25 23:33:22 +02:00
Serhiy Storchaka
e58785b200
Issue #16975 : Fix error handling bug in the escape-decode bytes decoder.
2013-01-25 23:32:41 +02:00
Serhiy Storchaka
ace3ad3bf7
Issue #16975 : Fix error handling bug in the escape-decode bytes decoder.
2013-01-25 23:31:43 +02:00
Serhiy Storchaka
55e2cb497b
Issue #14850 : Now a chamap decoder treates U+FFFE as "undefined mapping"
...
in any mapping, not only in an unicode string.
2013-01-15 15:30:04 +02:00
Serhiy Storchaka
45d16d9924
Issue #14850 : Now a chamap decoder treates U+FFFE as "undefined mapping"
...
in any mapping, not only in an unicode string.
2013-01-15 15:01:20 +02:00
Serhiy Storchaka
4fb8caee87
Issue #14850 : Now a chamap decoder treates U+FFFE as "undefined mapping"
...
in any mapping, not only in an unicode string.
2013-01-15 14:43:21 +02:00
Ezio Melotti
aabd0b0312
#16918 : merge with 3.3.
2013-01-11 06:05:51 +02:00
Ezio Melotti
5d3dba0d27
#16918 : test_codecs now works with unittest test discovery. Patch by Zachary Ware.
2013-01-11 06:02:07 +02:00
Ezio Melotti
e0b87edd7f
Merge fix for broken/disabled test.
2013-01-11 05:57:58 +02:00
Ezio Melotti
26ed234052
Enable a broken test and fix it.
2013-01-11 05:54:57 +02:00
Serhiy Storchaka
24a3ef6999
Issue #11461 : Fix the incremental UTF-16 decoder. Original patch by
...
Amaury Forgeot d'Arc. Added tests for partial decoding of non-BMP
characters.
2013-01-08 23:41:55 +02:00
Serhiy Storchaka
ae3b32ad6b
Issue #11461 : Fix the incremental UTF-16 decoder. Original patch by
...
Amaury Forgeot d'Arc. Added tests for partial decoding of non-BMP
characters.
2013-01-08 23:40:52 +02:00
Serhiy Storchaka
48e188e573
Issue #11461 : Fix the incremental UTF-16 decoder. Original patch by
...
Amaury Forgeot d'Arc. Added tests for partial decoding of non-BMP
characters.
2013-01-08 23:14:24 +02:00
Andrew Svetlov
2606a6f197
Issue #16719 : Get rid of WindowsError. Use OSError instead
...
Patch by Serhiy Storchaka.
2012-12-19 14:33:35 +02:00
Ezio Melotti
a0b5c46fa2
#16336 : merge with 3.2.
2012-11-03 23:04:41 +02:00
Ezio Melotti
540da76115
#16336 : fix input checking in the surrogatepass error handler. Patch by Serhiy Storchaka.
2012-11-03 23:03:39 +02:00
Philip Jenvey
5f9459fbed
merge with 3.2
2012-10-26 17:05:09 -07:00
Philip Jenvey
45c41494bf
bounds check for bad data (thanks amaury)
2012-10-26 17:01:53 -07:00
Antoine Pitrou
a1f7655fa7
Issue #15379 : Fix passing of non-BMP characters as integers for the charmap decoder (already working as unicode strings).
...
Patch by Serhiy Storchaka.
2012-09-23 20:00:04 +02:00
Antoine Pitrou
6f80f5d444
Issue #15379 : Fix passing of non-BMP characters as integers for the charmap decoder (already working as unicode strings).
...
Patch by Serhiy Storchaka.
2012-09-23 19:55:21 +02:00
Antoine Pitrou
5e36edbaba
Port additional tests from #14579 (the issue is already fixed).
2012-07-21 00:47:48 +02:00
Antoine Pitrou
b4bbee25b1
Issue #14579 : Fix CVE-2012-2135: vulnerability in the utf-16 decoder after error handling.
...
Patch by Serhiy Storchaka.
2012-07-21 00:45:14 +02:00
Victor Stinner
e3b47152a4
Write tests for invalid characters (U+00110000)
...
Test the following functions:
* codecs.raw_unicode_escape_decode()
* PyUnicode_FromWideChar()
* PyUnicode_FromUnicode()
* "unicode_internal" and "unicode_escape" decoders
2011-12-09 20:49:49 +01:00
Ezio Melotti
adc417ce36
#13406 : fix more deprecation warnings and move the deprecation of unicode-internal earlier in the code.
2011-11-17 12:23:34 +02:00
Ezio Melotti
345379a7f8
#13406 : correct the error message in check_warnings too.
2011-11-16 09:54:19 +02:00
Ezio Melotti
11060a4a48
#13406 : silence deprecation warnings in test_codecs.
2011-11-16 09:39:10 +02:00
Victor Stinner
040e16e3e8
"unicode_internal" codec has been deprecated: fix related tests
2011-11-15 22:44:05 +01:00
Victor Stinner
76a31a6bff
Cleanup decode_code_page_stateful() and encode_code_page()
...
* Fix decode_code_page_errors() result
* Inline decode_code_page() and encode_code_page_chunk()
* Replace the PyUnicodeObject type by PyObject
2011-11-04 00:05:13 +01:00
Victor Stinner
2f3ca9f20e
Close #13247 : Add cp65001 codec, the Windows UTF-8 (CP_UTF8)
2011-10-27 01:38:56 +02:00
Victor Stinner
9e92188f53
Issue #12281 : Fix test_codecs.test_cp932() on Windows XP
...
Cool! Decoding b'\x81\x00abc' from cp932 with replace error handler is now
giving the same result on all Windows versions.
2011-10-18 21:55:25 +02:00
Victor Stinner
62be4fb21f
Issue #12281 : Skip code page tests on non-Windows platforms
2011-10-18 21:46:37 +02:00
Victor Stinner
3a50e7056e
Issue #12281 : Rewrite the MBCS codec to handle correctly replace and ignore
...
error handlers on all Windows versions. The MBCS codec is now supporting all
error handlers, instead of only replace to encode and ignore to decode.
2011-10-18 21:21:00 +02:00
Antoine Pitrou
00b2c86d09
Fix text failures when ctypes is not available
...
(followup to Victor's 85d11cf67aa8 and 7a50e549bd11)
2011-10-05 13:01:41 +02:00
Victor Stinner
182d90d9ee
Fix test_codecs for Windows: check size of wchar_t, not sys.maxunicode
2011-09-29 19:53:55 +02:00
Martin v. Löwis
d63a3b8beb
Implement PEP 393.
2011-09-28 07:41:54 +02:00
Antoine Pitrou
2a20f9be70
Backport 0398f07d4827 (fix for weird buildbot failures)
2011-07-27 01:06:07 +02:00
Antoine Pitrou
d05066d1ee
Try to fix weird buildbot failures
2011-07-26 23:55:33 +02:00
Antoine Pitrou
5a24d82941
Add a test for issue #1813 : getlocale() failing under a Turkish locale
...
(not a problem under 3.x)
2011-07-24 02:41:54 +02:00
Antoine Pitrou
cf9d3c08c8
Issue #1813 : Fix codec lookup under Turkish locales.
2011-07-24 02:27:04 +02:00
Victor Stinner
0501070669
Revert my commit 3555cf6f9c98: "Issue #8796 : codecs.open() calls the builtin
...
open() function instead of using StreamReaderWriter. Deprecate StreamReader,
StreamWriter, StreamReaderWriter, StreamRecoder and EncodedFile() of the codec
module. Use the builtin open() function or io.TextIOWrapper instead."
"It has not been approved !" wrote Marc-Andre Lemburg.
2011-05-27 16:50:40 +02:00
Victor Stinner
98fe1a0c3b
Issue #8796 : codecs.open() calls the builtin open() function instead of using
...
StreamReaderWriter. Deprecate StreamReader, StreamWriter, StreamReaderWriter,
StreamRecoder and EncodedFile() of the codec module. Use the builtin open()
function or io.TextIOWrapper instead.
2011-05-27 01:51:18 +02:00
Victor Stinner
d6881701fb
Merge 3.2
2011-05-23 14:58:07 +02:00
Victor Stinner
b43dd4b8ca
Merge 3.1
2011-05-23 14:57:05 +02:00
Victor Stinner
2cca057284
test_codecs now removes the temporay file (created by the test)
2011-05-23 14:51:42 +02:00
Marc-André Lemburg
8f36af7a4c
Normalize the encoding names for Latin-1 and UTF-8 to
...
'latin-1' and 'utf-8'.
These are optimized in the Python Unicode implementation
to result in more direct processing, bypassing the codec
registry.
Also see issue11303.
2011-02-25 15:42:01 +00:00
Benjamin Peterson
28a4dce6a8
remove (un)transform methods
2010-12-12 01:33:04 +00:00
Victor Stinner
53a9dd776e
Issue #10546 : UTF-16-LE and UTF-16-BE *do* support non-BMP characters
...
Fix the doc and add tests.
2010-12-08 22:25:45 +00:00
Georg Brandl
02524629f3
#7475 : add (un)transform method to bytes/bytearray and str, add back codecs that can be used with them from Python 2.
2010-12-02 18:06:51 +00:00
Ezio Melotti
19f2aeba67
Merged revisions 86596 via svnmerge from
...
svn+ssh://pythondev@svn.python.org/python/branches/py3k
........
r86596 | ezio.melotti | 2010-11-20 21:04:17 +0200 (Sat, 20 Nov 2010) | 1 line
#9424 : Replace deprecated assert* methods in the Python test suite.
........
2010-11-21 01:30:29 +00:00
Ezio Melotti
b3aedd4862
#9424 : Replace deprecated assert* methods in the Python test suite.
2010-11-20 19:04:17 +00:00
Benjamin Peterson
5a6214afe2
Merged revisions 81499,81506 via svnmerge from
...
svn+ssh://pythondev@svn.python.org/python/trunk
........
r81499 | georg.brandl | 2010-05-24 16:29:07 -0500 (Mon, 24 May 2010) | 1 line
#8016 : add the CP858 codec (approved by Benjamin). (Also add CP720 to the tests, it was missing there.)
........
r81506 | benjamin.peterson | 2010-05-24 17:04:53 -0500 (Mon, 24 May 2010) | 1 line
set svn:eol-style
........
2010-06-27 22:41:29 +00:00
Victor Stinner
554f3f0081
Issue #850997 : mbcs encoding (Windows only) handles errors argument: strict
...
mode raises unicode errors. The encoder only supports "strict" and "replace"
error handlers, the decoder only supports "strict" and "ignore" error handlers.
2010-06-16 23:33:54 +00:00
Antoine Pitrou
6107a688ee
Merged revisions 81908 via svnmerge from
...
svn+ssh://pythondev@svn.python.org/python/branches/py3k
................
r81908 | antoine.pitrou | 2010-06-11 23:46:32 +0200 (ven., 11 juin 2010) | 11 lines
Merged revisions 81907 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r81907 | antoine.pitrou | 2010-06-11 23:42:26 +0200 (ven., 11 juin 2010) | 5 lines
Issue #8941 : decoding big endian UTF-32 data in UCS-2 builds could crash
the interpreter with characters outside the Basic Multilingual Plane
(higher than 0x10000).
........
................
2010-06-11 21:48:34 +00:00
Antoine Pitrou
cc0cfd3576
Merged revisions 81907 via svnmerge from
...
svn+ssh://pythondev@svn.python.org/python/trunk
........
r81907 | antoine.pitrou | 2010-06-11 23:42:26 +0200 (ven., 11 juin 2010) | 5 lines
Issue #8941 : decoding big endian UTF-32 data in UCS-2 builds could crash
the interpreter with characters outside the Basic Multilingual Plane
(higher than 0x10000).
........
2010-06-11 21:46:32 +00:00