Do not put a raw REPLACEMENT CHARACTER in the document.

2010-11-19 22:09:04 +00:00 · 2010-11-19 22:09:04 +00:00 · c8c60c2284
parent c5b0ec0a83
commit c8c60c2284
1 changed files with 4 additions and 1 deletions
--- a/Doc/howto/unicode.rst
+++ b/Doc/howto/unicode.rst
@ -263,10 +263,13 @@ Unicode result).  The following examples show the differences::
    UnicodeDecodeError: 'utf8' codec can't decode byte 0x80 in position 0:
                        unexpected code byte
    >>> b'\x80abc'.decode("utf-8", "replace")
-    '<EFBFBD>abc'
+    '?abc'
    >>> b'\x80abc'.decode("utf-8", "ignore")
    'abc'

+(In this code example, the Unicode replacement character has been replaced by
+a question mark because it may not be displayed on some systems.)
+
 Encodings are specified as strings containing the encoding's name.  Python 3.2
 comes with roughly 100 different encodings; see the Python Library Reference at
 :ref:`standard-encodings` for a list.  Some encodings have multiple names; for