mirror of https://github.com/python/cpython
287 lines
10 KiB
ReStructuredText
287 lines
10 KiB
ReStructuredText
:mod:`base64` --- Base16, Base32, Base64, Base85 Data Encodings
|
|
===============================================================
|
|
|
|
.. module:: base64
|
|
:synopsis: RFC 3548: Base16, Base32, Base64 Data Encodings;
|
|
Base85 and Ascii85
|
|
|
|
|
|
.. index::
|
|
pair: base64; encoding
|
|
single: MIME; base64 encoding
|
|
|
|
This module provides functions for encoding binary data to printable
|
|
ASCII characters and decoding such encodings back to binary data.
|
|
It provides encoding and decoding functions for the encodings specified in
|
|
:rfc:`3548`, which defines the Base16, Base32, and Base64 algorithms,
|
|
and for the de-facto standard Ascii85 and Base85 encodings.
|
|
|
|
The :rfc:`3548` encodings are suitable for encoding binary data so that it can
|
|
safely sent by email, used as parts of URLs, or included as part of an HTTP
|
|
POST request. The encoding algorithm is not the same as the
|
|
:program:`uuencode` program.
|
|
|
|
There are two interfaces provided by this module. The modern interface
|
|
supports encoding :term:`bytes-like objects <bytes-like object>` to ASCII
|
|
:class:`bytes`, and decoding :term:`bytes-like objects <bytes-like object>` or
|
|
strings containing ASCII to :class:`bytes`. Both base-64 alphabets
|
|
defined in :rfc:`3548` (normal, and URL- and filesystem-safe) are supported.
|
|
|
|
The legacy interface does not support decoding from strings, but it does
|
|
provide functions for encoding and decoding to and from :term:`file objects
|
|
<file object>`. It only supports the Base64 standard alphabet, and it adds
|
|
newlines every 76 characters as per :rfc:`2045`. Note that if you are looking
|
|
for :rfc:`2045` support you probably want to be looking at the :mod:`email`
|
|
package instead.
|
|
|
|
|
|
.. versionchanged:: 3.3
|
|
ASCII-only Unicode strings are now accepted by the decoding functions of
|
|
the modern interface.
|
|
|
|
.. versionchanged:: 3.4
|
|
Any :term:`bytes-like object`\ s are now accepted by all
|
|
encoding and decoding functions in this module. Ascii85/Base85 support added.
|
|
|
|
The modern interface provides:
|
|
|
|
.. function:: b64encode(s, altchars=None)
|
|
|
|
Encode the :term:`bytes-like object` *s* using Base64 and return the encoded
|
|
:class:`bytes`.
|
|
|
|
Optional *altchars* must be a :term:`bytes-like object` of at least
|
|
length 2 (additional characters are ignored) which specifies an alternative
|
|
alphabet for the ``+`` and ``/`` characters. This allows an application to e.g.
|
|
generate URL or filesystem safe Base64 strings. The default is ``None``, for
|
|
which the standard Base64 alphabet is used.
|
|
|
|
|
|
.. function:: b64decode(s, altchars=None, validate=False)
|
|
|
|
Decode the Base64 encoded :term:`bytes-like object` or ASCII string
|
|
*s* and return the decoded :class:`bytes`.
|
|
|
|
Optional *altchars* must be a :term:`bytes-like object` or ASCII string of
|
|
at least length 2 (additional characters are ignored) which specifies the
|
|
alternative alphabet used instead of the ``+`` and ``/`` characters.
|
|
|
|
A :exc:`binascii.Error` exception is raised
|
|
if *s* is incorrectly padded.
|
|
|
|
If *validate* is ``False`` (the default), characters that are neither
|
|
in the normal base-64 alphabet nor the alternative alphabet are
|
|
discarded prior to the padding check. If *validate* is ``True``,
|
|
these non-alphabet characters in the input result in a
|
|
:exc:`binascii.Error`.
|
|
|
|
|
|
.. function:: standard_b64encode(s)
|
|
|
|
Encode :term:`bytes-like object` *s* using the standard Base64 alphabet
|
|
and return the encoded :class:`bytes`.
|
|
|
|
|
|
.. function:: standard_b64decode(s)
|
|
|
|
Decode :term:`bytes-like object` or ASCII string *s* using the standard
|
|
Base64 alphabet and return the decoded :class:`bytes`.
|
|
|
|
|
|
.. function:: urlsafe_b64encode(s)
|
|
|
|
Encode :term:`bytes-like object` *s* using the
|
|
URL- and filesystem-safe alphabet, which
|
|
substitutes ``-`` instead of ``+`` and ``_`` instead of ``/`` in the
|
|
standard Base64 alphabet, and return the encoded :class:`bytes`. The result
|
|
can still contain ``=``.
|
|
|
|
|
|
.. function:: urlsafe_b64decode(s)
|
|
|
|
Decode :term:`bytes-like object` or ASCII string *s*
|
|
using the URL- and filesystem-safe
|
|
alphabet, which substitutes ``-`` instead of ``+`` and ``_`` instead of
|
|
``/`` in the standard Base64 alphabet, and return the decoded
|
|
:class:`bytes`.
|
|
|
|
|
|
.. function:: b32encode(s)
|
|
|
|
Encode the :term:`bytes-like object` *s* using Base32 and return the
|
|
encoded :class:`bytes`.
|
|
|
|
|
|
.. function:: b32decode(s, casefold=False, map01=None)
|
|
|
|
Decode the Base32 encoded :term:`bytes-like object` or ASCII string *s* and
|
|
return the decoded :class:`bytes`.
|
|
|
|
Optional *casefold* is a flag specifying
|
|
whether a lowercase alphabet is acceptable as input. For security purposes,
|
|
the default is ``False``.
|
|
|
|
:rfc:`3548` allows for optional mapping of the digit 0 (zero) to the letter O
|
|
(oh), and for optional mapping of the digit 1 (one) to either the letter I (eye)
|
|
or letter L (el). The optional argument *map01* when not ``None``, specifies
|
|
which letter the digit 1 should be mapped to (when *map01* is not ``None``, the
|
|
digit 0 is always mapped to the letter O). For security purposes the default is
|
|
``None``, so that 0 and 1 are not allowed in the input.
|
|
|
|
A :exc:`binascii.Error` is raised if *s* is
|
|
incorrectly padded or if there are non-alphabet characters present in the
|
|
input.
|
|
|
|
|
|
.. function:: b16encode(s)
|
|
|
|
Encode the :term:`bytes-like object` *s* using Base16 and return the
|
|
encoded :class:`bytes`.
|
|
|
|
|
|
.. function:: b16decode(s, casefold=False)
|
|
|
|
Decode the Base16 encoded :term:`bytes-like object` or ASCII string *s* and
|
|
return the decoded :class:`bytes`.
|
|
|
|
Optional *casefold* is a flag specifying whether a
|
|
lowercase alphabet is acceptable as input. For security purposes, the default
|
|
is ``False``.
|
|
|
|
A :exc:`binascii.Error` is raised if *s* is
|
|
incorrectly padded or if there are non-alphabet characters present in the
|
|
input.
|
|
|
|
|
|
.. function:: a85encode(b, *, foldspaces=False, wrapcol=0, pad=False, adobe=False)
|
|
|
|
Encode the :term:`bytes-like object` *b* using Ascii85 and return the
|
|
encoded :class:`bytes`.
|
|
|
|
*foldspaces* is an optional flag that uses the special short sequence 'y'
|
|
instead of 4 consecutive spaces (ASCII 0x20) as supported by 'btoa'. This
|
|
feature is not supported by the "standard" Ascii85 encoding.
|
|
|
|
*wrapcol* controls whether the output should have newline (``b'\n'``)
|
|
characters added to it. If this is non-zero, each output line will be
|
|
at most this many characters long.
|
|
|
|
*pad* controls whether the input is padded to a multiple of 4
|
|
before encoding. Note that the ``btoa`` implementation always pads.
|
|
|
|
*adobe* controls whether the encoded byte sequence is framed with ``<~``
|
|
and ``~>``, which is used by the Adobe implementation.
|
|
|
|
.. versionadded:: 3.4
|
|
|
|
|
|
.. function:: a85decode(b, *, foldspaces=False, adobe=False, ignorechars=b' \\t\\n\\r\\v')
|
|
|
|
Decode the Ascii85 encoded :term:`bytes-like object` or ASCII string *b* and
|
|
return the decoded :class:`bytes`.
|
|
|
|
*foldspaces* is a flag that specifies whether the 'y' short sequence
|
|
should be accepted as shorthand for 4 consecutive spaces (ASCII 0x20).
|
|
This feature is not supported by the "standard" Ascii85 encoding.
|
|
|
|
*adobe* controls whether the input sequence is in Adobe Ascii85 format
|
|
(i.e. is framed with <~ and ~>).
|
|
|
|
*ignorechars* should be a :term:`bytes-like object` or ASCII string
|
|
containing characters to ignore
|
|
from the input. This should only contain whitespace characters, and by
|
|
default contains all whitespace characters in ASCII.
|
|
|
|
.. versionadded:: 3.4
|
|
|
|
|
|
.. function:: b85encode(b, pad=False)
|
|
|
|
Encode the :term:`bytes-like object` *b* using base85 (as used in e.g.
|
|
git-style binary diffs) and return the encoded :class:`bytes`.
|
|
|
|
If *pad* is true, the input is padded with ``b'\0'`` so its length is a
|
|
multiple of 4 bytes before encoding.
|
|
|
|
.. versionadded:: 3.4
|
|
|
|
|
|
.. function:: b85decode(b)
|
|
|
|
Decode the base85-encoded :term:`bytes-like object` or ASCII string *b* and
|
|
return the decoded :class:`bytes`. Padding is implicitly removed, if
|
|
necessary.
|
|
|
|
.. versionadded:: 3.4
|
|
|
|
|
|
.. note::
|
|
Both Base85 and Ascii85 have an expansion factor of 5 to 4 (5 Base85 or
|
|
Ascii85 characters can encode 4 binary bytes), while the better-known
|
|
Base64 has an expansion factor of 6 to 4. They are therefore more
|
|
efficient when space expensive. They differ by details such as the
|
|
character map used for encoding.
|
|
|
|
|
|
The legacy interface:
|
|
|
|
.. function:: decode(input, output)
|
|
|
|
Decode the contents of the binary *input* file and write the resulting binary
|
|
data to the *output* file. *input* and *output* must be :term:`file objects
|
|
<file object>`. *input* will be read until ``input.readline()`` returns an
|
|
empty bytes object.
|
|
|
|
|
|
.. function:: decodebytes(s)
|
|
decodestring(s)
|
|
|
|
Decode the :term:`bytes-like object` *s*, which must contain one or more
|
|
lines of base64 encoded data, and return the decoded :class:`bytes`.
|
|
``decodestring`` is a deprecated alias.
|
|
|
|
.. versionadded:: 3.1
|
|
|
|
|
|
.. function:: encode(input, output)
|
|
|
|
Encode the contents of the binary *input* file and write the resulting base64
|
|
encoded data to the *output* file. *input* and *output* must be :term:`file
|
|
objects <file object>`. *input* will be read until ``input.read()`` returns
|
|
an empty bytes object. :func:`encode` inserts a newline character (``b'\n'``)
|
|
after every 76 bytes of the output, as well as ensuring that the output
|
|
always ends with a newline, as per :rfc:`2045` (MIME).
|
|
|
|
|
|
.. function:: encodebytes(s)
|
|
encodestring(s)
|
|
|
|
Encode the :term:`bytes-like object` *s*, which can contain arbitrary binary
|
|
data, and return :class:`bytes` containing the base64-encoded data, with newlines
|
|
(``b'\n'``) inserted after every 76 bytes of output, and ensuring that
|
|
there is a trailing newline, as per :rfc:`2045` (MIME).
|
|
|
|
``encodestring`` is a deprecated alias.
|
|
|
|
|
|
An example usage of the module:
|
|
|
|
>>> import base64
|
|
>>> encoded = base64.b64encode(b'data to be encoded')
|
|
>>> encoded
|
|
b'ZGF0YSB0byBiZSBlbmNvZGVk'
|
|
>>> data = base64.b64decode(encoded)
|
|
>>> data
|
|
b'data to be encoded'
|
|
|
|
|
|
.. seealso::
|
|
|
|
Module :mod:`binascii`
|
|
Support module containing ASCII-to-binary and binary-to-ASCII conversions.
|
|
|
|
:rfc:`1521` - MIME (Multipurpose Internet Mail Extensions) Part One: Mechanisms for Specifying and Describing the Format of Internet Message Bodies
|
|
Section 5.2, "Base64 Content-Transfer-Encoding," provides the definition of the
|
|
base64 encoding.
|
|
|