cpython/Doc/library/gzip.rst

124 lines
4.7 KiB
ReStructuredText
Raw Normal View History

2007-08-15 11:28:01 -03:00
:mod:`gzip` --- Support for :program:`gzip` files
=================================================
.. module:: gzip
:synopsis: Interfaces for gzip compression and decompression using file objects.
**Source code:** :source:`Lib/gzip.py`
--------------
2008-03-28 05:06:56 -03:00
This module provides a simple interface to compress and decompress files just
like the GNU programs :program:`gzip` and :program:`gunzip` would.
The data compression is provided by the :mod:`zlib` module.
2007-08-15 11:28:01 -03:00
2008-03-28 05:06:56 -03:00
The :mod:`gzip` module provides the :class:`GzipFile` class which is modeled
after Python's File Object. The :class:`GzipFile` class reads and writes
2007-08-15 11:28:01 -03:00
:program:`gzip`\ -format files, automatically compressing or decompressing the
2008-03-28 05:06:56 -03:00
data so that it looks like an ordinary file object.
Note that additional file formats which can be decompressed by the
:program:`gzip` and :program:`gunzip` programs, such as those produced by
:program:`compress` and :program:`pack`, are not supported by this module.
2007-08-15 11:28:01 -03:00
The module defines the following items:
.. class:: GzipFile([filename[, mode[, compresslevel[, fileobj[, mtime]]]]])
2007-08-15 11:28:01 -03:00
Constructor for the :class:`GzipFile` class, which simulates most of the methods
of a file object, with the exception of the :meth:`readinto` and
:meth:`truncate` methods. At least one of *fileobj* and *filename* must be
given a non-trivial value.
The new class instance is based on *fileobj*, which can be a regular file, a
:class:`~StringIO.StringIO` object, or any other object which simulates a file. It
2007-08-15 11:28:01 -03:00
defaults to ``None``, in which case *filename* is opened to provide a file
object.
When *fileobj* is not ``None``, the *filename* argument is only used to be
2013-10-06 07:33:20 -03:00
included in the :program:`gzip` file header, which may include the original
2007-08-15 11:28:01 -03:00
filename of the uncompressed file. It defaults to the filename of *fileobj*, if
discernible; otherwise, it defaults to the empty string, and in this case the
original filename is not included in the header.
The *mode* argument can be any of ``'r'``, ``'rb'``, ``'a'``, ``'ab'``, ``'w'``,
or ``'wb'``, depending on whether the file will be read or written. The default
is the mode of *fileobj* if discernible; otherwise, the default is ``'rb'``. If
not given, the 'b' flag will be added to the mode to ensure the file is opened
in binary mode for cross-platform portability.
The *compresslevel* argument is an integer from ``0`` to ``9`` controlling
the level of compression; ``1`` is fastest and produces the least
compression, and ``9`` is slowest and produces the most compression. ``0``
is no compression. The default is ``9``.
2007-08-15 11:28:01 -03:00
The *mtime* argument is an optional numeric timestamp to be written to
2009-03-09 13:35:48 -03:00
the stream when compressing. All :program:`gzip` compressed streams are
required to contain a timestamp. If omitted or ``None``, the current
time is used. This module ignores the timestamp when decompressing;
however, some programs, such as :program:`gunzip`\ , make use of it.
The format of the timestamp is the same as that of the return value of
``time.time()`` and of the ``st_mtime`` attribute of the object returned
by ``os.stat()``.
2007-08-15 11:28:01 -03:00
Calling a :class:`GzipFile` object's :meth:`close` method does not close
*fileobj*, since you might wish to append more material after the compressed
data. This also allows you to pass a :class:`~StringIO.StringIO` object opened for
2007-08-15 11:28:01 -03:00
writing as *fileobj*, and retrieve the resulting memory buffer using the
:class:`StringIO` object's :meth:`~StringIO.StringIO.getvalue` method.
2007-08-15 11:28:01 -03:00
:class:`GzipFile` supports iteration and the :keyword:`with` statement.
2009-03-09 17:38:56 -03:00
.. versionchanged:: 2.7
Support for the :keyword:`with` statement was added.
.. versionchanged:: 2.7
Support for zero-padded files was added.
2007-08-15 11:28:01 -03:00
.. function:: open(filename[, mode[, compresslevel]])
This is a shorthand for ``GzipFile(filename,`` ``mode,`` ``compresslevel)``.
The *filename* argument is required; *mode* defaults to ``'rb'`` and
*compresslevel* defaults to ``9``.
2008-03-28 05:06:56 -03:00
.. _gzip-usage-examples:
Examples of usage
-----------------
Example of how to read a compressed file::
import gzip
f = gzip.open('file.txt.gz', 'rb')
2008-03-28 05:06:56 -03:00
file_content = f.read()
f.close()
Example of how to create a compressed GZIP file::
import gzip
content = "Lots of content here"
f = gzip.open('file.txt.gz', 'wb')
2008-03-28 05:06:56 -03:00
f.write(content)
f.close()
Example of how to GZIP compress an existing file::
import gzip
f_in = open('file.txt', 'rb')
f_out = gzip.open('file.txt.gz', 'wb')
2008-03-28 05:06:56 -03:00
f_out.writelines(f_in)
f_out.close()
f_in.close()
2007-08-15 11:28:01 -03:00
.. seealso::
Module :mod:`zlib`
The basic data compression module needed to support the :program:`gzip` file
format.