2007-08-15 11:28:22 -03:00
|
|
|
:mod:`gettext` --- Multilingual internationalization services
|
|
|
|
=============================================================
|
|
|
|
|
|
|
|
.. module:: gettext
|
|
|
|
:synopsis: Multilingual internationalization services.
|
2016-06-11 16:02:54 -03:00
|
|
|
|
2013-11-12 11:02:35 -04:00
|
|
|
.. moduleauthor:: Barry A. Warsaw <barry@python.org>
|
|
|
|
.. sectionauthor:: Barry A. Warsaw <barry@python.org>
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2011-01-27 16:38:46 -04:00
|
|
|
**Source code:** :source:`Lib/gettext.py`
|
|
|
|
|
|
|
|
--------------
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
The :mod:`gettext` module provides internationalization (I18N) and localization
|
|
|
|
(L10N) services for your Python modules and applications. It supports both the
|
|
|
|
GNU ``gettext`` message catalog API and a higher level, class-based API that may
|
|
|
|
be more appropriate for Python files. The interface described below allows you
|
|
|
|
to write your module and application messages in one natural language, and
|
|
|
|
provide a catalog of translated messages for running under different natural
|
|
|
|
languages.
|
|
|
|
|
|
|
|
Some hints on localizing your Python modules and applications are also given.
|
|
|
|
|
|
|
|
|
|
|
|
GNU :program:`gettext` API
|
|
|
|
--------------------------
|
|
|
|
|
|
|
|
The :mod:`gettext` module defines the following API, which is very similar to
|
|
|
|
the GNU :program:`gettext` API. If you use this API you will affect the
|
|
|
|
translation of your entire application globally. Often this is what you want if
|
|
|
|
your application is monolingual, with the choice of language dependent on the
|
|
|
|
locale of your user. If you are localizing a Python module, or if your
|
|
|
|
application needs to switch languages on the fly, you probably want to use the
|
|
|
|
class-based API instead.
|
|
|
|
|
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
.. function:: bindtextdomain(domain, localedir=None)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
Bind the *domain* to the locale directory *localedir*. More concretely,
|
|
|
|
:mod:`gettext` will look for binary :file:`.mo` files for the given domain using
|
|
|
|
the path (on Unix): :file:`localedir/language/LC_MESSAGES/domain.mo`, where
|
|
|
|
*languages* is searched for in the environment variables :envvar:`LANGUAGE`,
|
|
|
|
:envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and :envvar:`LANG` respectively.
|
|
|
|
|
|
|
|
If *localedir* is omitted or ``None``, then the current binding for *domain* is
|
|
|
|
returned. [#]_
|
|
|
|
|
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
.. function:: bind_textdomain_codeset(domain, codeset=None)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Bind the *domain* to *codeset*, changing the encoding of byte strings
|
|
|
|
returned by the :func:`lgettext`, :func:`ldgettext`, :func:`lngettext`
|
|
|
|
and :func:`ldngettext` functions.
|
|
|
|
If *codeset* is omitted, then the current binding is returned.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
.. function:: textdomain(domain=None)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
Change or query the current global domain. If *domain* is ``None``, then the
|
|
|
|
current global domain is returned, otherwise the global domain is set to
|
|
|
|
*domain*, which is returned.
|
|
|
|
|
|
|
|
|
2018-10-26 03:00:49 -03:00
|
|
|
.. index:: single: _; gettext
|
2007-08-15 11:28:22 -03:00
|
|
|
.. function:: gettext(message)
|
|
|
|
|
|
|
|
Return the localized translation of *message*, based on the current global
|
|
|
|
domain, language, and locale directory. This function is usually aliased as
|
|
|
|
:func:`_` in the local namespace (see examples below).
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: dgettext(domain, message)
|
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Like :func:`.gettext`, but look the message up in the specified *domain*.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
.. function:: ngettext(singular, plural, n)
|
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Like :func:`.gettext`, but consider plural forms. If a translation is found,
|
2007-08-15 11:28:22 -03:00
|
|
|
apply the plural formula to *n*, and return the resulting message (some
|
|
|
|
languages have more than two plural forms). If no translation is found, return
|
|
|
|
*singular* if *n* is 1; return *plural* otherwise.
|
|
|
|
|
|
|
|
The Plural formula is taken from the catalog header. It is a C or Python
|
|
|
|
expression that has a free variable *n*; the expression evaluates to the index
|
2013-11-19 12:05:20 -04:00
|
|
|
of the plural in the catalog. See
|
|
|
|
`the GNU gettext documentation <https://www.gnu.org/software/gettext/manual/gettext.html>`__
|
|
|
|
for the precise syntax to be used in :file:`.po` files and the
|
|
|
|
formulas for a variety of languages.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
.. function:: dngettext(domain, singular, plural, n)
|
|
|
|
|
|
|
|
Like :func:`ngettext`, but look the message up in the specified *domain*.
|
|
|
|
|
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
.. function:: lgettext(message)
|
|
|
|
.. function:: ldgettext(domain, message)
|
|
|
|
.. function:: lngettext(singular, plural, n)
|
2007-08-15 11:28:22 -03:00
|
|
|
.. function:: ldngettext(domain, singular, plural, n)
|
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Equivalent to the corresponding functions without the ``l`` prefix
|
|
|
|
(:func:`.gettext`, :func:`dgettext`, :func:`ngettext` and :func:`dngettext`),
|
|
|
|
but the translation is returned as a byte string encoded in the preferred
|
|
|
|
system encoding if no other encoding was explicitly set with
|
2007-08-15 11:28:22 -03:00
|
|
|
:func:`bind_textdomain_codeset`.
|
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
.. warning::
|
|
|
|
|
|
|
|
These functions should be avoided in Python 3, because they return
|
|
|
|
encoded bytes. It's much better to use alternatives which return
|
|
|
|
Unicode strings instead, since most Python applications will want to
|
|
|
|
manipulate human readable text as strings instead of bytes. Further,
|
|
|
|
it's possible that you may get unexpected Unicode-related exceptions
|
|
|
|
if there are encoding problems with the translated strings. It is
|
|
|
|
possible that the ``l*()`` functions will be deprecated in future Python
|
|
|
|
versions due to their inherent problems and limitations.
|
|
|
|
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
Note that GNU :program:`gettext` also defines a :func:`dcgettext` method, but
|
|
|
|
this was deemed not useful and so it is currently unimplemented.
|
|
|
|
|
|
|
|
Here's an example of typical usage for this API::
|
|
|
|
|
|
|
|
import gettext
|
|
|
|
gettext.bindtextdomain('myapplication', '/path/to/my/language/directory')
|
|
|
|
gettext.textdomain('myapplication')
|
|
|
|
_ = gettext.gettext
|
|
|
|
# ...
|
2007-09-04 04:15:32 -03:00
|
|
|
print(_('This is a translatable string.'))
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
Class-based API
|
|
|
|
---------------
|
|
|
|
|
|
|
|
The class-based API of the :mod:`gettext` module gives you more flexibility and
|
|
|
|
greater convenience than the GNU :program:`gettext` API. It is the recommended
|
2017-10-04 14:28:20 -03:00
|
|
|
way of localizing your Python applications and modules. :mod:`!gettext` defines
|
2007-08-15 11:28:22 -03:00
|
|
|
a "translations" class which implements the parsing of GNU :file:`.mo` format
|
2008-02-01 07:56:49 -04:00
|
|
|
files, and has methods for returning strings. Instances of this "translations"
|
|
|
|
class can also install themselves in the built-in namespace as the function
|
|
|
|
:func:`_`.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
.. function:: find(domain, localedir=None, languages=None, all=False)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
This function implements the standard :file:`.mo` file search algorithm. It
|
|
|
|
takes a *domain*, identical to what :func:`textdomain` takes. Optional
|
|
|
|
*localedir* is as in :func:`bindtextdomain` Optional *languages* is a list of
|
|
|
|
strings, where each string is a language code.
|
|
|
|
|
|
|
|
If *localedir* is not given, then the default system locale directory is used.
|
|
|
|
[#]_ If *languages* is not given, then the following environment variables are
|
|
|
|
searched: :envvar:`LANGUAGE`, :envvar:`LC_ALL`, :envvar:`LC_MESSAGES`, and
|
|
|
|
:envvar:`LANG`. The first one returning a non-empty value is used for the
|
|
|
|
*languages* variable. The environment variables should contain a colon separated
|
|
|
|
list of languages, which will be split on the colon to produce the expected list
|
|
|
|
of language code strings.
|
|
|
|
|
|
|
|
:func:`find` then expands and normalizes the languages, and then iterates
|
|
|
|
through them, searching for an existing file built of these components:
|
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
:file:`{localedir}/{language}/LC_MESSAGES/{domain}.mo`
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
The first such file name that exists is returned by :func:`find`. If no such
|
|
|
|
file is found, then ``None`` is returned. If *all* is given, it returns a list
|
|
|
|
of all file names, in the order in which they appear in the languages list or
|
|
|
|
the environment variables.
|
|
|
|
|
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
.. function:: translation(domain, localedir=None, languages=None, class_=None, fallback=False, codeset=None)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
Return a :class:`Translations` instance based on the *domain*, *localedir*,
|
|
|
|
and *languages*, which are first passed to :func:`find` to get a list of the
|
2007-08-15 11:28:22 -03:00
|
|
|
associated :file:`.mo` file paths. Instances with identical :file:`.mo` file
|
2008-07-17 15:15:35 -03:00
|
|
|
names are cached. The actual class instantiated is either *class_* if
|
|
|
|
provided, otherwise :class:`GNUTranslations`. The class's constructor must
|
2010-09-15 08:11:28 -03:00
|
|
|
take a single :term:`file object` argument. If provided, *codeset* will change
|
2017-06-20 11:13:29 -03:00
|
|
|
the charset used to encode translated strings in the
|
|
|
|
:meth:`~NullTranslations.lgettext` and :meth:`~NullTranslations.lngettext`
|
|
|
|
methods.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
If multiple files are found, later files are used as fallbacks for earlier ones.
|
|
|
|
To allow setting the fallback, :func:`copy.copy` is used to clone each
|
|
|
|
translation object from the cache; the actual instance data is still shared with
|
|
|
|
the cache.
|
|
|
|
|
2011-10-12 15:10:51 -03:00
|
|
|
If no :file:`.mo` file is found, this function raises :exc:`OSError` if
|
2007-08-15 11:28:22 -03:00
|
|
|
*fallback* is false (which is the default), and returns a
|
|
|
|
:class:`NullTranslations` instance if *fallback* is true.
|
|
|
|
|
2011-10-12 15:10:51 -03:00
|
|
|
.. versionchanged:: 3.3
|
|
|
|
:exc:`IOError` used to be raised instead of :exc:`OSError`.
|
|
|
|
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
.. function:: install(domain, localedir=None, codeset=None, names=None)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2009-07-26 11:54:51 -03:00
|
|
|
This installs the function :func:`_` in Python's builtins namespace, based on
|
2007-08-15 11:28:22 -03:00
|
|
|
*domain*, *localedir*, and *codeset* which are passed to the function
|
2008-07-14 11:32:15 -03:00
|
|
|
:func:`translation`.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
For the *names* parameter, please see the description of the translation
|
Merged revisions 74074,74077,74111,74188,74192-74193,74200,74252-74253,74258-74261 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74074 | georg.brandl | 2009-07-18 05:03:10 -0400 (Sat, 18 Jul 2009) | 1 line
#6513: fix example code: warning categories are classes, not instances.
........
r74077 | georg.brandl | 2009-07-18 05:43:40 -0400 (Sat, 18 Jul 2009) | 1 line
#6489: fix an ambiguity in getiterator() documentation.
........
r74111 | benjamin.peterson | 2009-07-20 09:30:10 -0400 (Mon, 20 Jul 2009) | 1 line
remove docs for deprecated -p option
........
r74188 | benjamin.peterson | 2009-07-23 10:25:31 -0400 (Thu, 23 Jul 2009) | 1 line
use bools
........
r74192 | georg.brandl | 2009-07-24 12:28:38 -0400 (Fri, 24 Jul 2009) | 1 line
Fix arg types of et#.
........
r74193 | georg.brandl | 2009-07-24 12:46:38 -0400 (Fri, 24 Jul 2009) | 1 line
Dont put "void" in signature for nullary functions.
........
r74200 | georg.brandl | 2009-07-25 09:02:15 -0400 (Sat, 25 Jul 2009) | 1 line
#6571: add index entries for more operators.
........
r74252 | georg.brandl | 2009-07-29 12:06:31 -0400 (Wed, 29 Jul 2009) | 1 line
#6593: fix link targets.
........
r74253 | georg.brandl | 2009-07-29 12:09:17 -0400 (Wed, 29 Jul 2009) | 1 line
#6591: add reference to ioctl in fcntl module for platforms other than Windows.
........
r74258 | georg.brandl | 2009-07-29 12:57:05 -0400 (Wed, 29 Jul 2009) | 1 line
Add a link to readline, and mention IPython and bpython.
........
r74259 | georg.brandl | 2009-07-29 13:07:21 -0400 (Wed, 29 Jul 2009) | 1 line
Fix some markup and small factual glitches found by M. Markert.
........
r74260 | georg.brandl | 2009-07-29 13:15:20 -0400 (Wed, 29 Jul 2009) | 1 line
Fix a few markup glitches.
........
r74261 | georg.brandl | 2009-07-29 13:50:25 -0400 (Wed, 29 Jul 2009) | 1 line
Rewrite the section about classes a bit; mostly tidbits, and a larger update to the section about "private" variables to reflect the Pythonic consensus better.
........
2009-07-29 16:54:39 -03:00
|
|
|
object's :meth:`~NullTranslations.install` method.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
As seen below, you usually mark the strings in your application that are
|
|
|
|
candidates for translation, by wrapping them in a call to the :func:`_`
|
|
|
|
function, like this::
|
|
|
|
|
2007-09-04 04:15:32 -03:00
|
|
|
print(_('This string will be translated.'))
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
For convenience, you want the :func:`_` function to be installed in Python's
|
2009-07-26 11:54:51 -03:00
|
|
|
builtins namespace, so it is easily accessible in all modules of your
|
2007-08-15 11:28:22 -03:00
|
|
|
application.
|
|
|
|
|
|
|
|
|
|
|
|
The :class:`NullTranslations` class
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
Translation classes are what actually implement the translation of original
|
|
|
|
source file message strings to translated message strings. The base class used
|
|
|
|
by all translation classes is :class:`NullTranslations`; this provides the basic
|
|
|
|
interface you can use to write your own specialized translation classes. Here
|
2017-10-04 14:28:20 -03:00
|
|
|
are the methods of :class:`!NullTranslations`:
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
.. class:: NullTranslations(fp=None)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2010-09-15 08:11:28 -03:00
|
|
|
Takes an optional :term:`file object` *fp*, which is ignored by the base class.
|
2007-08-15 11:28:22 -03:00
|
|
|
Initializes "protected" instance variables *_info* and *_charset* which are set
|
|
|
|
by derived classes, as well as *_fallback*, which is set through
|
|
|
|
:meth:`add_fallback`. It then calls ``self._parse(fp)`` if *fp* is not
|
|
|
|
``None``.
|
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: _parse(fp)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
No-op'd in the base class, this method takes file object *fp*, and reads
|
|
|
|
the data from the file, initializing its message catalog. If you have an
|
|
|
|
unsupported message catalog file format, you should override this method
|
|
|
|
to parse your format.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: add_fallback(fallback)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
Add *fallback* as the fallback object for the current translation object.
|
|
|
|
A translation object should consult the fallback if it cannot provide a
|
|
|
|
translation for a given message.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: gettext(message)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-10-04 14:28:20 -03:00
|
|
|
If a fallback has been set, forward :meth:`!gettext` to the fallback.
|
2017-06-20 11:13:29 -03:00
|
|
|
Otherwise, return *message*. Overridden in derived classes.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: ngettext(singular, plural, n)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-10-04 14:28:20 -03:00
|
|
|
If a fallback has been set, forward :meth:`!ngettext` to the fallback.
|
2017-06-20 11:13:29 -03:00
|
|
|
Otherwise, return *singular* if *n* is 1; return *plural* otherwise.
|
|
|
|
Overridden in derived classes.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
.. method:: lgettext(message)
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: lngettext(singular, plural, n)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-10-04 14:28:20 -03:00
|
|
|
Equivalent to :meth:`.gettext` and :meth:`.ngettext`, but the translation
|
2017-06-20 11:13:29 -03:00
|
|
|
is returned as a byte string encoded in the preferred system encoding
|
|
|
|
if no encoding was explicitly set with :meth:`set_output_charset`.
|
|
|
|
Overridden in derived classes.
|
|
|
|
|
|
|
|
.. warning::
|
|
|
|
|
|
|
|
These methods should be avoided in Python 3. See the warning for the
|
|
|
|
:func:`lgettext` function.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: info()
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
Return the "protected" :attr:`_info` variable.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: charset()
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Return the encoding of the message catalog file.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: output_charset()
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Return the encoding used to return translated messages in :meth:`.lgettext`
|
|
|
|
and :meth:`.lngettext`.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
.. method:: set_output_charset(charset)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Change the encoding used to return translated messages.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2009-05-17 10:00:36 -03:00
|
|
|
.. method:: install(names=None)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
This method installs :meth:`.gettext` into the built-in namespace,
|
2008-07-17 15:15:35 -03:00
|
|
|
binding it to ``_``.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
If the *names* parameter is given, it must be a sequence containing the
|
2009-07-26 11:54:51 -03:00
|
|
|
names of functions you want to install in the builtins namespace in
|
2017-06-20 11:13:29 -03:00
|
|
|
addition to :func:`_`. Supported names are ``'gettext'``, ``'ngettext'``,
|
2008-07-17 15:15:35 -03:00
|
|
|
``'lgettext'`` and ``'lngettext'``.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
Note that this is only one way, albeit the most convenient way, to make
|
|
|
|
the :func:`_` function available to your application. Because it affects
|
|
|
|
the entire application globally, and specifically the built-in namespace,
|
|
|
|
localized modules should never install :func:`_`. Instead, they should use
|
|
|
|
this code to make :func:`_` available to their module::
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
import gettext
|
|
|
|
t = gettext.translation('mymodule', ...)
|
|
|
|
_ = t.gettext
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2008-07-17 15:15:35 -03:00
|
|
|
This puts :func:`_` only in the module's global namespace and so only
|
|
|
|
affects calls within this module.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
The :class:`GNUTranslations` class
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
The :mod:`gettext` module provides one additional class derived from
|
|
|
|
:class:`NullTranslations`: :class:`GNUTranslations`. This class overrides
|
|
|
|
:meth:`_parse` to enable reading GNU :program:`gettext` format :file:`.mo` files
|
2008-07-14 11:32:15 -03:00
|
|
|
in both big-endian and little-endian format.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
:class:`GNUTranslations` parses optional meta-data out of the translation
|
|
|
|
catalog. It is convention with GNU :program:`gettext` to include meta-data as
|
|
|
|
the translation for the empty string. This meta-data is in :rfc:`822`\ -style
|
|
|
|
``key: value`` pairs, and should contain the ``Project-Id-Version`` key. If the
|
|
|
|
key ``Content-Type`` is found, then the ``charset`` property is used to
|
|
|
|
initialize the "protected" :attr:`_charset` instance variable, defaulting to
|
|
|
|
``None`` if not found. If the charset encoding is specified, then all message
|
|
|
|
ids and message strings read from the catalog are converted to Unicode using
|
2008-07-17 15:15:35 -03:00
|
|
|
this encoding, else ASCII encoding is assumed.
|
|
|
|
|
|
|
|
Since message ids are read as Unicode strings too, all :meth:`*gettext` methods
|
|
|
|
will assume message ids as Unicode strings, not byte strings.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
The entire set of key/value pairs are placed into a dictionary and set as the
|
|
|
|
"protected" :attr:`_info` instance variable.
|
|
|
|
|
2014-10-28 16:17:51 -03:00
|
|
|
If the :file:`.mo` file's magic number is invalid, the major version number is
|
|
|
|
unexpected, or if other problems occur while reading the file, instantiating a
|
|
|
|
:class:`GNUTranslations` class can raise :exc:`OSError`.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
.. class:: GNUTranslations
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
The following methods are overridden from the base class implementation:
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
.. method:: gettext(message)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Look up the *message* id in the catalog and return the corresponding message
|
|
|
|
string, as a Unicode string. If there is no entry in the catalog for the
|
|
|
|
*message* id, and a fallback has been set, the look up is forwarded to the
|
|
|
|
fallback's :meth:`~NullTranslations.gettext` method. Otherwise, the
|
|
|
|
*message* id is returned.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
.. method:: ngettext(singular, plural, n)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Do a plural-forms lookup of a message id. *singular* is used as the message id
|
|
|
|
for purposes of lookup in the catalog, while *n* is used to determine which
|
|
|
|
plural form to use. The returned message string is a Unicode string.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
If the message id is not found in the catalog, and a fallback is specified,
|
|
|
|
the request is forwarded to the fallback's :meth:`~NullTranslations.ngettext`
|
|
|
|
method. Otherwise, when *n* is 1 *singular* is returned, and *plural* is
|
|
|
|
returned in all other cases.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Here is an example::
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
n = len(os.listdir('.'))
|
|
|
|
cat = GNUTranslations(somefile)
|
|
|
|
message = cat.ngettext(
|
|
|
|
'There is %(num)d file in this directory',
|
|
|
|
'There are %(num)d files in this directory',
|
|
|
|
n) % {'num': n}
|
2009-01-03 17:18:54 -04:00
|
|
|
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
.. method:: lgettext(message)
|
|
|
|
.. method:: lngettext(singular, plural, n)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
Equivalent to :meth:`.gettext` and :meth:`.ngettext`, but the translation
|
|
|
|
is returned as a byte string encoded in the preferred system encoding
|
|
|
|
if no encoding was explicitly set with
|
|
|
|
:meth:`~NullTranslations.set_output_charset`.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
.. warning::
|
2008-07-14 11:32:15 -03:00
|
|
|
|
2017-06-20 11:13:29 -03:00
|
|
|
These methods should be avoided in Python 3. See the warning for the
|
|
|
|
:func:`lgettext` function.
|
2008-07-14 11:32:15 -03:00
|
|
|
|
|
|
|
|
2007-08-15 11:28:22 -03:00
|
|
|
Solaris message catalog support
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
The Solaris operating system defines its own binary :file:`.mo` file format, but
|
|
|
|
since no documentation can be found on this format, it is not supported at this
|
|
|
|
time.
|
|
|
|
|
|
|
|
|
|
|
|
The Catalog constructor
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
.. index:: single: GNOME
|
|
|
|
|
|
|
|
GNOME uses a version of the :mod:`gettext` module by James Henstridge, but this
|
|
|
|
version has a slightly different API. Its documented usage was::
|
|
|
|
|
|
|
|
import gettext
|
|
|
|
cat = gettext.Catalog(domain, localedir)
|
|
|
|
_ = cat.gettext
|
2007-09-04 04:15:32 -03:00
|
|
|
print(_('hello world'))
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
For compatibility with this older module, the function :func:`Catalog` is an
|
|
|
|
alias for the :func:`translation` function described above.
|
|
|
|
|
|
|
|
One difference between this module and Henstridge's: his catalog objects
|
|
|
|
supported access through a mapping API, but this appears to be unused and so is
|
|
|
|
not currently supported.
|
|
|
|
|
|
|
|
|
|
|
|
Internationalizing your programs and modules
|
|
|
|
--------------------------------------------
|
|
|
|
|
|
|
|
Internationalization (I18N) refers to the operation by which a program is made
|
|
|
|
aware of multiple languages. Localization (L10N) refers to the adaptation of
|
|
|
|
your program, once internationalized, to the local language and cultural habits.
|
|
|
|
In order to provide multilingual messages for your Python programs, you need to
|
|
|
|
take the following steps:
|
|
|
|
|
|
|
|
#. prepare your program or module by specially marking translatable strings
|
|
|
|
|
|
|
|
#. run a suite of tools over your marked files to generate raw messages catalogs
|
|
|
|
|
|
|
|
#. create language specific translations of the message catalogs
|
|
|
|
|
|
|
|
#. use the :mod:`gettext` module so that message strings are properly translated
|
|
|
|
|
|
|
|
In order to prepare your code for I18N, you need to look at all the strings in
|
|
|
|
your files. Any string that needs to be translated should be marked by wrapping
|
|
|
|
it in ``_('...')`` --- that is, a call to the function :func:`_`. For example::
|
|
|
|
|
|
|
|
filename = 'mylog.txt'
|
|
|
|
message = _('writing a log message')
|
|
|
|
fp = open(filename, 'w')
|
|
|
|
fp.write(message)
|
|
|
|
fp.close()
|
|
|
|
|
|
|
|
In this example, the string ``'writing a log message'`` is marked as a candidate
|
|
|
|
for translation, while the strings ``'mylog.txt'`` and ``'w'`` are not.
|
|
|
|
|
2013-11-19 12:05:20 -04:00
|
|
|
There are a few tools to extract the strings meant for translation.
|
|
|
|
The original GNU :program:`gettext` only supported C or C++ source
|
|
|
|
code but its extended version :program:`xgettext` scans code written
|
|
|
|
in a number of languages, including Python, to find strings marked as
|
|
|
|
translatable. `Babel <http://babel.pocoo.org/>`__ is a Python
|
|
|
|
internationalization library that includes a :file:`pybabel` script to
|
|
|
|
extract and compile message catalogs. François Pinard's program
|
|
|
|
called :program:`xpot` does a similar job and is available as part of
|
2014-10-29 06:26:56 -03:00
|
|
|
his `po-utils package <https://github.com/pinard/po-utils>`__.
|
2013-11-19 12:05:20 -04:00
|
|
|
|
|
|
|
(Python also includes pure-Python versions of these programs, called
|
|
|
|
:program:`pygettext.py` and :program:`msgfmt.py`; some Python distributions
|
|
|
|
will install them for you. :program:`pygettext.py` is similar to
|
|
|
|
:program:`xgettext`, but only understands Python source code and
|
|
|
|
cannot handle other programming languages such as C or C++.
|
|
|
|
:program:`pygettext.py` supports a command-line interface similar to
|
|
|
|
:program:`xgettext`; for details on its use, run ``pygettext.py
|
|
|
|
--help``. :program:`msgfmt.py` is binary compatible with GNU
|
|
|
|
:program:`msgfmt`. With these two programs, you may not need the GNU
|
|
|
|
:program:`gettext` package to internationalize your Python
|
|
|
|
applications.)
|
|
|
|
|
|
|
|
:program:`xgettext`, :program:`pygettext`, and similar tools generate
|
|
|
|
:file:`.po` files that are message catalogs. They are structured
|
2013-11-24 11:09:26 -04:00
|
|
|
human-readable files that contain every marked string in the source
|
|
|
|
code, along with a placeholder for the translated versions of these
|
|
|
|
strings.
|
2013-11-19 12:05:20 -04:00
|
|
|
|
|
|
|
Copies of these :file:`.po` files are then handed over to the
|
|
|
|
individual human translators who write translations for every
|
|
|
|
supported natural language. They send back the completed
|
|
|
|
language-specific versions as a :file:`<language-name>.po` file that's
|
|
|
|
compiled into a machine-readable :file:`.mo` binary catalog file using
|
|
|
|
the :program:`msgfmt` program. The :file:`.mo` files are used by the
|
|
|
|
:mod:`gettext` module for the actual translation processing at
|
|
|
|
run-time.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
How you use the :mod:`gettext` module in your code depends on whether you are
|
|
|
|
internationalizing a single module or your entire application. The next two
|
|
|
|
sections will discuss each case.
|
|
|
|
|
|
|
|
|
|
|
|
Localizing your module
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
If you are localizing your module, you must take care not to make global
|
|
|
|
changes, e.g. to the built-in namespace. You should not use the GNU ``gettext``
|
|
|
|
API but instead the class-based API.
|
|
|
|
|
|
|
|
Let's say your module is called "spam" and the module's various natural language
|
|
|
|
translation :file:`.mo` files reside in :file:`/usr/share/locale` in GNU
|
|
|
|
:program:`gettext` format. Here's what you would put at the top of your
|
|
|
|
module::
|
|
|
|
|
|
|
|
import gettext
|
|
|
|
t = gettext.translation('spam', '/usr/share/locale')
|
2017-06-20 11:13:29 -03:00
|
|
|
_ = t.gettext
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
Localizing your application
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
If you are localizing your application, you can install the :func:`_` function
|
|
|
|
globally into the built-in namespace, usually in the main driver file of your
|
|
|
|
application. This will let all your application-specific files just use
|
|
|
|
``_('...')`` without having to explicitly install it in each file.
|
|
|
|
|
|
|
|
In the simple case then, you need only add the following bit of code to the main
|
|
|
|
driver file of your application::
|
|
|
|
|
|
|
|
import gettext
|
|
|
|
gettext.install('myapplication')
|
|
|
|
|
2013-11-19 12:05:20 -04:00
|
|
|
If you need to set the locale directory, you can pass it into the
|
2008-07-14 11:32:15 -03:00
|
|
|
:func:`install` function::
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
import gettext
|
2008-07-14 11:32:15 -03:00
|
|
|
gettext.install('myapplication', '/usr/share/locale')
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
Changing languages on the fly
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
If your program needs to support many languages at the same time, you may want
|
|
|
|
to create multiple translation instances and then switch between them
|
|
|
|
explicitly, like so::
|
|
|
|
|
|
|
|
import gettext
|
|
|
|
|
|
|
|
lang1 = gettext.translation('myapplication', languages=['en'])
|
|
|
|
lang2 = gettext.translation('myapplication', languages=['fr'])
|
|
|
|
lang3 = gettext.translation('myapplication', languages=['de'])
|
|
|
|
|
|
|
|
# start by using language1
|
|
|
|
lang1.install()
|
|
|
|
|
|
|
|
# ... time goes by, user selects language 2
|
|
|
|
lang2.install()
|
|
|
|
|
|
|
|
# ... more time goes by, user selects language 3
|
|
|
|
lang3.install()
|
|
|
|
|
|
|
|
|
|
|
|
Deferred translations
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
In most coding situations, strings are translated where they are coded.
|
|
|
|
Occasionally however, you need to mark strings for translation, but defer actual
|
|
|
|
translation until later. A classic example is::
|
|
|
|
|
|
|
|
animals = ['mollusk',
|
|
|
|
'albatross',
|
2009-01-03 17:26:05 -04:00
|
|
|
'rat',
|
|
|
|
'penguin',
|
|
|
|
'python', ]
|
2007-08-15 11:28:22 -03:00
|
|
|
# ...
|
|
|
|
for a in animals:
|
2007-09-04 04:15:32 -03:00
|
|
|
print(a)
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
Here, you want to mark the strings in the ``animals`` list as being
|
|
|
|
translatable, but you don't actually want to translate them until they are
|
|
|
|
printed.
|
|
|
|
|
|
|
|
Here is one way you can handle this situation::
|
|
|
|
|
|
|
|
def _(message): return message
|
|
|
|
|
|
|
|
animals = [_('mollusk'),
|
|
|
|
_('albatross'),
|
2009-01-03 17:26:05 -04:00
|
|
|
_('rat'),
|
|
|
|
_('penguin'),
|
|
|
|
_('python'), ]
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
del _
|
|
|
|
|
|
|
|
# ...
|
|
|
|
for a in animals:
|
2007-09-04 04:15:32 -03:00
|
|
|
print(_(a))
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
This works because the dummy definition of :func:`_` simply returns the string
|
|
|
|
unchanged. And this dummy definition will temporarily override any definition
|
|
|
|
of :func:`_` in the built-in namespace (until the :keyword:`del` command). Take
|
|
|
|
care, though if you have a previous definition of :func:`_` in the local
|
|
|
|
namespace.
|
|
|
|
|
|
|
|
Note that the second use of :func:`_` will not identify "a" as being
|
2013-11-19 12:05:20 -04:00
|
|
|
translatable to the :program:`gettext` program, because the parameter
|
|
|
|
is not a string literal.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
Another way to handle this is with the following example::
|
|
|
|
|
|
|
|
def N_(message): return message
|
|
|
|
|
|
|
|
animals = [N_('mollusk'),
|
|
|
|
N_('albatross'),
|
2009-01-03 17:26:05 -04:00
|
|
|
N_('rat'),
|
|
|
|
N_('penguin'),
|
|
|
|
N_('python'), ]
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
# ...
|
|
|
|
for a in animals:
|
2007-09-04 04:15:32 -03:00
|
|
|
print(_(a))
|
2007-08-15 11:28:22 -03:00
|
|
|
|
2013-11-19 12:05:20 -04:00
|
|
|
In this case, you are marking translatable strings with the function
|
|
|
|
:func:`N_`, which won't conflict with any definition of :func:`_`.
|
|
|
|
However, you will need to teach your message extraction program to
|
|
|
|
look for translatable strings marked with :func:`N_`. :program:`xgettext`,
|
|
|
|
:program:`pygettext`, ``pybabel extract``, and :program:`xpot` all
|
2016-10-30 01:20:17 -03:00
|
|
|
support this through the use of the :option:`!-k` command-line switch.
|
2013-11-19 12:05:20 -04:00
|
|
|
The choice of :func:`N_` here is totally arbitrary; it could have just
|
|
|
|
as easily been :func:`MarkThisStringForTranslation`.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
Acknowledgements
|
|
|
|
----------------
|
|
|
|
|
|
|
|
The following people contributed code, feedback, design suggestions, previous
|
|
|
|
implementations, and valuable experience to the creation of this module:
|
|
|
|
|
|
|
|
* Peter Funk
|
|
|
|
|
|
|
|
* James Henstridge
|
|
|
|
|
|
|
|
* Juan David Ibáñez Palomar
|
|
|
|
|
|
|
|
* Marc-André Lemburg
|
|
|
|
|
|
|
|
* Martin von Löwis
|
|
|
|
|
|
|
|
* François Pinard
|
|
|
|
|
|
|
|
* Barry Warsaw
|
|
|
|
|
|
|
|
* Gustavo Niemeyer
|
|
|
|
|
|
|
|
.. rubric:: Footnotes
|
|
|
|
|
|
|
|
.. [#] The default locale directory is system dependent; for example, on RedHat Linux
|
|
|
|
it is :file:`/usr/share/locale`, but on Solaris it is :file:`/usr/lib/locale`.
|
|
|
|
The :mod:`gettext` module does not try to support these system dependent
|
|
|
|
defaults; instead its default is :file:`sys.prefix/share/locale`. For this
|
|
|
|
reason, it is always best to call :func:`bindtextdomain` with an explicit
|
|
|
|
absolute path at the start of your application.
|
|
|
|
|
|
|
|
.. [#] See the footnote for :func:`bindtextdomain` above.
|