Merged revisions 79314 via svnmerge from

svn+ssh://pythondev@svn.python.org/python/trunk

........
  r79314 | ezio.melotti | 2010-03-23 01:07:32 +0200 (Tue, 23 Mar 2010) | 1 line

  Update the version number of the Unicode Database in a few more places.
........
This commit is contained in:
Ezio Melotti 2010-03-22 23:16:42 +00:00
parent 910bd51ea1
commit 4c5475d196
4 changed files with 15 additions and 14 deletions

View File

@ -403,7 +403,7 @@ These are grouped into categories such as "Letter", "Number", "Punctuation", or
from the above output, ``'Ll'`` means 'Letter, lowercase', ``'No'`` means
"Number, other", ``'Mn'`` is "Mark, nonspacing", and ``'So'`` is "Symbol,
other". See
<http://unicode.org/Public/5.1.0/ucd/UCD.html#General_Category_Values> for a
<http://www.unicode.org/reports/tr44/#General_Category_Values> for a
list of category codes.
References

View File

@ -15,12 +15,12 @@
This module provides access to the Unicode Character Database which defines
character properties for all Unicode characters. The data in this database is
based on the :file:`UnicodeData.txt` file version 5.1.0 which is publicly
based on the :file:`UnicodeData.txt` file version 5.2.0 which is publicly
available from ftp://ftp.unicode.org/.
The module uses the same names and symbols as defined by the UnicodeData File
Format 5.1.0 (see http://www.unicode.org/Public/5.1.0/ucd/UCD.html). It defines
the following functions:
Format 5.2.0 (see http://www.unicode.org/reports/tr44/). It defines the
following functions:
.. function:: lookup(name)

View File

@ -933,11 +933,13 @@ changes, or look through the Subversion logs for all the details.
a timeout was provided and the operation timed out.
(Contributed by Tim Lesher; :issue:`1674032`.)
* The Unicode database provided by the :mod:`unicodedata` module
remains at version 5.1.0, but Python now uses it internally to
determine which characters are numeric, whitespace, or represent
line breaks. The database also now includes information from the
:file:`Unihan.txt` data file. (Patch by Anders Chrigström
* The Unicode database has been updated to the version 5.2.0.
(Updated by Florent Xicluna; :issue:`8024`.)
* The Unicode database provided by the :mod:`unicodedata` is used
internally to determine which characters are numeric, whitespace,
or represent line breaks. The database also now includes information
from the :file:`Unihan.txt` data file. (Patch by Anders Chrigström
and Amaury Forgeot d'Arc; :issue:`1571184`.)
* The :class:`UserDict` class is now a new-style class. (Changed by

View File

@ -1,8 +1,8 @@
/* ------------------------------------------------------------------------
unicodedata -- Provides access to the Unicode 5.1 data base.
unicodedata -- Provides access to the Unicode 5.2 data base.
Data was extracted from the Unicode 5.1 UnicodeData.txt file.
Data was extracted from the Unicode 5.2 UnicodeData.txt file.
Written by Marc-Andre Lemburg (mal@lemburg.com).
Modified for Python 2.0 by Fredrik Lundh (fredrik@pythonware.com)
@ -1235,11 +1235,10 @@ PyDoc_STRVAR(unicodedata_docstring,
"This module provides access to the Unicode Character Database which\n\
defines character properties for all Unicode characters. The data in\n\
this database is based on the UnicodeData.txt file version\n\
5.1.0 which is publically available from ftp://ftp.unicode.org/.\n\
5.2.0 which is publically available from ftp://ftp.unicode.org/.\n\
\n\
The module uses the same names and symbols as defined by the\n\
UnicodeData File Format 5.1.0 (see\n\
http://www.unicode.org/Public/5.1.0/ucd/UCD.html).");
UnicodeData File Format 5.2.0 (see http://www.unicode.org/reports/tr44/).");
static struct PyModuleDef unicodedatamodule = {