Martin v. Löwis
c350912990
Adjust CJK Ideograph range to Unicode 4.1.
2006-03-11 12:16:23 +00:00
Martin v. Löwis
0e2f9b2dfb
Fix refcounting bug.
2006-03-10 11:29:32 +00:00
Martin v. Löwis
5bd7c02298
Avoid forward-declaring the methods array.
...
Rename unicodedata.db* to unicodedata.ucd*
2006-03-10 11:20:04 +00:00
Martin v. Löwis
480f1bb67b
Update Unicode database to Unicode 4.1.
2006-03-09 23:38:20 +00:00
Thomas Wouters
1e365b265a
Remove gcc (4.0.x) warning about uninitialized value by explicitly setting
...
the sentinel value in the main function, rather than the helper. This
function could possibly do with an early-out if any of the helper calls ends
up with a len of 0, but I doubt it really matters (how common are malformed
hangul syllables, really?)
2006-03-01 21:58:30 +00:00
Martin v. Löwis
8b291e2d66
Patch #1213831 : Fix typo in unicodedata._getcode.
...
Will backport to Python 2.4.
2005-09-18 08:17:56 +00:00
Hye-Shik Chang
4c560ea05b
Correct URL to the official UnicodeData 3.2.0 resource. (Reported
...
by Darek Suchojad)
2005-06-04 07:31:48 +00:00
Hye-Shik Chang
cf18a5d67b
Fill docstrings for module and functions, extracted from the tex
...
documentation. (Patch #1173245 , Contributed by Jeremy Yallop)
2005-04-04 16:32:07 +00:00
Hye-Shik Chang
e9ddfbb412
SF #989185 : Drop unicode.iswide() and unicode.width() and add
...
unicodedata.east_asian_width(). You can still implement your own
simple width() function using it like this:
def width(u):
w = 0
for c in unicodedata.normalize('NFC', u):
cwidth = unicodedata.east_asian_width(c)
if cwidth in ('W', 'F'): w += 2
else: w += 1
return w
2004-08-04 07:38:35 +00:00
Hye-Shik Chang
69dc1c8f6a
Fix typo.
2004-07-15 04:30:25 +00:00
Martin v. Löwis
61e40bd897
Special case normalization of empty strings. Fixes #924361 .
...
Backported to 2.3.
2004-04-17 19:36:48 +00:00
Martin v. Löwis
d2171d2ba4
Overallocate target buffer for normalization more early. Fixes #834676 .
...
Backported to 2.3.
2003-11-06 20:47:57 +00:00
Neal Norwitz
e9c571f968
Fix SF bug #694816 , remove comparison of unsigned value < 0
2003-02-28 03:14:37 +00:00
Martin v. Löwis
2fb661fb80
Remove C++ comment.
2002-12-07 14:56:36 +00:00
Martin v. Löwis
b5c980b802
Add unidata_version. Bump generator version number.
2002-11-25 09:13:37 +00:00
Martin v. Löwis
8d93ca1383
Verify that the code in CJK UNIFIED IDEOGRAPH- actually denotes an ideograph.
2002-11-23 22:10:29 +00:00
Martin v. Löwis
677bde2dd1
Patch #626485 : Support Unicode normalization.
2002-11-23 22:08:15 +00:00
Martin v. Löwis
ef7fe2e813
Implement names for CJK unified ideographs. Add name to KeyError output.
...
Verify that the lookup for an existing name succeeds.
2002-11-23 18:01:32 +00:00
Martin v. Löwis
2f4be4e38a
Fix off-by-one error.
2002-11-23 17:11:06 +00:00
Martin v. Löwis
7d41e29c58
Patch #626548 : Support Hangul syllable names.
2002-11-23 12:22:32 +00:00
Martin v. Löwis
9def6a3a77
Update to Unicode 3.2 database.
2002-10-18 16:11:54 +00:00
Mark Hammond
62b1ab1b31
Replace DL_IMPORT with PyMODINIT_FUNC and remove "/export:init..." link
...
command line for Windows builds. This should allow MSVC to import and
build the Python MSVC6 project files without error.
2002-07-23 06:31:15 +00:00
Martin v. Löwis
14f8b4cfcb
Patch #568124 : Add doc string macros.
2002-06-13 20:33:02 +00:00
Andrew MacIntyre
74a3bec592
_Py prefix is verboten for static entry points
2002-06-13 11:55:14 +00:00
Fred Drake
a2bd8d3816
Remove direct manipulation of the module dict.
2002-04-03 21:39:26 +00:00
Andrew MacIntyre
7bf6833e17
OS/2 EMX port changes (Modules part of patch #450267 ):
...
Modules/
_hotshot.c
dbmmodule.c
fcntlmodule.c
main.c
pwdmodule.c
readline.c
selectmodule.c
signalmodule.c
termios.c
timemodule.c
unicodedata.c
2002-03-03 02:59:16 +00:00
Tim Peters
69b83b113f
unicodedata_decomposition(): sprintf -> PyOS_snprintf.
2001-11-30 07:23:05 +00:00
Fred Drake
6a16ea07b8
Kill a warning on the SGI compiler.
...
This is part of SF patch #434992 .
2001-07-19 21:11:13 +00:00
Fred Drake
f585bef504
Be a bit more strict in setting up the export of the C API for this
...
module; do not attempt to insert the API object into the module dict
if there was an error creating it.
2001-03-03 19:41:55 +00:00
Fredrik Lundh
b95896b2d2
renamed internal functions to avoid name clashes under OpenVMS
...
(fixes bug #132815 )
2001-02-18 22:06:17 +00:00
Fredrik Lundh
ae7636753e
stupid typo (for some reason, this only caused problems on OpenVMS).
2001-02-18 11:41:49 +00:00
Fredrik Lundh
06d126803c
Move uchhash functionality into unicodedata (after the recent
...
crop of changes, the files are small enough to do this). Also
adds "name" and "lookup" functions to unicodedata.
2001-01-24 07:59:11 +00:00
Fredrik Lundh
b2dfd73bdc
Unicode nits: Don't include unicodedatabase.h no more. And make sure
...
to build *all* tables in makeunicodedata.py.
2001-01-21 23:31:52 +00:00
Fredrik Lundh
7b7dd107b3
compress unicode decomposition tables (this saves another 55k)
2001-01-21 22:41:08 +00:00
Fredrik Lundh
cfcea49218
unicode database compression, step 2:
...
- fixed attributions
- moved decomposition data to a separate table, in preparation
for step 3 (which won't happen before 2.0 final, promise!)
- use relative paths in the generator script
I have a lot more stuff in the works for 2.1, but let's leave
that for another day...
2000-09-25 08:07:06 +00:00
Fredrik Lundh
a4287c29b3
unicode database compression, step 1:
...
- use unidb compression for the unicodedata module. on Windows,
the new unidatabase module is 120k, down from nearly 600k.
2000-09-24 21:45:34 +00:00
Guido van Rossum
16b1ad9c7d
Changing the CNRI copyright notice according to CNRI's instructions.
...
This is a notice without a date, which apparently is not a claim to
copyright but only advice to the reader. IANAL. :-)
2000-08-03 16:24:25 +00:00
Thomas Wouters
f3f33dcf03
Bunch of minor ANSIfications: 'void initfunc()' -> 'void initfunc(void)',
...
and a couple of functions that were missed in the previous batches. Not
terribly tested, but very carefully scrutinized, three times.
All these were found by the little findkrc.py that I posted to python-dev,
which means there might be more lurking. Cases such as this:
long
func(a, b)
long a;
long b; /* flagword */
{
and other cases where the last ; in the argument list isn't followed by a
newline and an opening curly bracket. Regexps to catch all are welcome, of
course ;)
2000-07-21 06:00:07 +00:00
Guido van Rossum
8a16054240
Marc-Andre Lemburg: The large unicode database table is broken in
...
pages of 4k entries each. This should fix compiler problems on some
platforms.
2000-03-31 17:26:12 +00:00
Guido van Rossum
2a70a3a8fc
Module unicodedata -- Provides access to the Unicode 3.0 data base.
...
Written by Marc-Andre Lemburg.
2000-03-10 23:10:21 +00:00