cpython

Commit Graph

Author	SHA1	Message	Date
Fredrik Lundh	7b7dd107b3	compress unicode decomposition tables (this saves another 55k)	2001-01-21 22:41:08 +00:00
Fredrik Lundh	9e9bcda547	forgot to check in the new makeunicodedata.py script	2001-01-21 17:01:31 +00:00
Fredrik Lundh	fad27aee11	Added 38,642 missing characters to the Unicode database (first-last ranges) -- but thanks to the 2.0 compression scheme, this doesn't add a single byte to the resulting binaries (!) Closes bug #117524	2000-11-03 20:24:15 +00:00
Fred Drake	9c6850510c	Remove bogus stdout redirection and use of sys.__stdout__; use augmented print statement instead.	2000-10-26 03:56:46 +00:00
Fredrik Lundh	375732cd41	- don't set the titlecase flag for uppercase letters (sorry, tim)	2000-09-25 23:03:34 +00:00
Fredrik Lundh	0f8fad4969	unicode database compression, step 3: - added decimal digit and digit properties to the unidb tables	2000-09-25 21:01:56 +00:00
Fredrik Lundh	e9133f7e2e	unicode database compression, step 3: - use unidb compression for the unicodectype module. smaller, faster, and slightly more portable... - also mention the unicode directory in Tools/README	2000-09-25 17:59:57 +00:00
Fredrik Lundh	cfcea49218	unicode database compression, step 2: - fixed attributions - moved decomposition data to a separate table, in preparation for step 3 (which won't happen before 2.0 final, promise!) - use relative paths in the generator script I have a lot more stuff in the works for 2.1, but let's leave that for another day...	2000-09-25 08:07:06 +00:00
Tim Peters	2101348830	Fiddled w/ /F's cool new splitbins function: documented it, generalized it a bit, sped it a lot primarily by removing the unused assumption that None was a legit bin entry (the function doesn't really need to assume that there's anything special about 0), added an optional "trace" argument, and in __debug__ mode added exhaustive verification that the decomposition is both correct and doesn't overstep any array bounds (which wasn't obvious to me from staring at the generated C code -- now I feel safe!). Did not commit a new unicodedata_db.h, as the one produced by this version is identical to the one already checked in.	2000-09-25 07:13:41 +00:00
Fredrik Lundh	f367cacb98	unicode database compression, step 1: - use unidb compression for the unicodedata module. on Windows, the new unidatabase module is 120k, down from nearly 600k.	2000-09-24 23:18:31 +00:00

1 2 3

110 Commits