From d92e871ad8b71de1a276f94d14b797e2e049ced8 Mon Sep 17 00:00:00 2001 From: Mark Summerfield Date: Wed, 3 Oct 2007 08:53:21 +0000 Subject: [PATCH] Added note in footnote about string comparisons about unicodedata.normalize(). --- Doc/reference/expressions.rst | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/Doc/reference/expressions.rst b/Doc/reference/expressions.rst index 4e95ec36306..488f0907e09 100644 --- a/Doc/reference/expressions.rst +++ b/Doc/reference/expressions.rst @@ -1330,9 +1330,10 @@ groups from right to left). .. [#] While comparisons between unicode strings make sense at the byte level, they may be counter-intuitive to users. For example, the - strings ``u"\u00C7"`` and ``u"\u0327\u0043"`` compare differently, + strings ``u"\u00C7"`` and ``u"\u0043\u0327"`` compare differently, even though they both represent the same unicode character (LATIN - CAPTITAL LETTER C WITH CEDILLA). + CAPTITAL LETTER C WITH CEDILLA). To compare strings in a human + recognizable way, compare using :func:`unicodedata.normalize`. .. [#] The implementation computes this efficiently, without constructing lists or sorting.