Clarified meaning of \w and \W with respect to the UNICODE and LOCALE flags.

Closes SF bug #635595.
This commit is contained in:
Fred Drake 2002-11-12 23:12:54 +00:00
parent 82c7231071
commit 3d03968c75
1 changed files with 5 additions and 5 deletions

View File

@ -347,10 +347,10 @@ equivalent to the set \regexp{[ \e t\e n\e r\e f\e v]}.
equivalent to the set \regexp{[\textasciicircum\ \e t\e n\e r\e f\e v]}.
\item[\code{\e w}]When the \constant{LOCALE} and \constant{UNICODE}
flags are not specified,
matches any alphanumeric character; this is equivalent to the set
flags are not specified, matches any alphanumeric character and the
underscore; this is equivalent to the set
\regexp{[a-zA-Z0-9_]}. With \constant{LOCALE}, it will match the set
\regexp{[0-9_]} plus whatever characters are defined as letters for
\regexp{[0-9_]} plus whatever characters are defined as alphanumeric for
the current locale. If \constant{UNICODE} is set, this will match the
characters \regexp{[0-9_]} plus whatever is classified as alphanumeric
in the Unicode character properties database.
@ -359,9 +359,9 @@ in the Unicode character properties database.
flags are not specified, matches any non-alphanumeric character; this
is equivalent to the set \regexp{[{\textasciicircum}a-zA-Z0-9_]}. With
\constant{LOCALE}, it will match any character not in the set
\regexp{[0-9_]}, and not defined as a letter for the current locale.
\regexp{[0-9_]}, and not defined as alphanumeric for the current locale.
If \constant{UNICODE} is set, this will match anything other than
\regexp{[0-9_]} and characters marked at alphanumeric in the Unicode
\regexp{[0-9_]} and characters marked as alphanumeric in the Unicode
character properties database.
\item[\code{\e Z}]Matches only at the end of the string.