cpython/Doc/libregsub.tex

\section{Standard Module \sectcode{regsub}}
\label{module-regsub}

\stmodindex{regsub}
This module defines a number of functions useful for working with
regular expressions (see built-in module \code{regex}).

Warning: these functions are not thread-safe.

\renewcommand{\indexsubitem}{(in module regsub)}

\begin{funcdesc}{sub}{pat\, repl\, str}
Replace the first occurrence of pattern \var{pat} in string
\var{str} by replacement \var{repl}.  If the pattern isn't found,
the string is returned unchanged.  The pattern may be a string or an
already compiled pattern.  The replacement may contain references
\samp{\e \var{digit}} to subpatterns and escaped backslashes.
\end{funcdesc}

\begin{funcdesc}{gsub}{pat\, repl\, str}
Replace all (non-overlapping) occurrences of pattern \var{pat} in
string \var{str} by replacement \var{repl}.  The same rules as for
\code{sub()} apply.  Empty matches for the pattern are replaced only
when not adjacent to a previous match, so e.g.
\code{gsub('', '-', 'abc')} returns \code{'-a-b-c-'}.
\end{funcdesc}

\begin{funcdesc}{split}{str\, pat\optional{\, maxsplit}}
Split the string \var{str} in fields separated by delimiters matching
the pattern \var{pat}, and return a list containing the fields.  Only
non-empty matches for the pattern are considered, so e.g.
\code{split('a:b', ':*')} returns \code{['a', 'b']} and
\code{split('abc', '')} returns \code{['abc']}.  The \var{maxsplit}
defaults to 0. If it is nonzero, only \var{maxsplit} number of splits
occur, and the remainder of the string is returned as the final
element of the list.
\end{funcdesc}

\begin{funcdesc}{splitx}{str\, pat\optional{\, maxsplit}}
Split the string \var{str} in fields separated by delimiters matching
the pattern \var{pat}, and return a list containing the fields as well
as the separators.  For example, \code{splitx('a:::b', ':*')} returns
\code{['a', ':::', 'b']}.  Otherwise, this function behaves the same
as \code{split}.
\end{funcdesc}

\begin{funcdesc}{capwords}{s\optional{\, pat}}
Capitalize words separated by optional pattern \var{pat}.  The default
pattern uses any characters except letters, digits and underscores as
word delimiters.  Capitalization is done by changing the first
character of each word to upper case.
\end{funcdesc}

\begin{funcdesc}{clear_cache}{}
The regsub module maintains a cache of compiled regular expressions,
keyed on the regular expression string and the syntax of the regex
module at the time the expression was compiled.  This function clears
that cache.
\end{funcdesc}
Restructured library documentation 1994-01-01 21:22:07 -04:00			`\section{Standard Module \sectcode{regsub}}`
AMK's megapatch: * \bcode, \ecode added everywhere * \label{module-foo} added everywhere * A few \seealso sections added. * Indentation fixed inside verbatim in lib*tex files 1997-07-17 13:34:52 -03:00			`\label{module-regsub}`
Restructured library documentation 1994-01-01 21:22:07 -04:00
			`\stmodindex{regsub}`
			`This module defines a number of functions useful for working with`
			`regular expressions (see built-in module \code{regex}).`

Added thread unsafety warning. Added optional retain arg to split. 1996-06-26 16:24:22 -03:00			`Warning: these functions are not thread-safe.`

Restructured library documentation 1994-01-01 21:22:07 -04:00			`\renewcommand{\indexsubitem}{(in module regsub)}`
Added capwords, splitx, and optional 3rd argument to split/splitx. 1996-08-09 18:43:21 -03:00
Restructured library documentation 1994-01-01 21:22:07 -04:00			`\begin{funcdesc}{sub}{pat\, repl\, str}`
			`Replace the first occurrence of pattern \var{pat} in string`
			`\var{str} by replacement \var{repl}. If the pattern isn't found,`
			`the string is returned unchanged. The pattern may be a string or an`
			`already compiled pattern. The replacement may contain references`
			`\samp{\e \var{digit}} to subpatterns and escaped backslashes.`
			`\end{funcdesc}`

			`\begin{funcdesc}{gsub}{pat\, repl\, str}`
			`Replace all (non-overlapping) occurrences of pattern \var{pat} in`
			`string \var{str} by replacement \var{repl}. The same rules as for`
			`\code{sub()} apply. Empty matches for the pattern are replaced only`
			`when not adjacent to a previous match, so e.g.`
			`\code{gsub('', '-', 'abc')} returns \code{'-a-b-c-'}.`
			`\end{funcdesc}`

Added capwords, splitx, and optional 3rd argument to split/splitx. 1996-08-09 18:43:21 -03:00			`\begin{funcdesc}{split}{str\, pat\optional{\, maxsplit}}`
Restructured library documentation 1994-01-01 21:22:07 -04:00			`Split the string \var{str} in fields separated by delimiters matching`
			`the pattern \var{pat}, and return a list containing the fields. Only`
			`non-empty matches for the pattern are considered, so e.g.`
			`\code{split('a:b', ':*')} returns \code{['a', 'b']} and`
Added capwords, splitx, and optional 3rd argument to split/splitx. 1996-08-09 18:43:21 -03:00			`\code{split('abc', '')} returns \code{['abc']}. The \var{maxsplit}`
			`defaults to 0. If it is nonzero, only \var{maxsplit} number of splits`
			`occur, and the remainder of the string is returned as the final`
			`element of the list.`
			`\end{funcdesc}`

			`\begin{funcdesc}{splitx}{str\, pat\optional{\, maxsplit}}`
			`Split the string \var{str} in fields separated by delimiters matching`
			`the pattern \var{pat}, and return a list containing the fields as well`
			`as the separators. For example, \code{splitx('a:::b', ':*')} returns`
			`\code{['a', ':::', 'b']}. Otherwise, this function behaves the same`
			`as \code{split}.`
			`\end{funcdesc}`

			`\begin{funcdesc}{capwords}{s\optional{\, pat}}`
			`Capitalize words separated by optional pattern \var{pat}. The default`
			`pattern uses any characters except letters, digits and underscores as`
			`word delimiters. Capitalization is done by changing the first`
			`character of each word to upper case.`
Restructured library documentation 1994-01-01 21:22:07 -04:00			`\end{funcdesc}`
Added a paragraph to describe clear_cache(), and why it's necessary. 1997-02-18 14:59:37 -04:00
			`\begin{funcdesc}{clear_cache}{}`
			`The regsub module maintains a cache of compiled regular expressions,`
			`keyed on the regular expression string and the syntax of the regex`
			`module at the time the expression was compiled. This function clears`
			`that cache.`
			`\end{funcdesc}`