1998-08-10 16:42:37 -03:00
|
|
|
\section{\module{random} ---
|
1999-04-21 15:14:22 -03:00
|
|
|
Generate pseudo-random numbers}
|
1998-07-23 14:59:49 -03:00
|
|
|
|
1999-04-21 15:14:22 -03:00
|
|
|
\declaremodule{standard}{random}
|
1998-08-10 16:42:37 -03:00
|
|
|
\modulesynopsis{Generate pseudo-random numbers with various common
|
1999-04-21 15:14:22 -03:00
|
|
|
distributions.}
|
1998-07-23 14:59:49 -03:00
|
|
|
|
1997-04-03 18:41:49 -04:00
|
|
|
|
|
|
|
This module implements pseudo-random number generators for various
|
2001-01-24 19:06:53 -04:00
|
|
|
distributions.
|
2002-12-29 19:03:38 -04:00
|
|
|
|
2001-01-24 19:06:53 -04:00
|
|
|
For integers, uniform selection from a range.
|
2002-12-29 19:03:38 -04:00
|
|
|
For sequences, uniform selection of a random element, a function to
|
|
|
|
generate a random permutation of a list in-place, and a function for
|
|
|
|
random sampling without replacement.
|
|
|
|
|
2001-01-24 19:06:53 -04:00
|
|
|
On the real line, there are functions to compute uniform, normal (Gaussian),
|
|
|
|
lognormal, negative exponential, gamma, and beta distributions.
|
2002-12-29 19:03:38 -04:00
|
|
|
For generating distributions of angles, the von Mises distribution
|
|
|
|
is available.
|
2001-01-24 19:06:53 -04:00
|
|
|
|
|
|
|
Almost all module functions depend on the basic function
|
|
|
|
\function{random()}, which generates a random float uniformly in
|
2002-12-29 19:03:38 -04:00
|
|
|
the semi-open range [0.0, 1.0). Python uses the Mersenne Twister as
|
|
|
|
the core generator. It produces 53-bit precision floats and has a
|
|
|
|
period of 2**19937-1. The underlying implementation in C
|
|
|
|
is both fast and threadsafe. The Mersenne Twister is one of the most
|
|
|
|
extensively tested random number generators in existence. However, being
|
|
|
|
completely deterministic, it is not suitable for all purposes, and is
|
|
|
|
completely unsuitable for cryptographic purposes.
|
2001-01-24 19:06:53 -04:00
|
|
|
|
2001-01-24 23:36:26 -04:00
|
|
|
The functions supplied by this module are actually bound methods of a
|
2001-02-01 22:42:31 -04:00
|
|
|
hidden instance of the \class{random.Random} class. You can
|
|
|
|
instantiate your own instances of \class{Random} to get generators
|
|
|
|
that don't share state. This is especially useful for multi-threaded
|
|
|
|
programs, creating a different instance of \class{Random} for each
|
|
|
|
thread, and using the \method{jumpahead()} method to ensure that the
|
2002-12-29 19:03:38 -04:00
|
|
|
generated sequences seen by each thread don't overlap.
|
2001-02-01 22:42:31 -04:00
|
|
|
|
|
|
|
Class \class{Random} can also be subclassed if you want to use a
|
|
|
|
different basic generator of your own devising: in that case, override
|
|
|
|
the \method{random()}, \method{seed()}, \method{getstate()},
|
2001-01-25 02:23:18 -04:00
|
|
|
\method{setstate()} and \method{jumpahead()} methods.
|
2001-01-24 23:36:26 -04:00
|
|
|
|
2002-12-29 19:03:38 -04:00
|
|
|
As an example of subclassing, the \module{random} module provides
|
|
|
|
the \class{WichmannHill} class which implements an alternative generator
|
|
|
|
in pure Python. The class provides a backward compatible way to
|
|
|
|
reproduce results from earlier versions of Python which used the
|
|
|
|
Wichmann-Hill algorithm as the core generator.
|
|
|
|
\versionchanged[Substituted MersenneTwister for Wichmann-Hill]{2.3}
|
2001-01-26 06:00:39 -04:00
|
|
|
|
2001-01-24 23:36:26 -04:00
|
|
|
|
|
|
|
Bookkeeping functions:
|
2001-01-24 19:06:53 -04:00
|
|
|
|
|
|
|
\begin{funcdesc}{seed}{\optional{x}}
|
|
|
|
Initialize the basic random number generator.
|
2001-02-01 00:59:18 -04:00
|
|
|
Optional argument \var{x} can be any hashable object.
|
2001-04-21 02:56:06 -03:00
|
|
|
If \var{x} is omitted or \code{None}, current system time is used;
|
2001-02-01 00:59:18 -04:00
|
|
|
current system time is also used to initialize the generator when the
|
|
|
|
module is first imported.
|
2001-04-21 02:56:06 -03:00
|
|
|
If \var{x} is not \code{None} or an int or long,
|
2001-02-01 11:53:24 -04:00
|
|
|
\code{hash(\var{x})} is used instead.
|
2001-02-01 00:59:18 -04:00
|
|
|
If \var{x} is an int or long, \var{x} is used directly.
|
2001-01-24 20:39:16 -04:00
|
|
|
\end{funcdesc}
|
2000-04-03 17:13:55 -03:00
|
|
|
|
2001-01-24 23:36:26 -04:00
|
|
|
\begin{funcdesc}{getstate}{}
|
2001-02-01 22:42:31 -04:00
|
|
|
Return an object capturing the current internal state of the
|
|
|
|
generator. This object can be passed to \function{setstate()} to
|
|
|
|
restore the state.
|
2001-02-01 00:59:18 -04:00
|
|
|
\versionadded{2.1}
|
|
|
|
\end{funcdesc}
|
2001-01-24 23:36:26 -04:00
|
|
|
|
|
|
|
\begin{funcdesc}{setstate}{state}
|
|
|
|
\var{state} should have been obtained from a previous call to
|
2001-02-01 22:42:31 -04:00
|
|
|
\function{getstate()}, and \function{setstate()} restores the
|
|
|
|
internal state of the generator to what it was at the time
|
|
|
|
\function{setstate()} was called.
|
2001-02-01 00:59:18 -04:00
|
|
|
\versionadded{2.1}
|
2001-02-01 22:42:31 -04:00
|
|
|
\end{funcdesc}
|
2001-01-22 14:18:30 -04:00
|
|
|
|
2001-01-25 02:23:18 -04:00
|
|
|
\begin{funcdesc}{jumpahead}{n}
|
2002-12-29 19:03:38 -04:00
|
|
|
Change the internal state to one different from and likely far away from
|
|
|
|
the current state. \var{n} is a non-negative integer which is used to
|
|
|
|
scramble the current state vector. This is most useful in multi-threaded
|
2001-04-21 02:56:06 -03:00
|
|
|
programs, in conjuction with multiple instances of the \class{Random}
|
2002-12-29 19:03:38 -04:00
|
|
|
class: \method{setstate()} or \method{seed()} can be used to force all
|
|
|
|
instances into the same internal state, and then \method{jumpahead()}
|
|
|
|
can be used to force the instances' states far apart.
|
2001-02-01 00:59:18 -04:00
|
|
|
\versionadded{2.1}
|
2002-12-29 19:03:38 -04:00
|
|
|
\versionchanged[Instead of jumping to a specific state, \var{n} steps
|
|
|
|
ahead, \method{jumpahead(\var{n})} jumps to another state likely to be
|
|
|
|
separated by many steps.]{2.3}
|
2001-01-25 02:23:18 -04:00
|
|
|
\end{funcdesc}
|
2001-01-24 23:36:26 -04:00
|
|
|
|
2002-12-29 19:03:38 -04:00
|
|
|
|
2001-01-24 23:36:26 -04:00
|
|
|
Functions for integers:
|
2001-01-22 14:18:30 -04:00
|
|
|
|
|
|
|
\begin{funcdesc}{randrange}{\optional{start,} stop\optional{, step}}
|
|
|
|
Return a randomly selected element from \code{range(\var{start},
|
|
|
|
\var{stop}, \var{step})}. This is equivalent to
|
2001-01-24 19:06:53 -04:00
|
|
|
\code{choice(range(\var{start}, \var{stop}, \var{step}))},
|
|
|
|
but doesn't actually build a range object.
|
2001-01-22 14:18:30 -04:00
|
|
|
\versionadded{1.5.2}
|
|
|
|
\end{funcdesc}
|
|
|
|
|
2001-01-24 23:36:26 -04:00
|
|
|
\begin{funcdesc}{randint}{a, b}
|
|
|
|
Return a random integer \var{N} such that
|
|
|
|
\code{\var{a} <= \var{N} <= \var{b}}.
|
|
|
|
\end{funcdesc}
|
|
|
|
|
|
|
|
|
|
|
|
Functions for sequences:
|
|
|
|
|
|
|
|
\begin{funcdesc}{choice}{seq}
|
|
|
|
Return a random element from the non-empty sequence \var{seq}.
|
|
|
|
\end{funcdesc}
|
|
|
|
|
|
|
|
\begin{funcdesc}{shuffle}{x\optional{, random}}
|
|
|
|
Shuffle the sequence \var{x} in place.
|
|
|
|
The optional argument \var{random} is a 0-argument function
|
|
|
|
returning a random float in [0.0, 1.0); by default, this is the
|
|
|
|
function \function{random()}.
|
|
|
|
|
|
|
|
Note that for even rather small \code{len(\var{x})}, the total
|
|
|
|
number of permutations of \var{x} is larger than the period of most
|
|
|
|
random number generators; this implies that most permutations of a
|
|
|
|
long sequence can never be generated.
|
|
|
|
\end{funcdesc}
|
|
|
|
|
2002-11-12 13:41:57 -04:00
|
|
|
\begin{funcdesc}{sample}{population, k}
|
|
|
|
Return a \var{k} length list of unique elements chosen from the
|
|
|
|
population sequence. Used for random sampling without replacement.
|
2002-11-18 05:01:24 -04:00
|
|
|
\versionadded{2.3}
|
2002-11-12 13:41:57 -04:00
|
|
|
|
2002-11-18 05:01:24 -04:00
|
|
|
Returns a new list containing elements from the population while
|
|
|
|
leaving the original population unchanged. The resulting list is
|
|
|
|
in selection order so that all sub-slices will also be valid random
|
|
|
|
samples. This allows raffle winners (the sample) to be partitioned
|
|
|
|
into grand prize and second place winners (the subslices).
|
2002-11-12 13:41:57 -04:00
|
|
|
|
2002-11-18 05:01:24 -04:00
|
|
|
Members of the population need not be hashable or unique. If the
|
|
|
|
population contains repeats, then each occurrence is a possible
|
|
|
|
selection in the sample.
|
2002-11-12 13:41:57 -04:00
|
|
|
|
2002-11-18 05:01:24 -04:00
|
|
|
To choose a sample from a range of integers, use \function{xrange}
|
|
|
|
as an argument. This is especially fast and space efficient for
|
|
|
|
sampling from a large population: \code{sample(xrange(10000000), 60)}.
|
2002-11-12 13:41:57 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
2001-01-24 23:36:26 -04:00
|
|
|
|
|
|
|
The following functions generate specific real-valued distributions.
|
|
|
|
Function parameters are named after the corresponding variables in the
|
|
|
|
distribution's equation, as used in common mathematical practice; most of
|
|
|
|
these equations can be found in any statistics text.
|
|
|
|
|
2001-01-24 19:06:53 -04:00
|
|
|
\begin{funcdesc}{random}{}
|
|
|
|
Return the next random floating point number in the range [0.0, 1.0).
|
|
|
|
\end{funcdesc}
|
|
|
|
|
2001-01-22 14:18:30 -04:00
|
|
|
\begin{funcdesc}{uniform}{a, b}
|
2001-01-24 19:06:53 -04:00
|
|
|
Return a random real number \var{N} such that
|
2001-01-22 14:18:30 -04:00
|
|
|
\code{\var{a} <= \var{N} < \var{b}}.
|
|
|
|
\end{funcdesc}
|
2000-04-03 17:13:55 -03:00
|
|
|
|
1998-03-08 04:13:53 -04:00
|
|
|
\begin{funcdesc}{betavariate}{alpha, beta}
|
2001-01-22 14:18:30 -04:00
|
|
|
Beta distribution. Conditions on the parameters are
|
|
|
|
\code{\var{alpha} > -1} and \code{\var{beta} > -1}.
|
|
|
|
Returned values range between 0 and 1.
|
1997-04-03 18:41:49 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
1998-03-08 04:13:53 -04:00
|
|
|
\begin{funcdesc}{cunifvariate}{mean, arc}
|
2001-01-22 14:18:30 -04:00
|
|
|
Circular uniform distribution. \var{mean} is the mean angle, and
|
|
|
|
\var{arc} is the range of the distribution, centered around the mean
|
|
|
|
angle. Both values must be expressed in radians, and can range
|
2001-01-24 19:06:53 -04:00
|
|
|
between 0 and \emph{pi}. Returned values range between
|
2001-01-22 14:18:30 -04:00
|
|
|
\code{\var{mean} - \var{arc}/2} and \code{\var{mean} +
|
2002-05-23 16:44:49 -03:00
|
|
|
\var{arc}/2} and are normalized to between 0 and \emph{pi}.
|
|
|
|
|
2002-05-23 18:07:19 -03:00
|
|
|
\deprecated{2.3}{Instead, use \code{(\var{mean} + \var{arc} *
|
|
|
|
(random.random() - 0.5)) \% math.pi}.}
|
1997-04-03 18:41:49 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
|
|
|
\begin{funcdesc}{expovariate}{lambd}
|
2001-01-22 14:18:30 -04:00
|
|
|
Exponential distribution. \var{lambd} is 1.0 divided by the desired
|
|
|
|
mean. (The parameter would be called ``lambda'', but that is a
|
2001-01-24 19:06:53 -04:00
|
|
|
reserved word in Python.) Returned values range from 0 to
|
2001-01-22 14:18:30 -04:00
|
|
|
positive infinity.
|
1997-04-03 18:41:49 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
2002-05-13 19:40:38 -03:00
|
|
|
\begin{funcdesc}{gammavariate}{alpha, beta}
|
2001-01-22 14:18:30 -04:00
|
|
|
Gamma distribution. (\emph{Not} the gamma function!) Conditions on
|
2002-05-13 20:49:13 -03:00
|
|
|
the parameters are \code{\var{alpha} > 0} and \code{\var{beta} > 0}.
|
1997-04-03 18:41:49 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
1998-03-08 04:13:53 -04:00
|
|
|
\begin{funcdesc}{gauss}{mu, sigma}
|
2001-01-22 14:18:30 -04:00
|
|
|
Gaussian distribution. \var{mu} is the mean, and \var{sigma} is the
|
|
|
|
standard deviation. This is slightly faster than the
|
|
|
|
\function{normalvariate()} function defined below.
|
1997-04-03 18:41:49 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
1998-03-08 04:13:53 -04:00
|
|
|
\begin{funcdesc}{lognormvariate}{mu, sigma}
|
2001-01-22 14:18:30 -04:00
|
|
|
Log normal distribution. If you take the natural logarithm of this
|
|
|
|
distribution, you'll get a normal distribution with mean \var{mu}
|
|
|
|
and standard deviation \var{sigma}. \var{mu} can have any value,
|
2001-01-24 19:06:53 -04:00
|
|
|
and \var{sigma} must be greater than zero.
|
1997-04-03 18:41:49 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
1998-03-08 04:13:53 -04:00
|
|
|
\begin{funcdesc}{normalvariate}{mu, sigma}
|
2001-01-22 14:18:30 -04:00
|
|
|
Normal distribution. \var{mu} is the mean, and \var{sigma} is the
|
|
|
|
standard deviation.
|
1997-04-03 18:41:49 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
1998-03-08 04:13:53 -04:00
|
|
|
\begin{funcdesc}{vonmisesvariate}{mu, kappa}
|
2001-01-22 14:18:30 -04:00
|
|
|
\var{mu} is the mean angle, expressed in radians between 0 and
|
|
|
|
2*\emph{pi}, and \var{kappa} is the concentration parameter, which
|
|
|
|
must be greater than or equal to zero. If \var{kappa} is equal to
|
|
|
|
zero, this distribution reduces to a uniform random angle over the
|
|
|
|
range 0 to 2*\emph{pi}.
|
1997-04-03 18:41:49 -04:00
|
|
|
\end{funcdesc}
|
1997-07-17 13:34:52 -03:00
|
|
|
|
1997-12-30 13:38:05 -04:00
|
|
|
\begin{funcdesc}{paretovariate}{alpha}
|
2001-01-22 14:18:30 -04:00
|
|
|
Pareto distribution. \var{alpha} is the shape parameter.
|
1997-12-30 13:38:05 -04:00
|
|
|
\end{funcdesc}
|
|
|
|
|
|
|
|
\begin{funcdesc}{weibullvariate}{alpha, beta}
|
2001-01-22 14:18:30 -04:00
|
|
|
Weibull distribution. \var{alpha} is the scale parameter and
|
|
|
|
\var{beta} is the shape parameter.
|
1997-12-30 13:38:05 -04:00
|
|
|
\end{funcdesc}
|
1997-07-17 13:34:52 -03:00
|
|
|
|
2002-12-29 19:03:38 -04:00
|
|
|
Alternative Generator
|
|
|
|
|
|
|
|
\begin{classdesc}{WichmannHill}{\optional{seed}}
|
|
|
|
Class that implements the Wichmann-Hill algorithm as the core generator.
|
|
|
|
Has all of the same methods as \class{Random} plus the \method{whseed}
|
|
|
|
method described below. Because this class is implemented in pure
|
|
|
|
Python, it is not threadsafe and may require locks between calls. The
|
|
|
|
period of the generator is 6,953,607,871,644 which is small enough to
|
|
|
|
require care that two independent random sequences do not overlap.
|
|
|
|
\end{classdesc}
|
|
|
|
|
|
|
|
\begin{funcdesc}{whseed}{\optional{x}}
|
|
|
|
This is obsolete, supplied for bit-level compatibility with versions
|
|
|
|
of Python prior to 2.1.
|
|
|
|
See \function{seed} for details. \function{whseed} does not guarantee
|
|
|
|
that distinct integer arguments yield distinct internal states, and can
|
|
|
|
yield no more than about 2**24 distinct internal states in all.
|
|
|
|
\end{funcdesc}
|
2000-12-15 15:07:17 -04:00
|
|
|
|
1997-07-17 13:34:52 -03:00
|
|
|
\begin{seealso}
|
2002-12-29 19:03:38 -04:00
|
|
|
\seetext{M. Matsumoto and T. Nishimura, ``Mersenne Twister: A
|
|
|
|
623-dimensionally equidistributed uniform pseudorandom
|
|
|
|
number generator'',
|
|
|
|
\citetitle{ACM Transactions on Modeling and Computer Simulation}
|
|
|
|
Vol. 8, No. 1, January pp.3-30 1998.}
|
|
|
|
|
2001-01-24 19:06:53 -04:00
|
|
|
\seetext{Wichmann, B. A. \& Hill, I. D., ``Algorithm AS 183:
|
|
|
|
An efficient and portable pseudo-random number generator'',
|
|
|
|
\citetitle{Applied Statistics} 31 (1982) 188-190.}
|
1997-07-17 13:34:52 -03:00
|
|
|
\end{seealso}
|