264 lines
11 KiB
Plaintext
264 lines
11 KiB
Plaintext
This file describes some special Python build types enabled via compile-time
|
|
preprocessor defines.
|
|
|
|
IMPORTANT: if you want to build a debug-enabled Python, it is recommended that
|
|
you use ``./configure --with-pydebug``, rather than the options listed here.
|
|
|
|
However, if you wish to define some of these options individually, it is best
|
|
to define them in the EXTRA_CFLAGS make variable;
|
|
``make EXTRA_CFLAGS="-DPy_REF_DEBUG"``.
|
|
|
|
|
|
Py_REF_DEBUG
|
|
------------
|
|
|
|
Turn on aggregate reference counting. This arranges that extern _Py_RefTotal
|
|
hold a count of all references, the sum of ob_refcnt across all objects.
|
|
Passing ``-X showrefcount`` on the command line causes the interactive
|
|
interpreter to print the reference count total as well the number of memory
|
|
blocks allocated after each statement:
|
|
|
|
>>> 23
|
|
23
|
|
[8288 refs, 14332 blocks]
|
|
>>>
|
|
|
|
Note that if this count increases when you're not storing away new objects,
|
|
there's probably a leak. Remember, though, that in interactive mode the special
|
|
name "_" holds a reference to the last result displayed!
|
|
|
|
Py_REF_DEBUG also checks after every decref to verify that the refcount hasn't
|
|
gone negative, and causes an immediate fatal error if it has.
|
|
|
|
Special gimmicks:
|
|
|
|
sys.gettotalrefcount()
|
|
Return current total of all refcounts.
|
|
|
|
|
|
Py_TRACE_REFS
|
|
-------------
|
|
|
|
Turn on heavy reference debugging. This is major surgery. Every PyObject grows
|
|
two more pointers, to maintain a doubly-linked list of all live heap-allocated
|
|
objects. Most built-in type objects are not in this list, as they're statically
|
|
allocated. Starting in Python 2.3, if COUNT_ALLOCS (see below) is also defined,
|
|
a static type object T does appear in this list if at least one object of type T
|
|
has been created.
|
|
|
|
Note that because the fundamental PyObject layout changes, Python modules
|
|
compiled with Py_TRACE_REFS are incompatible with modules compiled without it.
|
|
|
|
Py_TRACE_REFS implies Py_REF_DEBUG.
|
|
|
|
Special gimmicks:
|
|
|
|
sys.getobjects(max[, type])
|
|
Return list of the (no more than) max most-recently allocated objects, most
|
|
recently allocated first in the list, least-recently allocated last in the
|
|
list. max=0 means no limit on list length. If an optional type object is
|
|
passed, the list is also restricted to objects of that type. The return
|
|
list itself, and some temp objects created just to call sys.getobjects(),
|
|
are excluded from the return list. Note that the list returned is just
|
|
another object, though, so may appear in the return list the next time you
|
|
call getobjects(); note that every object in the list is kept alive too,
|
|
simply by virtue of being in the list.
|
|
|
|
envvar PYTHONDUMPREFS
|
|
If this envvar exists, Py_Finalize() arranges to print a list of all
|
|
still-live heap objects. This is printed twice, in different formats,
|
|
before and after Py_Finalize has cleaned up everything it can clean up. The
|
|
first output block produces the repr() of each object so is more
|
|
informative; however, a lot of stuff destined to die is still alive then.
|
|
The second output block is much harder to work with (repr() can't be invoked
|
|
anymore -- the interpreter has been torn down too far), but doesn't list any
|
|
objects that will die. The tool script combinerefs.py can be run over this
|
|
to combine the info from both output blocks. The second output block, and
|
|
combinerefs.py, were new in Python 2.3b1.
|
|
|
|
|
|
PYMALLOC_DEBUG
|
|
--------------
|
|
|
|
When pymalloc is enabled (WITH_PYMALLOC is defined), calls to the PyObject_
|
|
memory routines are handled by Python's own small-object allocator, while calls
|
|
to the PyMem_ memory routines are directed to the system malloc/ realloc/free.
|
|
If PYMALLOC_DEBUG is also defined, calls to both PyObject_ and PyMem_ memory
|
|
routines are directed to a special debugging mode of Python's small-object
|
|
allocator.
|
|
|
|
This mode fills dynamically allocated memory blocks with special, recognizable
|
|
bit patterns, and adds debugging info on each end of dynamically allocated
|
|
memory blocks. The special bit patterns are:
|
|
|
|
#define CLEANBYTE 0xCB /* clean (newly allocated) memory */
|
|
#define DEADBYTE 0xDB /* dead (newly freed) memory */
|
|
#define FORBIDDENBYTE 0xFB /* forbidden -- untouchable bytes */
|
|
|
|
Strings of these bytes are unlikely to be valid addresses, floats, or 7-bit
|
|
ASCII strings.
|
|
|
|
Let S = sizeof(size_t). 2*S bytes are added at each end of each block of N bytes
|
|
requested. The memory layout is like so, where p represents the address
|
|
returned by a malloc-like or realloc-like function (p[i:j] means the slice of
|
|
bytes from *(p+i) inclusive up to *(p+j) exclusive; note that the treatment of
|
|
negative indices differs from a Python slice):
|
|
|
|
p[-2*S:-S]
|
|
Number of bytes originally asked for. This is a size_t, big-endian (easier
|
|
to read in a memory dump).
|
|
p[-S]
|
|
API ID. See PEP 445. This is a character, but seems undocumented.
|
|
p[-S+1:0]
|
|
Copies of FORBIDDENBYTE. Used to catch under- writes and reads.
|
|
p[0:N]
|
|
The requested memory, filled with copies of CLEANBYTE, used to catch
|
|
reference to uninitialized memory. When a realloc-like function is called
|
|
requesting a larger memory block, the new excess bytes are also filled with
|
|
CLEANBYTE. When a free-like function is called, these are overwritten with
|
|
DEADBYTE, to catch reference to freed memory. When a realloc- like function
|
|
is called requesting a smaller memory block, the excess old bytes are also
|
|
filled with DEADBYTE.
|
|
p[N:N+S]
|
|
Copies of FORBIDDENBYTE. Used to catch over- writes and reads.
|
|
p[N+S:N+2*S]
|
|
A serial number, incremented by 1 on each call to a malloc-like or
|
|
realloc-like function. Big-endian size_t. If "bad memory" is detected
|
|
later, the serial number gives an excellent way to set a breakpoint on the
|
|
next run, to capture the instant at which this block was passed out. The
|
|
static function bumpserialno() in obmalloc.c is the only place the serial
|
|
number is incremented, and exists so you can set such a breakpoint easily.
|
|
|
|
A realloc-like or free-like function first checks that the FORBIDDENBYTEs at
|
|
each end are intact. If they've been altered, diagnostic output is written to
|
|
stderr, and the program is aborted via Py_FatalError(). The other main failure
|
|
mode is provoking a memory error when a program reads up one of the special bit
|
|
patterns and tries to use it as an address. If you get in a debugger then and
|
|
look at the object, you're likely to see that it's entirely filled with 0xDB
|
|
(meaning freed memory is getting used) or 0xCB (meaning uninitialized memory is
|
|
getting used).
|
|
|
|
Note that PYMALLOC_DEBUG requires WITH_PYMALLOC.
|
|
|
|
Special gimmicks:
|
|
|
|
envvar PYTHONMALLOCSTATS
|
|
If this envvar exists, a report of pymalloc summary statistics is printed to
|
|
stderr whenever a new arena is allocated, and also by Py_Finalize().
|
|
|
|
Changed in 2.5: The number of extra bytes allocated is 4*sizeof(size_t).
|
|
Before it was 16 on all boxes, reflecting that Python couldn't make use of
|
|
allocations >= 2**32 bytes even on 64-bit boxes before 2.5.
|
|
|
|
|
|
Py_DEBUG
|
|
--------
|
|
|
|
This is what is generally meant by "a debug build" of Python.
|
|
|
|
Py_DEBUG implies LLTRACE, Py_REF_DEBUG, Py_TRACE_REFS, and PYMALLOC_DEBUG (if
|
|
WITH_PYMALLOC is enabled). In addition, C assert()s are enabled (via the C way:
|
|
by not defining NDEBUG), and some routines do additional sanity checks inside
|
|
"#ifdef Py_DEBUG" blocks.
|
|
|
|
|
|
COUNT_ALLOCS
|
|
------------
|
|
|
|
Each type object grows three new members:
|
|
|
|
/* Number of times an object of this type was allocated. */
|
|
int tp_allocs;
|
|
|
|
/* Number of times an object of this type was deallocated. */
|
|
int tp_frees;
|
|
|
|
/* Highwater mark: the maximum value of tp_allocs - tp_frees so
|
|
* far; or, IOW, the largest number of objects of this type alive at
|
|
* the same time.
|
|
*/
|
|
int tp_maxalloc;
|
|
|
|
Allocation and deallocation code keeps these counts up to date. Py_Finalize()
|
|
displays a summary of the info returned by sys.getcounts() (see below), along
|
|
with assorted other special allocation counts (like the number of tuple
|
|
allocations satisfied by a tuple free-list, the number of 1-character strings
|
|
allocated, etc).
|
|
|
|
Before Python 2.2, type objects were immortal, and the COUNT_ALLOCS
|
|
implementation relies on that. As of Python 2.2, heap-allocated type/ class
|
|
objects can go away. COUNT_ALLOCS can blow up in 2.2 and 2.2.1 because of this;
|
|
this was fixed in 2.2.2. Use of COUNT_ALLOCS makes all heap-allocated type
|
|
objects immortal, except for those for which no object of that type is ever
|
|
allocated.
|
|
|
|
Starting with Python 2.3, If Py_TRACE_REFS is also defined, COUNT_ALLOCS
|
|
arranges to ensure that the type object for each allocated object appears in the
|
|
doubly-linked list of all objects maintained by Py_TRACE_REFS.
|
|
|
|
Special gimmicks:
|
|
|
|
sys.getcounts()
|
|
Return a list of 4-tuples, one entry for each type object for which at least
|
|
one object of that type was allocated. Each tuple is of the form:
|
|
|
|
(tp_name, tp_allocs, tp_frees, tp_maxalloc)
|
|
|
|
Each distinct type object gets a distinct entry in this list, even if two or
|
|
more type objects have the same tp_name (in which case there's no way to
|
|
distinguish them by looking at this list). The list is ordered by time of
|
|
first object allocation: the type object for which the first allocation of
|
|
an object of that type occurred most recently is at the front of the list.
|
|
|
|
|
|
LLTRACE
|
|
-------
|
|
|
|
Compile in support for Low Level TRACE-ing of the main interpreter loop.
|
|
|
|
When this preprocessor symbol is defined, before PyEval_EvalFrame (eval_frame in
|
|
2.3 and 2.2, eval_code2 before that) executes a frame's code it checks the
|
|
frame's global namespace for a variable "__lltrace__". If such a variable is
|
|
found, mounds of information about what the interpreter is doing are sprayed to
|
|
stdout, such as every opcode and opcode argument and values pushed onto and
|
|
popped off the value stack.
|
|
|
|
Not useful very often, but very useful when needed.
|
|
|
|
|
|
CALL_PROFILE
|
|
------------
|
|
|
|
Count the number of function calls executed.
|
|
|
|
When this symbol is defined, the ceval mainloop and helper functions count the
|
|
number of function calls made. It keeps detailed statistics about what kind of
|
|
object was called and whether the call hit any of the special fast paths in the
|
|
code.
|
|
|
|
|
|
WITH_TSC
|
|
--------
|
|
|
|
Super-lowlevel profiling of the interpreter. When enabled, the sys module grows
|
|
a new function:
|
|
|
|
settscdump(bool)
|
|
If true, tell the Python interpreter to dump VM measurements to stderr. If
|
|
false, turn off dump. The measurements are based on the processor's
|
|
time-stamp counter.
|
|
|
|
This build option requires a small amount of platform specific code. Currently
|
|
this code is present for linux/x86 and any PowerPC platform that uses GCC
|
|
(i.e. OS X and linux/ppc).
|
|
|
|
On the PowerPC the rate at which the time base register is incremented is not
|
|
defined by the architecture specification, so you'll need to find the manual for
|
|
your specific processor. For the 750CX, 750CXe and 750FX (all sold as the G3)
|
|
we find:
|
|
|
|
The time base counter is clocked at a frequency that is one-fourth that of
|
|
the bus clock.
|
|
|
|
This build is enabled by the --with-tsc flag to configure.
|