2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
:mod:`mmap` --- Memory-mapped file support
|
|
|
|
==========================================
|
|
|
|
|
|
|
|
.. module:: mmap
|
|
|
|
:synopsis: Interface to memory-mapped files for Unix and Windows.
|
|
|
|
|
|
|
|
|
|
|
|
Memory-mapped file objects behave like both strings and like file objects.
|
|
|
|
Unlike normal string objects, however, these are mutable. You can use mmap
|
2008-04-16 09:47:01 -03:00
|
|
|
objects in most places where strings are expected; for example, you can use
|
|
|
|
the :mod:`re` module to search through a memory-mapped file. Since they're
|
|
|
|
mutable, you can change a single character by doing ``obj[index] = 'a'``, or
|
|
|
|
change a substring by assigning to a slice: ``obj[i1:i2] = '...'``. You can
|
|
|
|
also read and write data starting at the current file position, and
|
|
|
|
:meth:`seek` through the file to different positions.
|
|
|
|
|
|
|
|
A memory-mapped file is created by the :class:`mmap` constructor, which is
|
|
|
|
different on Unix and on Windows. In either case you must provide a file
|
|
|
|
descriptor for a file opened for update. If you wish to map an existing Python
|
|
|
|
file object, use its :meth:`fileno` method to obtain the correct value for the
|
|
|
|
*fileno* parameter. Otherwise, you can open the file using the
|
|
|
|
:func:`os.open` function, which returns a file descriptor directly (the file
|
|
|
|
still needs to be closed when done).
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-01-21 10:16:46 -04:00
|
|
|
For both the Unix and Windows versions of the constructor, *access* may be
|
2007-08-15 11:28:01 -03:00
|
|
|
specified as an optional keyword parameter. *access* accepts one of three
|
2008-04-16 09:47:01 -03:00
|
|
|
values: :const:`ACCESS_READ`, :const:`ACCESS_WRITE`, or :const:`ACCESS_COPY`
|
2008-04-17 09:39:45 -03:00
|
|
|
to specify read-only, write-through or copy-on-write memory respectively.
|
2008-04-16 09:47:01 -03:00
|
|
|
*access* can be used on both Unix and Windows. If *access* is not specified,
|
|
|
|
Windows mmap returns a write-through mapping. The initial memory values for
|
|
|
|
all three access types are taken from the specified file. Assignment to an
|
|
|
|
:const:`ACCESS_READ` memory map raises a :exc:`TypeError` exception.
|
|
|
|
Assignment to an :const:`ACCESS_WRITE` memory map affects both memory and the
|
|
|
|
underlying file. Assignment to an :const:`ACCESS_COPY` memory map affects
|
|
|
|
memory but does not update the underlying file.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
.. versionchanged:: 2.5
|
|
|
|
To map anonymous memory, -1 should be passed as the fileno along with the
|
|
|
|
length.
|
|
|
|
|
2008-01-21 10:16:46 -04:00
|
|
|
.. versionchanged:: 2.6
|
2008-04-16 09:47:01 -03:00
|
|
|
mmap.mmap has formerly been a factory function creating mmap objects. Now
|
2008-01-21 10:16:46 -04:00
|
|
|
mmap.mmap is the class itself.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-01-21 10:16:46 -04:00
|
|
|
.. class:: mmap(fileno, length[, tagname[, access[, offset]]])
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-16 09:47:01 -03:00
|
|
|
**(Windows version)** Maps *length* bytes from the file specified by the
|
|
|
|
file handle *fileno*, and creates a mmap object. If *length* is larger
|
|
|
|
than the current size of the file, the file is extended to contain *length*
|
|
|
|
bytes. If *length* is ``0``, the maximum length of the map is the current
|
|
|
|
size of the file, except that if the file is empty Windows raises an
|
|
|
|
exception (you cannot create an empty mapping on Windows).
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-16 09:47:01 -03:00
|
|
|
*tagname*, if specified and not ``None``, is a string giving a tag name for
|
|
|
|
the mapping. Windows allows you to have many different mappings against
|
|
|
|
the same file. If you specify the name of an existing tag, that tag is
|
|
|
|
opened, otherwise a new tag of this name is created. If this parameter is
|
|
|
|
omitted or ``None``, the mapping is created without a name. Avoiding the
|
|
|
|
use of the tag parameter will assist in keeping your code portable between
|
|
|
|
Unix and Windows.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-16 09:47:01 -03:00
|
|
|
*offset* may be specified as a non-negative integer offset. mmap references
|
|
|
|
will be relative to the offset from the beginning of the file. *offset*
|
|
|
|
defaults to 0. *offset* must be a multiple of the ALLOCATIONGRANULARITY.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2007-10-22 23:40:56 -03:00
|
|
|
|
2008-01-21 10:16:46 -04:00
|
|
|
.. class:: mmap(fileno, length[, flags[, prot[, access[, offset]]]])
|
2007-08-15 11:28:01 -03:00
|
|
|
:noindex:
|
|
|
|
|
|
|
|
**(Unix version)** Maps *length* bytes from the file specified by the file
|
|
|
|
descriptor *fileno*, and returns a mmap object. If *length* is ``0``, the
|
2008-04-16 09:47:01 -03:00
|
|
|
maximum length of the map will be the current size of the file when
|
|
|
|
:class:`mmap` is called.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
*flags* specifies the nature of the mapping. :const:`MAP_PRIVATE` creates a
|
2008-04-16 09:47:01 -03:00
|
|
|
private copy-on-write mapping, so changes to the contents of the mmap
|
|
|
|
object will be private to this process, and :const:`MAP_SHARED` creates a
|
|
|
|
mapping that's shared with all other processes mapping the same areas of
|
|
|
|
the file. The default value is :const:`MAP_SHARED`.
|
|
|
|
|
|
|
|
*prot*, if specified, gives the desired memory protection; the two most
|
|
|
|
useful values are :const:`PROT_READ` and :const:`PROT_WRITE`, to specify
|
|
|
|
that the pages may be read or written. *prot* defaults to
|
|
|
|
:const:`PROT_READ \| PROT_WRITE`.
|
|
|
|
|
|
|
|
*access* may be specified in lieu of *flags* and *prot* as an optional
|
|
|
|
keyword parameter. It is an error to specify both *flags*, *prot* and
|
|
|
|
*access*. See the description of *access* above for information on how to
|
|
|
|
use this parameter.
|
|
|
|
|
|
|
|
*offset* may be specified as a non-negative integer offset. mmap references
|
|
|
|
will be relative to the offset from the beginning of the file. *offset*
|
|
|
|
defaults to 0. *offset* must be a multiple of the PAGESIZE or
|
|
|
|
ALLOCATIONGRANULARITY.
|
Merged revisions 68133-68134,68141-68142,68145-68146,68148-68149,68159-68162,68166,68171-68174,68179,68195-68196,68210,68214-68215,68217-68222 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r68133 | antoine.pitrou | 2009-01-01 16:38:03 +0100 (Thu, 01 Jan 2009) | 1 line
fill in actual issue number in tests
........
r68134 | hirokazu.yamamoto | 2009-01-01 16:45:39 +0100 (Thu, 01 Jan 2009) | 2 lines
Issue #4797: IOError.filename was not set when _fileio.FileIO failed to open
file with `str' filename on Windows.
........
r68141 | benjamin.peterson | 2009-01-01 17:43:12 +0100 (Thu, 01 Jan 2009) | 1 line
fix highlighting
........
r68142 | benjamin.peterson | 2009-01-01 18:29:49 +0100 (Thu, 01 Jan 2009) | 2 lines
welcome to 2009, Python!
........
r68145 | amaury.forgeotdarc | 2009-01-02 01:03:54 +0100 (Fri, 02 Jan 2009) | 5 lines
#4801 _collections module fails to build on cygwin.
_PyObject_GC_TRACK is the macro version of PyObject_GC_Track,
and according to documentation it should not be used for extension modules.
........
r68146 | ronald.oussoren | 2009-01-02 11:44:46 +0100 (Fri, 02 Jan 2009) | 2 lines
Fix for issue4472: "configure --enable-shared doesn't work on OSX"
........
r68148 | ronald.oussoren | 2009-01-02 11:48:31 +0100 (Fri, 02 Jan 2009) | 2 lines
Forgot to add a NEWS item in my previous checkin
........
r68149 | ronald.oussoren | 2009-01-02 11:50:48 +0100 (Fri, 02 Jan 2009) | 2 lines
Fix for issue4780
........
r68159 | ronald.oussoren | 2009-01-02 15:48:17 +0100 (Fri, 02 Jan 2009) | 2 lines
Fix for issue 1627952
........
r68160 | ronald.oussoren | 2009-01-02 15:52:09 +0100 (Fri, 02 Jan 2009) | 2 lines
Fix for issue r1737832
........
r68161 | ronald.oussoren | 2009-01-02 16:00:05 +0100 (Fri, 02 Jan 2009) | 3 lines
Fix for issue 1149804
........
r68162 | ronald.oussoren | 2009-01-02 16:06:00 +0100 (Fri, 02 Jan 2009) | 3 lines
Fix for issue 4472 is incompatible with Cygwin, this patch
should fix that.
........
r68166 | benjamin.peterson | 2009-01-02 19:26:23 +0100 (Fri, 02 Jan 2009) | 1 line
document PyMemberDef
........
r68171 | georg.brandl | 2009-01-02 21:25:14 +0100 (Fri, 02 Jan 2009) | 3 lines
#4811: fix markup glitches (mostly remains of the conversion),
found by Gabriel Genellina.
........
r68172 | martin.v.loewis | 2009-01-02 21:32:55 +0100 (Fri, 02 Jan 2009) | 2 lines
Issue #4075: Use OutputDebugStringW in Py_FatalError.
........
r68173 | martin.v.loewis | 2009-01-02 21:40:14 +0100 (Fri, 02 Jan 2009) | 2 lines
Issue #4051: Prevent conflict of UNICODE macros in cPickle.
........
r68174 | benjamin.peterson | 2009-01-02 21:47:27 +0100 (Fri, 02 Jan 2009) | 1 line
fix compilation on non-Windows platforms
........
r68179 | raymond.hettinger | 2009-01-02 22:26:45 +0100 (Fri, 02 Jan 2009) | 1 line
Issue #4615. Document how to use itertools for de-duping.
........
r68195 | georg.brandl | 2009-01-03 14:45:15 +0100 (Sat, 03 Jan 2009) | 2 lines
Remove useless string literal.
........
r68196 | georg.brandl | 2009-01-03 15:29:53 +0100 (Sat, 03 Jan 2009) | 2 lines
Fix indentation.
........
r68210 | georg.brandl | 2009-01-03 20:10:12 +0100 (Sat, 03 Jan 2009) | 2 lines
Set eol-style correctly for mp_distributing.py.
........
r68214 | georg.brandl | 2009-01-03 20:44:48 +0100 (Sat, 03 Jan 2009) | 2 lines
Make indentation consistent.
........
r68215 | georg.brandl | 2009-01-03 21:15:14 +0100 (Sat, 03 Jan 2009) | 2 lines
Fix role name.
........
r68217 | georg.brandl | 2009-01-03 21:30:15 +0100 (Sat, 03 Jan 2009) | 2 lines
Add rstlint, a little tool to find subtle markup problems and inconsistencies in the Doc sources.
........
r68218 | georg.brandl | 2009-01-03 21:38:59 +0100 (Sat, 03 Jan 2009) | 2 lines
Recognize usage of the default role.
........
r68219 | georg.brandl | 2009-01-03 21:47:01 +0100 (Sat, 03 Jan 2009) | 2 lines
Fix uses of the default role.
........
r68220 | georg.brandl | 2009-01-03 21:55:06 +0100 (Sat, 03 Jan 2009) | 2 lines
Remove trailing whitespace.
........
r68221 | georg.brandl | 2009-01-03 22:04:55 +0100 (Sat, 03 Jan 2009) | 2 lines
Remove tabs from the documentation.
........
r68222 | georg.brandl | 2009-01-03 22:11:58 +0100 (Sat, 03 Jan 2009) | 2 lines
Disable the line length checker by default.
........
2009-01-03 17:55:17 -04:00
|
|
|
|
2008-01-21 10:16:46 -04:00
|
|
|
This example shows a simple way of using :class:`mmap`::
|
2007-12-02 10:34:34 -04:00
|
|
|
|
|
|
|
import mmap
|
|
|
|
|
|
|
|
# write a simple example file
|
|
|
|
with open("hello.txt", "w") as f:
|
|
|
|
f.write("Hello Python!\n")
|
|
|
|
|
|
|
|
with open("hello.txt", "r+") as f:
|
|
|
|
# memory-map the file, size 0 means whole file
|
|
|
|
map = mmap.mmap(f.fileno(), 0)
|
|
|
|
# read content via standard file methods
|
|
|
|
print map.readline() # prints "Hello Python!"
|
|
|
|
# read content via slice notation
|
|
|
|
print map[:5] # prints "Hello"
|
|
|
|
# update content using slice notation;
|
|
|
|
# note that new content must have same size
|
|
|
|
map[6:] = " world!\n"
|
|
|
|
# ... and read again using standard file methods
|
|
|
|
map.seek(0)
|
|
|
|
print map.readline() # prints "Hello world!"
|
|
|
|
# close the map
|
|
|
|
map.close()
|
|
|
|
|
|
|
|
|
|
|
|
The next example demonstrates how to create an anonymous map and exchange
|
|
|
|
data between the parent and child processes::
|
|
|
|
|
|
|
|
import mmap
|
|
|
|
import os
|
|
|
|
|
|
|
|
map = mmap.mmap(-1, 13)
|
|
|
|
map.write("Hello world!")
|
|
|
|
|
|
|
|
pid = os.fork()
|
|
|
|
|
|
|
|
if pid == 0: # In a child process
|
|
|
|
map.seek(0)
|
|
|
|
print map.readline()
|
|
|
|
|
|
|
|
map.close()
|
|
|
|
|
2007-10-22 23:40:56 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Memory-mapped file objects support the following methods:
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: close()
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Close the file. Subsequent calls to other methods of the object will
|
|
|
|
result in an exception being raised.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: find(string[, start[, end]])
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Returns the lowest index in the object where the substring *string* is
|
|
|
|
found, such that *string* is contained in the range [*start*, *end*].
|
|
|
|
Optional arguments *start* and *end* are interpreted as in slice notation.
|
|
|
|
Returns ``-1`` on failure.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: flush([offset, size])
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Flushes changes made to the in-memory copy of a file back to disk. Without
|
|
|
|
use of this call there is no guarantee that changes are written back before
|
|
|
|
the object is destroyed. If *offset* and *size* are specified, only
|
|
|
|
changes to the given range of bytes will be flushed to disk; otherwise, the
|
|
|
|
whole extent of the mapping is flushed.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
**(Windows version)** A nonzero value returned indicates success; zero
|
|
|
|
indicates failure.
|
2008-04-16 09:57:43 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
**(Unix version)** A zero value is returned to indicate success. An
|
|
|
|
exception is raised when the call failed.
|
2008-04-16 09:57:43 -03:00
|
|
|
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: move(dest, src, count)
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Copy the *count* bytes starting at offset *src* to the destination index
|
|
|
|
*dest*. If the mmap was created with :const:`ACCESS_READ`, then calls to
|
|
|
|
move will throw a :exc:`TypeError` exception.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: read(num)
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Return a string containing up to *num* bytes starting from the current
|
|
|
|
file position; the file position is updated to point after the bytes that
|
|
|
|
were returned.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: read_byte()
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Returns a string of length 1 containing the character at the current file
|
|
|
|
position, and advances the file position by 1.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: readline()
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Returns a single line, starting at the current file position and up to the
|
|
|
|
next newline.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: resize(newsize)
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Resizes the map and the underlying file, if any. If the mmap was created
|
|
|
|
with :const:`ACCESS_READ` or :const:`ACCESS_COPY`, resizing the map will
|
|
|
|
throw a :exc:`TypeError` exception.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: rfind(string[, start[, end]])
|
2008-01-19 14:18:41 -04:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Returns the highest index in the object where the substring *string* is
|
|
|
|
found, such that *string* is contained in the range [*start*, *end*].
|
|
|
|
Optional arguments *start* and *end* are interpreted as in slice notation.
|
|
|
|
Returns ``-1`` on failure.
|
2008-01-19 14:18:41 -04:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: seek(pos[, whence])
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Set the file's current position. *whence* argument is optional and
|
|
|
|
defaults to ``os.SEEK_SET`` or ``0`` (absolute file positioning); other
|
|
|
|
values are ``os.SEEK_CUR`` or ``1`` (seek relative to the current
|
|
|
|
position) and ``os.SEEK_END`` or ``2`` (seek relative to the file's end).
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: size()
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Return the length of the file, which can be larger than the size of the
|
|
|
|
memory-mapped area.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: tell()
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Returns the current position of the file pointer.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: write(string)
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Write the bytes in *string* into memory at the current position of the
|
|
|
|
file pointer; the file position is updated to point after the bytes that
|
|
|
|
were written. If the mmap was created with :const:`ACCESS_READ`, then
|
|
|
|
writing to it will throw a :exc:`TypeError` exception.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
.. method:: write_byte(byte)
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2008-04-24 22:29:10 -03:00
|
|
|
Write the single-character string *byte* into memory at the current
|
|
|
|
position of the file pointer; the file position is advanced by ``1``. If
|
|
|
|
the mmap was created with :const:`ACCESS_READ`, then writing to it will
|
|
|
|
throw a :exc:`TypeError` exception.
|
2007-08-15 11:28:01 -03:00
|
|
|
|
2007-10-22 23:40:56 -03:00
|
|
|
|