2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
:mod:`cgi` --- Common Gateway Interface support.
|
|
|
|
================================================
|
|
|
|
|
|
|
|
.. module:: cgi
|
|
|
|
:synopsis: Helpers for running Python scripts via the Common Gateway Interface.
|
|
|
|
|
|
|
|
|
|
|
|
.. index::
|
|
|
|
pair: WWW; server
|
|
|
|
pair: CGI; protocol
|
|
|
|
pair: HTTP; protocol
|
|
|
|
pair: MIME; headers
|
|
|
|
single: URL
|
|
|
|
single: Common Gateway Interface
|
|
|
|
|
|
|
|
Support module for Common Gateway Interface (CGI) scripts.
|
|
|
|
|
|
|
|
This module defines a number of utilities for use by CGI scripts written in
|
|
|
|
Python.
|
|
|
|
|
|
|
|
|
|
|
|
Introduction
|
|
|
|
------------
|
|
|
|
|
|
|
|
.. _cgi-intro:
|
|
|
|
|
|
|
|
A CGI script is invoked by an HTTP server, usually to process user input
|
|
|
|
submitted through an HTML ``<FORM>`` or ``<ISINDEX>`` element.
|
|
|
|
|
|
|
|
Most often, CGI scripts live in the server's special :file:`cgi-bin` directory.
|
|
|
|
The HTTP server places all sorts of information about the request (such as the
|
|
|
|
client's hostname, the requested URL, the query string, and lots of other
|
|
|
|
goodies) in the script's shell environment, executes the script, and sends the
|
|
|
|
script's output back to the client.
|
|
|
|
|
|
|
|
The script's input is connected to the client too, and sometimes the form data
|
|
|
|
is read this way; at other times the form data is passed via the "query string"
|
|
|
|
part of the URL. This module is intended to take care of the different cases
|
|
|
|
and provide a simpler interface to the Python script. It also provides a number
|
|
|
|
of utilities that help in debugging scripts, and the latest addition is support
|
|
|
|
for file uploads from a form (if your browser supports it).
|
|
|
|
|
|
|
|
The output of a CGI script should consist of two sections, separated by a blank
|
|
|
|
line. The first section contains a number of headers, telling the client what
|
|
|
|
kind of data is following. Python code to generate a minimal header section
|
|
|
|
looks like this::
|
|
|
|
|
2007-09-04 04:15:32 -03:00
|
|
|
print("Content-Type: text/html") # HTML is following
|
|
|
|
print() # blank line, end of headers
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
The second section is usually HTML, which allows the client software to display
|
|
|
|
nicely formatted text with header, in-line images, etc. Here's Python code that
|
|
|
|
prints a simple piece of HTML::
|
|
|
|
|
2007-09-04 04:15:32 -03:00
|
|
|
print("<TITLE>CGI script output</TITLE>")
|
|
|
|
print("<H1>This is my first CGI script</H1>")
|
|
|
|
print("Hello, world!")
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
.. _using-the-cgi-module:
|
|
|
|
|
|
|
|
Using the cgi module
|
|
|
|
--------------------
|
|
|
|
|
2008-05-11 18:42:51 -03:00
|
|
|
Begin by writing ``import cgi``.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
Merged revisions 69803-69805,69840,69901,69905,69907,69924,69927,69987 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r69803 | georg.brandl | 2009-02-20 01:48:21 -0600 (Fri, 20 Feb 2009) | 1 line
#5327: fix a broken link by joining it.
........
r69804 | georg.brandl | 2009-02-20 02:22:21 -0600 (Fri, 20 Feb 2009) | 1 line
At least separate imports from other statements.
........
r69805 | georg.brandl | 2009-02-20 02:45:47 -0600 (Fri, 20 Feb 2009) | 2 lines
Fix punctuation.
........
r69840 | georg.brandl | 2009-02-21 13:09:40 -0600 (Sat, 21 Feb 2009) | 1 line
#5338, #5339: two types in the API manual.
........
r69901 | georg.brandl | 2009-02-23 05:24:46 -0600 (Mon, 23 Feb 2009) | 2 lines
#5349: C++ pure virtuals can also have an implementation.
........
r69905 | georg.brandl | 2009-02-23 09:51:27 -0600 (Mon, 23 Feb 2009) | 2 lines
#5352: str.count() counts non-overlapping instances.
........
r69907 | georg.brandl | 2009-02-23 12:33:48 -0600 (Mon, 23 Feb 2009) | 1 line
Fix grammar.
........
r69924 | benjamin.peterson | 2009-02-23 20:45:35 -0600 (Mon, 23 Feb 2009) | 1 line
update README on running tests
........
r69927 | neil.schemenauer | 2009-02-23 22:23:25 -0600 (Mon, 23 Feb 2009) | 1 line
Fix call to os.waitpid, it does not take keyword args.
........
r69987 | benjamin.peterson | 2009-02-25 18:30:11 -0600 (Wed, 25 Feb 2009) | 1 line
fix str.format()'s first arg #5371
........
2009-02-25 23:38:59 -04:00
|
|
|
When you write a new script, consider adding these lines::
|
2007-08-15 11:28:22 -03:00
|
|
|
|
Merged revisions 69803-69805,69840,69901,69905,69907,69924,69927,69987 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r69803 | georg.brandl | 2009-02-20 01:48:21 -0600 (Fri, 20 Feb 2009) | 1 line
#5327: fix a broken link by joining it.
........
r69804 | georg.brandl | 2009-02-20 02:22:21 -0600 (Fri, 20 Feb 2009) | 1 line
At least separate imports from other statements.
........
r69805 | georg.brandl | 2009-02-20 02:45:47 -0600 (Fri, 20 Feb 2009) | 2 lines
Fix punctuation.
........
r69840 | georg.brandl | 2009-02-21 13:09:40 -0600 (Sat, 21 Feb 2009) | 1 line
#5338, #5339: two types in the API manual.
........
r69901 | georg.brandl | 2009-02-23 05:24:46 -0600 (Mon, 23 Feb 2009) | 2 lines
#5349: C++ pure virtuals can also have an implementation.
........
r69905 | georg.brandl | 2009-02-23 09:51:27 -0600 (Mon, 23 Feb 2009) | 2 lines
#5352: str.count() counts non-overlapping instances.
........
r69907 | georg.brandl | 2009-02-23 12:33:48 -0600 (Mon, 23 Feb 2009) | 1 line
Fix grammar.
........
r69924 | benjamin.peterson | 2009-02-23 20:45:35 -0600 (Mon, 23 Feb 2009) | 1 line
update README on running tests
........
r69927 | neil.schemenauer | 2009-02-23 22:23:25 -0600 (Mon, 23 Feb 2009) | 1 line
Fix call to os.waitpid, it does not take keyword args.
........
r69987 | benjamin.peterson | 2009-02-25 18:30:11 -0600 (Wed, 25 Feb 2009) | 1 line
fix str.format()'s first arg #5371
........
2009-02-25 23:38:59 -04:00
|
|
|
import cgitb
|
|
|
|
cgitb.enable()
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
This activates a special exception handler that will display detailed reports in
|
|
|
|
the Web browser if any errors occur. If you'd rather not show the guts of your
|
|
|
|
program to users of your script, you can have the reports saved to files
|
Merged revisions 69803-69805,69840,69901,69905,69907,69924,69927,69987 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r69803 | georg.brandl | 2009-02-20 01:48:21 -0600 (Fri, 20 Feb 2009) | 1 line
#5327: fix a broken link by joining it.
........
r69804 | georg.brandl | 2009-02-20 02:22:21 -0600 (Fri, 20 Feb 2009) | 1 line
At least separate imports from other statements.
........
r69805 | georg.brandl | 2009-02-20 02:45:47 -0600 (Fri, 20 Feb 2009) | 2 lines
Fix punctuation.
........
r69840 | georg.brandl | 2009-02-21 13:09:40 -0600 (Sat, 21 Feb 2009) | 1 line
#5338, #5339: two types in the API manual.
........
r69901 | georg.brandl | 2009-02-23 05:24:46 -0600 (Mon, 23 Feb 2009) | 2 lines
#5349: C++ pure virtuals can also have an implementation.
........
r69905 | georg.brandl | 2009-02-23 09:51:27 -0600 (Mon, 23 Feb 2009) | 2 lines
#5352: str.count() counts non-overlapping instances.
........
r69907 | georg.brandl | 2009-02-23 12:33:48 -0600 (Mon, 23 Feb 2009) | 1 line
Fix grammar.
........
r69924 | benjamin.peterson | 2009-02-23 20:45:35 -0600 (Mon, 23 Feb 2009) | 1 line
update README on running tests
........
r69927 | neil.schemenauer | 2009-02-23 22:23:25 -0600 (Mon, 23 Feb 2009) | 1 line
Fix call to os.waitpid, it does not take keyword args.
........
r69987 | benjamin.peterson | 2009-02-25 18:30:11 -0600 (Wed, 25 Feb 2009) | 1 line
fix str.format()'s first arg #5371
........
2009-02-25 23:38:59 -04:00
|
|
|
instead, with code like this::
|
2007-08-15 11:28:22 -03:00
|
|
|
|
Merged revisions 69803-69805,69840,69901,69905,69907,69924,69927,69987 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r69803 | georg.brandl | 2009-02-20 01:48:21 -0600 (Fri, 20 Feb 2009) | 1 line
#5327: fix a broken link by joining it.
........
r69804 | georg.brandl | 2009-02-20 02:22:21 -0600 (Fri, 20 Feb 2009) | 1 line
At least separate imports from other statements.
........
r69805 | georg.brandl | 2009-02-20 02:45:47 -0600 (Fri, 20 Feb 2009) | 2 lines
Fix punctuation.
........
r69840 | georg.brandl | 2009-02-21 13:09:40 -0600 (Sat, 21 Feb 2009) | 1 line
#5338, #5339: two types in the API manual.
........
r69901 | georg.brandl | 2009-02-23 05:24:46 -0600 (Mon, 23 Feb 2009) | 2 lines
#5349: C++ pure virtuals can also have an implementation.
........
r69905 | georg.brandl | 2009-02-23 09:51:27 -0600 (Mon, 23 Feb 2009) | 2 lines
#5352: str.count() counts non-overlapping instances.
........
r69907 | georg.brandl | 2009-02-23 12:33:48 -0600 (Mon, 23 Feb 2009) | 1 line
Fix grammar.
........
r69924 | benjamin.peterson | 2009-02-23 20:45:35 -0600 (Mon, 23 Feb 2009) | 1 line
update README on running tests
........
r69927 | neil.schemenauer | 2009-02-23 22:23:25 -0600 (Mon, 23 Feb 2009) | 1 line
Fix call to os.waitpid, it does not take keyword args.
........
r69987 | benjamin.peterson | 2009-02-25 18:30:11 -0600 (Wed, 25 Feb 2009) | 1 line
fix str.format()'s first arg #5371
........
2009-02-25 23:38:59 -04:00
|
|
|
import cgitb
|
|
|
|
cgitb.enable(display=0, logdir="/tmp")
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
It's very helpful to use this feature during script development. The reports
|
|
|
|
produced by :mod:`cgitb` provide information that can save you a lot of time in
|
|
|
|
tracking down bugs. You can always remove the ``cgitb`` line later when you
|
|
|
|
have tested your script and are confident that it works correctly.
|
|
|
|
|
2008-05-11 18:42:51 -03:00
|
|
|
To get at submitted form data, use the :class:`FieldStorage` class. Instantiate
|
|
|
|
it exactly once, without arguments. This reads the form contents from standard
|
|
|
|
input or the environment (depending on the value of various environment
|
|
|
|
variables set according to the CGI standard). Since it may consume standard
|
|
|
|
input, it should be instantiated only once.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
The :class:`FieldStorage` instance can be indexed like a Python dictionary, and
|
2007-09-01 20:34:30 -03:00
|
|
|
also supports the standard dictionary methods :meth:`__contains__` and
|
|
|
|
:meth:`keys`. The built-in :func:`len` is also supported. Form fields
|
|
|
|
containing empty strings are ignored and do not appear in the dictionary; to
|
|
|
|
keep such values, provide a true value for the optional *keep_blank_values*
|
|
|
|
keyword parameter when creating the :class:`FieldStorage` instance.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
For instance, the following code (which assumes that the
|
|
|
|
:mailheader:`Content-Type` header and blank line have already been printed)
|
|
|
|
checks that the fields ``name`` and ``addr`` are both set to a non-empty
|
|
|
|
string::
|
|
|
|
|
|
|
|
form = cgi.FieldStorage()
|
2007-09-01 20:34:30 -03:00
|
|
|
if not ("name" in form and "addr" in form):
|
2007-09-04 04:15:32 -03:00
|
|
|
print("<H1>Error</H1>")
|
|
|
|
print("Please fill in the name and addr fields.")
|
2007-08-15 11:28:22 -03:00
|
|
|
return
|
2007-09-04 04:15:32 -03:00
|
|
|
print("<p>name:", form["name"].value)
|
|
|
|
print("<p>addr:", form["addr"].value)
|
2007-08-15 11:28:22 -03:00
|
|
|
...further form processing here...
|
|
|
|
|
|
|
|
Here the fields, accessed through ``form[key]``, are themselves instances of
|
|
|
|
:class:`FieldStorage` (or :class:`MiniFieldStorage`, depending on the form
|
|
|
|
encoding). The :attr:`value` attribute of the instance yields the string value
|
|
|
|
of the field. The :meth:`getvalue` method returns this string value directly;
|
|
|
|
it also accepts an optional second argument as a default to return if the
|
|
|
|
requested key is not present.
|
|
|
|
|
|
|
|
If the submitted form data contains more than one field with the same name, the
|
|
|
|
object retrieved by ``form[key]`` is not a :class:`FieldStorage` or
|
|
|
|
:class:`MiniFieldStorage` instance but a list of such instances. Similarly, in
|
|
|
|
this situation, ``form.getvalue(key)`` would return a list of strings. If you
|
|
|
|
expect this possibility (when your HTML form contains multiple fields with the
|
|
|
|
same name), use the :func:`getlist` function, which always returns a list of
|
|
|
|
values (so that you do not need to special-case the single item case). For
|
|
|
|
example, this code concatenates any number of username fields, separated by
|
|
|
|
commas::
|
|
|
|
|
|
|
|
value = form.getlist("username")
|
|
|
|
usernames = ",".join(value)
|
|
|
|
|
|
|
|
If a field represents an uploaded file, accessing the value via the
|
|
|
|
:attr:`value` attribute or the :func:`getvalue` method reads the entire file in
|
|
|
|
memory as a string. This may not be what you want. You can test for an uploaded
|
|
|
|
file by testing either the :attr:`filename` attribute or the :attr:`file`
|
|
|
|
attribute. You can then read the data at leisure from the :attr:`file`
|
|
|
|
attribute::
|
|
|
|
|
|
|
|
fileitem = form["userfile"]
|
|
|
|
if fileitem.file:
|
|
|
|
# It's an uploaded file; count lines
|
|
|
|
linecount = 0
|
2007-09-09 21:49:57 -03:00
|
|
|
while True:
|
2007-08-15 11:28:22 -03:00
|
|
|
line = fileitem.file.readline()
|
|
|
|
if not line: break
|
|
|
|
linecount = linecount + 1
|
|
|
|
|
2007-09-18 20:39:35 -03:00
|
|
|
If an error is encountered when obtaining the contents of an uploaded file
|
|
|
|
(for example, when the user interrupts the form submission by clicking on
|
|
|
|
a Back or Cancel button) the :attr:`done` attribute of the object for the
|
|
|
|
field will be set to the value -1.
|
|
|
|
|
2007-08-15 11:28:22 -03:00
|
|
|
The file upload draft standard entertains the possibility of uploading multiple
|
|
|
|
files from one field (using a recursive :mimetype:`multipart/\*` encoding).
|
|
|
|
When this occurs, the item will be a dictionary-like :class:`FieldStorage` item.
|
|
|
|
This can be determined by testing its :attr:`type` attribute, which should be
|
|
|
|
:mimetype:`multipart/form-data` (or perhaps another MIME type matching
|
|
|
|
:mimetype:`multipart/\*`). In this case, it can be iterated over recursively
|
|
|
|
just like the top-level form object.
|
|
|
|
|
|
|
|
When a form is submitted in the "old" format (as the query string or as a single
|
|
|
|
data part of type :mimetype:`application/x-www-form-urlencoded`), the items will
|
|
|
|
actually be instances of the class :class:`MiniFieldStorage`. In this case, the
|
|
|
|
:attr:`list`, :attr:`file`, and :attr:`filename` attributes are always ``None``.
|
|
|
|
|
Merged revisions 64434-64435,64440-64443,64445,64447-64448,64450,64452,64455,64461,64464,64466,64468 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r64434 | andrew.kuchling | 2008-06-20 18:13:58 -0500 (Fri, 20 Jun 2008) | 1 line
Remove request for e-mail; it's unlikely these classes will be saved
........
r64435 | andrew.kuchling | 2008-06-20 18:14:32 -0500 (Fri, 20 Jun 2008) | 1 line
Grammar fixes
........
r64440 | andrew.kuchling | 2008-06-21 08:29:12 -0500 (Sat, 21 Jun 2008) | 1 line
Docstring typo
........
r64441 | andrew.kuchling | 2008-06-21 08:47:20 -0500 (Sat, 21 Jun 2008) | 1 line
Use repr() for bad input strings; this makes the empty string or binary characters more visible
........
r64442 | andrew.kuchling | 2008-06-21 08:48:38 -0500 (Sat, 21 Jun 2008) | 1 line
Docstring correction
........
r64443 | georg.brandl | 2008-06-21 09:26:19 -0500 (Sat, 21 Jun 2008) | 2 lines
Documentation fix.
........
r64445 | facundo.batista | 2008-06-21 12:30:06 -0500 (Sat, 21 Jun 2008) | 3 lines
Reviewed and updated the documentation. Fixes #3017.
........
r64447 | facundo.batista | 2008-06-21 13:58:04 -0500 (Sat, 21 Jun 2008) | 6 lines
Now a from submitted via POST that also has a query string
will contain both FieldStorage and MiniFieldStorage items.
Fixes #1817.
........
r64448 | facundo.batista | 2008-06-21 14:48:19 -0500 (Sat, 21 Jun 2008) | 5 lines
In the deprecated functions I added an alert to review
specially a section of the subprocess documentation
that helps with the replacing of those functionss.
........
r64450 | georg.brandl | 2008-06-22 04:05:29 -0500 (Sun, 22 Jun 2008) | 2 lines
Turn section references into proper cross-references.
........
r64452 | facundo.batista | 2008-06-22 08:36:20 -0500 (Sun, 22 Jun 2008) | 5 lines
Issue #2722. Now the char buffer to support the path string has
not fixed length, it mallocs memory if needed. As a result, we
don't have a maximum for the getcwd() method.
........
r64455 | facundo.batista | 2008-06-22 10:27:10 -0500 (Sun, 22 Jun 2008) | 4 lines
Issue 3164. Small fix to don't repeat a comparation
without necessity.
........
r64461 | georg.brandl | 2008-06-22 13:11:52 -0500 (Sun, 22 Jun 2008) | 2 lines
#3085: Fix syntax error.
........
r64464 | georg.brandl | 2008-06-22 13:31:54 -0500 (Sun, 22 Jun 2008) | 2 lines
Expand docstrings of sqlite3 functions.
........
r64466 | georg.brandl | 2008-06-22 14:07:59 -0500 (Sun, 22 Jun 2008) | 2 lines
Write out "phi" consistently.
........
r64468 | facundo.batista | 2008-06-22 14:35:24 -0500 (Sun, 22 Jun 2008) | 4 lines
Just returning nothing instead of rising TestSkipped, because
it makes the test fail in the trunk.loewis-sun buildbot.
........
2008-07-02 14:30:14 -03:00
|
|
|
A form submitted via POST that also has a query string will contain both
|
|
|
|
:class:`FieldStorage` and :class:`MiniFieldStorage` items.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
Higher Level Interface
|
|
|
|
----------------------
|
|
|
|
|
|
|
|
The previous section explains how to read CGI form data using the
|
|
|
|
:class:`FieldStorage` class. This section describes a higher level interface
|
|
|
|
which was added to this class to allow one to do it in a more readable and
|
|
|
|
intuitive way. The interface doesn't make the techniques described in previous
|
|
|
|
sections obsolete --- they are still useful to process file uploads efficiently,
|
|
|
|
for example.
|
|
|
|
|
Merged revisions 59605-59624 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r59606 | georg.brandl | 2007-12-29 11:57:00 +0100 (Sat, 29 Dec 2007) | 2 lines
Some cleanup in the docs.
........
r59611 | martin.v.loewis | 2007-12-29 19:49:21 +0100 (Sat, 29 Dec 2007) | 2 lines
Bug #1699: Define _BSD_SOURCE only on OpenBSD.
........
r59612 | raymond.hettinger | 2007-12-29 23:09:34 +0100 (Sat, 29 Dec 2007) | 1 line
Simpler documentation for itertools.tee(). Should be backported.
........
r59613 | raymond.hettinger | 2007-12-29 23:16:24 +0100 (Sat, 29 Dec 2007) | 1 line
Improve docs for itertools.groupby(). The use of xrange(0) to create a unique object is less obvious than object().
........
r59620 | christian.heimes | 2007-12-31 15:47:07 +0100 (Mon, 31 Dec 2007) | 3 lines
Added wininst-9.0.exe executable for VS 2008
Integrated bdist_wininst into PCBuild9 directory
........
r59621 | christian.heimes | 2007-12-31 15:51:18 +0100 (Mon, 31 Dec 2007) | 1 line
Moved PCbuild directory to PC/VS7.1
........
r59622 | christian.heimes | 2007-12-31 15:59:26 +0100 (Mon, 31 Dec 2007) | 1 line
Fix paths for build bot
........
r59623 | christian.heimes | 2007-12-31 16:02:41 +0100 (Mon, 31 Dec 2007) | 1 line
Fix paths for build bot, part 2
........
r59624 | christian.heimes | 2007-12-31 16:18:55 +0100 (Mon, 31 Dec 2007) | 1 line
Renamed PCBuild9 directory to PCBuild
........
2007-12-31 12:14:33 -04:00
|
|
|
.. XXX: Is this true ?
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
The interface consists of two simple methods. Using the methods you can process
|
|
|
|
form data in a generic way, without the need to worry whether only one or more
|
|
|
|
values were posted under one name.
|
|
|
|
|
|
|
|
In the previous section, you learned to write following code anytime you
|
|
|
|
expected a user to post more than one value under one name::
|
|
|
|
|
|
|
|
item = form.getvalue("item")
|
|
|
|
if isinstance(item, list):
|
|
|
|
# The user is requesting more than one item.
|
|
|
|
else:
|
|
|
|
# The user is requesting only one item.
|
|
|
|
|
|
|
|
This situation is common for example when a form contains a group of multiple
|
|
|
|
checkboxes with the same name::
|
|
|
|
|
|
|
|
<input type="checkbox" name="item" value="1" />
|
|
|
|
<input type="checkbox" name="item" value="2" />
|
|
|
|
|
|
|
|
In most situations, however, there's only one form control with a particular
|
|
|
|
name in a form and then you expect and need only one value associated with this
|
|
|
|
name. So you write a script containing for example this code::
|
|
|
|
|
|
|
|
user = form.getvalue("user").upper()
|
|
|
|
|
|
|
|
The problem with the code is that you should never expect that a client will
|
|
|
|
provide valid input to your scripts. For example, if a curious user appends
|
|
|
|
another ``user=foo`` pair to the query string, then the script would crash,
|
|
|
|
because in this situation the ``getvalue("user")`` method call returns a list
|
|
|
|
instead of a string. Calling the :meth:`toupper` method on a list is not valid
|
|
|
|
(since lists do not have a method of this name) and results in an
|
|
|
|
:exc:`AttributeError` exception.
|
|
|
|
|
|
|
|
Therefore, the appropriate way to read form data values was to always use the
|
|
|
|
code which checks whether the obtained value is a single value or a list of
|
|
|
|
values. That's annoying and leads to less readable scripts.
|
|
|
|
|
|
|
|
A more convenient approach is to use the methods :meth:`getfirst` and
|
|
|
|
:meth:`getlist` provided by this higher level interface.
|
|
|
|
|
|
|
|
|
|
|
|
.. method:: FieldStorage.getfirst(name[, default])
|
|
|
|
|
|
|
|
This method always returns only one value associated with form field *name*.
|
|
|
|
The method returns only the first value in case that more values were posted
|
|
|
|
under such name. Please note that the order in which the values are received
|
|
|
|
may vary from browser to browser and should not be counted on. [#]_ If no such
|
|
|
|
form field or value exists then the method returns the value specified by the
|
|
|
|
optional parameter *default*. This parameter defaults to ``None`` if not
|
|
|
|
specified.
|
|
|
|
|
|
|
|
|
|
|
|
.. method:: FieldStorage.getlist(name)
|
|
|
|
|
|
|
|
This method always returns a list of values associated with form field *name*.
|
|
|
|
The method returns an empty list if no such form field or value exists for
|
|
|
|
*name*. It returns a list consisting of one item if only one such value exists.
|
|
|
|
|
|
|
|
Using these methods you can write nice compact code::
|
|
|
|
|
|
|
|
import cgi
|
|
|
|
form = cgi.FieldStorage()
|
|
|
|
user = form.getfirst("user", "").upper() # This way it's safe.
|
|
|
|
for item in form.getlist("item"):
|
|
|
|
do_something(item)
|
|
|
|
|
|
|
|
|
|
|
|
.. _functions-in-cgi-module:
|
|
|
|
|
|
|
|
Functions
|
|
|
|
---------
|
|
|
|
|
|
|
|
These are useful if you want more control, or if you want to employ some of the
|
|
|
|
algorithms implemented in this module in other circumstances.
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: parse(fp[, keep_blank_values[, strict_parsing]])
|
|
|
|
|
|
|
|
Parse a query in the environment or from a file (the file defaults to
|
|
|
|
``sys.stdin``). The *keep_blank_values* and *strict_parsing* parameters are
|
2008-09-03 19:49:01 -03:00
|
|
|
passed to :func:`urllib.parse.parse_qs` unchanged.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
|
|
|
|
.. function:: parse_qs(qs[, keep_blank_values[, strict_parsing]])
|
|
|
|
|
2008-09-03 19:49:01 -03:00
|
|
|
This function is deprecated in this module. Use :func:`urllib.parse.parse_qs`
|
2009-03-13 16:04:40 -03:00
|
|
|
instead. It is maintained here only for backward compatibility.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
.. function:: parse_qsl(qs[, keep_blank_values[, strict_parsing]])
|
|
|
|
|
2008-09-03 19:49:01 -03:00
|
|
|
This function is deprecated in this module. Use :func:`urllib.parse.parse_qs`
|
2009-03-13 16:04:40 -03:00
|
|
|
instead. It is maintained here only for backward compatibility.
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
.. function:: parse_multipart(fp, pdict)
|
|
|
|
|
|
|
|
Parse input of type :mimetype:`multipart/form-data` (for file uploads).
|
|
|
|
Arguments are *fp* for the input file and *pdict* for a dictionary containing
|
|
|
|
other parameters in the :mailheader:`Content-Type` header.
|
|
|
|
|
2008-09-03 19:49:01 -03:00
|
|
|
Returns a dictionary just like :func:`urllib.parse.parse_qs` keys are the field names, each
|
2007-08-15 11:28:22 -03:00
|
|
|
value is a list of values for that field. This is easy to use but not much good
|
|
|
|
if you are expecting megabytes to be uploaded --- in that case, use the
|
|
|
|
:class:`FieldStorage` class instead which is much more flexible.
|
|
|
|
|
|
|
|
Note that this does not parse nested multipart parts --- use
|
|
|
|
:class:`FieldStorage` for that.
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: parse_header(string)
|
|
|
|
|
|
|
|
Parse a MIME header (such as :mailheader:`Content-Type`) into a main value and a
|
|
|
|
dictionary of parameters.
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: test()
|
|
|
|
|
|
|
|
Robust test CGI script, usable as main program. Writes minimal HTTP headers and
|
|
|
|
formats all information provided to the script in HTML form.
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: print_environ()
|
|
|
|
|
|
|
|
Format the shell environment in HTML.
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: print_form(form)
|
|
|
|
|
|
|
|
Format a form in HTML.
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: print_directory()
|
|
|
|
|
|
|
|
Format the current directory in HTML.
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: print_environ_usage()
|
|
|
|
|
|
|
|
Print a list of useful (used by CGI) environment variables in HTML.
|
|
|
|
|
|
|
|
|
|
|
|
.. function:: escape(s[, quote])
|
|
|
|
|
|
|
|
Convert the characters ``'&'``, ``'<'`` and ``'>'`` in string *s* to HTML-safe
|
|
|
|
sequences. Use this if you need to display text that might contain such
|
|
|
|
characters in HTML. If the optional flag *quote* is true, the quotation mark
|
|
|
|
character (``'"'``) is also translated; this helps for inclusion in an HTML
|
|
|
|
attribute value, as in ``<A HREF="...">``. If the value to be quoted might
|
|
|
|
include single- or double-quote characters, or both, consider using the
|
|
|
|
:func:`quoteattr` function in the :mod:`xml.sax.saxutils` module instead.
|
|
|
|
|
|
|
|
|
|
|
|
.. _cgi-security:
|
|
|
|
|
|
|
|
Caring about security
|
|
|
|
---------------------
|
|
|
|
|
|
|
|
.. index:: pair: CGI; security
|
|
|
|
|
|
|
|
There's one important rule: if you invoke an external program (via the
|
|
|
|
:func:`os.system` or :func:`os.popen` functions. or others with similar
|
|
|
|
functionality), make very sure you don't pass arbitrary strings received from
|
|
|
|
the client to the shell. This is a well-known security hole whereby clever
|
|
|
|
hackers anywhere on the Web can exploit a gullible CGI script to invoke
|
|
|
|
arbitrary shell commands. Even parts of the URL or field names cannot be
|
|
|
|
trusted, since the request doesn't have to come from your form!
|
|
|
|
|
|
|
|
To be on the safe side, if you must pass a string gotten from a form to a shell
|
|
|
|
command, you should make sure the string contains only alphanumeric characters,
|
|
|
|
dashes, underscores, and periods.
|
|
|
|
|
|
|
|
|
|
|
|
Installing your CGI script on a Unix system
|
|
|
|
-------------------------------------------
|
|
|
|
|
|
|
|
Read the documentation for your HTTP server and check with your local system
|
|
|
|
administrator to find the directory where CGI scripts should be installed;
|
|
|
|
usually this is in a directory :file:`cgi-bin` in the server tree.
|
|
|
|
|
|
|
|
Make sure that your script is readable and executable by "others"; the Unix file
|
2008-05-26 14:55:52 -03:00
|
|
|
mode should be ``0o755`` octal (use ``chmod 0755 filename``). Make sure that the
|
2007-08-15 11:28:22 -03:00
|
|
|
first line of the script contains ``#!`` starting in column 1 followed by the
|
|
|
|
pathname of the Python interpreter, for instance::
|
|
|
|
|
|
|
|
#!/usr/local/bin/python
|
|
|
|
|
|
|
|
Make sure the Python interpreter exists and is executable by "others".
|
|
|
|
|
|
|
|
Make sure that any files your script needs to read or write are readable or
|
2008-05-26 14:55:52 -03:00
|
|
|
writable, respectively, by "others" --- their mode should be ``0o644`` for
|
|
|
|
readable and ``0o666`` for writable. This is because, for security reasons, the
|
2007-08-15 11:28:22 -03:00
|
|
|
HTTP server executes your script as user "nobody", without any special
|
|
|
|
privileges. It can only read (write, execute) files that everybody can read
|
|
|
|
(write, execute). The current directory at execution time is also different (it
|
|
|
|
is usually the server's cgi-bin directory) and the set of environment variables
|
|
|
|
is also different from what you get when you log in. In particular, don't count
|
|
|
|
on the shell's search path for executables (:envvar:`PATH`) or the Python module
|
|
|
|
search path (:envvar:`PYTHONPATH`) to be set to anything interesting.
|
|
|
|
|
|
|
|
If you need to load modules from a directory which is not on Python's default
|
|
|
|
module search path, you can change the path in your script, before importing
|
|
|
|
other modules. For example::
|
|
|
|
|
|
|
|
import sys
|
|
|
|
sys.path.insert(0, "/usr/home/joe/lib/python")
|
|
|
|
sys.path.insert(0, "/usr/local/lib/python")
|
|
|
|
|
|
|
|
(This way, the directory inserted last will be searched first!)
|
|
|
|
|
|
|
|
Instructions for non-Unix systems will vary; check your HTTP server's
|
|
|
|
documentation (it will usually have a section on CGI scripts).
|
|
|
|
|
|
|
|
|
|
|
|
Testing your CGI script
|
|
|
|
-----------------------
|
|
|
|
|
|
|
|
Unfortunately, a CGI script will generally not run when you try it from the
|
|
|
|
command line, and a script that works perfectly from the command line may fail
|
|
|
|
mysteriously when run from the server. There's one reason why you should still
|
|
|
|
test your script from the command line: if it contains a syntax error, the
|
|
|
|
Python interpreter won't execute it at all, and the HTTP server will most likely
|
|
|
|
send a cryptic error to the client.
|
|
|
|
|
|
|
|
Assuming your script has no syntax errors, yet it does not work, you have no
|
|
|
|
choice but to read the next section.
|
|
|
|
|
|
|
|
|
|
|
|
Debugging CGI scripts
|
|
|
|
---------------------
|
|
|
|
|
|
|
|
.. index:: pair: CGI; debugging
|
|
|
|
|
|
|
|
First of all, check for trivial installation errors --- reading the section
|
|
|
|
above on installing your CGI script carefully can save you a lot of time. If
|
|
|
|
you wonder whether you have understood the installation procedure correctly, try
|
|
|
|
installing a copy of this module file (:file:`cgi.py`) as a CGI script. When
|
|
|
|
invoked as a script, the file will dump its environment and the contents of the
|
|
|
|
form in HTML form. Give it the right mode etc, and send it a request. If it's
|
|
|
|
installed in the standard :file:`cgi-bin` directory, it should be possible to
|
|
|
|
send it a request by entering a URL into your browser of the form::
|
|
|
|
|
|
|
|
http://yourhostname/cgi-bin/cgi.py?name=Joe+Blow&addr=At+Home
|
|
|
|
|
|
|
|
If this gives an error of type 404, the server cannot find the script -- perhaps
|
|
|
|
you need to install it in a different directory. If it gives another error,
|
|
|
|
there's an installation problem that you should fix before trying to go any
|
|
|
|
further. If you get a nicely formatted listing of the environment and form
|
|
|
|
content (in this example, the fields should be listed as "addr" with value "At
|
|
|
|
Home" and "name" with value "Joe Blow"), the :file:`cgi.py` script has been
|
|
|
|
installed correctly. If you follow the same procedure for your own script, you
|
|
|
|
should now be able to debug it.
|
|
|
|
|
|
|
|
The next step could be to call the :mod:`cgi` module's :func:`test` function
|
|
|
|
from your script: replace its main code with the single statement ::
|
|
|
|
|
|
|
|
cgi.test()
|
|
|
|
|
|
|
|
This should produce the same results as those gotten from installing the
|
|
|
|
:file:`cgi.py` file itself.
|
|
|
|
|
|
|
|
When an ordinary Python script raises an unhandled exception (for whatever
|
|
|
|
reason: of a typo in a module name, a file that can't be opened, etc.), the
|
|
|
|
Python interpreter prints a nice traceback and exits. While the Python
|
|
|
|
interpreter will still do this when your CGI script raises an exception, most
|
|
|
|
likely the traceback will end up in one of the HTTP server's log files, or be
|
|
|
|
discarded altogether.
|
|
|
|
|
|
|
|
Fortunately, once you have managed to get your script to execute *some* code,
|
|
|
|
you can easily send tracebacks to the Web browser using the :mod:`cgitb` module.
|
Merged revisions 69803-69805,69840,69901,69905,69907,69924,69927,69987 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r69803 | georg.brandl | 2009-02-20 01:48:21 -0600 (Fri, 20 Feb 2009) | 1 line
#5327: fix a broken link by joining it.
........
r69804 | georg.brandl | 2009-02-20 02:22:21 -0600 (Fri, 20 Feb 2009) | 1 line
At least separate imports from other statements.
........
r69805 | georg.brandl | 2009-02-20 02:45:47 -0600 (Fri, 20 Feb 2009) | 2 lines
Fix punctuation.
........
r69840 | georg.brandl | 2009-02-21 13:09:40 -0600 (Sat, 21 Feb 2009) | 1 line
#5338, #5339: two types in the API manual.
........
r69901 | georg.brandl | 2009-02-23 05:24:46 -0600 (Mon, 23 Feb 2009) | 2 lines
#5349: C++ pure virtuals can also have an implementation.
........
r69905 | georg.brandl | 2009-02-23 09:51:27 -0600 (Mon, 23 Feb 2009) | 2 lines
#5352: str.count() counts non-overlapping instances.
........
r69907 | georg.brandl | 2009-02-23 12:33:48 -0600 (Mon, 23 Feb 2009) | 1 line
Fix grammar.
........
r69924 | benjamin.peterson | 2009-02-23 20:45:35 -0600 (Mon, 23 Feb 2009) | 1 line
update README on running tests
........
r69927 | neil.schemenauer | 2009-02-23 22:23:25 -0600 (Mon, 23 Feb 2009) | 1 line
Fix call to os.waitpid, it does not take keyword args.
........
r69987 | benjamin.peterson | 2009-02-25 18:30:11 -0600 (Wed, 25 Feb 2009) | 1 line
fix str.format()'s first arg #5371
........
2009-02-25 23:38:59 -04:00
|
|
|
If you haven't done so already, just add the lines::
|
2007-08-15 11:28:22 -03:00
|
|
|
|
Merged revisions 69803-69805,69840,69901,69905,69907,69924,69927,69987 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r69803 | georg.brandl | 2009-02-20 01:48:21 -0600 (Fri, 20 Feb 2009) | 1 line
#5327: fix a broken link by joining it.
........
r69804 | georg.brandl | 2009-02-20 02:22:21 -0600 (Fri, 20 Feb 2009) | 1 line
At least separate imports from other statements.
........
r69805 | georg.brandl | 2009-02-20 02:45:47 -0600 (Fri, 20 Feb 2009) | 2 lines
Fix punctuation.
........
r69840 | georg.brandl | 2009-02-21 13:09:40 -0600 (Sat, 21 Feb 2009) | 1 line
#5338, #5339: two types in the API manual.
........
r69901 | georg.brandl | 2009-02-23 05:24:46 -0600 (Mon, 23 Feb 2009) | 2 lines
#5349: C++ pure virtuals can also have an implementation.
........
r69905 | georg.brandl | 2009-02-23 09:51:27 -0600 (Mon, 23 Feb 2009) | 2 lines
#5352: str.count() counts non-overlapping instances.
........
r69907 | georg.brandl | 2009-02-23 12:33:48 -0600 (Mon, 23 Feb 2009) | 1 line
Fix grammar.
........
r69924 | benjamin.peterson | 2009-02-23 20:45:35 -0600 (Mon, 23 Feb 2009) | 1 line
update README on running tests
........
r69927 | neil.schemenauer | 2009-02-23 22:23:25 -0600 (Mon, 23 Feb 2009) | 1 line
Fix call to os.waitpid, it does not take keyword args.
........
r69987 | benjamin.peterson | 2009-02-25 18:30:11 -0600 (Wed, 25 Feb 2009) | 1 line
fix str.format()'s first arg #5371
........
2009-02-25 23:38:59 -04:00
|
|
|
import cgitb
|
|
|
|
cgitb.enable()
|
2007-08-15 11:28:22 -03:00
|
|
|
|
|
|
|
to the top of your script. Then try running it again; when a problem occurs,
|
|
|
|
you should see a detailed report that will likely make apparent the cause of the
|
|
|
|
crash.
|
|
|
|
|
|
|
|
If you suspect that there may be a problem in importing the :mod:`cgitb` module,
|
|
|
|
you can use an even more robust approach (which only uses built-in modules)::
|
|
|
|
|
|
|
|
import sys
|
|
|
|
sys.stderr = sys.stdout
|
2007-09-04 04:15:32 -03:00
|
|
|
print("Content-Type: text/plain")
|
|
|
|
print()
|
2007-08-15 11:28:22 -03:00
|
|
|
...your code here...
|
|
|
|
|
|
|
|
This relies on the Python interpreter to print the traceback. The content type
|
|
|
|
of the output is set to plain text, which disables all HTML processing. If your
|
|
|
|
script works, the raw HTML will be displayed by your client. If it raises an
|
|
|
|
exception, most likely after the first two lines have been printed, a traceback
|
|
|
|
will be displayed. Because no HTML interpretation is going on, the traceback
|
|
|
|
will be readable.
|
|
|
|
|
|
|
|
|
|
|
|
Common problems and solutions
|
|
|
|
-----------------------------
|
|
|
|
|
|
|
|
* Most HTTP servers buffer the output from CGI scripts until the script is
|
|
|
|
completed. This means that it is not possible to display a progress report on
|
|
|
|
the client's display while the script is running.
|
|
|
|
|
|
|
|
* Check the installation instructions above.
|
|
|
|
|
|
|
|
* Check the HTTP server's log files. (``tail -f logfile`` in a separate window
|
|
|
|
may be useful!)
|
|
|
|
|
|
|
|
* Always check a script for syntax errors first, by doing something like
|
|
|
|
``python script.py``.
|
|
|
|
|
|
|
|
* If your script does not have any syntax errors, try adding ``import cgitb;
|
|
|
|
cgitb.enable()`` to the top of the script.
|
|
|
|
|
|
|
|
* When invoking external programs, make sure they can be found. Usually, this
|
|
|
|
means using absolute path names --- :envvar:`PATH` is usually not set to a very
|
|
|
|
useful value in a CGI script.
|
|
|
|
|
|
|
|
* When reading or writing external files, make sure they can be read or written
|
|
|
|
by the userid under which your CGI script will be running: this is typically the
|
|
|
|
userid under which the web server is running, or some explicitly specified
|
|
|
|
userid for a web server's ``suexec`` feature.
|
|
|
|
|
|
|
|
* Don't try to give a CGI script a set-uid mode. This doesn't work on most
|
|
|
|
systems, and is a security liability as well.
|
|
|
|
|
|
|
|
.. rubric:: Footnotes
|
|
|
|
|
|
|
|
.. [#] Note that some recent versions of the HTML specification do state what order the
|
|
|
|
field values should be supplied in, but knowing whether a request was
|
|
|
|
received from a conforming browser, or even from a browser at all, is tedious
|
|
|
|
and error-prone.
|
|
|
|
|