cpython/Doc/lib/libhttplib.tex

229 lines
7.0 KiB
TeX
Raw Normal View History

\section{\module{httplib} ---
HTTP protocol client}
\declaremodule{standard}{httplib}
2001-09-25 13:32:02 -03:00
\modulesynopsis{HTTP and HTTPS protocol client (requires sockets).}
\indexii{HTTP}{protocol}
2001-12-26 15:48:43 -04:00
\index{HTTP!\module{httplib} (standard module)}
1995-02-27 13:53:25 -04:00
2001-09-25 13:32:02 -03:00
This module defines classes which implement the client side of the
HTTP and HTTPS protocols. It is normally not used directly --- the
module \refmodule{urllib}\refstmodindex{urllib} uses it to handle URLs
that use HTTP and HTTPS. \note{HTTPS support is only
2001-09-25 13:32:02 -03:00
available if the \refmodule{socket} module was compiled with SSL
support.}
1995-02-27 13:53:25 -04:00
The constants defined in this module are:
\begin{datadesc}{HTTP_PORT}
The default port for the HTTP protocol (always \code{80}).
\end{datadesc}
\begin{datadesc}{HTTPS_PORT}
The default port for the HTTPS protocol (always \code{443}).
\end{datadesc}
The module provides the following classes:
\begin{classdesc}{HTTPConnection}{host\optional{, port}}
An \class{HTTPConnection} instance represents one transaction with an HTTP
server. It should be instantiated passing it a host and optional port number.
If no port number is passed, the port is extracted from the host string if it
has the form \code{\var{host}:\var{port}}, else the default HTTP port (80) is
used. For example, the following calls all create instances that connect to
the server at the same host and port:
\begin{verbatim}
>>> h1 = httplib.HTTPConnection('www.cwi.nl')
>>> h2 = httplib.HTTPConnection('www.cwi.nl:80')
>>> h3 = httplib.HTTPConnection('www.cwi.nl', 80)
\end{verbatim}
\end{classdesc}
\begin{classdesc}{HTTPSConnection}{host\optional{, port}}
A subclass of \class{HTTPConnection} that uses SSL for communication with
secure servers. Default port is \code{443}.
\end{classdesc}
1995-02-27 13:53:25 -04:00
The following exceptions are raised as appropriate:
1995-02-27 13:53:25 -04:00
\begin{excdesc}{HTTPException}
The base class of the other exceptions in this module. It is a
subclass of \exception{Exception}.
\end{excdesc}
1995-02-27 13:53:25 -04:00
\begin{excdesc}{NotConnected}
A subclass of \exception{HTTPException}.
\end{excdesc}
1995-02-27 13:53:25 -04:00
2002-03-24 12:55:57 -04:00
\begin{excdesc}{InvalidURL}
A subclass of \exception{HTTPException}, raised if a port is given and is
either non-numeric or empty.
\end{excdesc}
\begin{excdesc}{UnknownProtocol}
A subclass of \exception{HTTPException}.
\end{excdesc}
1995-02-27 13:53:25 -04:00
\begin{excdesc}{UnknownTransferEncoding}
A subclass of \exception{HTTPException}.
\end{excdesc}
1995-02-27 13:53:25 -04:00
\begin{excdesc}{IllegalKeywordArgument}
A subclass of \exception{HTTPException}.
\end{excdesc}
1995-02-27 13:53:25 -04:00
\begin{excdesc}{UnimplementedFileMode}
A subclass of \exception{HTTPException}.
\end{excdesc}
1995-02-27 13:53:25 -04:00
\begin{excdesc}{IncompleteRead}
A subclass of \exception{HTTPException}.
\end{excdesc}
1995-02-27 13:53:25 -04:00
\begin{excdesc}{ImproperConnectionState}
A subclass of \exception{HTTPException}.
\end{excdesc}
\begin{excdesc}{CannotSendRequest}
A subclass of \exception{ImproperConnectionState}.
\end{excdesc}
\begin{excdesc}{CannotSendHeader}
A subclass of \exception{ImproperConnectionState}.
\end{excdesc}
\begin{excdesc}{ResponseNotReady}
A subclass of \exception{ImproperConnectionState}.
\end{excdesc}
\begin{excdesc}{BadStatusLine}
A subclass of \exception{HTTPException}. Raised if a server responds with a
HTTP status code that we don't understand.
\end{excdesc}
1995-02-27 13:53:25 -04:00
\subsection{HTTPConnection Objects \label{httpconnection-objects}}
\class{HTTPConnection} instances have the following methods:
\begin{methoddesc}{request}{method, url\optional{, body\optional{, headers}}}
This will send a request to the server using the HTTP request method
\var{method} and the selector \var{url}. If the \var{body} argument is
present, it should be a string of data to send after the headers are finished.
The header Content-Length is automatically set to the correct value.
The \var{headers} argument should be a mapping of extra HTTP headers to send
with the request.
\end{methoddesc}
\begin{methoddesc}{getresponse}{}
Should be called after a request is sent to get the response from the server.
Returns an \class{HTTPResponse} instance.
\end{methoddesc}
\begin{methoddesc}{set_debuglevel}{level}
1995-02-27 13:53:25 -04:00
Set the debugging level (the amount of debugging output printed).
The default debug level is \code{0}, meaning no debugging output is
printed.
\end{methoddesc}
1995-02-27 13:53:25 -04:00
\begin{methoddesc}{connect}{}
Connect to the server specified when the object was created.
\end{methoddesc}
\begin{methoddesc}{close}{}
Close the connection to the server.
\end{methoddesc}
1995-02-27 13:53:25 -04:00
\begin{methoddesc}{send}{data}
1995-02-27 13:53:25 -04:00
Send data to the server. This should be used directly only after the
\method{endheaders()} method has been called and before
\method{getreply()} has been called.
\end{methoddesc}
1995-02-27 13:53:25 -04:00
\begin{methoddesc}{putrequest}{request, selector}
1995-02-27 13:53:25 -04:00
This should be the first call after the connection to the server has
been made. It sends a line to the server consisting of the
\var{request} string, the \var{selector} string, and the HTTP version
(\code{HTTP/1.1}).
\end{methoddesc}
1995-02-27 13:53:25 -04:00
\begin{methoddesc}{putheader}{header, argument\optional{, ...}}
Send an \rfc{822}-style header to the server. It sends a line to the
1995-02-27 13:53:25 -04:00
server consisting of the header, a colon and a space, and the first
argument. If more arguments are given, continuation lines are sent,
each consisting of a tab and an argument.
\end{methoddesc}
1995-02-27 13:53:25 -04:00
\begin{methoddesc}{endheaders}{}
1995-02-27 13:53:25 -04:00
Send a blank line to the server, signalling the end of the headers.
\end{methoddesc}
1995-02-27 13:53:25 -04:00
\subsection{HTTPResponse Objects \label{httpresponse-objects}}
\class{HTTPResponse} instances have the following methods and attributes:
\begin{methoddesc}{read}{}
Reads and returns the response body.
\end{methoddesc}
1995-02-27 13:53:25 -04:00
\begin{methoddesc}{getheader}{name\optional{, default}}
Get the contents of the header \var{name}, or \var{default} if there is no
matching header.
\end{methoddesc}
\begin{datadesc}{msg}
A \class{mimetools.Message} instance containing the response headers.
\end{datadesc}
\begin{datadesc}{version}
HTTP protocol version used by server. 10 for HTTP/1.0, 11 for HTTP/1.1.
\end{datadesc}
\begin{datadesc}{status}
Status code returned by server.
\end{datadesc}
\begin{datadesc}{reason}
Reason phrase returned by server.
\end{datadesc}
2001-09-25 13:32:02 -03:00
2001-01-22 13:42:32 -04:00
\subsection{Examples \label{httplib-examples}}
Here is an example session that uses the \samp{GET} method:
\begin{verbatim}
>>> import httplib
>>> conn = httplib.HTTPConnection("www.python.org")
>>> conn.request("GET", "/index.html")
>>> r1 = conn.getresponse()
>>> print r1.status, r1.reason
200 OK
>>> data1 = r1.read()
>>> conn.request("GET", "/parrot.spam")
>>> r2 = conn.getresponse()
>>> print r2.status, r2.reason
404 Not Found
>>> data2 = r2.read()
>>> conn.close()
\end{verbatim}
Here is an example session that shows how to \samp{POST} requests:
\begin{verbatim}
>>> import httplib, urllib
>>> params = urllib.urlencode({'spam': 1, 'eggs': 2, 'bacon': 0})
>>> headers = {"Content-type": "application/x-www-form-urlencoded",
... "Accept": "text/plain"}
>>> conn = httplib.HTTPConnection("musi-cal.mojam.com:80")
>>> conn.request("POST", "/cgi-bin/query", params, headers)
>>> response = conn.getresponse()
>>> print response.status, response.reason
200 OK
>>> data = response.read()
>>> conn.close()
\end{verbatim}