I'm using urllib2 to pull pages for a custom version of a web proxy
and am having issues with 404 errors. Urllib2 does a great job of
letting me know that a 404 happened with the following code.
import urllib2
url = 'http://cnn.com/asfsdafsadfasdf/'
try:
page = urllib2.urlopen( url )
except urllib2.URLError, e:
print e
returns: HTTP Error 404: Not Found
>From the urllib2 docs: HTTPError is also a valid HTTP response, so youcan treat an HTTP error as an exceptional event or a valid response:
import urllib2
url = 'http://cnn.com/asfsdafsadfasdf/'
try:
page = urllib2.urlopen(url)
except urllib2.URLError, e:
print e.read()
>
http://cnn.com/asdfasdfadsf