Okay, so I am having issues figuring anything out about this and have
read the "missing manual" about it so please don't send me that link
again. To put it simply I want to be able to input a word and get the
definition from dictionary.com. Now I found a work-around for
searching for the word, I just make it in the actual address. For
example I want to search for cheese, I can just do a:
urllib2.urlopen("http://dictionary.reference.com/browse/cheese")
However, the actual definition is in javascript on the page. I used
firebug to see it, and the first def, looks like this:
<table class="luna-Ent">
<tbody>
<tr>
<td class="dn" valign="top">1.</td>
<td valign="top">the curd of milk separated from the whey and prepared
in many ways as a food. </td>
the problem being that if I use code like this to get the html of that
page in python:
response = urllib2.urlopen("the webiste....")
html = response.read()
print html
I get the html source of the page, but no table with my definitions.
So what can I do? Also, is there a book or a better tutorial or
explanation of this urllib2, and urllib? If so, PLEASE let me know
about it; I will be eternally grateful. 1 1157
On 28 Jun., 01:42, Alex Bryan <alexnbr...@gmail.comwrote:
Okay, so I am having issues figuring anything out about this and have *
read the "missing manual" about it so please don't send me that link *
again. To put it simply I want to be able to input a word and get the *
definition from dictionary.com. Now I found a work-around for *
searching for the word, I just make it in the actual address. For *
example I want to search for cheese, I can just do a:
urllib2.urlopen("http://dictionary.reference.com/browse/cheese")
However, the actual definition is in javascript on the page. I used *
firebug to see it, and the first def, looks like this:
<table class="luna-Ent">
<tbody>
<tr>
<td class="dn" valign="top">1.</td>
<td valign="top">the curd of milk separated from the whey and prepared *
in many ways as a food. </td>
the problem being that if I use code like this to get the html of that *
page in python:
response = urllib2.urlopen("the webiste....")
html = response.read()
print html
I get the html source of the page, but no table with my definitions. *
So what can I do? Also, is there a book or a better tutorial or *
explanation of this urllib2, and urllib? If so, PLEASE let me know *
about it; I will be eternally grateful.
It would probably be a good idea to take a look at mechanize: http://wwwsearch.sourceforge.net/mechanize/
and at BeautifulSoup: http://www.crummy.com/software/BeautifulSoup/
Greetz,
CracKPod This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics
by: Max Khesin |
last post by:
Is there a way to access an htaccess-protected directory with urllib,
password being known?
thanks,
max
--
========================================
Max Khesin, software developer -...
|
by: Gary Feldman |
last post by:
I think I've found a deficiency in the design of urllib related to https.
In order to complete an https connection, it appears that URLOpener and
hence FancyURLOpener require the key and cert...
|
by: Chris Lyon |
last post by:
Could somebody please explain the difference between these two modules
and explain why they are both required, and if there will ever be a
unification of them?
|
by: Mike Zupan |
last post by:
I had some problems with urllib and py2exe under pyton2.3. I works fine
until i try to use the exe file.
Here is the error
Error: 1
LookupError Exception in Tk callback
Function: <bound...
|
by: Monty |
last post by:
Hello,
Sorry for this maybe stupid newbie question but I didn't find any
answer in all my readings about python:
With urllib, using urlretrieve, it's possible to get the number of
blocks...
|
by: Pieter Edelman |
last post by:
Hi all,
I'm trying to submit some data using a POST request to a HTTP server with
BASIC authentication with python, but I can't get it to work. Since it's
driving me completely nuts, so here's...
|
by: Ali.Sabil |
last post by:
hello all,
I just maybe hit a bug in both urllib and urllib2, actually urllib
doesn't support proxy authentication, and if you setup the http_proxy
env var to...
|
by: John Nagle |
last post by:
I thought I had all the timeout problems with urllib worked around,
but no.
socket.setdefaulttimeout is useful, but not always effective.
I'm setting that to 15 seconds.
If the host end won't...
|
by: O.R.Senthil Kumaran |
last post by:
Hi,
There is an Open Tracker item against urllib2 library python.org/sf/735515
which states that.
urllib / urllib2 should cache the results of 301 (permanent) redirections.
This shouldn't break...
|
by: johnpollard |
last post by:
For some reason this script isn't working and I dont know what it is.
I believe the problem lies in the following lines of code since the script works with a different website and username/password...
|
by: emmanuelkatto |
last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud.
Please let me know.
Thanks!
Emmanuel
|
by: BarryA |
last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
|
by: nemocccc |
last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
|
by: Sonnysonu |
last post by:
This is the data of csv file
1 2 3
1 2 3
1 2 3
1 2 3
2 3
2 3
3
the lengths should be different i have to store the data by column-wise with in the specific length.
suppose the i have to...
|
by: Hystou |
last post by:
There are some requirements for setting up RAID:
1. The motherboard and BIOS support RAID configuration.
2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
|
by: marktang |
last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
|
by: Oralloy |
last post by:
Hello folks,
I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>".
The problem is that using the GNU compilers,...
|
by: agi2029 |
last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
|
by: isladogs |
last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM).
In this session, we are pleased to welcome a new...
| | |