By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
437,986 Members | 1,522 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 437,986 IT Pros & Developers. It's quick & easy.

Unable to download some HTML files

P: n/a
Hello.

I am trying to download individual HTML pages useing a program called WGET
on a Linux machine. The WGET fetches typed in URL and stores as source
code into a directory. This works ok for older, unintelligent HTML pages,
but I am having trouble with a newer version.
The URL LINK is as follows.
<a class="dropDown"
href="http://www.foosite.com/foo?action=go_types&class_id=2">

I send & as hex.
I know all after ? is/are Query String , but am not sure what that means .
ARE they sent as is or are they manipulated by browser software
beforehand?
WGET is sending every thing between second quotes.
I have found no reference in HTML literature to the item in LINK of
class="dropDown"
and would appreciate a point in the right direction.

Thanks in anticipation.
Barry.
Replies to pe******@uko2.com

xxx=pan


Jul 20 '05 #1
Share this Question
Share on Google+
2 Replies


P: n/a
BAZZER wrote:
<a class="dropDown"
href="http://www.foosite.com/foo?action=go_types&class_id=2">

I know all after ? is/are Query String , but am not sure what that
means.
It is information sent as part of the url.
ARE they sent as is or are they manipulated by browser software
beforehand?


They are (may be?) encoded by the browser before being sent, as with any
url. Otherwise, the browser does nothing with them. The server typically
does something with them, like querying a database and producing a
document, or conducting a search, or performing some other task.

--
Brian (remove "invalid" from my address to email me)
http://www.tsmchughs.com/
Jul 20 '05 #2

P: n/a
BAZZER wrote:
Hello.

I am trying to download individual HTML pages useing a program called WGET
on a Linux machine. The WGET fetches typed in URL and stores as source
code into a directory. This works ok for older, unintelligent HTML pages,
but I am having trouble with a newer version.
The URL LINK is as follows.
<a class="dropDown"
href="http://www.foosite.com/foo?action=go_types&class_id=2">

I send & as hex.
don't. quote it like this:
http://www.foosite.com/foo\?action=g...s\&class_id\=2

if you plan on looking at this document locally, i'd suggest using this
command:

wget -pE http://www.foosite.com/foo\?action=g...s\&class_id\=2

if you want it to follow the links in that page, i'd suggest this:

wget -np -mrpKE http://www.foosite.com/foo\?action=g...s\&class_id\=2
I know all after ? is/are Query String , but am not sure what that means .
ARE they sent as is or are they manipulated by browser software
beforehand?
generally, they're sent as part of the GET request. the RFC for HTTP
would answer your question more specifically.
WGET is sending every thing between second quotes.
I have found no reference in HTML literature to the item in LINK of
class="dropDown"
and would appreciate a point in the right direction.

it's the CSS class specified to act on the <a> tag. it has to do with
the way the link looks in your browser. wget couldn't care less what it is.
Jul 20 '05 #3

This discussion thread is closed

Replies have been disabled for this discussion.