Hi,
I've been working on an application to do some 'scraping' of web
content, using the WebClient class. I'm using code rather like the
following..
Dim objWebClient As New WebClient
Dim strURL As String = CType(URL, String)
Dim aRequestedHTML( ) As Byte
Dim objUTF8 As New UTF8Encoding
Dim strRequestedHTM L As String
aRequestedHTML = objWebClient.Do wnloadData(strU RL)
strRequestedHTM L = objUTF8.GetStri ng(aRequestedHT ML)
Return strRequestedHTM L
However, if I enter the url, say
http://web.archive.org/web/200306221...odhouse.co.uk/
the returned html is not the page that I requested (as viewed in the
browser), but rather a error page saying that the page has not been
found.
Does anyone know why this may be the case? My ideas so far have been
that the web browser (and server) and work out to send the correct
page, while the Web Client class isn't actually specifying a page at
the end of the url..
If anyone has an idea on this problem I would like to hear it, the
deadline for this project is coming up fast and this is a stumbling
block that I need to overcome, once this is done then its (hopefully!)
all plain sailing from here.
Thanks for your time
Chris Williams (chris (at) oxymoron-failsafe.com)