473,322 Members | 1,259 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,322 software developers and data experts.

website/webclient

I have a website, and I need to programmatically find out all the
pages that the first page referes to, and then goto those pages and
retrive some information. The problem is that the links could be
relative or absolute. I know how to use webclient. I just don't know
how to parse out the relative or absolute links.

Does anybody know how? I could have many links on the first page.

Thanks
Nov 20 '05 #1
3 969
Cor
Hi Sehboo,

You can use mshtml to go to your document and find all the anchors <A> tags,
than with that you can find the references to other pages..

It is no easy stuff to do.

You have to set a reference to it, but do not set an import to mshtml,
because your IDE will freeze.

Cor
I have a website, and I need to programmatically find out all the
pages that the first page referes to, and then goto those pages and
retrive some information. The problem is that the links could be
relative or absolute. I know how to use webclient. I just don't know
how to parse out the relative or absolute links.

Nov 20 '05 #2
Sehboo,
Matthew MacDonald's book "Microsoft Visual basic .NET Programmer's Cookbook"
from MS Press has a sample of using the System.Net.WebResponse class
combined with a System.Text.RegularExpressions.RegEx class to look for all
the links on a page.

Hope this helps
Jay

"Sehboo" <ma*********@hotmail.com> wrote in message
news:7b**************************@posting.google.c om...
I have a website, and I need to programmatically find out all the
pages that the first page referes to, and then goto those pages and
retrive some information. The problem is that the links could be
relative or absolute. I know how to use webclient. I just don't know
how to parse out the relative or absolute links.

Does anybody know how? I could have many links on the first page.

Thanks

Nov 20 '05 #3
Cor
to = through
You can use mshtml to go to your document and find all the anchors <A> tags, than with that you can find the references to other pages..

Nov 20 '05 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

3
by: Ray Price | last post by:
I want to use a server-side C# script to copy an image from a remote URL to a local file in the web directory. I tried using File.Copy from the System.IO namespace but it doesn't support URL. ...
1
by: Sam | last post by:
Hello everyone, WebClient object works very well with the no-need-to-authentication website. Can it also work also with authentication website? Thanks in advance. Sam
6
by: Alan | last post by:
Hi All, I am trying to make a few calls in succession to a website to allow me to login and then perform a search, in a c# windows program. wcResponse =...
6
by: A.M-SG | last post by:
Hi, I have an aspx page at the web server that provides PDF documents for smart client applications. Here is the code in aspx page that defines content type: Response.ContentType =...
3
by: MJ | last post by:
Automatically download and save pdf files from a website. Is there a way to do this? I would like to automate a daily procedure for downloading and saving a couple of dozen pdf from several...
2
by: Stefan Kiryazov (MCAD) | last post by:
Hi all, I'm facing the following problem. I am trying to programmatically retrieve data from a website that needs a login through HTTPS. Originally, as I am not familiar with HTTPS and SSL, i...
0
by: tom | last post by:
What I want to do is upload a zip file to an https website. The site itself uses secure File transport. Now webclient has the simplicity to upload a file, but I cannot attach the neccessary cert to...
3
by: Smokey Grindle | last post by:
I want to make a manual HTTP processor, but need to do GET/POST requests that could include post data such as button clicks or text fields... i know i read about this once before in straight .NET...
5
by: Bill Nguyen | last post by:
I have the URL, loginID, and password. How do I login the website within my VB app? Thanks Bill
0
by: DolphinDB | last post by:
Tired of spending countless mintues downsampling your data? Look no further! In this article, you’ll learn how to efficiently downsample 6.48 billion high-frequency records to 61 million...
0
by: ryjfgjl | last post by:
ExcelToDatabase: batch import excel into database automatically...
0
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
0
by: Vimpel783 | last post by:
Hello! Guys, I found this code on the Internet, but I need to modify it a little. It works well, the problem is this: Data is sent from only one cell, in this case B5, but it is necessary that data...
0
by: ArrayDB | last post by:
The error message I've encountered is; ERROR:root:Error generating model response: exception: access violation writing 0x0000000000005140, which seems to be indicative of an access violation...
1
by: PapaRatzi | last post by:
Hello, I am teaching myself MS Access forms design and Visual Basic. I've created a table to capture a list of Top 30 singles and forms to capture new entries. The final step is a form (unbound)...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
1
by: Defcon1945 | last post by:
I'm trying to learn Python using Pycharm but import shutil doesn't work
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.