473,657 Members | 2,505 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Help:a tool which will extract contents and images from html page

Hello everyone
Do you have any idea about how to generate a tool which will extract
the contents and images from html page.or do you have any resources
then let me know. i am using VS 2005 C#.net,Asp.net for development of
this system.
reply me asap
Thanks
Dhananjay

Nov 29 '06 #1
3 2158
Perhaps the HTML agility pack can help you

http://www.codeplex.com/Wiki/View.as...tmlagilitypack
--
--
Regards

John Timney (MVP)
VISIT MY WEBSITE:
http://www.johntimney.com
http://www.johntimney.com/blog
"Dhananjay" <dh**********@y ahoo.co.inwrote in message
news:11******** **************@ l12g2000cwl.goo glegroups.com.. .
Hello everyone
Do you have any idea about how to generate a tool which will extract
the contents and images from html page.or do you have any resources
then let me know. i am using VS 2005 C#.net,Asp.net for development of
this system.
reply me asap
Thanks
Dhananjay

Nov 29 '06 #2

John Timney (MVP) wrote:
Perhaps the HTML agility pack can help you

http://www.codeplex.com/Wiki/View.as...tmlagilitypack
--
--
Regards

John Timney (MVP)
VISIT MY WEBSITE:
http://www.johntimney.com
http://www.johntimney.com/blog
"Dhananjay" <dh**********@y ahoo.co.inwrote in message
news:11******** **************@ l12g2000cwl.goo glegroups.com.. .
Hello everyone
Do you have any idea about how to generate a tool which will extract
the contents and images from html page.or do you have any resources
then let me know. i am using VS 2005 C#.net,Asp.net for development of
this system.
reply me asap
Thanks
Dhananjay


hi john
you have got my point what i was trying to ask you all.
now i want some free asp.net source code so that i can easily develop
the web scrapper
if you have any info. then send me the urls i will check it out.

Thanks
Dhananjay

Nov 30 '06 #3
The quickstart networking section will show you how to make a get request,
that will get you the page as html.

The agility pack should allow you to parse it and locate the image tags.

Theres an excample on Channel9 that shows how to downlado an actual image
http://channel9.msdn.com/ShowPost.aspx?PostID=14793

--
--
Regards

John Timney (MVP)
VISIT MY WEBSITE:
http://www.johntimney.com
http://www.johntimney.com/blog
"Dhananjay" <dh**********@y ahoo.co.inwrote in message
news:11******** *************@8 0g2000cwy.googl egroups.com...
>
John Timney (MVP) wrote:
>Perhaps the HTML agility pack can help you

http://www.codeplex.com/Wiki/View.as...tmlagilitypack
--
--
Regards

John Timney (MVP)
VISIT MY WEBSITE:
http://www.johntimney.com
http://www.johntimney.com/blog
"Dhananjay" <dh**********@y ahoo.co.inwrote in message
news:11******* *************** @l12g2000cwl.go oglegroups.com. ..
Hello everyone
Do you have any idea about how to generate a tool which will extract
the contents and images from html page.or do you have any resources
then let me know. i am using VS 2005 C#.net,Asp.net for development of
this system.
reply me asap
Thanks
Dhananjay

hi john
you have got my point what i was trying to ask you all.
now i want some free asp.net source code so that i can easily develop
the web scrapper
if you have any info. then send me the urls i will check it out.

Thanks
Dhananjay

Nov 30 '06 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
3921
by: matt | last post by:
I have compiled some code, some written by me, some compiled from various sources online, and basically i've got a very simple flat file photo gallery. An upload form, to upload the photos and give them a caption, storing the caption and filename in a text file. It's a bit buggy when removing the photos and captions from the file, and also in displaying them on the delete page. you can see it in action at www.4am.com.au/gallery/upload.php...
3
5170
by: Steve | last post by:
Hi, I have a nice little script that works well displaying images on my website. It's a script where if you clik a thumbnail image a pop up window opens that contains a larger version of the same image. What I would like to create is a link that can be clicked on to close the window that contains the larger image. This would make it easier for the users to close the window. I have posted the script that I use. Any help would be much...
5
2985
by: Craig Keightley | last post by:
Please help, i have attached my page which worksin IE but i cannnot get the drop down menu to fucntion in firefox. Any one have any ideas why? Many Thanks Craig <<<<<<<<<<<<<<CODE>>>>>>>>>>>>>>>> <html>
8
5463
by: baustin75 | last post by:
Posted: Mon Oct 03, 2005 1:41 pm Post subject: cannot mail() in ie only when debugging in php designer 2005 -------------------------------------------------------------------------------- Hello, I have a very simple problem but cannot seem to figure it out. I have a very simple php script that sends a test email to myself. When I debug it in PHP designer, it works with no problems, I get the test email. If
2
2311
by: Goober | last post by:
I have the following default.aspx page that works properly. However, what I want to do is to link the graphics within it (that are hard coded now in the default web page) to our corporate website, which has the same graphics rotated on a 30 day basis or so. They have a standard naming convention where I know the 3 graphics are always going to be fa_mmyy.jpg, am_mmyy.jpg, and ax_mmyy.jpg. The HTML follows:
8
1571
by: Dhananjay | last post by:
hello everyone Do you have any information how to generate a tool using .net which is used to translate the web page contents to html format. Plz reply me asap Thanks in advance Dhananjay
0
877
by: Dhananjay | last post by:
Hello everyone Do you have any idea about how to generate a tool which will extract the contents and images from html page.or do you have any resources then let me know. i am using VS 2005 C#.net,Asp.net for development of this system. reply me asap Thanks Dhananjay
36
3095
by: aljamala | last post by:
Hi, I keep getting this warning on a page, but I do not know what the problem is...does anyone have an idea about what could be wrong? line 88 column 7 - Warning: missing </formbefore <td> it highlights this line: <form name="frmCurrency" action="" method="post"> Page source....
0
5557
by: gunimpi | last post by:
http://www.vbforums.com/showthread.php?p=2745431#post2745431 ******************************************************** VB6 OR VBA & Webbrowser DOM Tiny $50 Mini Project Programmer help wanted ******************************************************** For this teeny job, please refer to: http://feeds.reddit.com/feed/8fu/?o=25
0
8825
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
1
8503
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
8605
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
7324
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
6163
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
5632
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
4151
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
0
4302
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
2
1611
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.