473,416 Members | 1,657 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,416 software developers and data experts.

extract URLs using HtmlUnit

12
I want to extract URLs and Titles of them from a page results. I could extract URLs using with gethrefAttribute() in htmluni library but I don't know how extract the titles of them?
could you help me?
Jun 14 '10 #1
1 2557
jkmyoung
2,057 Expert 2GB
Assuming you have an HTMLAnchor don't forget that this is also a DomElement. You should be able to use getNodeValue().
Jun 15 '10 #2

Sign in to post your reply or Sign up for a free account.

Similar topics

12
by: Brian Henry | last post by:
I made a header control in asp.net and it references images in the /images/ folder in the virtural path's root folder... How can I get the header control to show the images at any level of the web...
2
by: | last post by:
Apache has a look-back feature that lets it scan back down the URL if it doesn't find what it is looking for. For example: http://www.domain.com/article.php/999/12 In this above case there is no...
1
by: csgraham74 | last post by:
Hi Guys, I want to populate a nodelist so that i can extract various details. The xml document i have is similar to the one below. baiscally i want to extract the first instance of <PP>...
4
by: Jeeran | last post by:
We use an ISAPI filter to convert long urls into short clean ones. For example: "Site.com/user/john/" Is re-written as: "Site.com/user/userinfo.aspx?uid=john" Now, "userinfo.aspx" contains a...
1
by: brahatha | last post by:
I am trying to come up with a Java code where I can read a xml file and extract particular element/attribute. This is the structure of xml I have <RootElement> <colAttempt> <Attempt1 a="1"...
1
by: NitinSawant | last post by:
Hi pls tell me, how to extract urls from webpage using c#, and adding them to a listbox??
5
by: dhruvjha | last post by:
I'd like to take the following URL: http://www.website.com/index.php?url=about-us and turn it into : http://www.website.com/about-us and aso http://www.website.com/about-us.php I have...
3
by: manishabh77 | last post by:
I want to extract some info from the following input line using perl regular expression. I will appreciate any help in doing so. input line: hg19_ensGene_ENST00000237247...
6
by: Nikkhah | last post by:
I want to write a program in java that it can parse search engine results and save URLs and titles in a file, but I don't know how can I do this?
3
by: Nikkhah | last post by:
Hello I want to Extract URLs and descriptions of them from a search result. I use Jericho library and I can extract URLs but I don't know how could I extract descriptions of them. please help me....
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...
0
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.