473,386 Members | 1,720 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,386 software developers and data experts.

Scan web pages and compose summary

Hello.

I am looking for a way to read html file and create
a short summary (like that shows in google results for example)
which ought to be the first few lines of welcome text or so.

Does any got any idea on how to do this? (I searched allot,
but all I found was simply extracting meta tags).

Thanks
Jan 17 '08 #1
2 1176
Well, the tricky part is that you'll need to decide what text to grab
and show from the file - which is why there's a meta description tag
for the purpose. I believe google grabs the text surrounding a search
term and displays that if there's no meta description tag to use - so
if you're actually searching for a term you could do something like
that.

---
www.NEXCESS.NET - Shared/Reseller Hosting
www.EliteRax.com - Dedicated Servers, Server Clusters
www.MaxVPS.com - Virtual Private Servers
- Great prices, Great service - check us out!

On Jan 17, 3:48 pm, solk <rikibl...@gmail.comwrote:
Hello.

I am looking for a way to read html file and create
a short summary (like that shows in google results for example)
which ought to be the first few lines of welcome text or so.

Does any got any idea on how to do this? (I searched allot,
but all I found was simply extracting meta tags).

Thanks
Jan 17 '08 #2
Hello,

solk wrote:
Hello.

I am looking for a way to read html file and create
a short summary (like that shows in google results for example)
which ought to be the first few lines of welcome text or so.

Does any got any idea on how to do this? (I searched allot,
but all I found was simply extracting meta tags).

Thanks
I can recommend Snoopy (http://snoopy.sourceforge.net/). It is able to
retrieve an entire web page, follow links and so on. The result will be
the HTML source output you can see if you do a view source in your web
browser. From there you can strip HTML tags, use substr() to jump to
certain sections in the source (eg: jump to right after the body tag,
remove all HTML tags and save the text output).

- Jensen
Jan 18 '08 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

35
by: Dirk Bruere at Neopax | last post by:
Is it possible to have one table somehow 'called' to appear on different pages? I want something to appear on all pages, yet only want to (re)edit one item. -- Dirk The Consensus:- The...
7
by: PaulThomas | last post by:
I am fighting with XP-Pro and VS.Net trying to allow some of the pages in my application to be accessable by 'all' I am using <authentication mode="Forms" /> and if I Login - everything works...
3
by: selma | last post by:
I am puzzled by visual explain for a simple query ( select * from mytable ) chooses an index scan. That index has only a single column and the table has 30 columns not covered by the index. When...
11
by: ajikoe | last post by:
Hello, I used Visual C# Standard Edition. I want to comment my program using xml commentary method, I don't know why if I use value and example tag, it is not working / showed in the html...
0
by: Joachim | last post by:
When using the "Build comment web pages" function I only get the first page like this: Project Description KontoDBConnector And when I click on KontoDBConnector...
4
by: Carl Gilbert | last post by:
Hi I have a few asp pages that I plan to burn to a CD so the pages can be navigated without an internet connection. I have just realised that the pages might not run that well, if at all, as...
5
by: Michael Herman \(Parallelspace\) | last post by:
1. What are some compelling solutions for using Master/Content pages with Web Pages? 2. If a content area has a web part zone with web parts, what is the user experience like when "editting" the...
7
by: SHC | last post by:
I'm in need of some javascript to load two pages into two seperate iframes which are on two seperate and different pages. Rather complicated I know (and easier done in one frameset), but caused...
8
by: kumarboston | last post by:
Hi All, I have a mysql database and 3 pages which queries and returns the data. 1st page(main.html) is just a form in html which takes query from the user and is connected to "form...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.