473,395 Members | 1,696 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,395 software developers and data experts.

Seeking Tool to Archive and Mark Up Websites for Searches

Greetings,

I do a LOT of research on the web. It would be extremely handy if I
had a tool that would allow me, with few clicks and little effort, to:

* Archive the site locally then;

* highlight or in some way isolate important parts of the text which
would then be;
* searchable.

For example, suppose I find a document with "Bush" and "Cheney" and
"Iraq" and a sentence about their WMD program. I'd like to be able to
save this page locally within the tool, be able to highlight these
words as "keywords." If I later searched for "Bush," this would be
included in the search result with the other keywords or key phrases
listed below the link the whole page.

More graphically, here is what might be returned from a search of
"Bush" in this example.

1) d:\\archives\websites\cnn-transcript-2005-02-14.php

"...in North Korea today, <b>Bush</b> outlined his strategic..."
"...defending his assertion that Al Queda was in it's "death throes,"
<b>Cheney</b> again reiterated..." "...prior <b>to 1991, Saddam's WMD
program was well-understood. By 2006 the intelligence community</b>
had lost sight..."

2) etc ...

3) etc...

[I realize that you are probably seeing <b></b> instead of bold text
here but it probably gets the idea across a bit better.]

In a way it would be quite like a combination of a wiki and google, but
with the ability to import new material directly from the web.

Any suggestions? I'm not even sure this is the best group to post in,
feel free to suggest better groups.

Thank much in advance for any help.

Regards,
Jason

Dec 5 '05 #1
3 1375
bo*******@gmail.com wrote:

I do a LOT of research on the web. It would be extremely handy if I
had a tool that would allow me, with few clicks and little effort, to:

* Archive the site locally then;

* highlight or in some way isolate important parts of the text which
would then be;
* searchable.

Ah. Then you can leverage your web searching skills to locate searching
and indexing engines, tools, web-bots, etc.
You could use Google, for instance, since they have a number of
"personalization" options. They also have a searching and indexing program
for Windows and Linux. And their competitors are thinking of doing the
same thing.

--
jmm (hyphen) list (at) sohnen-moe (dot) com
(Remove .AXSPAMGN for email)
Dec 6 '05 #2
That would work for some of my features but not others.

I want to "mark up" the pages I cache locally so that keywords that I
consider important come up, not everything in the page. Right now when
I search for WMD and Bush, I come up with ...obvious a lot of pages.
But I want to store pages that have been useful for me, quotes, etc, so
that I'm only searching pages that I've indicated are "good" in the
past.

Any ideas? =/

Thanks again in advance,
Jason

Dec 6 '05 #3
bo*******@gmail.com wrote:
Any ideas? =/


Annotea.

--
not me guv
Dec 6 '05 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
by: Mark | last post by:
I'm looking for a set of php scripts that can read an nntp server, pull all the posts from a newsgroup (text only, no images) and convert them to html pages. Perfect example is :...
42
by: Steven O. | last post by:
I am seeking some kind of tool that I can use for GUI prototyping. I know how to use Visual Basic, but since a lot of software is being coded in Java or C++, I'd like to learn a Java or C++ -based...
12
by: David | last post by:
I am a full-time freelance writer. I am seeking an established, professional web designer who has designed more than one successful website for freelance writers. The individual needs to be able...
2
by: Christoph Wienands | last post by:
Hello everybody, a while ago on one of the "big" DotNet websites (like GotDotNet) I stumbled across a description of a tool that enables developers to work with features like "Declarative...
29
by: Jim | last post by:
I want to extract data from several websites that I visit daily. I'd like to condense the info into a single web page that I can visit (instead of the multiple websites I have to visit now to get...
0
by: larry | last post by:
IDE starting out any good text editor is enough, get one that has PHP syntax highlighting to save a lot of debugging. If you were on Linux (which your ASP reference probably means you are a...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.