473,378 Members | 1,449 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,378 software developers and data experts.

web searching scripts

Does anyone know of a freely available script that can take a given URL
and follow every link within it?

Ideally, I would like to start with this to build a quick application
to grab all the content off a website to publish it to a CD.

Thanks,

jul

Aug 4 '06 #1
2 1034
ju*********@gmail.com wrote:
Does anyone know of a freely available script that can take a given URL
and follow every link within it?

Ideally, I would like to start with this to build a quick application
to grab all the content off a website to publish it to a CD.

Thanks,

jul

If you just want to download websites (i.e. not necessarily writing a
program yourself to do that), you may try Httrack, it might suite your
needs.

http://www.httrack.com/

There even seem to be some sort of python bindings ...

http://www.satzbau-gmbh.de/staff/abel/httrack-py/

But there might be some more pythonic solution around ... i would start
looking at twisted or cherrypy, but i never used them myself ...

HIH

regards

Avell
Aug 4 '06 #2
On Fri, 04 Aug 2006 18:11:18 +0200, Avell Diroll <av*********@yahoo.frwrote:
ju*********@gmail.com wrote:
>Does anyone know of a freely available script that can take a given URL
and follow every link within it?

Ideally, I would like to start with this to build a quick application
to grab all the content off a website to publish it to a CD.
....
If you just want to download websites (i.e. not necessarily writing a
program yourself to do that), you may try Httrack, it might suite your
needs.
The well-known Gnu wget is what I always use.

(IMHO, this is a situation where it's a /good/ idea to glue together existing
software, rather than joining many bits of code to a mirror-from-http-to-cdr
application.)

/Jorgen

--
// Jorgen Grahn <grahn@ Ph'nglui mglw'nafh Cthulhu
\X/ snipabacken.dyndns.org R'lyeh wgah'nagl fhtagn!
Aug 4 '06 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

41
by: Richard James | last post by:
Are we looking at the scripting world through Python colored glasses? Has Python development been sleeping while the world of scripting languages has passed us Pythonista's by? On Saturday...
1
by: Rene Ruppert | last post by:
Hi, We've been using Index Server and IIS4 and the corresponding objects to search our sites. Everything fine. Now we have set up a new server running IIS6 and the search results always return...
4
by: James | last post by:
We have a need to search through an entire drive for a specific file name. The process is currently written with recursive loops through each directory and the Scripting.FileSystemObject. Problem...
4
by: anton | last post by:
Hi, I am googeling some hours now ... still without result. So I have a question: Does somebody know a filemanager: - which looks like Norton Commander/7-Zip Filemanager
2
by: Hans Georg Krauthaeuser | last post by:
Dear all, for the measurements in our labs we have developed python scripts that are pretty fine for our needs. Basically, we have classes and call the appropriate methods from the command line...
1
by: Psapg | last post by:
Hi! I'm new to javasript, and i must confess to have borowed a few free scripts from the net to satisfie my needs.... Still i can't find even an idea of ascipt to do this... Please Help!!! ...
1
by: ponsibabu | last post by:
We have several scripts for sale. We are selling them at reasonable prices and willing to work around your budget. For more information please contact totascriptz@yahoo.com with "Scripts" as...
5
Nepomuk
by: Nepomuk | last post by:
Hi everybody! I'm working on a website and would like to add a contact form. However, I don't want to have an email-address in my documents code. (So, no mailto:name@domain.com!) I've heard, that...
3
by: Morf | last post by:
Greetings, I am having a little issue with my project at work. I am an apprentice in a pretty big company for my education and my task is to make an asset management system with asp.net in C#. ...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.