473,397 Members | 2,028 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,397 software developers and data experts.

How to batch download files from web page?

sj
I wish to download hundreds of files from the University of Iowa sound
archive. Doing it manually would be a daunting task especially since the
files are each a few mega bytes long. Is there a standard way of using
Python for such a task? I have a fair amount of programming experiance but
very little of it relates to networks.

For those who are intrested the University of Iowa's sound archive may be
found at http://theremin.music.uiowa.edu/MIS.html

Thanks in advance
Steven Jones
Jul 18 '05 #1
3 4515
sj <no****@forgetit.com> writes:
I wish to download hundreds of files from the University of Iowa sound
archive. Doing it manually would be a daunting task especially since the
files are each a few mega bytes long. Is there a standard way of using
Python for such a task? I have a fair amount of programming experiance but
very little of it relates to networks.


You could use the urllib module, or maybe you could use wget:

http://www.gnu.org/software/wget/wget.html
Jul 18 '05 #2
N.K
Consider http://harvestman.freezope.org
( Python)

sj <no****@forgetit.com> wrote in message news:<wy*******************@newssvr22.news.prodigy .com>...
I wish to download hundreds of files from the University of Iowa sound
archive. Doing it manually would be a daunting task especially since the
files are each a few mega bytes long. Is there a standard way of using
Python for such a task? I have a fair amount of programming experiance but
very little of it relates to networks.

For those who are intrested the University of Iowa's sound archive may be
found at http://theremin.music.uiowa.edu/MIS.html

Thanks in advance
Steven Jones

Jul 18 '05 #3
> I wish to download hundreds of files from the University of Iowa
sound archive. Doing it manually would be a daunting task
especially since the files are each a few mega bytes long. Is there
a standard way of using Python for such a task? I have a fair
amount of programming experiance but very little of it relates
to networks.

For those who are intrested the University of Iowa's sound archive
may be found at http://theremin.music.uiowa.edu/MIS.html


The easiest way to download the individual files is with
urllib.urlretrieve().

You can parse the HTML files using either htmllib.HTMLParser or the
HTMLParser module, combined with urllib.urlopen().

If you do this Google search:

urlopen htmlparser

The first several matches have some nice code samples showing how to find
the links in a web page. Add urlretrieve() to download the files and you'll
be off and running.

-Mike
Jul 18 '05 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

10
by: scrimp | last post by:
Ive been using PAMIE 1.4 to try to automate web page processes. The one thing I cannot do with it is upload files and download files. With uploading files, the file input box does not allow PAMIE...
0
by: Sachit | last post by:
I am working on an BIT application to download files from remote server using C#, There are an issue related with performance, I am downloading 3 files 28Kb, 79KB, 121KB respectivly on my...
2
by: Grant | last post by:
Hi, I'm trying to download files using asp.net (vb) and it seems to take forever to actually bring up the Save As dialog box. The files are Academic Software Downloads so are quite large in...
6
by: Michael Tissington | last post by:
I have setup up Forms Authentication on my website and added configuration/mappings for exe, pdf and zip files (using C:\WINDOWS\Microsoft.NET\Framework\v2.0.50727\aspnet_isapi.dll) However when...
2
by: Roger Withnell | last post by:
I'm using the Response object download files from an SQL Server table to the client browser. Having set the content type and AddHeader, I use Response.Write and the file opens in the browser. ...
0
by: tanyali | last post by:
using php5, I download files : ******** downloadfile.php $fileContent = @mysql_result($result1,$i,"gdata"); ...
12
nathj
by: nathj | last post by:
Hi, I am working on a system, as you may have seen from my other posts, that allows members to download files. When they download a file I want to store the user ID and the file ID in a table...
5
by: FAQ server | last post by:
----------------------------------------------------------------------- FAQ Topic - How do I download a page to a variable? ----------------------------------------------------------------------- ...
5
by: nagmvs | last post by:
Hello, Can anyone tell me how can i create a button in asp and link to download files from some folder one by one Simultaneously? I wrote some code to download one file,i don't no...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.