468,769 Members | 2,139 Online
Bytes | Developer Community
New Post

Home Posts Topics Members FAQ

Post your question to a community of 468,769 developers. It's quick & easy.

Converting links in HTML documents to relative URL ?

Is there some command-line program for Unix to make all links
relative in HTML documents saved in wget -x fashion ?
(http://foo.com/a/b.html saved as ./foo.com/a/b.html.)

For example,

- if ./foo.com/a/b.html contains <img src="/images/d.jpg">
and ./foo.com/images/d.jpg
exists, replace that tag by <img src="../images/d.jpg">

- if ./foo.com/a/b.html contains <a href="http://bar.org/c.html">
and ./bar.org/c.html
exists, replace that tag by <a href="../../bar.org/c.html">

I know about wget -k and it doesn't do what I need. My goal is
to use wget or some such to have an exact mirror of the web site
and then make a _copy_ of the mirror that can be navigated

Thanks in advance.

André Majorel <URL:http://www.teaser.fr/~amajorel/>
"Je regrette le Concorde. Au moins il vous amenait directement à
l'hôtel." -- Cyrano
Sep 4 '08 #1
0 1210

This discussion thread is closed

Replies have been disabled for this discussion.

By using this site, you agree to our Privacy Policy and Terms of Use.