471,090 Members | 1,719 Online
Bytes | Software Development & Data Engineering Community
Post +

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 471,090 software developers and data experts.

Converting links in HTML documents to relative URL ?

Is there some command-line program for Unix to make all links
relative in HTML documents saved in wget -x fashion ?
(http://foo.com/a/b.html saved as ./foo.com/a/b.html.)

For example,

- if ./foo.com/a/b.html contains <img src="/images/d.jpg">
and ./foo.com/images/d.jpg
exists, replace that tag by <img src="../images/d.jpg">

- if ./foo.com/a/b.html contains <a href="http://bar.org/c.html">
and ./bar.org/c.html
exists, replace that tag by <a href="../../bar.org/c.html">

I know about wget -k and it doesn't do what I need. My goal is
to use wget or some such to have an exact mirror of the web site
and then make a _copy_ of the mirror that can be navigated

Thanks in advance.

André Majorel <URL:http://www.teaser.fr/~amajorel/>
"Je regrette le Concorde. Au moins il vous amenait directement à
l'hôtel." -- Cyrano
Sep 4 '08 #1
0 1260

This discussion thread is closed

Replies have been disabled for this discussion.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.