Connecting Tech Pros Worldwide Help | Site Map

Converting links in HTML documents to relative URL ?

 
LinkBack Thread Tools Search this Thread
  #1  
Old September 4th, 2008, 09:35 AM
Andre Majorel
Guest
 
Posts: n/a
Default Converting links in HTML documents to relative URL ?

Is there some command-line program for Unix to make all links
relative in HTML documents saved in wget -x fashion ?
(http://foo.com/a/b.html saved as ./foo.com/a/b.html.)

For example,

- if ./foo.com/a/b.html contains <img src="/images/d.jpg">
and ./foo.com/images/d.jpg
exists, replace that tag by <img src="../images/d.jpg">

- if ./foo.com/a/b.html contains <a href="http://bar.org/c.html">
and ./bar.org/c.html
exists, replace that tag by <a href="../../bar.org/c.html">

I know about wget -k and it doesn't do what I need. My goal is
to use wget or some such to have an exact mirror of the web site
and then make a _copy_ of the mirror that can be navigated
off-line.

Thanks in advance.

--
André Majorel <URL:http://www.teaser.fr/~amajorel/>
"Je regrette le Concorde. Au moins il vous amenait directement à
l'hôtel." -- Cyrano

 

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Popular Articles

What is Bytes?

We are a network of experts and professionals in IT and software development that help one another with answers to tough questions and share insights. Get the best answers to your questions from over 220,989 network members.