By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
459,527 Members | 1,210 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 459,527 IT Pros & Developers. It's quick & easy.

javascript shell

P: n/a
Hello all,

i want to ask a question about retrieving embedded javascript pages from
website:

my problem is that i'm currently working on possibility of mirroring some
websites whith wget - but when the page contains javascript i just cannot
fetch recursively all links from that page... and this is normal behaviour.

what i thought to do is something like

RECURSIVELY='-m'
OUTPUT_TO='-O'

fetch(url) {
wget url $OUTPUT_TO page.local

if contains_javascript(page.local); then
javascript_shell_fetches_all_possible_links page.local > links.pool

for i in `cat links.pool`; do
wget $RECURSIVELY $i
done

else
wget $RECURSIVELY page.local
fi

mv page.local page.html
}

as you see the real problem is javascript_shell_fetches_all_links: but i
thought that this point could be solved thanks to some existing javascript
shell as jython or rhine.

first of all can someone among you tell me if this solution could be
feasible (or codable in a reasonable period of time)...

.... any suggestion is appreciated (besides solutions that imply use of
commercial products)
--
Hekaton Keires,
En to phronein maedhen aedistos bios.

Jul 23 '05 #1
Share this Question
Share on Google+
1 Reply


P: n/a


Hekaton Keires wrote:

i want to ask a question about retrieving embedded javascript pages from
website:

my problem is that i'm currently working on possibility of mirroring some
websites whith wget - but when the page contains javascript i just cannot
fetch recursively all links from that page... and this is normal behaviour.


If you need help with wget then it seems asking in a group or mailing
list about that tool is more likely to get you help.

As for other tools, HttpUnit from
<http://www.httpunit.org/>
is a tool meant to allow automatic testing of web sites, it has (some)
JavaScript support (based on the Rhino engine), so perhaps it can assist
you in downloading pages and following links where JavaScript is used.
Check the web site for details and use their mailing list if needed.
--

Martin Honnen
http://JavaScript.FAQTs.com/
Jul 23 '05 #2

This discussion thread is closed

Replies have been disabled for this discussion.