473,406 Members | 2,745 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,406 software developers and data experts.

JS Web Robot

Hi,

I'm trying to write a web robot using JavaScript.
It's objective would be to surf around and look
for patterns in the way web pages link to each
other or in the text they contain. Data would be
returned in a web box which could later be copied
into another application.

That's not to tough a challenge. I can make a
JS application surf around my hard drive or
web site with ease. I simply click an html into
a second window and wait for the document
readyState to be complete, then grab the
document.links array and point the window
at a new location. Off it goes.

But when it tries to surf from my drive to
my web site, or from my web site to another
web site, it gets an error. It crashes the first
time it tries to check the readyState of a
document from a different server.

I think that maybe JS has been designed to foil
attempts to build web robots with it. If so, is there
any way around it? Or maybe I'm just missing a
critical JS detail or two. So, does anyone know
what's going on here? Can anyone help me out?

-Paul Dennis.
Jul 20 '05 #1
1 1932
"Paul Dennis" <pb******@telus.net> writes:
But when it tries to surf from my drive to
my web site, or from my web site to another
web site, it gets an error. It crashes the first
time it tries to check the readyState of a
document from a different server.

I think that maybe JS has been designed to foil
attempts to build web robots with it.
The browser security model has. If you try to access the content of a
page from a different domain, you are stopped - the hard way.
If so, is there any way around it?


Not in any browser, but if it is just your own browser you might be
able to give it extended permissions. If the browser is IE, you can
look into HTML Applications (google for "HTML application HTA").

/L
--
Lasse Reichstein Nielsen - lr*@hotpop.com
DHTML Death Colors: <URL:http://www.infimum.dk/HTML/rasterTriangleDOM.html>
'Faith without judgement merely degrades the spirit divine.'
Jul 20 '05 #2

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
by: Adrian Lumsden | last post by:
Hello, I have an app where I have to screen scrape to capture an image from a JMF film player. The user is given a dialog with a list of frames that can be exported as images. If the one they...
3
by: Christopher Koppler | last post by:
I need to simulate mouseclicks (in Win2K) in another application's window. Googling threw up java.awt.Robot, and a bit of trial and error showed that it could do what I needed (simulate a...
0
by: Robert | last post by:
I have python related stuff on some of my web pages. This month "The Python Robot" is going for over 400 MB / 4000 accesses downloads in my stats ! increasing frequency the last months!? thats...
0
by: Jonathan Vance | last post by:
I am looking for a python robot that Van Rossum released with python 0.9.8. It may have been the first web robot (see http://www.webhistory.org/www.lists/www-talk.1993q1/0060.html). I've had no...
11
by: Marcelo | last post by:
Hi, I need to make a robot that fills forms. I already made that with php sites, but now I need that in a javascript pages site ( the robot can be php,asp, asp.net ), anyone knows how can I do...
1
by: nnobakht | last post by:
Hi, I'm working on an assignment for school which i am a bit stuck on. The assignment is to make robot which i have been given the library for move around different boards and collecting "coins" and...
0
by: Shiv Kumar | last post by:
Rational Robot is a complete set of components for automating the testing of Microsoft Windows client/server and Internet applications running under Windows NT 4.0, Windows XP, Windows 2000, and...
11
by: socialanxiety | last post by:
i hope someone here can help me. basically, me and my friend have a summer project. in this project, we need something that would basically function as a blender. we know we'll need to buy a...
1
by: John Nagle | last post by:
I just discovered that the "robotparser" module interprets a 403 ("Forbidden") status on a "robots.txt" file as meaning "all access disallowed". That's unexpected behavior. A major site...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.