473,386 Members | 1,708 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,386 software developers and data experts.

Enhancing SpiderInNet

Hi,

I'm primarily a web developer and have recently moved onto .NET and am
slowly realising the enormous potential both through ASP.NET but
integrating all the different services.

While teaching myself .NET I decided to start building a spider to get a
load of information off other sites (thanks for the points on here guys)
and on of the spiders that seems to suiting my needs is the Spider In
..NET article on MSDN
(http://msdn.microsoft.com/msdnmag/is...T/default.aspx)
but I would now like to expand on what I have altered so far (simply
restricting the spider to search URL's matching certain parameters) to
cope with loss of connection and pausing the process, is there any way
of doing this? I have been reading about pausing processes and needing
other threads and all kinds so have got a little confused and would
apprechiate any advice given.

Many thanks in advance.

Tim

Nov 21 '05 #1
2 986
You can kick off a "watcher" type thread that looks at connection
status / socket traffic-over-time etc. This thread could suspend/unsuspend
or kill "spidering" threads for you.


"Tim Gaunt" <ti***@SPAMthesitedoctor.co.uk> wrote in message
news:cr**********@hercules.btinternet.com...
Hi,

I'm primarily a web developer and have recently moved onto .NET and am
slowly realising the enormous potential both through ASP.NET but
integrating all the different services.

While teaching myself .NET I decided to start building a spider to get a
load of information off other sites (thanks for the points on here guys)
and on of the spiders that seems to suiting my needs is the Spider In
.NET article on MSDN
(http://msdn.microsoft.com/msdnmag/is...T/default.aspx)
but I would now like to expand on what I have altered so far (simply
restricting the spider to search URL's matching certain parameters) to
cope with loss of connection and pausing the process, is there any way
of doing this? I have been reading about pausing processes and needing
other threads and all kinds so have got a little confused and would
apprechiate any advice given.

Many thanks in advance.

Tim

Nov 21 '05 #2
ah great thanks, I'll look into that now, for some reason though the
spider just stops part way through spidering which I'm yet to figure out
:) thanks again.

Tim

Darious Snell wrote:
You can kick off a "watcher" type thread that looks at connection
status / socket traffic-over-time etc. This thread could suspend/unsuspend
or kill "spidering" threads for you.


"Tim Gaunt" <ti***@SPAMthesitedoctor.co.uk> wrote in message
news:cr**********@hercules.btinternet.com...
Hi,

I'm primarily a web developer and have recently moved onto .NET and am
slowly realising the enormous potential both through ASP.NET but
integrating all the different services.

While teaching myself .NET I decided to start building a spider to get a
load of information off other sites (thanks for the points on here guys)
and on of the spiders that seems to suiting my needs is the Spider In
.NET article on MSDN
(http://msdn.microsoft.com/msdnmag/is...T/default.aspx)
but I would now like to expand on what I have altered so far (simply
restricting the spider to search URL's matching certain parameters) to
cope with loss of connection and pausing the process, is there any way
of doing this? I have been reading about pausing processes and needing
other threads and all kinds so have got a little confused and would
apprechiate any advice given.

Many thanks in advance.

Tim



Nov 21 '05 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
by: cognite | last post by:
This venue would surely appreciate the cool stuff being done in python on bioinformatics and python's tools for info parsing and extraction (like fulltext indexing, xml tools, parser builders,...
6
by: Christian Brechbühler | last post by:
The template std::valarray behaves pretty much like a mathematical vector. Arithmetic operators apply elementwise. Now I'd like to extend this to a user-defined type, e.g., complex. ...
3
by: gizmo | last post by:
I have a stored procedure that queries a database using a Select statement with some inner joins and conditions. With over 9 million records it takes 1 min 36 sec to complete. This is too slow...
9
by: Dmitriy Zakharov | last post by:
Hi Everyone, You know how we have cache objects at different levels in ASP.NET. For example, there is one at application level (Cache) and one at user level (Session). How about having one...
2
by: Vittorio | last post by:
Context: Pentium PIII, 128 MB RAM, 5400 RPM HD-10GB (of which 1.8 GB as swap disk) Debian linux testing, Postgresql 7.4.2 from the deb box. Beginner with Postgresql but not with linux At...
19
by: sasan3 | last post by:
Every thread I start goes out on a tangent simply because some of you people are not reading my posts carefully: Here is the phrase I am proposing for inclusion in the FAQ: "to existing...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.