473,463 Members | 1,528 Online
Bytes | Software Development & Data Engineering Community
Create Post

Home Posts Topics Members FAQ

C++ Crawlers

Hi
Does anyone here have a good recommendation for an open source crawler
that I could get my hands on? It doesn't have to be C++ based. I am
interested in learning how crawling works. I think python based
crawlers will ensure a high degree of flexibility but at the same time
I am also torn between looking for open source crawlers in python vs C
++ because the latter is much more efficient(or so I heard. I will be
crawling on very cheap hardware.)

I am definitely open to suggestions. Please suggest some names if you
could.

Thx
Jul 5 '08 #1
2 1653
di***********@gmail.com wrote:
Hi
Does anyone here have a good recommendation for an open source crawler
that I could get my hands on?
That depends on what "an open source crawler" is. However, your question is
off-topic here, since this newsgroup deals with questions about the C++
language itself. So if you have a specific problem with the language while
implementing or compiling "open source crawler", this would be the right
newsgroup.

Jul 5 '08 #2
di***********@gmail.com wrote:
[redacted]
>
I am definitely open to suggestions. Please suggest some names if you
could.
Here's a suggestion -- ask where it's on topic.

C++ as defined by ISO/IEC 14882:2003 doesn't discuss "crawlers"
(whatever the heck they are). The very fact that your post said that
Python-based is acceptable indicates that it's OT here.
Jul 5 '08 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

3
by: Araxes Tharsis | last post by:
Hi, This must be a very old and well studied question... I created a site using JSP, that permits the viewing of articles that are fully stored in a database. The url for the articles is something...
54
by: Vincent | last post by:
I would like to have my website translated in several languages and take advantage of language negotiation to let the user choose its preferred version of the site. I read with most interest the...
5
by: Ian Lane .enizin.net> | last post by:
Hello, Does anyone happen to have a Browsercaps update that properly sets the Crawler attribute? I am seeing that google and others are being recognized as browsers and not crawlers. Thank...
0
by: TomislaW | last post by:
I try to trace users on my web page In global.asax.cs on Application_BeginRequest I check if user has my cookie, if not I give him new cookie (integer identity number from database). When...
0
by: Stefano | last post by:
Hi all, I'm trying to create a browser definition file (.browser) that matches crawlers user agents. I don't want modify browser files in the Config system folder. I'd like to use App_Browsers...
3
by: rooznamechi.h | last post by:
Hi, I use Url rewriting on my website and my website works normally , but I don't know why Google crawlers can not read my pages . For example look at address below :...
2
by: disappearedng | last post by:
Hi Does anyone here have a good recommendation for an open source crawler that I could get my hands on? It doesn't have to be python based. I am interested in learning how crawling works. I think...
4
by: =?Utf-8?B?Wm9sdA==?= | last post by:
Hi, Would someone know where I could get a list of the supported crawlers for the HttpBrowserCapabilities? Is there a way to add new ones/modify the list? I have a web site for which I want to...
1
by: amanagarwal89 | last post by:
can ne body give me an insight on java cralers and what it takes to code web crawlers???
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
1
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...
0
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...
0
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.