473,385 Members | 1,492 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,385 software developers and data experts.

open source .NET search engine?

does anyone know of a framework; or tools; or something-- that
describes an open source VB.net search engine / spider?

anyone want to trade notes?

I want to build something a lot more focused than google; for example;
I want to spider Home Depot websites and sell it to Lowes.

Does anyone want to help?

-Susie

Oct 17 '06 #1
11 2574
"su******@hotmail.com" <su******@hotmail.comwrote in
news:11**********************@i3g2000cwc.googlegro ups.com:
does anyone know of a framework; or tools; or something-- that
describes an open source VB.net search engine / spider?

anyone want to trade notes?

I want to build something a lot more focused than google; for example;
I want to spider Home Depot websites and sell it to Lowes.

Lucene.NET or Microsoft Index Server or SQL Server Full Text Search Engine.
Oct 17 '06 #2
Susie,

As for your subject: open source .NET search engine try
http://www.dotlucene.net/

As for spidering, there are many website copiers out there, try HTTrack
http://www.httrack.com/.

I am not condoning unethical copyright infringement. However, I'm so sure
that you will be unable to sell a competitor's website to a large company
like Lowes that I give this information. Besides, I'm sure they have many
talented people in their IT department that could get them this info in no
time at all.

As far as building something more focused than google--good luck. Google,
just like Microsoft, has top computer scientist working on some crazy stuff
i.e. natural language processing, query analysis, best bets, controlled
vocabularies and a little artificial intelligence. Most of which involves a
fair amount of some pretty advanced mathematics. I'm not trying to
discourage you to not build something better; there are lots of brilliant
people in this world. Just use your brilliance for something good!

As far as making money, why not use the Amazon E-commerce Web Service. It
gives you access to all of Amazon's products, images, reviews, pricing, and
a remote shopping cart system. You can just mark up the prices a bit to make
money for not doing much of anything besides coding a website.
http://aws.amazon.com
http://www.google.com/search?hl=en&q=amazon+web+service

Chris
<su******@hotmail.comwrote in message
news:11**********************@i3g2000cwc.googlegro ups.com...
does anyone know of a framework; or tools; or something-- that
describes an open source VB.net search engine / spider?

anyone want to trade notes?

I want to build something a lot more focused than google; for example;
I want to spider Home Depot websites and sell it to Lowes.

Does anyone want to help?

-Susie

Oct 17 '06 #3
thanks guys; does anyone else have any ideas??
Chris wrote:
Susie,

As for your subject: open source .NET search engine try
http://www.dotlucene.net/

As for spidering, there are many website copiers out there, try HTTrack
http://www.httrack.com/.

I am not condoning unethical copyright infringement. However, I'm so sure
that you will be unable to sell a competitor's website to a large company
like Lowes that I give this information. Besides, I'm sure they have many
talented people in their IT department that could get them this info in no
time at all.

As far as building something more focused than google--good luck. Google,
just like Microsoft, has top computer scientist working on some crazy stuff
i.e. natural language processing, query analysis, best bets, controlled
vocabularies and a little artificial intelligence. Most of which involves a
fair amount of some pretty advanced mathematics. I'm not trying to
discourage you to not build something better; there are lots of brilliant
people in this world. Just use your brilliance for something good!

As far as making money, why not use the Amazon E-commerce Web Service. It
gives you access to all of Amazon's products, images, reviews, pricing, and
a remote shopping cart system. You can just mark up the prices a bit to make
money for not doing much of anything besides coding a website.
http://aws.amazon.com
http://www.google.com/search?hl=en&q=amazon+web+service

Chris
<su******@hotmail.comwrote in message
news:11**********************@i3g2000cwc.googlegro ups.com...
does anyone know of a framework; or tools; or something-- that
describes an open source VB.net search engine / spider?

anyone want to trade notes?

I want to build something a lot more focused than google; for example;
I want to spider Home Depot websites and sell it to Lowes.

Does anyone want to help?

-Susie
Oct 17 '06 #4
I really do think that there is room for a new service.
I just am going to have some sort of scope to my project-- instead of
pulling a napoleon- like google does-- and try to enter EVERY MARKET at
the same time.

moving from search engines to spreadsheets; IM; Email; Usenet; Books;
eCommerce-- I just dont think that google is 'big enough' to be
successful in any of these new markets.

which means that there is room for innovation.

I've had many customers ask me to pull XYZ off of site ABC

I personally think that a simple search engine should consist of a
couple of Olap Servers and a couple of relational boxes.. and a couple
of crawlers... not too complex at all.

-Susie


Chris wrote:
Susie,

As for your subject: open source .NET search engine try
http://www.dotlucene.net/

As for spidering, there are many website copiers out there, try HTTrack
http://www.httrack.com/.

I am not condoning unethical copyright infringement. However, I'm so sure
that you will be unable to sell a competitor's website to a large company
like Lowes that I give this information. Besides, I'm sure they have many
talented people in their IT department that could get them this info in no
time at all.

As far as building something more focused than google--good luck. Google,
just like Microsoft, has top computer scientist working on some crazy stuff
i.e. natural language processing, query analysis, best bets, controlled
vocabularies and a little artificial intelligence. Most of which involves a
fair amount of some pretty advanced mathematics. I'm not trying to
discourage you to not build something better; there are lots of brilliant
people in this world. Just use your brilliance for something good!

As far as making money, why not use the Amazon E-commerce Web Service. It
gives you access to all of Amazon's products, images, reviews, pricing, and
a remote shopping cart system. You can just mark up the prices a bit to make
money for not doing much of anything besides coding a website.
http://aws.amazon.com
http://www.google.com/search?hl=en&q=amazon+web+service

Chris
<su******@hotmail.comwrote in message
news:11**********************@i3g2000cwc.googlegro ups.com...
does anyone know of a framework; or tools; or something-- that
describes an open source VB.net search engine / spider?

anyone want to trade notes?

I want to build something a lot more focused than google; for example;
I want to spider Home Depot websites and sell it to Lowes.

Does anyone want to help?

-Susie
Oct 17 '06 #5
and it goes without saying that I think that Microsoft is completely
and utterly incompetent.

they're pulling a napoleon also.. they need to sell their Xbox and MSN
division and fold it back into their core competencies.

supposedly they have 5,000 developers and testers working on vista AND
office 2007.

what the hell are the other 60,000 employees doing?


su******@hotmail.com wrote:
thanks guys; does anyone else have any ideas??
Chris wrote:
Susie,

As for your subject: open source .NET search engine try
http://www.dotlucene.net/

As for spidering, there are many website copiers out there, try HTTrack
http://www.httrack.com/.

I am not condoning unethical copyright infringement. However, I'm so sure
that you will be unable to sell a competitor's website to a large company
like Lowes that I give this information. Besides, I'm sure they have many
talented people in their IT department that could get them this info in no
time at all.

As far as building something more focused than google--good luck. Google,
just like Microsoft, has top computer scientist working on some crazy stuff
i.e. natural language processing, query analysis, best bets, controlled
vocabularies and a little artificial intelligence. Most of which involves a
fair amount of some pretty advanced mathematics. I'm not trying to
discourage you to not build something better; there are lots of brilliant
people in this world. Just use your brilliance for something good!

As far as making money, why not use the Amazon E-commerce Web Service. It
gives you access to all of Amazon's products, images, reviews, pricing, and
a remote shopping cart system. You can just mark up the prices a bit to make
money for not doing much of anything besides coding a website.
http://aws.amazon.com
http://www.google.com/search?hl=en&q=amazon+web+service

Chris
<su******@hotmail.comwrote in message
news:11**********************@i3g2000cwc.googlegro ups.com...
does anyone know of a framework; or tools; or something-- that
describes an open source VB.net search engine / spider?
>
anyone want to trade notes?
>
I want to build something a lot more focused than google; for example;
I want to spider Home Depot websites and sell it to Lowes.
>
Does anyone want to help?
>
-Susie
>
Oct 17 '06 #6
"su******@hotmail.com" <su******@hotmail.comwrote in
news:11*********************@e3g2000cwe.googlegrou ps.com:
supposedly they have 5,000 developers and testers working on vista AND
office 2007.

what the hell are the other 60,000 employees doing?
Microsoft does have more than 2 products. .NET, VS.NET, SQL Server :-)
Oct 17 '06 #7
"su******@hotmail.com" <su******@hotmail.comwrote in
news:11*********************@e3g2000cwe.googlegrou ps.com:
I personally think that a simple search engine should consist of a
couple of Olap Servers and a couple of relational boxes.. and a couple
of crawlers... not too complex at all.
Good luck if you don't think it's complex - there's a reason why Google
hires a lot of PhDs!
Oct 17 '06 #8
not really; .NET VS.NET and SQL Server sure dont take more people than
Office and Windows.

what about revenue.

80% of their revenue comes from Office and Windows???

Spam Catcher wrote:
"su******@hotmail.com" <su******@hotmail.comwrote in
news:11*********************@e3g2000cwe.googlegrou ps.com:
supposedly they have 5,000 developers and testers working on vista AND
office 2007.

what the hell are the other 60,000 employees doing?

Microsoft does have more than 2 products. .NET, VS.NET, SQL Server :-)
Oct 17 '06 #9
the reason that they hire a lot of PhD is because they reinvent the
wheel.

I dont see a need to write your own Operating System, Web Browser;
Database Engine.
I dont see a need to write everything in crazy-ass-complex AJAX.

I just think that they're too trendy and not functional enough.

I've also had a half dozen clients ask me if I can crawl website X and
give them data XYZ.

And I dont think that there is an enterprise level search engine.. I
mean-- store it in a database and allow simple queries against a
database.

I just think that if i had a good datamart and a couple of olap servers
I could run circles around google.


Spam Catcher wrote:
"su******@hotmail.com" <su******@hotmail.comwrote in
news:11*********************@e3g2000cwe.googlegrou ps.com:
I personally think that a simple search engine should consist of a
couple of Olap Servers and a couple of relational boxes.. and a couple
of crawlers... not too complex at all.

Good luck if you don't think it's complex - there's a reason why Google
hires a lot of PhDs!
Oct 17 '06 #10
I mean seriously

if you can use Olap to look for similiar words; etc
I just dont think that it would be that complex.

and you people that sit around and think that google is worth more than
IBM?

LAUGHABLE.

I just think that it's a shame.. I would love to help you susie--
because I think that there IS a better solution.

Instead of using 100 different languges from Perl to PHP to BigTable to
GoogleOS- I just call hogwash on their ass.

maybe we should build a project on sourceforge.. does anyone have an
idea?

literally-- database driven search engine; most everything lives in
mySql with a couple of SQL Server boxes at the top of the equation.

Oct 17 '06 #11
wow; aaron.. I agree.

why dont you drop me an email at susiedba

AT
hotmale.com
aa*********@gmail.com wrote:
the reason that they hire a lot of PhD is because they reinvent the
wheel.

I dont see a need to write your own Operating System, Web Browser;
Database Engine.
I dont see a need to write everything in crazy-ass-complex AJAX.

I just think that they're too trendy and not functional enough.

I've also had a half dozen clients ask me if I can crawl website X and
give them data XYZ.

And I dont think that there is an enterprise level search engine.. I
mean-- store it in a database and allow simple queries against a
database.

I just think that if i had a good datamart and a couple of olap servers
I could run circles around google.


Spam Catcher wrote:
"su******@hotmail.com" <su******@hotmail.comwrote in
news:11*********************@e3g2000cwe.googlegrou ps.com:
I personally think that a simple search engine should consist of a
couple of Olap Servers and a couple of relational boxes.. and a couple
of crawlers... not too complex at all.
Good luck if you don't think it's complex - there's a reason why Google
hires a lot of PhDs!
Oct 18 '06 #12

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
by: Unigroup of New York | last post by:
Content-Type: multipart/mixed; boundary="------------C465DF38DCB38DD2AF7117E0" Lines: 327 Date: Tue, 15 Feb 2005 23:36:38 -0500 NNTP-Posting-Host: 24.46.113.251 X-Complaints-To: abuse@cv.net...
5
by: George | last post by:
Hi, Anyone has the background for explaining? I have made a search on my name and I have got a link to another search engine. The link's title was the search phrase for the other search engine...
13
by: sembiance | last post by:
Hi folks :) I've been working on a C/C++ Source Code search engine website for over a year now, and just thought I'd let you all know that I just put it live a few minutes ago. It searches 99...
7
by: Brandon J. Van Every | last post by:
Anyone know of any "good" open source C# game projects out there? Something that actually has a game engine and some content done, so I can just fiddle with it and do interesting / goofy things. ...
7
by: ABC | last post by:
Hi, All Is there any search engine source code for reference? Thanks
158
by: Giovanni Bajo | last post by:
Hello, I just read this mail by Brett Cannon: http://mail.python.org/pipermail/python-dev/2006-October/069139.html where the "PSF infrastracture committee", after weeks of evaluation, recommends...
1
by: cglewis03 | last post by:
Hello, I am trying to build a search form with several different options to choose from. Currently it is set up to open within the same window if a single option is selected and open within a...
4
by: a | last post by:
hallo I have the input box of the internal custom "Google Search Engine" in a page of mine. The page with all the result appears on the same page, ok. Now if I click on a result, I'd like...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...
0
by: ryjfgjl | last post by:
In our work, we often need to import Excel data into databases (such as MySQL, SQL Server, Oracle) for data analysis and processing. Usually, we use database tools like Navicat or the Excel import...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.