473,406 Members | 2,378 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,406 software developers and data experts.

Search engines crawling our .NET site

Our site gets searched by robots all the time. This is great. However,
many of our pages that we want to be cataloged are data driven, so we end up
with pages like:

www.ourdomain.com/products.aspx?productid=356

Let's assume that we stop selling productid 356. This means that the url
above is invalid. If a general user has bookmarked this page or pastes in a
url into a browser that isn't quite right, we want them to get a 'pretty'
error message. However, using this approach means that a search engine like
google will interpret this page as still being "OK" and will continue to
catalog it.

Off the top of my head, I see two solutions:

1. Redirect to a bogus page that doesn't exist, so a 404 message displays.
Search engines would remove the link in their catalog(hopefully????), but
users would not have a "pleasing" experience.
2. Throw an unhandled exception. A ugly ASP.NET error message page is
displayed. This ticks off the user, but hopefully the search engine
realizes that the page does not actually exist? Probably not.

Suggestions on how to handle this?

Thanks in advance.

Mark
Nov 19 '05 #1
3 1273
Option 3:
Don't worry about it and send the consumer to a page that says "we no longer
carry that product, but suggest this instead" and keep old products in a
"dead product" table with cross sell possibilities.
---

Gregory A. Beamer
MVP; MCP: +I, SE, SD, DBA

***************************
Think Outside the Box!
***************************

"Mark" wrote:
Our site gets searched by robots all the time. This is great. However,
many of our pages that we want to be cataloged are data driven, so we end up
with pages like:

www.ourdomain.com/products.aspx?productid=356

Let's assume that we stop selling productid 356. This means that the url
above is invalid. If a general user has bookmarked this page or pastes in a
url into a browser that isn't quite right, we want them to get a 'pretty'
error message. However, using this approach means that a search engine like
google will interpret this page as still being "OK" and will continue to
catalog it.

Off the top of my head, I see two solutions:

1. Redirect to a bogus page that doesn't exist, so a 404 message displays.
Search engines would remove the link in their catalog(hopefully????), but
users would not have a "pleasing" experience.
2. Throw an unhandled exception. A ugly ASP.NET error message page is
displayed. This ticks off the user, but hopefully the search engine
realizes that the page does not actually exist? Probably not.

Suggestions on how to handle this?

Thanks in advance.

Mark

Nov 19 '05 #2
most search robots support the robot headers, just include on your friendly
page:

<META name="ROBOTS" content="NOINDEX">

-- bruce (sqlwork.com)
"Mark" <Ma**@nowhere.com> wrote in message
news:u7**************@TK2MSFTNGP12.phx.gbl...
| Our site gets searched by robots all the time. This is great. However,
| many of our pages that we want to be cataloged are data driven, so we end
up
| with pages like:
|
| www.ourdomain.com/products.aspx?productid=356
|
| Let's assume that we stop selling productid 356. This means that the url
| above is invalid. If a general user has bookmarked this page or pastes in
a
| url into a browser that isn't quite right, we want them to get a 'pretty'
| error message. However, using this approach means that a search engine
like
| google will interpret this page as still being "OK" and will continue to
| catalog it.
|
| Off the top of my head, I see two solutions:
|
| 1. Redirect to a bogus page that doesn't exist, so a 404 message displays.
| Search engines would remove the link in their catalog(hopefully????), but
| users would not have a "pleasing" experience.
| 2. Throw an unhandled exception. A ugly ASP.NET error message page is
| displayed. This ticks off the user, but hopefully the search engine
| realizes that the page does not actually exist? Probably not.
|
| Suggestions on how to handle this?
|
| Thanks in advance.
|
| Mark
|
|
Nov 19 '05 #3
Hi Mark,

You could also place a "robots.txt" text file in the root directory of
your app, containing this :

User-agent:*
Disallow:products.aspx

Although I find Gregory's solution very elegant, user-friendly and
commercialy sound. I'd go that way. Out of the box...

HTH,

Michel

"Mark" <Ma**@nowhere.com> wrote in message news:<u7**************@TK2MSFTNGP12.phx.gbl>...
Our site gets searched by robots all the time. This is great. However,
many of our pages that we want to be cataloged are data driven, so we end up
with pages like:

www.ourdomain.com/products.aspx?productid=356

Let's assume that we stop selling productid 356. This means that the url
above is invalid. If a general user has bookmarked this page or pastes in a
url into a browser that isn't quite right, we want them to get a 'pretty'
error message. However, using this approach means that a search engine like
google will interpret this page as still being "OK" and will continue to
catalog it.

Off the top of my head, I see two solutions:

1. Redirect to a bogus page that doesn't exist, so a 404 message displays.
Search engines would remove the link in their catalog(hopefully????), but
users would not have a "pleasing" experience.
2. Throw an unhandled exception. A ugly ASP.NET error message page is
displayed. This ticks off the user, but hopefully the search engine
realizes that the page does not actually exist? Probably not.

Suggestions on how to handle this?

Thanks in advance.

Mark

Nov 19 '05 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
by: R. Rajesh Jeba Anbiah | last post by:
Q: Is PHP search engine friendly? Q: Will search engine spiders crawl my PHP pages? A: Spiders should crawl anything provided they're accessible. Since, nowadays most of the websites are been...
11
by: Petre Huile | last post by:
I have designed a site for a client, but they have hired an internet marketing person to incrase their search engine ranking and traffic. He wants to put extra-large fonts on every page which will...
2
by: Patrick | last post by:
Are the differences between a search engine, a subject directory and a meta search engine significant for an ebusiness web site owner? A meta search engine merely uses ordinary existing search...
8
by: Steve | last post by:
I notice that search engines are now finding robots.txt files and catalogue their contents. Is this wise I wonder? Is it a possible security risk? I even found the White House robots.txt file on...
67
by: Sandy.Pittendrigh | last post by:
Here's a question I don't know the answer to: I have a friend who makes very expensive, hand-made bamboo flyrods. He's widely recognized (in the fishing industry) as one of the 3-5 'best' rod...
4
by: MDW | last post by:
Posted this on another board, but evidently it was off-topic there...hope you folks will be able to provide some guidance. I've been working on a Web site for a business (my first non-personal...
2
by: Griff | last post by:
Hi We have an eCommerce site that was designed as a BusinessToBusiness system. When anyone accesses a page, the site checks to see whether they have a current session (i.e. already...
3
by: MZ | last post by:
Hello! I have registered my web page on www.google.pl 7 days ago. Till now my web page is not being found when I type a few keywords declared in meta elements. My web page is not being found...
0
by: passion | last post by:
"Specialized Search Engines" along with Google Search Capability (2 in 1): http://specialized-search-engines.blogspot.com/ Billions of websites are available on the web and plenty of extremely...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.