page id to the database. It is the main content which is stored as HTML
so that when it is displayed it has the required formatting for the
display (i.e Bold text, Italics, bullet points etc.)
Assuming that none of the dynamic-HTML content contains any form-elements
that may submit the main form (the [runat='server'] form) of the page, if
any, because that may break ASP.Net processing.
Also, I am assuming that what you want to do is grab the HTML from the
appropriate row for which the searched matched and display the contents.
Since you would not be looking at 'WHERE' or 'LIKE' match but free-flow-text
match, I would suggest one of these options:
1. Microsoft Indexing Service to index your files. Instead of putting the
HTML-content in db, put them in files and then let MIS do the job. The
results are fairly good. We have been using it for our internal purposes...
basically to test our KM product.
2. Buy Google MiniSearch. Let it index the documents. You query it using
APIs (Web Service enabled). You trust Google? I do... at least for search.
Do look at the cost figures... MiniSearch can index upto around 1million
documents of any type (you'd specifically be interested in HTML contents
only) but costs around $x,000 (don't recall if x = 2 or x = 5 :D).
If you are looking to scale up your operations and have control over hosting
environment, my personal recommendation would be Google MiniSearch since as
the size of repository grows, MIS tends to get very slower. (size >=
100-200k documents; well, we have a mix of text, html, Office [doc, ppt,
xls] etc documents).
Hope that helps!
--
Happy Hacking,
Gaurav Vaish |
www.mastergaurav.com www.edujinionline.com http://eduzine.edujinionline.com
-----------------------------------------