473,394 Members | 1,761 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,394 software developers and data experts.

Searching TEXT fields

Hello,

I am creating a databse with a large number of text (or blob?)
entries. I want users to be able to search these fields.

An example would be a forum or journal. Indexing every word in the
post seems costly. If I have 100,000 posts over a few months, and each
one has 50 words in it, that is a 500,000 match entries. The databse I
will be creating will be even larger. For longer posts, there is a
huge overhead of dozens of INSERT commands, which would then further
slow any search attempts (write locking).

Is there a better method for indexing free-style text entries so they
are readibly searchable? Also, the above mentioned method doesn't
allow much wiggle room for "exact phrase" searching. What is the best
way to approach this problem for a large scale database (500,000
rows+)?
Jul 20 '05 #1
2 1864
Michi wrote:
Hello,

I am creating a databse with a large number of text (or blob?)
entries. I want users to be able to search these fields.

An example would be a forum or journal. Indexing every word in the
post seems costly. If I have 100,000 posts over a few months, and each
one has 50 words in it, that is a 500,000 match entries. The databse I
will be creating will be even larger. For longer posts, there is a
huge overhead of dozens of INSERT commands, which would then further
slow any search attempts (write locking).

Is there a better method for indexing free-style text entries so they
are readibly searchable? Also, the above mentioned method doesn't
allow much wiggle room for "exact phrase" searching. What is the best
way to approach this problem for a large scale database (500,000
rows+)?


Use regular expressions :)
SELECT * FROM forum WHERE message REGEXP 'text'
Jul 20 '05 #2
Lüpher Cypher <lu***********@verizon.net> wrote in message news:<N1kcd.868$WN5.765@trndny08>...
Michi wrote:
Hello,

I am creating a databse with a large number of text (or blob?)
entries. I want users to be able to search these fields.

An example would be a forum or journal. Indexing every word in the
post seems costly. If I have 100,000 posts over a few months, and each
one has 50 words in it, that is a 500,000 match entries. The databse I
will be creating will be even larger. For longer posts, there is a
huge overhead of dozens of INSERT commands, which would then further
slow any search attempts (write locking).

Is there a better method for indexing free-style text entries so they
are readibly searchable? Also, the above mentioned method doesn't
allow much wiggle room for "exact phrase" searching. What is the best
way to approach this problem for a large scale database (500,000
rows+)?


Use regular expressions :)
SELECT * FROM forum WHERE message REGEXP 'text'


That seems highly inefficient. If I were to search for a word through
500,000 rows, it would take the full time of having to read and
process every row. I might as well not index anything. I am trying to
see what is the best enterprise solution to search text fields. I
could use match() AGAINST fields and do full text searching...?
Jul 20 '05 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

4
by: Michi | last post by:
I was wondering what the best solution is for making large numbers of TEXT (or BLOB?) fields searchable. For example, if I have a forum, what is the best way to be able to search for specific...
0
by: Chris Chandler | last post by:
Hello I am developing a web application that uses full text searching quite extensively. This is ideal where I am searching a large number of large varchar fields for a set of key words. My...
3
by: Stewart Allen | last post by:
Hi there I'm trying to find part serial numbers between 2 numbers. The user selects a part number from a combo box and then enters a range of serial numbers into 2 text boxes and the resulting...
3
by: Paul H | last post by:
I have a text file that contains the following: ******************** __StartCustomerID_41 Name: Fred Smith Address: 57 Pew Road Croydon
3
by: ANDY AIYER | last post by:
Guru's! Your time and guidance is much appreciated in this task that i am trying to get done. Background I have a SQL Server 2000 database table which contains 2 Fields (RecordID, XMLData...
2
by: Spencer | last post by:
We have z/OS and DB2 v7. I have various fields on tables that I need to perform searching through using the LIKE operator and wildcards such as %. Currently on our million row plus tables the...
7
by: john | last post by:
In my form I have a master table and a details table linked 1xM. I can search through the whole parent table but I also like to be able to search through the child table fields to find parent...
4
by: dodjem | last post by:
Hi all, I am facing an issue which I really don't know how to solve after googling for quite some time. I have 2 tables: one is for my new articles and one is for the website content; I would...
0
by: Kassimu | last post by:
Hi guys out there, There is this database Iam creating, I have a table with 40 fields among which there are Date/time, Text, Number, Memo and Yes/No fields and I have created the form bound to that...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.