473,480 Members | 2,094 Online
Bytes | Software Development & Data Engineering Community
Create Post

Home Posts Topics Members FAQ

spelling errors in query terms

Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.

Anyone know what I'm talking about ?

Cheers,
Joel


__________________________________________________ ___________________
Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postgresql.org so that your
message can get through to the mailing list cleanly

Nov 12 '05 #1
5 1819
Hi Joel,

If your using java, and full text, fuzzy searching is an important part of
your app, you might want to check out Lucene. I evaluated it for a
project, and it was very good (the project was a fuzzy search of huge
pages of text).

http://jakarta.apache.org/lucene/docs/index.html
On Mon, 8 Dec 2003, Joel Rodrigues wrote:
Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.

Anyone know what I'm talking about ?

Cheers,
Joel


__________________________________________________ ___________________
Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postgresql.org so that your
message can get through to the mailing list cleanly

---------------------------(end of broadcast)---------------------------
TIP 1: subscribe and unsubscribe commands go to ma*******@postgresql.org

Nov 12 '05 #2
On Mon, 8 Dec 2003, Joel Rodrigues wrote:
Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.
You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
which uses trigram.


Anyone know what I'm talking about ?

Cheers,
Joel


__________________________________________________ ___________________
Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postgresql.org so that your
message can get through to the mailing list cleanly


Regards,
Oleg
__________________________________________________ ___________
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83

---------------------------(end of broadcast)---------------------------
TIP 8: explain analyze is your friend

Nov 12 '05 #3
Thanks Oleg, I'll take a look at it. Too bad there is no documentation.

Also, there's something wrong with 'trgm.tgz'. This is what happens if I
try to extract it's contents :

gzip: stdin is encrypted -- get newer version of gzip
tar: End of archive volume 1 reached
tar: Sorry, unable to determine archive format.
Cheers ! - Joel
On Monday, December 8, 2003, at 05:39 , Oleg Bartunov wrote:
On Mon, 8 Dec 2003, Joel Rodrigues wrote:
Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.


You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
which uses trigram.


__________________________________________________ ___________________
Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
---------------------------(end of broadcast)---------------------------
TIP 5: Have you checked our extensive FAQ?

http://www.postgresql.org/docs/faqs/FAQ.html

Nov 12 '05 #4
Thanks Oleg, I'll take a look at it. Too bad there is no documentation.

Also, there's something wrong with 'trgm.tgz'. This is what happens if I
try to extract it's contents :

gzip: stdin is encrypted -- get newer version of gzip
tar: End of archive volume 1 reached
tar: Sorry, unable to determine archive format.
Cheers ! - Joel
On Monday, December 8, 2003, at 05:39 , Oleg Bartunov wrote:
On Mon, 8 Dec 2003, Joel Rodrigues wrote:
Hi,

I seem to recall once coming across & using functionality in PostgreSQL
that allowed for some flexibility in the spelling of a query term. For
example, if one meant to look for 'Honda', but typed in 'Zonda'. There
was even a 'looseness' factor of sorts available. I've spent a lot of
time trying to find it in the docs and various articles & tutorials on
my hard drive, even an hour on Google in vain. It was not
contrib/fuzzystrmatch.


You might try http://www.sai.msu.su/~megera/postgres/gist/trgm/
which uses trigram.


__________________________________________________ ___________________
Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
---------------------------(end of broadcast)---------------------------
TIP 2: you can get off all lists at once with the unregister command
(send "unregister YourEmailAddressHere" to ma*******@postgresql.org)

Nov 12 '05 #5
Thanks for the tip. However I use Python with psycopg to interface with
PostgreSQL, so unfortunately it's not an option.

It's really strange, I could swear I used a function within PostgreSQL
that could handle small spelling errors in the query term.

Cheers,
Joel

On Monday, December 8, 2003, at 11:34 , Craig O'Shannessy wrote:
Hi Joel,

If your using java, and full text, fuzzy searching is an important part
of
your app, you might want to check out Lucene. I evaluated it for a
project, and it was very good (the project was a fuzzy search of huge
pages of text).

http://jakarta.apache.org/lucene/docs/index.html


__________________________________________________ ___________________
Envie de discuter en "live" avec vos amis ? Télécharger MSN Messenger
http://www.ifrance.com/_reloc/m la 1ère messagerie instantanée de France
---------------------------(end of broadcast)---------------------------
TIP 8: explain analyze is your friend

Nov 12 '05 #6

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
4269
by: Paul | last post by:
Assume you have two varchar (or Text) columns named L and U which are identical except that the charset for L is latin1 and the charset for U is utf8. All the records in L and U are identical in...
2
2574
by: Kavitha Rao | last post by:
Hi, I am getting the following errors while trying to run this snippet in Microsoft Visual C++.Can't seem to print the crc value stored. /* +++Date last modified: 05-Jul-1997 */ /* Crc - 32...
2
5759
by: Smartin | last post by:
Using Access 97 I am trying to create a form that allows for user entry of search terms and displays a table of results. I thought I could accomplish this using a subform but it isn't quite...
3
1672
by: Jim in Arizona | last post by:
I'm going insane! I don't know if it's just that the .net 2.0 framework is buggy or if it really is my code. This is pretty hard to explain since I can't even begin to nail down why this is...
53
3907
by: KraftDiner | last post by:
I've spent hours trying to find a bug that was a simple spelling mistake. in an init method I declare a variable self.someLongName later in a different method of the class I use...
2
1775
by: Arnold | last post by:
Greetings Gurus, In a report showing the names of students and their progress, I am getting an error in the name field (Name: #Error). The report gets its data from an unbound form containing...
3
3432
by: Marc | last post by:
Hi everyone, I'm trying to build a Thesaurus. In my thesaurus i've created the following tables: terms, btnt (broader terms/narrower terms), rt (related terms) The structures of the tables...
1
3027
by: Don Li | last post by:
Hi, Env: MS SQL Server 2000 DB Info (sorry no DDL nor sample data): tblA has 147249 rows -- clustered index on pk (one key of datatype(int)) and has two clumns, both are being used in joins;...
15
3225
by: Lawrence Krubner | last post by:
Does anything about this script look expensive, in terms of resources or execution time? This script dies after processing about 20 or 25 numbers, yet it leaves no errors in the error logs. This is...
0
7051
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
6915
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
7054
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
1
6750
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
6993
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
4493
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...
0
3003
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The...
1
567
muto222
php
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
0
193
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.