473,699 Members | 2,008 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Any Fuzzy Software ( Other than SOUNDEX / NYSIIS )?

Hi All
We are using soundex (and later tried Nysiis) for fuzzy name search
software. But we faced a lot of problems the search accuracy was not very
good also we saw a lot of misses of relevant names.

There are many problems other than precision and accuracy, with soundex and
NYSIIS.

e.g.

Look for Smith and it will come-up with around 250 very popular last names.
That dones not help much when a user is searching for "John Smith". Also, it
does not return Creighton for Kryton as the search string.

I googled a little and saw soming called NamiX. Without contacting the
company ( arizcon.com ) directly, I wanted to get feedback from newsgroups
or people who are experts at this. Has anyone used this software? If so, Is
it as good as they claim?

Thanks a million in advance.

Steve Creighton
(please remove .antispam from email address) or post back your answers to
this group

Jul 23 '05 #1
1 3777
Ray
Steve,

This is not really the group for this kind of question but Name match
software or simply text matching software can be reasonably cheap or very
expensive. The best I've used (~$100K price tag for prod software only) was
Ascentials Quality Stage Server. Just so I related this to SQL Server after
all what group is this posted in?

A company I consulted for had a need to match names address pairs to members
in thier SQL DB. They already owned and used Quality Server for other
purposes. They eventually decided to write an extensive TSQL stored
procedure that checked for about 20 different combinations of things like
soundex of last name plus state plus first three letters of street address.
The client was statisfied with the match rate and they could understand it
well. The moral of the story here is if absolute highest accurate match
rate is essential then use a package writen by someone with a PHD in
statistics. If over 99% is all your looking for, write it in something you
can support and maintain.

Ray

"Evaluating Fuzzy SW" <ev*******@hotm ail.com> wrote in message
news:Ze******** *********@bgtns c04-news.ops.worldn et.att.net...
Hi All
We are using soundex (and later tried Nysiis) for fuzzy name search
software. But we faced a lot of problems the search accuracy was not very
good also we saw a lot of misses of relevant names.

There are many problems other than precision and accuracy, with soundex
and
NYSIIS.

e.g.

Look for Smith and it will come-up with around 250 very popular last
names.
That dones not help much when a user is searching for "John Smith". Also,
it
does not return Creighton for Kryton as the search string.

I googled a little and saw soming called NamiX. Without contacting the
company ( arizcon.com ) directly, I wanted to get feedback from newsgroups
or people who are experts at this. Has anyone used this software? If so,
Is
it as good as they claim?

Thanks a million in advance.

Steve Creighton
(please remove .antispam from email address) or post back your answers to
this group

Jul 23 '05 #2

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

6
3342
by: Ricky Romaya | last post by:
Hi, I'm curious about soundex. All I know that it's a way for making spelling- error-tolerant word matching. What I want to know is whether the soundex algorithm are made exclusively for english language, or can it be used for any arbitrary language with satisfactory performance (by 'satisfactory performance' I meant that it can detect at least 80% spelling-errors). What about PHP soundex support? TIA
17
14057
by: Andrew McLean | last post by:
I have a problem that is suspect isn't unusual and I'm looking to see if there is any code available to help. I've Googled without success. Basically, I have two databases containing lists of postal addresses and need to look for matching addresses in the two databases. More precisely, for each address in database A I want to find a single matching address in database B. I'm 90% of the way there, in the sense that I have a simplistic...
2
2256
by: William Morris | last post by:
Apologies in advance for the cross-post. Not sure if this is better handled in ASP code or TransactSQL. Windows2000 Server SQL 2000 tblPeople contactid int firstname varchar(25) lastname varchar(25)
24
2676
by: BBands | last post by:
I have some CDs and have been archiving them on a PC. I wrote a Python script that spans the archive and returns a list of its contents: ...]. I wanted to add a search function to locate all the versions of a particular song. This is harder than you might think. For example the Cajun "national anthem" is Jolie Blond, but it can be spelled several different ways jolie, joli, blon, blond, etc... In addition the various online services that...
24
14409
by: cassetti | last post by:
Here's the issue: I have roughly 20 MS excel spreadsheets, each row contains a record. These records were hand entered by people in call centers. The problem is, there can and are duplicate phone numbers, and emails and addresses even person names. I need to sift through all this data (roughly 300,000+ records and use fuzzy logic to break it down, so that i have only unique records.
32
8596
by: vonclausowitz | last post by:
Hi All, I have database with names on which I want to use the soundex option. So I have created two seperate fields for the Lastname and Firstname in which I save the Soundex version of a new name I save in the database. I have the soundex code with the 6 numeric option. So I save for example in the field LastnameSE = 600192 and in the FirstnameSE = 545910.
14
13488
by: Steve Bergman | last post by:
I'm looking for a module to do fuzzy comparison of strings. I have 2 item master files which are supposed to be identical, but they have thousands of records where the item numbers don't match in various ways. One might include a '-' or have leading zeros, or have a single character missing, or a zero that is typed as a letter 'O'. That kind of thing. These tables currently reside in a mysql database. I was wondering if there is a...
0
2359
by: jojo41300000 | last post by:
Hi all, Is there any ideas how to implement the fuzzy search using soundex in SQL server 2005? Could you please post some suggestions, ideas, or source codes if you have already implemented? Thanks alot. Jason
2
7086
by: Yashgt | last post by:
Hi, We have a table of users with about 10000 rows. We have to import a large text file into this table. While doing so we would like to ensure that if a row already exists, it should not be imported. Rather that True/false logic, we need to apply Fuzzy logic here so that if we find similarities in the name, address, ad a few other columns, we treat the row as a duplicate. Such possibly duplicate rows should be identified and logged...
0
8697
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
9184
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
1
8929
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
8891
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
7759
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
6538
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
4380
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
1
3061
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
2
2357
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.