Andy Hassall wrote:
Soundex is for English words, based on English pronunciation rules.
See: http://en.wikipedia.org/wiki/Soundex
You *can* of course cook up your own Soundex-functions with values
created based on other languages the algorithm is very easy. For some
languages it might be rather easy, but possibly not worth the effort;
though the original algorithm is for english, it will work "quite
well" for many other languages too.
It's worthwhile to note that soundex (and similar functions) only work
for individual words, and that by using it you aren't supposed to
detect spelling errors. The best use for soundex is when you're
searching for names, addresses or the like and don't know how it is
actually written, but know what it sounds like - you can have the
soundex values stored in the database with other data and when you do
a search, you first look for the exact string the user entered. If
this doesn't return enough results, you count the soundex value for
the user input and try with that. This way you get results that "sound
same" ... so they're propably close to what you really were looking
for. I think a similar approach is used on the search engine at
www.php.net (I can't be certain though, but it seems like that - see
http://fi.php.net/manual-lookup.php?pattern=sundeks for example:)
--
Markku Uttula