Chung Leong wrote:[color=blue]
> Some guy was working on creating a PHP extension for the ICU library. I
> don't know what the status is at this point. Worth googling.[/color]
Can't help much on collation currently, but I've written a partial PHP
extension wrapper for ICU's normalizer and a pure-PHP equivalent for
validation and normalization of UTF-8 input. It's under GPL license and
is bundled with MediaWiki 1.4 (
www.mediawiki.org)
I ended up writing that rather than trying to wade through the full ICU
extension; I had a hard enough time trying to track it down, and didn't
want to rely on ICU and a custom PHP extension as they aren't always
available.
[color=blue]
> If MySQL doesn't support Unicode collation, I guess there's no much you can
> do. Perhaps time to consider a commercial database? I know MSSQL can handle
> sorting in a number of languages. Oracle can too probably.[/color]
MySQL 4.1 and higher do have UTF-8 support. I don't know how well it
works at this stage or whether the collation support is suitable for the
original poster's needs.
-- brion vibber (brion @ pobox.com)