By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
454,445 Members | 1,329 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 454,445 IT Pros & Developers. It's quick & easy.

ICU Unicode library

P: n/a
Hi
I am studying the ICU library. Has anyone here
any experience with it?

It is a freely available internationalization library
provided by IBM, with full UNICODE 2.0 support.

It provides support for C, and it looks very impressive.
I was wondering if that would be a good replacement
for the string support in C.

jacob
May 30 '07 #1
Share this Question
Share on Google+
6 Replies


P: n/a
On May 30, 6:04 am, jacob navia <j...@jacob.remcomp.frwrote:
I am studying the ICU library. Has anyone here
any experience with it?

It is a freely available internationalization library
provided by IBM, with full UNICODE 2.0 support.

It provides support for C, and it looks very impressive.
I was wondering if that would be a good replacement
for the string support in C.
It duplicates the idiotic '\0' semantics from the C world. As far as
I can tell, its really meant as a replacement for wchar_t based
strings.

--
Paul Hsieh
http://www.pobox.com/~qed/
http://bstring.sf.net/

May 30 '07 #2

P: n/a
we******@gmail.com wrote:
On May 30, 6:04 am, jacob navia <j...@jacob.remcomp.frwrote:
>I am studying the ICU library. Has anyone here
any experience with it?

It is a freely available internationalization library
provided by IBM, with full UNICODE 2.0 support.

It provides support for C, and it looks very impressive.
I was wondering if that would be a good replacement
for the string support in C.

It duplicates the idiotic '\0' semantics from the C world.
It has length delimited strings too.
What is impressive is the Unicode and internationalization support.
What I am thinking is replacing most of the string library
like strlower() or strupper() and several others with calls to that
library.

As far as
I can tell, its really meant as a replacement for wchar_t based
strings.
It supports UTF8, UTF16 and UTF32, what is quite a feat.
May 30 '07 #3

P: n/a
jacob navia wrote:
>
I am studying the ICU library. Has anyone here
any experience with it?

It is a freely available internationalization library
provided by IBM, with full UNICODE 2.0 support.

It provides support for C, and it looks very impressive.
I was wondering if that would be a good replacement
for the string support in C.
This newsgroup discusses the standard C language as defined in the
various C standards and (earlier versions) in K&R. Not
miscellaneous libraries. You are off-topic.

--
<http://www.cs.auckland.ac.nz/~pgut001/pubs/vista_cost.txt>
<http://www.securityfocus.com/columnists/423>
<http://www.aaxnet.com/editor/edit043.html>
<http://kadaitcha.cx/vista/dogsbreakfast/index.html>
cbfalconer at maineline dot net

--
Posted via a free Usenet account from http://www.teranews.com

May 30 '07 #4

P: n/a
in comp.lang.c i read:
[re: icu as in <url:http://www.icu-project.org>]
>full UNICODE 2.0 support.

It provides support for C, and it looks very impressive. I was wondering
if that would be a good replacement for the string support in C.
so only 3 major revisions out of date? sounds like you are setting your
sights fairly low.

also, it forces c code into forms normal for java, and so you might find
some resistance to the changes you would demand of user's code to use your
add-on string library's str_lower function.

that aside, with the right wrappers icu isn't too annoying.

--
a signature
May 31 '07 #5

P: n/a
those who know me have no need of my name wrote:
in comp.lang.c i read:
[re: icu as in <url:http://www.icu-project.org>]
>full UNICODE 2.0 support.

It provides support for C, and it looks very impressive. I was wondering
if that would be a good replacement for the string support in C.

so only 3 major revisions out of date? sounds like you are setting your
sights fairly low.

also, it forces c code into forms normal for java, and so you might find
some resistance to the changes you would demand of user's code to use your
add-on string library's str_lower function.

that aside, with the right wrappers icu isn't too annoying.
I was always thinking in writing wrappers for it. But the idea
of allowing to specify the language where you want str_lower
to work is obviously a good one, since for instance german sz
is uppercase SS, and similar problems.
May 31 '07 #6

P: n/a
On May 30, 7:47 am, jacob navia <j...@jacob.remcomp.frwrote:
websn...@gmail.com wrote:
On May 30, 6:04 am, jacob navia <j...@jacob.remcomp.frwrote:
I am studying the ICU library. Has anyone here
any experience with it?
It is a freely available internationalization library
provided by IBM, with full UNICODE 2.0 support.
It provides support for C, and it looks very impressive.
I was wondering if that would be a good replacement
for the string support in C.
It duplicates the idiotic '\0' semantics from the C world.

It has length delimited strings too.
I was not aware of this. Do you know if they are dynamic or if \U+0
is necessarily treated special? Either way, perhaps I should take
another look at it.
What is impressive is the Unicode and internationalization support.
Well obviously; its a full implementation of Unicode strings.
What I am thinking is replacing most of the string library
like strlower() or strupper() and several others with calls to that
library.
Well sorting strings is where the real problems begin. A different
*kind* of strcmp() function has to be designed. You can't just
replace C's str* functions with ICU. You want to just support it as a
addition to char * strings, and deprecate the whole wchar_t nonsense.
As far as
I can tell, its really meant as a replacement for wchar_t based
strings.

It supports UTF8, UTF16 and UTF32, what is quite a feat.
The transfer formats are a triviality. I'd be slightly more impressed
if they supported UTF-7 and UTF-1 as well.

--
Paul Hsieh
http://www.pobox.com/~qed/
http://bstring.sf.net/

May 31 '07 #7

This discussion thread is closed

Replies have been disabled for this discussion.