467,145 Members | 1,015 Online
Bytes | Developer Community
Ask Question

Home New Posts Topics Members FAQ

Post your question to a community of 467,145 developers. It's quick & easy.

Python / Chinese Encodings

Hi,

I need to convert Big5 or GB encoded chinese strings to unicode. It would
be also nice to be able to detect the encoding of the original string.
Search with groups.google.com I found some links to different projects but
they all look not very active. Can somebody give me a short overview of the
status of processing chinese texts with python?

regards,
Achim
Jul 18 '05 #1
  • viewed: 2308
Share:
1 Reply
"Achim Domma" <do***@procoders.net> writes:
I need to convert Big5 or GB encoded chinese strings to unicode. It would
be also nice to be able to detect the encoding of the original string.
Search with groups.google.com I found some links to different projects but
they all look not very active. Can somebody give me a short overview of the
status of processing chinese texts with python?


The very short summary: Use the CJK codecs package; it supports all
encodings you might encounter, and it is actively maintained.

As for detecting the encoding of the original string: Forget it. Tell
your communication partners to always properly declare the encoding.

Regards,
Martin

Jul 18 '05 #2

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

reply views Thread by Achim Domma | last post: by
1 post views Thread by Anthony Liu | last post: by
3 posts views Thread by Coco | last post: by
8 posts views Thread by pabv | last post: by
4 posts views Thread by Markus Dahlbokum | last post: by
3 posts views Thread by Philip Semanchuk | last post: by
By using this site, you agree to our Privacy Policy and Terms of Use.