Connecting Tech Pros Worldwide Forums | Help | Site Map

Python / Chinese Encodings

Achim Domma
Guest
 
Posts: n/a
#1: Jul 18 '05
Hi,

I need to convert Big5 or GB encoded chinese strings to unicode. It would
be also nice to be able to detect the encoding of the original string.
Search with groups.google.com I found some links to different projects but
they all look not very active. Can somebody give me a short overview of the
status of processing chinese texts with python?

regards,
Achim



Martin v. Löwis
Guest
 
Posts: n/a
#2: Jul 18 '05

re: Python / Chinese Encodings


"Achim Domma" <domma@procoders.net> writes:
[color=blue]
> I need to convert Big5 or GB encoded chinese strings to unicode. It would
> be also nice to be able to detect the encoding of the original string.
> Search with groups.google.com I found some links to different projects but
> they all look not very active. Can somebody give me a short overview of the
> status of processing chinese texts with python?[/color]

The very short summary: Use the CJK codecs package; it supports all
encodings you might encounter, and it is actively maintained.

As for detecting the encoding of the original string: Forget it. Tell
your communication partners to always properly declare the encoding.

Regards,
Martin

Closed Thread