473,385 Members | 1,798 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,385 software developers and data experts.

What are character sets?

All I know is that there are 8 bit numbers from 0 to 255 mapped to
characters like A, B, C, D and some strange looking ones (like the ones
used to make boxes in old PC text modes) all these being called ASCII
characters.

So, what in a nutshell does "character sets" mean and how are they
helpful in displaying multiple languages ?
Any simple explanation?

Mike

Jul 23 '05 #1
4 1378
siliconmike wrote:
All I know is that there are 8 bit numbers from 0 to 255 mapped to
characters like A, B, C, D and some strange looking ones (like the ones
used to make boxes in old PC text modes) all these being called ASCII
characters.

So, what in a nutshell does "character sets" mean and how are they
helpful in displaying multiple languages ?
Any simple explanation?

Mike

Someone else here might correct me but I'm pretty sure I know... and
you're near enough correct.

A standard character set (at least where the western world is concerned)
uses values 1 to 127 for the characters on your keyboard. An example is
A is 65, B is 66, C is 67 and so on... a (lowercase A) is 97... b is 98
and so on... All the numbers, alphabetic characters (upper/lower case)
and the characters above your numbers (on a QWERTY keyboard) are, among
a few others, included in this character set.

In order to allow for 'foreign' character sets, there is room for
additional characters from 128 to 255. (By foreign I don't mean to
offend anyone but I cannot think of any other word forit - sorry). This
128-255 in the character set I believe is used for characters that
don't follow what I've mentioned. For example, some of the Nordic
countries in Europe I believe have a letter that looks like a numeric
zero (I think they have an O with a slash that goes through it).
France, Spain and Germany are a few too which have dashes, apostraphes
or carats (^) above some of their characters. In order to allow for
this, they would have a value stored somewhere from 128 to 255 in the
appropriate character set. If you include the extra dashes, apostraphes
or whatever above some alphabets in foreign countries as being seperate
characters then you could *almost* say that some of these countries have
more than 26 letters in their alphabet...

Does this help / answer your question?

randelld
Jul 23 '05 #2
yes, then how to make sure that a field of a column called
"description" could contain text in any global language ?

Should I have an extra column called "char-set"
and another extra column called "language" ?
to complement the "description" column ?

Any other mechanisms?

Jul 23 '05 #3
siliconmike wrote:
yes, then how to make sure that a field of a column called
"description" could contain text in any global language ?


Read about Unicode. MySQL 4.1 supports this character set. I'm no
expert, but it appear that Unicode allows you to store different
language encodings in one column.

Overview of Unicode:
http://en.wikipedia.org/wiki/Unicode

Unicode support in MySQL 4.1:
http://dev.mysql.com/doc/mysql/en/charset-unicode.html
(read the example provided in the user comments)

Regards,
Bill K.
Jul 23 '05 #4
Unicode solves it. Thank you.

Mike

Jul 23 '05 #5

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

7
by: WindAndWaves | last post by:
Hi Folk Here I am writing my first php / mysql site, almost ready, and now this... charactersets.... The encoding that I use on my webpage is: <META HTTP-EQUIV="content-type"...
0
by: Thiko | last post by:
Hi According to the official mysql manual: http://www.mysql.com/doc/en/Charset-SHOW-CHARSET.html The syntax to show all available character sets is the SHOW CHARACTER SET command. It takes...
5
by: PEK | last post by:
I need some code that convert a multi-byte string to a Unicode string, and Unicode to multi-byte. I work mostly in Windows and know how to solve it there, but I would like to have some platform...
1
by: Vishal | last post by:
Hello! My client has a need to be able to store Japanese characters in their PeopleSoft database. So we need to change the character set from from Latin1_General (1252) to Japanese character set...
37
by: chandy | last post by:
Hi, I have an Html document that declares that it uses the utf-8 character set. As this document is editable via a web interface I need to make sure than high-ascii characters that may be...
64
by: ng5000 | last post by:
Hi, What's the point of a signed char? As I see it a char represents a character (not an integer, use an int type e.g. short int if you want an 8 bit number, or one of the new types, uint8 I...
21
by: aegis | last post by:
7.4#1 states The header <ctype.h> declares several functions useful for classifying and mapping characters.166) In all cases the argument is an int, the value of which shall be representable as an...
12
by: semut | last post by:
Given that the string is of null terminated type. What could be the possible causes (by experience) the string to have no null terminated and cause buffer overflow later. I know it is quite broad,...
0
by: peridian | last post by:
Hi, Can I do a bulk "find and replace" on data in SQL Server to convert character set data? I have data coming in from multiple external sources. That data is not always in UTF-8 or ASCII...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.