472,374 Members | 1,376 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 472,374 software developers and data experts.

which unicode encoding to use for saving ?

THY
Hi,

I am developing a website in english & chinese both language. whenever I
save, it required I set the encoding in advanced save options. But I found
there are 4 related to unicode, can anyone tell me what's the different and
which to choose ?

Unicode (UTF-8 without signature) - Codepage 65001
Unicode (UTF-8 with signature) - Codepage 65001
Unicode - Codepage 1200
Unicode (Big-Endian) - Codepage 1201

thanks,
Tee
Nov 17 '05 #1
2 2552
"THY" wrote:
Hi,

I am developing a website in english & chinese both language.
whenever I save, it required I set the encoding in advanced save
options. But I found there are 4 related to unicode, can anyone tell
me what's the different and which to choose ?

Unicode (UTF-8 without signature) - Codepage 65001
Unicode (UTF-8 with signature) - Codepage 65001
Unicode - Codepage 1200
Unicode (Big-Endian) - Codepage 1201


Check out this very nice article:

http://www.joelonsoftware.com/articles/Unicode.html

Cheers,
--
Joerg Jooss
jo*********@gmx.net
Nov 17 '05 #2
"Joerg Jooss" <jo*********@gmx.net> wrote in message
news:ep**************@TK2MSFTNGP12.phx.gbl...

Check out this very nice article:

http://www.joelonsoftware.com/articles/Unicode.html


That is, indeed, a very nice article. It does have one problem, however. It
implies that using UTF-8 for all web pages is an OK thing to do because most
browsers have UTF-8 support. This is true, but UTF-8 causes huge bloat in
the byte count for some languages. Chinese is a great example. In my opinion
page size still matters, and you can greatly optimize the page size in many
cases if you customize the encoding to match the primary language of the
page.

For example, a typical block of Chinese text will take three times as much
space in UTF-8 as it will using Big5. Characters that don't exist in Big5
can be encoded as &# entities. Browsers that people use to read Chinese are
very likely to support Big5, so in my opinion you should use Big5 encoding
for Chinese pages. ASP.NET makes this very easy to do. This conserves
Internet bandwidth, saves space in proxy servers, saves space in your local
cache, reduces download times for those unfortunate modem and ISDN users,
etc.

In web pages there's going to be a lot of ASCII characters (HTML tags and so
forth) mixed in with the Chinese, so your actual savings will be less than
3-to-1, but for the bulk of Chinese content pages there will be a
significant savings. I'm just using Chinese as an example--pick any
non-European language and the result will likely be similar.

Nov 17 '05 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

15
by: | last post by:
The data file is a simple Unicode file with lines of text. BCP apparently doesn't guarantee this ordering, and neither does the import tool. I want to be able to load the data either sequentially...
10
by: Nikolay Petrov | last post by:
How can I convert DOS cyrillic text to Unicode
40
by: apprentice | last post by:
Hello, I'm writing an class library that I imagine people from different countries might be interested in using, so I'm considering what needs to be provided to support foreign languages,...
3
by: ravan1234 | last post by:
Hello Guys, I am having some problem with Unicode string in KATAKANA format. I am decoding string and then storing in db (SYbase). Also while retrieving data I am encoding and then displaying....
2
by: Kemmylinns12 | last post by:
Blockchain technology has emerged as a transformative force in the business world, offering unprecedented opportunities for innovation and efficiency. While initially associated with cryptocurrencies...
0
by: Naresh1 | last post by:
What is WebLogic Admin Training? WebLogic Admin Training is a specialized program designed to equip individuals with the skills and knowledge required to effectively administer and manage Oracle...
0
hi
by: WisdomUfot | last post by:
It's an interesting question you've got about how Gmail hides the HTTP referrer when a link in an email is clicked. While I don't have the specific technical details, Gmail likely implements measures...
0
Oralloy
by: Oralloy | last post by:
Hello Folks, I am trying to hook up a CPU which I designed using SystemC to I/O pins on an FPGA. My problem (spelled failure) is with the synthesis of my design into a bitstream, not the C++...
0
by: Carina712 | last post by:
Setting background colors for Excel documents can help to improve the visual appeal of the document and make it easier to read and understand. Background colors can be used to highlight important...
0
by: Rahul1995seven | last post by:
Introduction: In the realm of programming languages, Python has emerged as a powerhouse. With its simplicity, versatility, and robustness, Python has gained popularity among beginners and experts...
2
by: Ricardo de Mila | last post by:
Dear people, good afternoon... I have a form in msAccess with lots of controls and a specific routine must be triggered if the mouse_down event happens in any control. Than I need to discover what...
0
by: jack2019x | last post by:
hello, Is there code or static lib for hook swapchain present? I wanna hook dxgi swapchain present for dx11 and dx9.
0
DizelArs
by: DizelArs | last post by:
Hi all) Faced with a problem, element.click() event doesn't work in Safari browser. Tried various tricks like emulating touch event through a function: let clickEvent = new Event('click', {...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.