467,166 Members | 1,069 Online
Bytes | Developer Community
Ask Question

Home New Posts Topics Members FAQ

Post your question to a community of 467,166 developers. It's quick & easy.

Help filling in missing symbol representations?

SwissProgrammer
128KB
What are the missing [] parts in the following?

Thank you.



I have been working on writing a converter in C++ (not Visual Studio. Not .net) for Unicode. I am currently building a reference to test my program's results against it.

I have put together the following, but it is missing some parts and I am not certain if it is all correct as it is.

Therefore, please help me fill in the missing [] parts. Then I can run my program later and put in "A" or "办" and check against this table to see if my program returns the correct information.





For example:
I see that for "办" I have found
UTF-16 = [10011110 01010010]
UTF-16 = [\u529e] or [529e]

I looked at "A" and am not certain how to have the two types of representations there. Please tell me where that is to be found on the internet so that I may reference it and verify that I am using the correct information.




Thank you.


This is what I have so far:



[Symbol] to [Binary or UTF-8 or UTF-16 or etc.]


For the symbol "A"
Binary = [01000001]
UTF-8 Hexadecimal = [41]
Unicode Code Point = [41]

UTF-8 = [41]
UTF-8 = [] ?

UTF-16 = [01000001 00000000]
UTF-16 = [] ?

UTF-32 = [] ?
UTF-32 = [] ?

Decimal = [00065]
HTML = [A]
url-encoded Unicode = [A]
base64 = [QQ==]
data url = [data:text/plain;charset=utf-8;base64,QQ==]
bytes = [] ?
string literal = [\u0041]





For the symbol "办"
Binary = [101001010011110]
UTF-8 Hexadecimal = [529e] = [\xe5\x8a\x9e]
Unicode Code Point = [529e]

UTF-8 Binary = [11100101 10001010 10011110]
UTF-8 Hexadecimal = [\xe5\x8a\x9e] or [ e5 8a 9e] or [529e]

UTF-16 = [10011110 01010010]
UTF-16 = [\u529e] or [529e]

UTF-32 = [11100101 10001010 10011110]
UTF-32 = [0000529e]

Decimal = [21150]
url-encoded Unicode = [%e5%8a%9e]
base64 = [5Yqe]
HTML = [办]
data url = [data:text/plain;charset=utf-8;base64,5Yqe]
bytes = [0xe5 0x8a 0x9e]
string literal = [\u529e]
Dec 19 '20 #1

✓ answered by SioSio

If your PC OS is Windows 10, you can use the calculator's programmer mode to enter the binary, hexadecimal, and decimal numbers in this table.

You can check the character code at the following site.
https://www.compart.com/fr/unicode/U+529E

  • viewed: 2091
Share:
2 Replies
128KB
If your PC OS is Windows 10, you can use the calculator's programmer mode to enter the binary, hexadecimal, and decimal numbers in this table.

You can check the character code at the following site.
https://www.compart.com/fr/unicode/U+529E
4 Weeks Ago #2
SwissProgrammer
128KB
Thank you SioSio,

From various referenced, I have received different values, and I do not know which is correct.


For Unicode code point = [529E]


Should the following be called "UTF-8 Hexadecimal" or simply "UTF-8" or simply "Hexidecimal"?
UTF-8 Hexadecimal = [\xe5\x8a\x9E]


Or, should it be like this?
UTF-8 Hexadecimal = [529E]



Should the following be called "UTF-8 Binary" or simply "Binary"?
UTF-8 Binary = [11100101 10001010 10011110]



Is this labeled correctly?
UTF-8 Encoding = [\xe5 \x8a \x9E]


I want to make certain that my wording is correct.


Thank you SioSio.
4 Weeks Ago #3

Post your reply

Sign in to post your reply or Sign up for a free account.

Similar topics

reply views Thread by Michael Wang | last post: by
1 post views Thread by Warwick | last post: by
2 posts views Thread by Peter Smithson | last post: by
By using this site, you agree to our Privacy Policy and Terms of Use.