Help | Site Map
Connecting Tech Pros Worldwide
 
 
LinkBack Thread Tools
  #1  
Old July 26th, 2006, 01:45 PM
stil
Guest
 
Posts: n/a
Default xerces and UTF-16 encoding

hi,

i've got a little problem with encoding in UTF_16.
i create my document, giving him to my writer to write in a string,
when i set an encoding in UTF-8:

DOMWriter* L_poWriter = L_poImplementation->createDOMWriter();
XMLCh* L_poEncoding = XMLString::transcode( "UTF-8" );
L_poWriter->setEncoding(L_poEncoding);

and then write and get the result:

L_poWriter->writeNode(L_poTarget, *P_poNode);

// WARNING
// using reinterpret cast because the other cast didn't work:
// one day, it's will explode
P_rsResult =
string(
reinterpret_cast<const char*>(
L_poTarget->getRawBuffer()
)
);

the resulting string is valid and contains my Xml.
when i do the same except i encode in UTF-16 the resulting string is
empty

is use xerces v.2.4.0-1.7.0
if you have any idea of what append ?

thanks in advance.
stil

  #2  
Old July 26th, 2006, 03:45 PM
Andreas Prilop
Guest
 
Posts: n/a
Default Re: xerces and UTF-16 encoding

On 26 Jul 2006, stil wrote:
Quote:
Organization: http://groups.google.com
>
i've got a little problem with encoding in UTF_16.
You are funny! You are posting over Google's Usenet interface
and Google itself cannot read UTF-16. Most hits from
http://www.google.com/search?q=%22UTF+1+6%22
are webpages encoded in UTF-16, but not recognized by Google.
Go to the "Cached" versions!

Why do you want UTF-16 in the first place? UTF-16 and UTF-32
are no good for markup languages with ASCII markup.
UTF-8 is much better suited for XML and HTML.
http://ppewww.ph.gla.ac.uk/~flavell/...checklist.html

  #3  
Old July 26th, 2006, 05:05 PM
Richard Tobin
Guest
 
Posts: n/a
Default Re: xerces and UTF-16 encoding

In article <1153918203.562491.137650@p79g2000cwp.googlegroups .com>,
stil <daligny@gmail.comwrote:
Quote:
>the resulting string is valid and contains my Xml.
>when i do the same except i encode in UTF-16 the resulting string is
>empty
I'm not a C++ programmer, but are you using a char * string to store
your UTF-16 result? If so, it will probably look empty (or just have
one character) because each character will occupy two chars and either
the first of second char will be zero.

-- Richard
  #4  
Old July 26th, 2006, 05:25 PM
stil
Guest
 
Posts: n/a
Default Re: xerces and UTF-16 encoding

you're correct, when i check the resulting raw buffer, it contains the
value but the translation into a string lost the result as the first
character is a \0.

stil

ps: i'm going to try a wstring, i hope it will be better ;)

Richard Tobin a écrit :
Quote:
In article <1153918203.562491.137650@p79g2000cwp.googlegroups .com>,
stil <daligny@gmail.comwrote:
>
Quote:
the resulting string is valid and contains my Xml.
when i do the same except i encode in UTF-16 the resulting string is
empty
>
I'm not a C++ programmer, but are you using a char * string to store
your UTF-16 result? If so, it will probably look empty (or just have
one character) because each character will occupy two chars and either
the first of second char will be zero.

-- Richard
 

Bookmarks


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are Off
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

What is Bytes?

We are a network of experts and professionals in IT and software development that help one another with answers to tough questions and share insights. Get the best answers to your questions from over network members.
Post your question now . . .
It's fast and it's free

Popular Articles