473,243 Members | 1,645 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,243 software developers and data experts.

Saving XML as UTF-8?

How do I load and save a UTF-8 document in XML in ASP/VBS?
Well, the loading* is not the problem actually -- the file is in UTF-8,
and understood correctly -- but once saved, the UTF-8 is replaced by
what seems to be iso-8859-1 (which Flash doesn't understand, but that's
another problem). Any help greatly appreciated.
* Something like this...
set xDoc = server.createObject("Msxml2.DOMDocument")
xDoc.async = false
xDoc.load sPath
Jul 22 '05 #1
7 4941


Philipp Lenssen wrote:
How do I load and save a UTF-8 document in XML in ASP/VBS?

Well, the loading* is not the problem actually -- the file is in UTF-8,
and understood correctly -- but once saved, the UTF-8 is replaced by
what seems to be iso-8859-1 * Something like this...
set xDoc = server.createObject("Msxml2.DOMDocument")
xDoc.async = false
xDoc.load sPath


I am pretty sure if you then use
xDoc.save Server.MapPath(filename)
later then the encoding is preserved.
Are you by chance saving by writing xDoc.xml with the FileSystemObject?

The MSXML 4 docs say about the save method:

"Character encoding is based on the encoding attribute in the XML
declaration, such as <?xml version="1.0" encoding="windows-1252"?>. When
no encoding attribute is specified, the default setting is UTF-8."

which supports my view that the encoding the document has when being
loaded is preserved when saving.


--

Martin Honnen
http://JavaScript.FAQTs.com/
Jul 22 '05 #2
Martin Honnen wrote:
Philipp Lenssen wrote:
How do I load and save a UTF-8 document in XML in ASP/VBS?

I am pretty sure if you then use
xDoc.save Server.MapPath(filename)
later then the encoding is preserved.
Are you by chance saving by writing xDoc.xml with the
FileSystemObject?


Thanks so far Martin, this is my save method:

xDoc.save server.mapPath(sPath)

So no, I'm not using the FSO...
Any idea what's happening?

--
Google Blogoscoped
http://blog.outer-court.com
Jul 22 '05 #3


Philipp Lenssen wrote:

Philipp Lenssen wrote:

How do I load and save a UTF-8 document in XML in ASP/VBS?
this is my save method:

xDoc.save server.mapPath(sPath)


You say the file is saved as iso-8859-1, does MSXML really save it with
that encoding and put a
<?xml version="1.0" encoding="iso-8859-1"?>
in there, or why do you think that MSXML saves as iso-8859-1?

--

Martin Honnen
http://JavaScript.FAQTs.com/
Jul 22 '05 #4
Martin Honnen wrote:
Philipp Lenssen wrote:

Philipp Lenssen wrote:
> How do I load and save a UTF-8 document in XML in ASP/VBS?
>

this is my save method:

xDoc.save server.mapPath(sPath)


You say the file is saved as iso-8859-1, does MSXML really save it
with that encoding and put a <?xml version="1.0"
encoding="iso-8859-1"?> in there, or why do you think that MSXML
saves as iso-8859-1?


Let me put it this way. I use my own Netpadd editor, which doesn't
support UTF-8. I know because whenever I open UTF-8, I see this "i>?"
as first character. So when I want to open UTF-8, I use Notepad.
The files however that *were* UTF-8 when I put them in this tool which
I'm programming (a simple text translation tool), they are coming out
"fine" for my non-UTF-8 Netpadd once they are saved. So they lost their
"UTF-8ness" without me saying so in ASP!

Thanks so far, and hope you have more hints!
--
Google Blogoscoped
http://blog.outer-court.com
Jul 22 '05 #5
UTF-8 does not by itself add special characters to the start of a file. If
the files are plain XML the first non-whitespace character should be "<".
Unicode files do have 2 special characters at the beginning.

What operating system are you running on when you open files in Notepad? The
version of notepad included with NT, Win2000, and WinXP Pro is capable of
saving files in ANSI, Unicode, or UTF-8

How are you opening the files from the ASP script? If possible show the
simplest *working* code (just read and then write the file) that duplicates
the problem along with a sample XML file.
--
--Mark Schupp
Head of Development
Integrity eLearning
www.ielearning.com

"Philipp Lenssen" <in**@outer-court.com> wrote in message
news:35*************@individual.net...
Martin Honnen wrote:
Philipp Lenssen wrote:

> > Philipp Lenssen wrote:
> >
> >
> > > How do I load and save a UTF-8 document in XML in ASP/VBS?
> > >

> this is my save method:
>
> xDoc.save server.mapPath(sPath)
>


You say the file is saved as iso-8859-1, does MSXML really save it
with that encoding and put a <?xml version="1.0"
encoding="iso-8859-1"?> in there, or why do you think that MSXML
saves as iso-8859-1?


Let me put it this way. I use my own Netpadd editor, which doesn't
support UTF-8. I know because whenever I open UTF-8, I see this "i>?"
as first character. So when I want to open UTF-8, I use Notepad.
The files however that *were* UTF-8 when I put them in this tool which
I'm programming (a simple text translation tool), they are coming out
"fine" for my non-UTF-8 Netpadd once they are saved. So they lost their
"UTF-8ness" without me saying so in ASP!

Thanks so far, and hope you have more hints!
--
Google Blogoscoped
http://blog.outer-court.com

Jul 22 '05 #6


Philipp Lenssen wrote:
Martin Honnen wrote:
You say the file is saved as iso-8859-1, does MSXML really save it
with that encoding and put a <?xml version="1.0"
encoding="iso-8859-1"?> in there, or why do you think that MSXML
saves as iso-8859-1?


Let me put it this way. I use my own Netpadd editor, which doesn't
support UTF-8. I know because whenever I open UTF-8, I see this "i>?"
as first character. So when I want to open UTF-8, I use Notepad.
The files however that *were* UTF-8 when I put them in this tool which
I'm programming (a simple text translation tool), they are coming out
"fine" for my non-UTF-8 Netpadd once they are saved. So they lost their
"UTF-8ness" without me saying so in ASP!


Frankly to use a tool that doesn't understand UTF-8 to check whether a
file is UTF-8 encoded doesn't sound like a reliable way, it might simply
be a byte order mark at the beginning of the file and that mark is
optional in UTF-8.

I don't really how to help on that, I would use an XML parser to check
whether the file is properly encoded, simply loading the file in IE/Win
should do to check that.

If you have the application online then post a URL (or better two, one
to the original, one two the saved XML) then someone here could check
whether it is really UTF-8 or ISO-8859-1 what you get there.

--

Martin Honnen
http://JavaScript.FAQTs.com/
Jul 22 '05 #7
Martin Honnen wrote:
Philipp Lenssen wrote:

If you have the application online then post a URL (or better two,
one to the original, one two the saved XML) then someone here could
check whether it is really UTF-8 or ISO-8859-1 what you get there.


It's already solved, IIRC I posted this here already.

--
Google Blogoscoped
http://blog.outer-court.com
Jul 22 '05 #8

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

12
by: Mike Dee | last post by:
A very very basic UTF-8 question that's driving me nuts: If I have this in the beginning of my Python script in Linux: #!/usr/bin/env python # -*- coding: UTF-8 -*- should I - or should I...
19
by: Svennglenn | last post by:
I'm working on a program that is supposed to save different information to text files. Because the program is in swedish i have to use unicode text for letters. When I run the following...
6
by: hilio | last post by:
I have an asp application that should allow the user to enter Unicode characters. The characters appear correctly in the browser. When saved in sql 2000 thought they are converted to question...
2
by: Cesar Ronchese | last post by:
Hello, All! I'm working with accentuated characters in my XML files, and I have found problems to load and save it. First, for this case, I always have my XML in memory, and I load it via...
2
by: THY | last post by:
Hi, I am developing a website in english & chinese both language. whenever I save, it required I set the encoding in advanced save options. But I found there are 4 related to unicode, can anyone...
0
by: Ersin Gentrk | last post by:
hi, I am working with utf-8 encoded aspx files.But in visual studio.net 2003 when I make somechanges to a utf-8 aspx file and click save , it reverts the encoding to another (win-1254) format.I...
5
by: Neil G Jarman | last post by:
Hi, I would like to save my user's passwords as an encrypted sting. Are their built in functions for doing this? It's not financial data or anything, just to keep away prying eyes. many...
2
by: =?Utf-8?B?Um9iZXJ0SGlsbEVEUw==?= | last post by:
I have classic asp files (IIS6) that need to be saved in UTF-8 format. I save a file with Notepad using SAVE-AS & UTF-8 format. If I reopen the file using Notepad and bring up the save-as dialog...
5
by: Lucvdv | last post by:
This would better be described by 'serialization' than 'interop', but I didn't find a newsgroup that seems closer on topic. The problem in a few words: I save data with DataSet.WriteXML, but I...
0
by: abbasky | last post by:
### Vandf component communication method one: data sharing ​ Vandf components can achieve data exchange through data sharing, state sharing, events, and other methods. Vandf's data exchange method...
2
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 7 Feb 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:30 (7.30PM). In this month's session, the creator of the excellent VBE...
0
by: fareedcanada | last post by:
Hello I am trying to split number on their count. suppose i have 121314151617 (12cnt) then number should be split like 12,13,14,15,16,17 and if 11314151617 (11cnt) then should be split like...
0
by: stefan129 | last post by:
Hey forum members, I'm exploring options for SSL certificates for multiple domains. Has anyone had experience with multi-domain SSL certificates? Any recommendations on reliable providers or specific...
1
by: davi5007 | last post by:
Hi, Basically, I am trying to automate a field named TraceabilityNo into a web page from an access form. I've got the serial held in the variable strSearchString. How can I get this into the...
0
by: MeoLessi9 | last post by:
I have VirtualBox installed on Windows 11 and now I would like to install Kali on a virtual machine. However, on the official website, I see two options: "Installer images" and "Virtual machines"....
0
by: DolphinDB | last post by:
The formulas of 101 quantitative trading alphas used by WorldQuant were presented in the paper 101 Formulaic Alphas. However, some formulas are complex, leading to challenges in calculation. Take...
0
by: DolphinDB | last post by:
Tired of spending countless mintues downsampling your data? Look no further! In this article, youll learn how to efficiently downsample 6.48 billion high-frequency records to 61 million...
0
by: ryjfgjl | last post by:
ExcelToDatabase: batch import excel into database automatically...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.