473,787 Members | 2,928 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Opera guesses encoding for "applicatio n/xml"

Hi!

Okay, so positions on "text/html" XHTML are totally contradicting. Anyway!
I hope there's more consensus about "applicatio n/xml" XHTML.

I've recently learned that Opera 9.0b2 does not only evaluate HTTP header,
BOM and XML declaration to determine the character encoding of an XHTML
document sent as "applicatio n/xml", but also the "meta" element. For
example, <http://schneegans.de/sv/test-cases/?case=meta-only-encoding> is
rendered as "Česká republika". In contrast, Firefox displays "?esk?
republika", and IE even aborts parsing.

If you agree with me and think that this behavior is wrong, you might
want to post a follow-up to <news:op******* ***********@new s.opera.com>.

--
All free men, wherever they may live, are citizens of Denmark. And
therefore, as a free man, I take pride in the words "Jeg er dansker!"
Jun 8 '06
43 3176
Henri Sivonen wrote:
Omitting whitespace between attribute specifications is almost always
a typo, not a deliberate decision.


But with HTML it is harmless--like making the typo of putting two spaces
instead of one between attributes.


It's definitely harmless for HTML parsers. But what about tag soup
parsers? I will put a document containing

<a href="foo"class ="bar">

somewhere and check if spiders request 'foo"class="bar ' or similar.

--
All free men, wherever they may live, are citizens of Denmark. And
therefore, as a free man, I take pride in the words "Jeg er dansker!"
Jun 10 '06 #21
Christoph Schneegans <Ch*******@Schn eegans.de> wrote:
Do you dispute the fact that XML Schema validation is more powerful than
DTD validation?
Before changing to another topic, first let's finish off your original
claim;


My original claim was that XHTML allows more powerful validation. I really
don't know why you assume I was referring to XML DTD based validation.


I assumed no such thing. You attempted to change the subject from DTD
based validation to Schema or RELAX NG validation.
do you agree that XHTML DTD based validation is not "more powerful"
than HTML DTD validation?


Sure. This has no relevance at all.


I seem to remember that you claimed this as an advantage of XHTML.
If you want meaningful validation
results, use an XML Schema validator such as <http://schneegans.de/sv/>.


A different subject not related to the alleged advantages of XHTML.

FYI Schema and/or RELAX NG validation is also available for HTML.
All additional XHTML constraints can be emulated for HTML validation.

Using freaky regular expressions?


By editing the files used by the validator, this doesn't involve regular
expressions.


If I wanted to disallow

<p title=""style=" "></p>

in favor of

<p title="" style=""></p>

in my HTML documents, how would I do that?


Ask someone who knows SGML. As referenced in my resource on using a
custom DTD, Eric B. Bednarz once helped me to emulate some extra
constraints. He has posted in this thread, maybe he is willing to help
with that if you are serious about using it.
This is noted on the quoted resource, and it links to an explanation of
how to change non DTD constraints, again using a common example.

Most authors don't want to create a custom DTD or a custom SGML
declaration.


No-one has anything to gain from checking text/html documents for the
extra constraints that XHTML requires, by definition this includes "most
authors".


Far from it! Many authors wonder why e.g.

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN">
<html>
<head><title> </title></head>
<body>
<p>foo</p>
<!------>
<p>bar</p>
<!------>
</body>
</html>

is not displayed in Firefox as intended, although this document is valid
according to the W3C Validator. <http://valet.webthing. com/page/> doesn't
output a warning either, BTW.


The above code also validates (using a DTD validator) if I change it to
XHTML.
Again before having another change of subject, I'm still trying to find
out what you meant by your original claim that XHTML syntax is simpler
than HTML syntax.


'<p ltr<span></span</p>' is valid HTML syntax. It's not simple at all.
What you refer to as "valid HTML" is an error that isn't picked up by a
validator using the public DTD.


Which specification says that '<p ltr<span></span</p>' is not valid HTML?


As I said it's an error, but valid (assuming DTD validation using the
public validation profile). I've pointed you to a resource that allows
this error the be picked up by a DTD validator.
Like almost all other HTML parsers IE's error recovery mechanism
allows XHTML served as text/html to be parsed as pseudo HTML without
necessarily causing problems.


IE supports anything that is labelled "text/html" and has some angle
brackets in it. In this sense, it also supports HTML or XHTML. IE doesn't
parse "text/html" XHTML using an XML parser, and it doesn't parse
"text/html" HTML using an HTML parser either. The difference is that I
don't suggest the former, while you do suggest the latter.


By that measure there are no CSS or DOM capable browsers because they
all have bugs. An potential obscure issue versus a fundamental parsing
model does not compare.
You are avoiding the point made that contrary to your claim that the
media type used made no difference, that a document served as
application/xml may not be recognized as XHTML.

That's what the W3C says. It does not happen nevertheless.


You mean that you haven't seen it happen, excuse me for attaching little
to no value to that.


You have't seen it happen either. Otherwise you would happily disclose the
name of that user agent.


I haven't tested for it, nor am I about to for your education
considering that application/xml should not be used to serve XHTML,
which itself has virtually no value at this point in time.

--
Spartanicus
Jun 10 '06 #22
"Spartanicu s" wrote:
Do you dispute the fact that XML Schema validation is more powerful than
DTD validation?

Before changing to another topic, first let's finish off your original
claim;
My original claim was that XHTML allows more powerful validation. I really
don't know why you assume I was referring to XML DTD based validation.


I assumed no such thing. You attempted to change the subject from DTD
based validation to Schema or RELAX NG validation.


The subject never was "DTD based validation", it was "validation ". XML
validation includes XML Schema validation and RELAX NG validation, of
course.
FYI Schema and/or RELAX NG validation is also available for HTML.


I doubt it.
> All additional XHTML constraints can be emulated for HTML validation.

Using freaky regular expressions?

By editing the files used by the validator, this doesn't involve regular
expressions.


If I wanted to disallow

<p title=""style=" "></p>

in favor of

<p title="" style=""></p>

in my HTML documents, how would I do that?


Ask someone who knows SGML.


How can you claim that "all additional XHTML constraints can be emulated"
with HTML if you don't know how to do that?
Many authors wonder why e.g.

<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01//EN">
<html>
<head><title> </title></head>
<body>
<p>foo</p>
<!------>
<p>bar</p>
<!------>
</body>
</html>

is not displayed in Firefox as intended, although this document is valid
according to the W3C Validator.


The above code also validates (using a DTD validator) if I change it to
XHTML.


'<!------>' is not even well-formed XML, and all XML validators know that.
Of course, SGML validators such as OpenSP that try to act like XML
validators don't, see <http://esw.w3.org/topic/MarkupValidator/XML_Limitations >.
Which specification says that '<p ltr<span></span</p>' is not valid HTML?


As I said it's an error, but valid (assuming DTD validation using the
public validation profile).


So which specification says that '<p ltr<span></span</p>' is erroneous?

--
All free men, wherever they may live, are citizens of Denmark. And
therefore, as a free man, I take pride in the words "Jeg er dansker!"
Jun 10 '06 #23
In article <e6**********@n ews.christoph.s chneegans.de>,
Christoph Schneegans <Ch*******@Schn eegans.de> wrote:
Henri Sivonen wrote:
Omitting whitespace between attribute specifications is almost always
a typo, not a deliberate decision.
But with HTML it is harmless--like making the typo of putting two spaces
instead of one between attributes.


It's definitely harmless for HTML parsers. But what about tag soup
parsers?


What kind of difference is there, according to your definitions, between
an HTML parser and a tag soup parser?

Anyway, AFAIK, it is harmless for the text/html parsers deployed in
notable browsers.
I will put a document containing

<a href="foo"class ="bar">

somewhere and check if spiders request 'foo"class="bar ' or similar.


Well, I can't know if there's a broken spider somewhere out there that
does not support syntax that is OK according to browsers, according to
HTML5 and according to SGML.

--
Henri Sivonen
hs******@iki.fi
http://hsivonen.iki.fi/
Mozilla Web Author FAQ: http://mozilla.org/docs/web-developer/faq.html
Jun 10 '06 #24
In article <e6**********@n ews.christoph.s chneegans.de>,
Christoph Schneegans <Ch*******@Schn eegans.de> wrote:
FYI Schema and/or RELAX NG validation is also available for HTML.


I doubt it.


http://hsivonen.iki.fi/validator/
provides RELAX NG validation for HTML.

There's no reason why the same approach couldn't be taken with XSD. The
reason why XSD does not work with my validation service is that the
Xerces XSD validator has a crasher bug (null dereference) and I don't
care enough about XSD to fix the bug myself.

--
Henri Sivonen
hs******@iki.fi
http://hsivonen.iki.fi/
Mozilla Web Author FAQ: http://mozilla.org/docs/web-developer/faq.html
Jun 10 '06 #25
Christoph Schneegans <Ch*******@Schn eegans.de> wrote:
My original claim was that XHTML allows more powerful validation. I really
don't know why you assume I was referring to XML DTD based validation.


I assumed no such thing. You attempted to change the subject from DTD
based validation to Schema or RELAX NG validation.


The subject never was "DTD based validation", it was "validation ". XML
validation includes XML Schema validation and RELAX NG validation, of
course.


You should have made that clear from the start. There are different
definitions of validation. Afaik in SGML it is defined as "checking a
document against a DTD", nothing more. Given the topic of this group
this is the definition I apply unless the poster mentions that he is
referring to XML validation.
FYI Schema and/or RELAX NG validation is also available for HTML.


I doubt it.


One that was announced in this group: http://badame.vse.cz/validator/
there are likely to be others.
If I wanted to disallow

<p title=""style=" "></p>

in favor of

<p title="" style=""></p>

in my HTML documents, how would I do that?


Ask someone who knows SGML.


How can you claim that "all additional XHTML constraints can be emulated"
with HTML if you don't know how to do that?


Due to your failure to note that you were referring to XML validation my
responses until now assumed DTD validation.

So now I have to ask why you'd want to disallow "<p
title=""style=" "></p>" in HTML in the first place, given that it's valid
and a brief test doesn't demonstrate it causing a problem.
The above code also validates (using a DTD validator) if I change it to
XHTML.


'<!------>' is not even well-formed XML, and all XML validators know that.
Of course, SGML validators such as OpenSP that try to act like XML
validators don't, see <http://esw.w3.org/topic/MarkupValidator/XML_Limitations >.


Checking for XML constraints that are missed by SGML validators can be
achieved by using an XML validator and internally transcoding HTML to
XHTML (http://badame.vse.cz/validator/ does this). No need to author
XHTML because of "more powerful means for validation".
Which specification says that '<p ltr<span></span</p>' is not valid HTML?


As I said it's an error, but valid (assuming DTD validation using the
public validation profile).


So which specification says that '<p ltr<span></span</p>' is erroneous?


Presumably the SGML spec (to which I don't have access). I assume that a
left angled bracket is not allowed inside a tag unless it is surrounded
by quotes.

--
Spartanicus
Jun 10 '06 #26
Henri Sivonen wrote:
FYI Schema and/or RELAX NG validation is also available for HTML.


I doubt it.


http://hsivonen.iki.fi/validator/
provides RELAX NG validation for HTML.


How do you validate HTML 4.01 documents?

--
All free men, wherever they may live, are citizens of Denmark. And
therefore, as a free man, I take pride in the words "Jeg er dansker!"
Jun 10 '06 #27
"Spartanicu s" wrote:
There are different definitions of validation. Afaik in SGML it is
defined as "checking a document against a DTD", nothing more. Given
the topic of this group this is the definition I apply unless the
poster mentions that he is referring to XML validation.
It's not my problem when you jump to conclusions.
FYI Schema and/or RELAX NG validation is also available for HTML.


I doubt it.


One that was announced in this group: http://badame.vse.cz/validator/
there are likely to be others.


<http://badame.vse.cz/validator/validate?uri=ht tp://www.spartanicus .utvinternet.ie/no-xhtml.htm>
seems to complain about a missing system identifier, which is perfectly
legal in HTML...
So now I have to ask why you'd want to disallow "<p
title=""style=" "></p>" in HTML in the first place, given that it's valid
and a brief test doesn't demonstrate it causing a problem.


Omitting whitespace between attribute specifications is almost always
a typo, not a deliberate decision. I'd like to spot typos.
So which specification says that '<p ltr<span></span</p>' is erroneous?


Presumably the SGML spec (to which I don't have access).


So public availability of all relevant specifications is obviously another
advantage of XHTML over HTML.

--
All free men, wherever they may live, are citizens of Denmark. And
therefore, as a free man, I take pride in the words "Jeg er dansker!"
Jun 10 '06 #28
Henri Sivonen wrote:
What kind of difference is there, according to your definitions, between
an HTML parser and a tag soup parser?


'<p ltr<span></span</p>' and '<p dir="ltr"><span ></span></p>' are
equivalent for an HTML parser, but probably not for a tag soup parser.

--
All free men, wherever they may live, are citizens of Denmark. And
therefore, as a free man, I take pride in the words "Jeg er dansker!"
Jun 10 '06 #29
In article <e6**********@n ews.christoph.s chneegans.de>,
Christoph Schneegans <Ch*******@Schn eegans.de> wrote:
Henri Sivonen wrote:
FYI Schema and/or RELAX NG validation is also available for HTML.

I doubt it.


http://hsivonen.iki.fi/validator/
provides RELAX NG validation for HTML.


How do you validate HTML 4.01 documents?


The parser works as in the HTML5 case except:
* An HTML 4.01 doctype is required instead of the HTML5 doctype.
* Extra entities from HTML5 are not allowed.

The XHTML 1.0 Strict and Transitional schemas are used for HTML 4.01
Strict and Transitional, respectively.

--
Henri Sivonen
hs******@iki.fi
http://hsivonen.iki.fi/
Mozilla Web Author FAQ: http://mozilla.org/docs/web-developer/faq.html
Jun 10 '06 #30

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

3
8608
by: Tjerk Wolterink | last post by:
Hello, I've xml code that looks like this: <?xml version="1.0" encoding="ISO-8859-1"?> <xc:xcontent xmlns:xc="http://www.wolterinkwebdesign.com/xml/xcontent" xmlns="http://www.w3.org/1999/xhtml">
2
1949
by: Ferd Biffle | last post by:
Hello! I have an Excel XML feed from an Enterprise PDM system that is basically a flat text file with XML tags: <?xml version="1.0" encoding="UTF-8"?> <Import> <Row> <ID>1</ID> <Level>0</Level>
1
1172
by: ouioui2000 | last post by:
Hi, I try to generate xml flow with javascript in a new windows. Here is my javascritp code : <code> fenetrePopUp = window.open('','_blank','width=500,height=300,scrollbars=yes,status=yes,resizable=yes'); fenetrePopUp.document.open("application/xml+xsl"); fenetrePopUp.document.write(myCodeXML); </code>
3
11816
by: Gustavo L. Fabro | last post by:
Greetings! I'm testing VS 2005 Beta 2, and I've compiled a program that runs just fine on my computer (where VS is installed). I've tried to run this same program on another computer (after having installed MS .NET Framework 2.0 Beta 2 on it, from http://www.microsoft.com/downloads/details.aspx?FamilyId=7ABD8C8F-287E-4C7E-9A4A-A4ECFF40FC8E&displaylang=en), but got the following message:
4
1663
by: Keith Chadwick | last post by:
I am having some trouble referencing an Application("myVar") variable from within a module.vb file on my ASP.NET site. According to the documentation I should be able to reference System.Web.HttpApplication but it does not seem to like that one bit. I also have a MSXML2.DOMDocument30 defined as a public object within the vb module which is loaded during the application start up with a bunch of XML. What is weird is that the data contained...
2
1618
by: Joel Vazquez | last post by:
Visual Basic.NET Application RunTime Crashes and Stalls Im a newbie if you could say in .NET ive been working with it the past 3 months and have done lots of things with it, without any prior knowledge. I have a few question regarding the stability of my application. I have a synchronization application that connects to a service and send the results to SQL Server 2000. I got all my code in just one form.
9
5113
by: MrSpock | last post by:
1. Create a new Windows Application project. 2. Open the project properties and check "Make single instance application". 3. Build. 4. Go to the release folder and run the application. 5. Try to open a second instance of the application. This will cause an unhandled exception and the "Send Error Report" box shows up. Does this happen to anyone else, or is it just me? Debug info: "Unhandled exception at 0x00e149fd in...
0
1908
by: Phil | last post by:
I am writing an application (using Visual Studio 2003) that consumes a third party Web Service, but I receive the following error : Client found response content type of 'application/xml; charset=utf-8', but expected 'text/xml'. I can see the XML Document and it is properly structured. Is there any way to overide this and get .NET to process the response XML regardless of the Content Type Setting in the HTTP Response ?
1
1265
by: wingsganesh | last post by:
Hi guys, I know how load XMl file in Windows application.But comparing windows application , device application have some special feature. When i use the same code in device application i have bug called"Web Exception" Any body can help My code is:
0
10169
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
0
9964
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
8993
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
0
6749
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5398
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
0
5534
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
4067
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
2
3670
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
3
2894
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.