473,385 Members | 1,973 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,385 software developers and data experts.

When did IE stop recognizing entity references without ";"?

I noticed that Internet Explorer (6.0, on Win XP SP 2, all fixes
installed) incorrectly renders e.g.
&harr &euro &Omega
literally and not as characters denoted by the entities, but if a
semicolon is appended to each of the entity references, they work.

I'm pretty sure that previous versions of IE rendered them by the
specifications. I first thought this has something to do with XML (i.e.
maybe IE pretends to play a little bit of XML game when processing HTML,
even when I serve it as text/html and use an HTML 4.01 DOCTYPE). But on
closer look, I notice that
&auml &eacute
work correctly.

It seems that ISO Latin 1 entities work without a semicolon, newer
entities don't. Am I mistaken, i.e. was this always the case on IE?

(Of course, it has always been recommendable to use the terminating
semicolon in entity references, but it is not required by SGML, hence by
classic HTML rules, when the reference is followed by a character that
is not a name character, e.g. by a space. There must be _lots_ of legacy
documents that play fast with entity references without semicolons.)
Oct 11 '05 #1
9 2168
Jukka K. Korpela wrote:
I noticed that Internet Explorer (6.0, on Win XP SP 2, all fixes
installed) incorrectly renders e.g.
&harr &euro &Omega
literally and not as characters denoted by the entities, but if a
semicolon is appended to each of the entity references, they work.


Internet Explorer 5.5 seems to behave the same way.
Oct 11 '05 #2
Els
Leif K-Brooks wrote:
Jukka K. Korpela wrote:
I noticed that Internet Explorer (6.0, on Win XP SP 2, all fixes
installed) incorrectly renders e.g.
&harr &euro &Omega
literally and not as characters denoted by the entities, but if a
semicolon is appended to each of the entity references, they work.


Internet Explorer 5.5 seems to behave the same way.


Mine too, as does my IE4.0. But I reckon it maybe due to those being
installed on WinXP with SP2. For a real test, you should look at an
installation on Win98 or Win95 I think.

--
Els http://locusmeus.com/
Sonhos vem. Sonhos vão. O resto é imperfeito.
- Renato Russo -
Oct 11 '05 #3
"Jukka K. Korpela" <jk******@cs.tut.fi> wrote:
I noticed that Internet Explorer (6.0, on Win XP SP 2, all fixes
installed) incorrectly renders e.g.
&harr &euro &Omega
literally and not as characters denoted by the entities, but if a
semicolon is appended to each of the entity references, they work.

I'm pretty sure that previous versions of IE rendered them by the
specifications. I first thought this has something to do with XML (i.e.
maybe IE pretends to play a little bit of XML game when processing HTML,
even when I serve it as text/html and use an HTML 4.01 DOCTYPE). But on
closer look, I notice that
&auml &eacute
work correctly.


I get the same rendering on IE6 6.0.2800.1106, the about dialog says
that SP1 is installed.

--
Spartanicus
Oct 11 '05 #4
On Tue, 11 Oct 2005, Jukka K. Korpela wrote:
I noticed that Internet Explorer (6.0, on Win XP SP 2, all fixes
installed) incorrectly renders e.g.
&harr &euro &Omega
literally and not as characters denoted by the entities, but if a
semicolon is appended to each of the entity references, they work.
Google does the same.
http://www.google.com/search?q=%26om...te:helsinki.fi
There must be _lots_ of legacy
documents that play fast with entity references without semicolons.)


They deserve it to be punished. ;-)
Actually, I doubt that there a lots of such documents.

Oct 11 '05 #5
Andreas Prilop wrote:
On Tue, 11 Oct 2005, Jukka K. Korpela wrote:
I noticed that Internet Explorer (6.0, on Win XP SP 2, all fixes
installed) incorrectly renders e.g.
&harr &euro &Omega
literally and not as characters denoted by the entities, but if a
semicolon is appended to each of the entity references, they work.
Google does the same.
http://www.google.com/search?q=%26om...te:helsinki.fi


Most interesting. Do you know whether Google treats any other
non-letters than "&" as something that it looks for, instead of treating
it just separators between "words"?
There must be _lots_ of legacy
documents that play fast with entity references without semicolons.)


They deserve it to be punished. ;-)


Maybe. The world is cruel but unjust. I still wonder why Microsoft
thinks they can afford that - and do it for non-Latin-1 characters only.

(I checked on Windows 98 with IE 6.0, and noted that it too displays
&euro literally. Maybe my memory does not serve me right, or maybe I'm
just getting mad. Maybe I just imagined that some IE got this by the specs.)
Actually, I doubt that there a lots of such documents.


You're right in the sense that most documents that sloppily (or, in rare
cases, intentionally) omit the semicolon usually do that for ISO Latin 1
characters only. But search for &omega without site restriction gives
43,500 hits, and although many of them actually discuss the &omega;
or &Omega; entity reference, there are lots of documents (beyond the
pages of my alma mater too) that really try to present the Greek letter
in a manner that fully conforms to HTML specifications, though not to
good style.

With &euro, I get 786,000 hits, and nausea. (It's not _that_ bad, since
most people who know what "euro" means probably figure out what
"42 &euro" is supposed to mean. I'm not so sure about &harr for example.
Not to mention &lang, which is mostly just a syntax error.)
Oct 11 '05 #6
"Jukka K. Korpela" wrote:

I noticed that Internet Explorer (6.0, on Win XP SP 2, all fixes
installed) incorrectly renders e.g.
&harr &euro &Omega
literally and not as characters denoted by the entities, but if a
semicolon is appended to each of the entity references, they work.

I'm pretty sure that previous versions of IE rendered them by the
specifications. I first thought this has something to do with XML (i.e.
maybe IE pretends to play a little bit of XML game when processing HTML,
even when I serve it as text/html and use an HTML 4.01 DOCTYPE). But on
closer look, I notice that
&auml &eacute
work correctly.

It seems that ISO Latin 1 entities work without a semicolon, newer
entities don't. Am I mistaken, i.e. was this always the case on IE?

(Of course, it has always been recommendable to use the terminating
semicolon in entity references, but it is not required by SGML, hence by
classic HTML rules, when the reference is followed by a character that
is not a name character, e.g. by a space. There must be _lots_ of legacy
documents that play fast with entity references without semicolons.)


Section 5.3 of the HTML 4.01 specification says:
"Note. In SGML, it is possible to eliminate the final ";" after a
character reference in some cases (e.g., at a line break or
immediately before a tag). In other circumstances it may not be
eliminated (e.g., in the middle of a word). We strongly suggest
using the ";" in all cases to avoid problems with user agents that
require this character to be present."

Thus, the absence of a semi-colon is not necessarily wrong even if
it is contrary to a strong suggestion to always have the
semi-colon.

--

David E. Ross
<URL:http://www.rossde.com/>

I use Mozilla as my Web browser because I want a browser that
complies with Web standards. See <URL:http://www.mozilla.org/>.
Oct 12 '05 #7
On Tue, 11 Oct 2005, Jukka K. Korpela wrote:
Do you know whether Google treats any other
non-letters than "&" as something that it looks for, instead of treating
it just separators between "words"?


Period and comma inside numbers. Others I don't know.

Oct 12 '05 #8


Andreas Prilop wrote:

Jukka K. Korpela wrote:
Do you know whether Google treats any other
non-letters than "&" as something that it looks for, instead
of treating it just separators between "words"?


Period and comma inside numbers. Others I don't know.


It just so happens that I do know... <grin> It is something that
a lot of folks like to reference, so I will post it under a more
descriptive subject line.


Oct 12 '05 #9
Jukka K. Korpela wrote:
I noticed that Internet Explorer (6.0, on Win XP SP 2, all fixes
installed) incorrectly renders e.g.
&harr &euro &Omega
literally and not as characters denoted by the entities, but if a
semicolon is appended to each of the entity references, they work.
...
It seems that ISO Latin 1 entities work without a semicolon, newer
entities don't. Am I mistaken, i.e. was this always the case on IE?


Support in IE actually looks a little worse than that. It is, indeed,
as you mentioned, IE only supports ISO-8859-1 entity references without
the semi-colon, plus &amp, &lt, &gt and &quot from the markup
significant category. All others are only supported with the semi-colon.

However, it also fails to support any hex character reference without
the semi-colon, but does support numeric (decimal) character references
just fine with or without.

See the tests:
http://lachy.id.au/dev/markup/tests/...charref/syntax

--
Lachlan Hunt
http://lachy.id.au/
http://GetFirefox.com/ Rediscover the Web
http://GetThunderbird.com/ Reclaim your Inbox
Oct 16 '05 #10

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

3
by: Phil | last post by:
Hi everybody, I am a XSLT beginner and the following problem really makes me crazy ! I have a main "contacts.xml" document which contains references to several contact data XML files. My aim...
3
by: Tjerk Wolterink | last post by:
Hello, I've xml code that looks like this: <?xml version="1.0" encoding="ISO-8859-1"?> <xc:xcontent xmlns:xc="http://www.wolterinkwebdesign.com/xml/xcontent"...
8
by: Ayende Rahien | last post by:
I've a really strange problem, in some part of my code I compare two strings (through object), and while I *know* that they equal each other, and in the watch window they do equal each other, then...
2
by: Robin S. | last post by:
I don't have the "references" selection in my "tools" menu. Obviously this is kind of important. I have "Always show full menus" checked. Program has been updated to SP3. Any ideas as to why...
11
by: comp.lang.php | last post by:
function blah($item) { if (!isset($baseDir)) { static $baseDir = ''; $baseDir = $item; print_r("baseDir = $baseDir\n"); } $dirID = opendir($item); while (($fyl = readdir($dirID)) !== false)...
2
by: Chameleon | last post by:
I have a BIG php output and I start to transmit after client's request. What happen on server when client press "Stop"? PHP continues to send data? Where? Until the end?
5
by: John Nagle | last post by:
This, which is from a real web site, went into BeautifulSoup: <param name="movie" value="/images/offersBanners/sw04.swf?binfot=We offer fantastic rates for selected weeks or days!!&blinkt=Click...
4
by: SpaceMarine | last post by:
hello, i am trying to figure something out. the facts: - i have an Access 2000 app that ive converted to Access 2007 (windows Vista) - it uses several external references, DLLs and an EXE....
1
by: ziycon | last post by:
Can anyone tell me what this error means in tomcat logs?
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.