473,382 Members | 1,238 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,382 software developers and data experts.

Theory question: U+000C in HTML 4.01

This question is fairly theoretical (even for me), but it started to
puzzle me:

According to the SGML declaration for HTML 4.01, at
http://www.w3.org/TR/REC-html40/sgml...cl.html#h-20.1
the Form Feed character, U+000C (12 in decimal), is UNUSED, i.e. forbidden:

DESCSET 0 9 UNUSED
9 2 9
11 2 UNUSED

Yet, the prose of the specification discusses it as if it were an
allowed character. Section 9.1 White space says:

"In HTML, only the following characters are defined as white space
characters:

- ASCII space ( )
- ASCII tab (	)
- ASCII form feed ()
- Zero-width space (​)"

( http://www.w3.org/TR/REC-html40/struct/text.html#h-9.1 )

Is this just a slip in the SGML declaration, or in the prose? I'd
suppose the latter, since the formal rule was the same in HTML 3.2,
which did not mention U+000C at all in the prose. So when people wrote
the HTML 4.01 prose, they just didn't check what's in the formal
declaration.

The W3C validator and the WDG validator seem to report U+000C as an
error ("Non-SGML character number 12"), apparently playing by the SGML
declaration for HTML 4.01.

(XHTML, as XML in general, forbids U+000C explicitly. And U+000C is not
useful in HTML: it's just another whitespace character, not a page eject
character, as one might naively expect.)
Oct 27 '05 #1
2 2257
Jukka K. Korpela wrote:
According to the SGML declaration for HTML 4.01, at
http://www.w3.org/TR/REC-html40/sgml...cl.html#h-20.1
the Form Feed character, U+000C (12 in decimal), is UNUSED, i.e.
forbidden:

DESCSET 0 9 UNUSED
9 2 9
11 2 UNUSED

Yet, the prose of the specification discusses it as if it were an
allowed character. Section 9.1 White space says:

"In HTML, only the following characters are defined as white space
characters:

- ASCII space ( )
- ASCII tab (	)
- ASCII form feed ()
- Zero-width space (​)"

( http://www.w3.org/TR/REC-html40/struct/text.html#h-9.1 )

Is this just a slip in the SGML declaration, or in the prose? I'd
suppose the latter, since the formal rule was the same in HTML 3.2,
which did not mention U+000C at all in the prose. So when people wrote
the HTML 4.01 prose, they just didn't check what's in the formal
declaration.


It certainly does not make much sense to allow the form feed in HTML. But it
is rather common to treat it as whitespace in languages where whitespace is
not significant (e.g. C), this seems to be some kind of traditional
convention. The CSS21 spec (I did not check the other CSS versions) does
also include it in its list of ws characters, so there seems to be _some_
consistency across the different specs.

I would bet (but not much money ;) that the authors added the form feed in
order to conform to some tradition, but forgot to think about the SGML
declaration.

And the form feed _could_ be used to manually prepare HTML source code for
printing, so it might be useful for one or two people on this planet.

--
Benjamin Niemann
Email: pink at odahoda dot de
WWW: http://www.odahoda.de/
Oct 28 '05 #2
Jukka K. Korpela wrote:
This question is fairly theoretical (even for me), but it started to
puzzle me:


btw: you are not the first one:
http://lists.w3.org/Archives/Public/...lSep/0131.html
But no one seems to care...

--
Benjamin Niemann
Email: pink at odahoda dot de
WWW: http://www.odahoda.de/
Oct 28 '05 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

7
by: gene.ellis | last post by:
Good morning. I am racking my brains over what seems like should be a simple question. I have a string that contains text and html. Basically, I would like to grab the HTML tags from the string and...
0
by: Janis Jekabsons | last post by:
Hi all, Excuse me if this is not the right place to ask this .., but it's the closest I could find. I am reading Ramakrishnan and Gehrke "Database Management Systems". In their errata for the...
10
by: Kathy Burke | last post by:
Hi, in trying to discover why my RegisterStartUpScript wouldn't work (I do NOT see it in the HTML source), I looked at the HTML source of a page where I do an XslTransform. First, I get the...
2
by: YYZ | last post by:
Can anyone tell me why (not accusing .Net, just wondering) if I make a property on a form Public, then inherit from that form, then I can see the property in the form's property window (good), but...
3
by: Connor T | last post by:
Hi, I wish to generate some relatively simple html reports from my ODBC database. I'm quite happy to write out raw html to a file, but before I do that I just wondered if there is any better...
4
by: bbcrock | last post by:
I have some modular code that is written for display purposes. It contains inline CSS code. I originally thought about moving all the inline code to a css file for use throughout the site- one...
5
by: Tristán White | last post by:
As some of you who helped with my "No input file selected" query earlier already know, I've just started a new job and am new to PHP. All the webpages are PHP, but contain HTML as a string...
2
by: robtyketto | last post by:
Greetings, Within my jsp I have HTML code (see below) which accepts input, one of these fields sequence unlike the others is an Integer. <FORM ACTION="wk465682AddFAQ.jsp" METHOD="POST"> Id:...
17
by: mysticwater | last post by:
Hi, I know how annoying these SEO questions can get, but I was hoping someone could quickly answer mine <link removed>: 1)Is a certain way of coding using HTML better in terms of SEO? 2)...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...
0
by: ryjfgjl | last post by:
In our work, we often need to import Excel data into databases (such as MySQL, SQL Server, Oracle) for data analysis and processing. Usually, we use database tools like Navicat or the Excel import...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.