473,553 Members | 3,084 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Minidom empty script element bug

Hello All,

I ran into a problem while dynamically constructing XHTML documents using
minidom. If you create a script tag such as:

script_node_0 = self.doc.create Element("script ")
script_node_0.s etAttribute("ty pe", "text/javascript")
script_node_0.s etAttribute("sr c", "../test.js")

minidom renders it as:

<script src='../test.js' type='text/javascript'/>

Which is incorrect because:

XHTML 1.0 specs, Appendix C
~~~~@~~~~
C.3 Element Minimization and Empty Element Content

Given an empty instance of an element whose content model is not EMPTY (for
example, an empty title or paragraph) do not use the minimized form (e.g.
use <p> </p> and not <p />)
~~~~@~~~~

reference for further explanation:
http://lists.evolt.org/archive/Week-...04/105951.html

So, the rendered page completely fails on IE6 because it actually handles the
empty script element correctly. Mozilla handles the element incorrectly and
instantiates the javascript.

How do I get minidom to NOT render an empty script element? Should I submit a
bug report?

Thanks for the help,
Derek Basch


_______________ _______________ ____
Do you Yahoo!?
Yahoo! Small Business - Try our new resources site!
http://smallbusiness.yahoo.com/resources/
Jul 18 '05 #1
4 2858
Derek Basch wrote:
XHTML 1.0 specs, Appendix C
~~~~@~~~~
C.3 Element Minimization and Empty Element Content

Given an empty instance of an element whose content model is not EMPTY (for
example, an empty title or paragraph) do not use the minimized form (e.g.
use <p> </p> and not <p />)
~~~~@~~~~
I'd like to point out that this is *not* a minidom bug. minidom cannot
possibly know that the document type is XHTML, and that strange, non-XML
rules apply to XHTML (i.e. rules which are not present in XML itself).

I'd also like to point out that XHTML Appendix C is informative (i.e.
non-normative), meaning that failure to comply to it does not imply
non-compliance with XHTML. An XML file which uses the minimized form
for the script element is still proper, well-formed, valid XHTML.
How do I get minidom to NOT render an empty script element? Should I submit a
bug report?


That said, I think there is a simple solution: add an empty Text node
to the script element:

script_node_0.a ppendChild(doc. createText(u"") )

[Disclaimer: this is untested; from reading the source, I think it
should work]

Regards,
Martin
Jul 18 '05 #2

Martin v. Löwis wrote:
Derek Basch wrote:
XHTML 1.0 specs, Appendix C
~~~~@~~~~
C.3 Element Minimization and Empty Element Content

Given an empty instance of an element whose content model is not EMPTY (for example, an empty title or paragraph) do not use the minimized form (e.g. use <p> </p> and not <p />)
~~~~@~~~~
I'd like to point out that this is *not* a minidom bug. minidom

cannot possibly know that the document type is XHTML, and that strange, non-XML rules apply to XHTML (i.e. rules which are not present in XML itself).
I'd also like to point out that XHTML Appendix C is informative (i.e.
non-normative), meaning that failure to comply to it does not imply
non-compliance with XHTML. An XML file which uses the minimized form
for the script element is still proper, well-formed, valid XHTML.
How do I get minidom to NOT render an empty script element? Should I submit a bug report?


That said, I think there is a simple solution: add an empty Text node
to the script element:

script_node_0.a ppendChild(doc. createText(u"") )

[Disclaimer: this is untested; from reading the source, I think it
should work]

Regards,
Martin

Thanks Martin. That fixed it. I had to change your code a bit to this:

script_node_0.a ppendChild(self .doc.createText Node(""))

maybe you meant createTextNode?

I started digging through the dom modules on this path:

XHTMLPrettyPrin t -> XHTMLPrinter -> Printer

and found this comment:

try:
#The following stanza courtesy Martin von Loewis
import codecs # Python 1.6+ only
from types import UnicodeType

So I guess you are pretty qualified to answer my question! You are
correct that this is not a minidom bug now that I think about it.

However, it seems proper that XHTMLPrinter (or some other module)
should allow the developer to use either normative or non-normative
XHTML design guidlines to achieve some sane degree of HTML user agent
compatablilty. Maybe something like this in Printer.py:

def visitElement(se lf, node):
...........
if len(node.childN odes):
self._write('>' )
self._depth = self._depth + 1
self.visitNodeL ist(node.childN odes)
self._depth = self._depth - 1
if not self._html or (node.tagName not in
HTML_FORBIDDEN_ END):
not (self._inText and inline) and self._tryIndent ()
self._write('</%s>' % node.tagName)
elif not self._html and node.tagName not in
XHTML_NON_NORMA TIVES:
self._write('/>')
elif node.tagName not in HTML_FORBIDDEN_ END:
self._write('></%s>' % node.tagName)
else:
self._write('>' )

of course this would only take care of the "C.3. Element Minimization
and Empty Element Content" guideline but you get the general idea.

Anyways, thanks for the help again and feel free to shoot down my
suggestions :)

Derek Basch

Jul 18 '05 #3
Derek Basch wrote:
maybe you meant createTextNode?
Yes, that's what I meant :-)
However, it seems proper that XHTMLPrinter (or some other module)
should allow the developer to use either normative or non-normative
XHTML design guidlines to achieve some sane degree of HTML user agent
compatablilty.
This is now PyXML, right? I also maintain PyXML...
Yes, XHtmlPrinter would be the right place to deal with XHTML
idiosyncrasies.
Anyways, thanks for the help again and feel free to shoot down my
suggestions :)


The general approach sounds good; feel free to submit a patch
to sf.net/projects/pyxml. I would recommend to implement Annex C
to the letter, i.e. only avoid the minimized form if the content
model is not EMPTY.

Regards,
Martin
Jul 18 '05 #4
Cross post from XML-SIG:

--- Walter Dörwald <wa****@livingl ogic.de> wrote:
Martin v. Löwis sagte:
Derek Basch wrote:
[...]
How do I get minidom to NOT render an empty script element? Should
I
submit a bug report?

That said, I think there is a simple solution: add an empty Text
node to the script element:

script_node_0.a ppendChild(doc. createText(u"") )

[Disclaimer: this is untested; from reading the source, I think it
should work]

If this doesn't work, you might want to try XIST
(http://www.livinglogic.de/Python/xist)
instead of minidom. XIST knows that the script element is not EMPTY, and when the
output is in HTML compatible XML an end tag will be produced:
from ll.xist.ns import html
print html.script(typ e="text/javascript", src="../test.js").asByt es(xhtml=1)
<script src="../test.js" type="text/javascript"></script>

Using pure XML mode gives:
print html.script(typ e="text/javascript",

src="../test.js").asByt es(xhtml=2)
<script src="../test.js" type="text/javascript"/>

Bye,
Walter Dörwald


Wow! XIST is very elegant. Perfectly designed for what it is supposed
to do.

"XIST is an extensible HTML/XML generator written in Python."

I guess there isn't much point in "fixing" the pyXML XHTMLPrinter when
something as cool as XIST exists (pun intended).

Kid also seems really neat. I like the TAL like features. However, it
seems less mature than XIST.

There seems to be lots of functionality crossover between the two but
it is good that there is enough demand for XML output functionality in
python to support two distinct modules.

Thanks Everyone!,
Derek Basch

Jul 18 '05 #5

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
7486
by: xtian | last post by:
Hi - I'm doing some data conversion with minidom (turning a csv file into a specific xml format), and I've hit a couple of small problems. 1: The output format has a header with some xml that looks something like this: <item xmlns="" xmlns:thing="http://www.blah.com"> <thing:child name="smith"/> </item>
2
3226
by: Vincent De Baere | last post by:
Hi I am playing around a little with python (I've started using python at 1 PM today, so I guess I could be considered a newbie :-)), and have come across a little problem regarding xml.dom.minidom. this code snippet: menudoc = xml.dom.minidom.parse("menu.xml") menurootchildren = menudoc.documentElement.childNodes
0
1895
by: Scott F | last post by:
Hi all, Today's embarassingly simple question goes like this. I have a file, testdoc.xml, to parse. Inside the file is an element <refrain>Yo, Ho, Ho</refrain> So, starting with
4
6047
by: webdev | last post by:
lo all, some of the questions i'll ask below have most certainly been discussed already, i just hope someone's kind enough to answer them again to help me out.. so i started a python 2.3 script that grabs some web pages from the web, regex parse the data and stores it localy to xml file for further use.. at first i had no problem using...
0
1887
by: Sullivan WxPyQtKinter | last post by:
Hi, I am now using minidom for my current development. I use cloneNode method in Element object, but it just does not work. The test code is very simple as follows: =========CODE============== from xml.dom.minidom import * a=Element('see') print a.toprettyxml() b=a.cloneNode(True)
1
1626
by: Dean Card | last post by:
I am using minidom to parse a 20,000 line XML file. I have a few instances where the number of child nodes of a particular node can be variable in number. To access them I am doing something like the following... xmldoc = minidom.parseString(r) results = xmldoc.childNodes for myNode in results.childNodes.childNodes: do Stuff with...
1
1538
by: JoReiners | last post by:
Hello, I have a really strange problem. I'm unable to figure it out on my own. I parse very simple xml documents, without any check for their form. These files look very similar and are encoded in UTF-8. Now minidom is always able to parse these files with minidom.parse("file") . Now when fetching I use this expression:...
6
4461
by: Dan | last post by:
I'm using python's xml.dom.minidom module to generate xml files, and I'm running into memory problems. The xml files I'm trying to create are relatively flat, with one root node which may have millions of direct child nodes. Here's an example script: #!/usr/bin/env python import xml.dom.minidom
0
1508
by: Gary | last post by:
Howdy I ran into a difference between Python on Windows XP and Linux Fedora 6. Writing a dom to xml with minidom works on Linux. It gives an error on XP if there is an empty namespace. The problem was handled in CVS a while ago. http://mail.python.org/pipermail/xml-sig/2003-October/009904.html
0
7568
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main...
0
7772
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. ...
0
8007
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that...
1
7526
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For...
0
6109
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then...
0
5133
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert...
1
1990
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
1
1106
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
0
808
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.