473,695 Members | 3,220 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

pyXML beginner questions



hi,

id like to do the following:

(1) open a .xml, change something and save it.
problem is: how to save/serialize?
i tried xml.dom.ext.Pri nt, but this resolves all entities and serializes
the xml with resolved entities. (see example below)

(2) id also like to load external parsed entities referenced in the xml.
MSXML provides an extension(?) to DOM which returns the uri to an
entityReference-NODE.
any similar in pyXML. actually, is nodetype entityReference implemented
in pyXML. i always get the nodeType of the resolved entity, ie 3
(NODE_TEXT) with a internal unparsed entity.
#############
<?xml version="1.0" encoding="iso-8859-1"?>
<!DOCTYPE xbel [
<!ENTITY intTxt 'GIGI'>
<!ENTITY intMarkup '<entIntern>tex t</entIntern>'>
<!ENTITY extParsed SYSTEM "ent.xml">
]>
<root>
<text>&intTxt ;</text>
&intMarkup;
&extParsed;
</root>
############

becomes:

#############
<?xml version="1.0" encoding="iso-8859-1"?>
<root>
<text>some text</text>
<entIntern>text </entIntern>'
<entIntern>text </entIntern>'
</root>
#############

thx,
Sebastian


Jul 18 '05 #1
2 2017
Sebastian Fey <fe*@parsytec.d e> wrote in message news:<c2******* ******@ID-190842.news.uni-berlin.de>...
hi,

id like to do the following:

(1) open a .xml, change something and save it.
problem is: how to save/serialize?
i tried xml.dom.ext.Pri nt, but this resolves all entities and serializes
the xml with resolved entities. (see example below)
Sounds as if you want a lexical round-trip. Very few XML processing
packages allow for this. I'd check whether pxdom supports this. If
not, I don't expect you'll find it in Python.

http://www.doxdesk.com/software/py/pxdom.html
(2) id also like to load external parsed entities referenced in the xml.
MSXML provides an extension(?) to DOM which returns the uri to an
entityReference-NODE.
any similar in pyXML. actually, is nodetype entityReference implemented
in pyXML. i always get the nodeType of the resolved entity, ie 3
(NODE_TEXT) with a internal unparsed entity.


Again pxdom will get you closest.

--Uche
Jul 18 '05 #2
Sebastian Fey <fe*@parsytec.d e> wrote:
actually, is nodetype entityReference implemented in pyXML.
Yes, but you won't ever see them from a parse operation.

Print() will happily serialise an entity reference as &e; providing you
can get one into the document in the first place. Using
Document.create EntityReference () is the only way I know.
(2) id also like to load external parsed entities referenced in the xml.
MSXML provides an extension(?) to DOM which returns the uri to an
entityReference-NODE. any similar in pyXML.


The standard DOM way of doing it is to use the DocumentType.en tities
interface:

doctype= entref.ownerDoc ument.doctype
entdecl= doctype.entitie s.getNamedItem( entref.nodeName )
uri= entdecl.systemI d # see also baseURI if using DOM Level 3 Core

This isn't any use for 4DOM as you won't get any Entity objects from its
parse stage and you can't create your own.

In DOM Level 3 Load/Save, control of whether Entity and EntityReference
objects are kept in the document is achieved with the DOMConfiguratio n
parameter 'entities':

parser= implementation. createLSParser( 1, None)
parser.domConfi g.setParameter( 'entities', True) # False by default
doc= parser.parseURI ('file:///in.xml')
serialiser= implementation. createLSSeriali zer()
serialiser.domC onfig.setParame ter('entities', True)
serialiser.writ eToURI(doc, 'file:///out.xml')

DOM 3 LS is still at Proposed Recommendation stage and isn't supported
by 4DOM yet. (Insert customary pxdom plug here.)

--
Andrew Clover
mailto:an*@doxd esk.com
http://www.doxdesk.com/
Jul 18 '05 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
1432
by: leo | last post by:
hi there i want to write a little SOAP client and thought about using ZSI. now i have read that ZSI only works with PyXML versions later than 0.6 and earlier than 0.7. unfortunaltly i can find that old versioons of PyXML on sourceforge any more...
2
3523
by: David Dorward | last post by:
I'm attempting to read an XHTML 1.1 file, perform some DOM manipulation, then write the results to a different file. I've found myself rather stuck at the first hurdle. I have the following: from xml.dom.ext.reader import Sax2 reader = Sax2.Reader() f = open('dorward.me.uk/sitemap.html', 'r')
1
1797
by: Alain | last post by:
Hi ! I'm looking for an XPath implementation (even partial) on PyXML. I'm using libxml2 on Windows XP. It works well but I have to port my application on RISCOS (OS on ARM processor) where only PyXML is ported. Thanks. Alain
2
2077
by: Stefan Behnel | last post by:
Hi! I'm using PyXML 0.8.3 on a number of machines. When I now run import xml.sax.saxlib python touches both the python/xml/sax and python/site-packages/_xmlplus modules but not _xmlplus/sax (I see that from strace), and then only searches for saxlib in python/xml/sax, i.e. in stdlib. I therefore get an ImportError though saxlib exists in _xmlplus/sax.
1
1879
by: Dan | last post by:
I'm writing a Python program that does some XML parsing, though nothing heavy, and I want to avoid requiring the user to install additional libraries like PyXML. The documentation for my version of Python (2.3.5) mentions PyXML as an additional library while discussing the DOM module <http://www.python.org/doc/2.3.5/lib/module-xml.dom.html>. I don't seem to have PyXML installed on my system, but: >>> import xml.dom.minidom
1
1572
by: PyPK | last post by:
How do I add a new attribute to the existing xml Document tree???
5
2128
by: Matthias Kaeppler | last post by:
Hi, I have to say I am confused about the documentation on pyxml.sf.net. When I want to use DOM, I effectively am using a class called Sax2? ^^ I also have to catch SAXExceptions, which reside in xml.sax._exceptions. I thought DOM and SAX are two completely different things. Why is PyXML mixing them up like this? Thanks,
3
4028
by: Thomas W | last post by:
I've tried to install the pyxml-package, available at http://pyxml.sourceforge.net/, both from source and using some hack on the win32-binary package available for python2.4, but without luck. After install the module xml.dom.ext.reader is not available, eventhough the folder, __init__py and everything is in place. Why? And could someone please make a win32-binary package of pyxml for python2.5? Thanks!
4
1764
by: kdwyer | last post by:
Hello Everyone, I've been looking into writing a utility to compare/analyse xml files, and thought I'd have a look at PyXml, but the Sourceforge page says it's no longer maintained. Two questions: What's the story with PyXml? Is it stable/complete or has effort moved elsewhere?
0
8582
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
9122
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
0
9001
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
0
7672
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
6498
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
5841
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
1
3018
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
2
2281
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
3
1984
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.