ElementTree, how to get the whole content of a tag

Damjan

Given the folowing XML snippet, I build an ElementTree instance with
et=ElementTree.fromstring(..). Now et.text returns just '\n text\n some
other text'.
Is there any way I could get everything between the <div> and </div> tag?

<div>
text
some other text<br/>
and then some more
</div>
--
damjan

Jul 18 '05 #1

Subscribe Post Reply

3157

Fredrik Lundh

Damjan <gd*****@gmail.com> wrote:

Given the folowing XML snippet, I build an ElementTree instance with
et=ElementTree.fromstring(..). Now et.text returns just '\n text\n some
other text'.
Is there any way I could get everything between the <div> and </div> tag?

<div>
text
some other text<br/>
and then some more
</div>

def gettext(elem):
text = elem.text or ""
for subelem in elem:
text = text + gettext(subelem)
if subelem.tail:
text = text + subelem.tail
return text

gettext(et)

'\n text\n some other text\n and then some more\n'

</F>

Jul 18 '05 #2

Damjan

>> Is there any way I could get everything between the <div> and </div> tag?

<div>
text
some other text<br/>
and then some more
</div>
gettext(et)

'\n text\n some other text\n and then some more\n'

I acctually need to get
'\n text\n some other text<br/>\n and then some more\n'

And if there were attributes in <br/> I'd want them too where they were.
Can't I just get ALL the text between the <div> tags?

--
damjan

Jul 18 '05 #3

Fredrik Lundh

Damjan wrote:

Is there any way I could get everything between the <div> and </div> tag?

<div>
text
some other text<br/>
and then some more
</div> gettext(et)

'\n text\n some other text\n and then some more\n'

I acctually need to get
'\n text\n some other text<br/>\n and then some more\n'

that's not the tree content, that's a serialized XML fragment.

the quickest way to do that is to serialize the entire element, and
strip off the start and end tags:

text = ElementTree.tostring(elem)
text = text.split(">", 1)[1].rsplit("<", 1)[0]

alternatively, you can serialize the subelements, and add in properly
encoded text and tail attributes:

def innersource(elem, encoding="ascii"):
text = ElementTree._encode(elem.text or "", encoding)
for subelem in elem:
text = text + ElementTree.tostring(subelem)
if subelem.tail:
text = text + ElementTree._encode(subelem.tail, encoding)
return text

(but _encode is not an official part of the elementtree API, so this code
may not work in post-1.2 releases)

</F>

Jul 18 '05 #4

by: Stewart Midwinter | last post by:

I want to parse a file with ElementTree. My file has the following format:  <?xml version='1.0' encoding='utf-8'?> <population> <person><name="joe" sex="male"...

Python

effbot ElementTree question

by: dayzman | last post by:

Hi, Is anyone here familiar with ElementTree by effbot? With <html><body>hello</body></html> how is "hello" stored in the element tree? Which node is it under? Similarly, with: foo <a href =...

Python

ElementTree Namespace Prefixes

by: Chris Spencer | last post by:

Does anyone know how to make ElementTree preserve namespace prefixes in parsed xml files? The default behavior is to strip a document of all prefixes and then replace them autogenerated prefixes...

Python

import statement / ElementTree

by: mirandacascade | last post by:

O/S: Windows 2K Vsn of Python: 2.4 Currently: 1) Folder structure: \workarea\ <- ElementTree files reside here \xml\ \dom\

Python

ElementTree - Why not part of the core?

by: doug.bromley | last post by:

Why is the ElementTree API not a part of the Python core? I've recently been developing a script for accessing the Miva API only to find all the core API's provided by Python for parsing XML is...

Python

the tostring and XML methods in ElementTree

by: mirandacascade | last post by:

O/S: Windows XP Home Vsn of Python: 2.4 Copy/paste of interactive window is immediately below; the text/questions toward the bottom of this post will refer to the content of the copy/paste ...

Python

request for advice - possible ElementTree nexus

by: mirandacascade | last post by:

Situation is this: 1) I have inherited some python code that accepts a string object, the contents of which is an XML document, and produces a data structure that represents some of the content of...

Python

lxml/ElementTree and .tail

by: Chas Emerick | last post by:

I looked around for an ElementTree-specific mailing list, but found none -- my apologies if this is too broad a forum for this question. I've been using the lxml variant of the ElementTree API,...

Python

Re: Using ElementTree as backend for a chat web application issues

by: Gabriel Genellina | last post by:

En Mon, 09 Jun 2008 15:32:00 -0300, Marcelo de Moraes Serpa <celoserpa@gmail.comescribió: I don't think it's a problem with ElementTree. Perhaps you are writing the same (global) configuration...

Python

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Basic Javascript concepts

by: aa123db | last post by:

Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...

Javascript

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

ElementTree, how to get the whole content of a tag

Similar topics