Quick question on the presence of CDATA

Dilip

I have been out of the XML world for a while and have sort of forgotten
the exact difference between:

<Symbol><![CDATA[IBM]]></Symbol>

and just:

<Symbol>IBM</Symbol>

Can anyone tell me why one is preferred over the other?

thanks!

Oct 25 '06 #1

Subscribe Post Reply

1435

Joseph Kesselman

Followup to the Microsoft list doesn't work through my servers, so
answering here...
Dilip wrote:

<Symbol><![CDATA[IBM]]></Symbol>
<Symbol>IBM</Symbol>

Identical meaning, since there aren't any special characters in the value.

<!CDATA[]]sections are an alternative to character-by-character
escaping of characters that would otherwise confuse XML syntax (such as
"<" and "&"). It escapes its entire contents -- with the exception of
any ]]sequences, which require special handling.

Generally the only time you care about this is when you're hand-editing
XML, want to drop non-XML text into the value of an XML element (note
that you can't use this kluge for attribute values), and are too lazy to
fix it up by hand. If you build your XML using any XML-aware tool, it
should take care of the escaping for you and you don't have to care
whether it escapes individual characters or uses <!CDATA[]]>
--
Joe Kesselman / Beware the fury of a patient man. -- John Dryden

Oct 25 '06 #2

Dilip

Joseph Kesselman wrote:

Followup to the Microsoft list doesn't work through my servers, so
answering here...
Dilip wrote:
<Symbol><![CDATA[IBM]]></Symbol>
<Symbol>IBM</Symbol>

Identical meaning, since there aren't any special characters in the value.

<!CDATA[]]sections are an alternative to character-by-character
escaping of characters that would otherwise confuse XML syntax (such as
"<" and "&"). It escapes its entire contents -- with the exception of
any ]]sequences, which require special handling.

Generally the only time you care about this is when you're hand-editing
XML, want to drop non-XML text into the value of an XML element (note
that you can't use this kluge for attribute values), and are too lazy to
fix it up by hand. If you build your XML using any XML-aware tool, it
should take care of the escaping for you and you don't have to care
whether it escapes individual characters or uses <!CDATA[]]>

Just so that I got this straight, from the standpoint of the XML parser
does the 2 forms of elements make a difference? I mean, if I use XPath
to locate that element to retrieve its value, will I get back IBM or
something else?

Sorry if the question sounds stupid. I remember what CDATA is about
but I have forgotten what happens when a parser encounters it. (It
probably just treats whatever is inside as plain text, right?)

Oct 25 '06 #3

Joseph Kesselman

Dilip wrote:

Just so that I got this straight, from the standpoint of the XML parser
does the 2 forms of elements make a difference? I mean, if I use XPath
to locate that element to retrieve its value, will I get back IBM or
something else?

XPath doesn't distinguish the two; both yield IBM.

Parsers *CAN* distinguish the two, for the convenience of editors and
other tools which want to be able to display syntax as well as semantics
-- but aren't required to and often don't unless you ask them to.

probably just treats whatever is inside as plain text, right?)

Modulo the difference in how escaping is handled, yes, pretty much. A
SAX parser may tell the application that it's now inside the bounds of a
CDATA section; the app needs to decide whether to listen for lexical
events and whether it cares about this one. A DOM (depending on how the
builder is configured) may display the data using a CDATASection Node
rather than a Text Node, but the former is a subclass of the latter so
again that doesn't matter unless the application cares about the difference.

As far as the XML Infoset is concerned, <![CDATA[&a<]]is just a
representation of the character sequence &a< and is identical to
&a< or &a< or &a< or any of the other possible
combinations. The Infoset considers the differences between these to be
No Difference.

--
Joe Kesselman / Beware the fury of a patient man. -- John Dryden

Oct 25 '06 #4

Similar topics

Strip CDATA with regex

by: Balaras | last post by:

Hi, Can sombody here please help me a bit with a regular expression. I have a xml file where I need to strip the CDATA sections of any contained data. Eg. <xml> <tag><]></tag>...

Javascript

style tag with CDATA... @import...

by: Xah Lee | last post by:

what does it mean when a style tag gives something like the following? <style type="text/css" media="screen,projection">/*<!]>*/</style> is this standard? Xah xah@xahlee.org âˆ‘...

HTML / CSS

CDATA delimiter within CDATA Section

by: Cade Perkins | last post by:

How can the CDATA ending delimiter "]]>" be represented within a CDATA section itself? Consider an XML document that is intended to contain an embedded, uninterpreted XML example. Generally,...

.NET Framework

May a CDATA section appear in an attribute value?

by: Jon Noring | last post by:

Out of curiosity, may a CDATA section appear within an attribute value with datatype CDATA? And if so, how about other attribute value datatypes which accept the XML markup characters? To me,...

.NET Framework

scriptalicious documentation suggests wrapping code in CDATA blocks - why?

by: Jake Barnes | last post by:

I'm reading over this page: http://wiki.script.aculo.us/scriptaculous/show/Usage I'm struck by this code example +++++++++++++++++++++++++++++++ 3. Use

Javascript

How to insert a CDATA section using XPathNavigator ?

by: ericms | last post by:

Can anybody show me how to insert a CDATA section using XPathNavigator ? I have tried the follwing with no luck: XmlDocument docNav = new XmlDocument(); docNav.LoadXml(xmlString);...

.NET Framework

HTML and CDATA produced by Rails

by: Peter Michaux | last post by:

Hi, I am experimenting with some of the Ruby on Rails JavaScript generators and see something I haven't before. Maybe it is worthwhile? In the page below the script is enclosed in //<!]> ...

Javascript

Convert CDATA expression to Javascript RegExp

by: Max | last post by:

Hello everyone! Can anyone help me to convert the CDATA expression "CDATA ::= (Char* - (Char* ']]>' Char*)" to Javascript Regular Expression? Thanks, Max

.NET Framework

display the value from xml file when user click the drop down menu

by: dkyadav80 | last post by:

Hi sir, I'm new about xml, javascript. I have two selection field(html) first is city and second is state. the city and state values should be store in xml file. when user select city then all...

Javascript

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

Career Advice