XML CDATA etc - .NET Framework

JohnAD

Hello NG,

I am getting some information from DB, and that data has mix html and XML
tags in the content (e.g. detail on country).

Basically CDATA types are mixed with regular string. Also, html tags are in
escape form (e.g. is >). When I display that string I see those tags.

Basically I am getting all this data as xml form and I want to find out how
can I change those html tags into regular tags, and also how to remove CDATA
or any instructions in the string. Is there a quick way to do that? My
problem is increased as I don't know XML.

Thank you,
Po

Jan 6 '07 #1

Subscribe Post Reply

5903

Peter Flynn

JohnAD wrote:

Hello NG,

I am getting some information from DB, and that data has mix html and
XML tags in the content (e.g. detail on country).

Basically CDATA types are mixed with regular string. Also, html tags are
in escape form (e.g. is >). When I display that string I see those
tags.

This sounds like someone has interfered with the file.

Basically I am getting all this data as xml form and I want to find out
how can I change those html tags into regular tags,

What does that mean? Change <p> back into <p>?

and also how to
remove CDATA or any instructions in the string. Is there a quick way to
do that? My problem is increased as I don't know XML.

It sounds like whoever supplied you with the file doesn't know any XML
either.

a) Best move is to ask them for valid (or at least well-formed) XML to
start with. Unless you're working with well-formed data at the very
least, you don't stand much chance of using XML. If you don't know
if what you've got is well-formed or not, install a reliable
standalone XML parser like rxp and use it to test the file[s].

b) To change the escaped pointy brackets back into real ones you'll need
to write and run some non-XML script, but the risk is that they were
escaped for a reason (usually ignorance, sometimes laziness) and that
by putting them back they way they were, you'll break the data model.
By restoring them, you are essentially adding new elements to a file
which wasn't designed to hold them (which is why they were escaped to
begin with). It *is* possible to repair the damage with XSLT, but its
string-handling isn't very sophisticated.

c) CDATA markup is used along with HTML escapement to allow the remains
of the elements to be embedded in XML, in the (usually) forlorn hope
that someone (you) will struggle to restore them at a later stage in
the process. This is often done by people with little understanding
of markup or XML (your supplier). Running the document through any
parsing XML processor will automatically remove the CDATA markup and
pass the content through to whatever the next stage is. However, if
doing so reveals pointy-bracket markup that doesn't fit the document
model (DTD, Schema,...) then the process will halt (as it's supposed
to).

Have a look at http://xml.silmaril.ie/authors/cdata/ and
http://xml.silmaril.ie/authors/html/

And do try (a) if at all possible: it will make your life, your
supplier's life, and the life of the information very much easier.

///Peter
--
XML FAQ: http://xml.silmaril.ie/

Jan 6 '07 #2

JohnAD

Thanks Peter that is a real good reply you gave me. That helps in many ways.
Thanks again.

"Peter Flynn" <pe********@m.silmaril.iewrote in message
news:50*************@mid.individual.net...

JohnAD wrote:
>Hello NG,

I am getting some information from DB, and that data has mix html and XML
tags in the content (e.g. detail on country).

Basically CDATA types are mixed with regular string. Also, html tags are
in escape form (e.g. is >). When I display that string I see those
tags.

This sounds like someone has interfered with the file.

>Basically I am getting all this data as xml form and I want to find out
how can I change those html tags into regular tags,

What does that mean? Change <p> back into <p>?

>and also how to remove CDATA or any instructions in the string. Is there
a quick way to do that? My problem is increased as I don't know XML.

It sounds like whoever supplied you with the file doesn't know any XML
either.

a) Best move is to ask them for valid (or at least well-formed) XML to
start with. Unless you're working with well-formed data at the very
least, you don't stand much chance of using XML. If you don't know
if what you've got is well-formed or not, install a reliable
standalone XML parser like rxp and use it to test the file[s].

b) To change the escaped pointy brackets back into real ones you'll need
to write and run some non-XML script, but the risk is that they were
escaped for a reason (usually ignorance, sometimes laziness) and that
by putting them back they way they were, you'll break the data model.
By restoring them, you are essentially adding new elements to a file
which wasn't designed to hold them (which is why they were escaped to
begin with). It *is* possible to repair the damage with XSLT, but its
string-handling isn't very sophisticated.

c) CDATA markup is used along with HTML escapement to allow the remains
of the elements to be embedded in XML, in the (usually) forlorn hope
that someone (you) will struggle to restore them at a later stage in
the process. This is often done by people with little understanding
of markup or XML (your supplier). Running the document through any
parsing XML processor will automatically remove the CDATA markup and
pass the content through to whatever the next stage is. However, if
doing so reveals pointy-bracket markup that doesn't fit the document
model (DTD, Schema,...) then the process will halt (as it's supposed
to).

Have a look at http://xml.silmaril.ie/authors/cdata/ and
http://xml.silmaril.ie/authors/html/

And do try (a) if at all possible: it will make your life, your supplier's
life, and the life of the information very much easier.

///Peter
--
XML FAQ: http://xml.silmaril.ie/

Jan 7 '07 #3

Similar topics

Strip CDATA with regex

by: Balaras | last post by:

Hi, Can sombody here please help me a bit with a regular expression. I have a xml file where I need to strip the CDATA sections of any contained data. Eg. <xml> <tag><]></tag>...

Javascript

style tag with CDATA... @import...

by: Xah Lee | last post by:

what does it mean when a style tag gives something like the following? <style type="text/css" media="screen,projection">/*<!]>*/</style> is this standard? Xah xah@xahlee.org âˆ‘...

HTML / CSS

CDATA delimiter within CDATA Section

by: Cade Perkins | last post by:

How can the CDATA ending delimiter "]]>" be represented within a CDATA section itself? Consider an XML document that is intended to contain an embedded, uninterpreted XML example. Generally,...

.NET Framework

May a CDATA section appear in an attribute value?

by: Jon Noring | last post by:

Out of curiosity, may a CDATA section appear within an attribute value with datatype CDATA? And if so, how about other attribute value datatypes which accept the XML markup characters? To me,...

.NET Framework

scriptalicious documentation suggests wrapping code in CDATA blocks - why?

by: Jake Barnes | last post by:

I'm reading over this page: http://wiki.script.aculo.us/scriptaculous/show/Usage I'm struck by this code example +++++++++++++++++++++++++++++++ 3. Use

Javascript

How to insert a CDATA section using XPathNavigator ?

by: ericms | last post by:

Can anybody show me how to insert a CDATA section using XPathNavigator ? I have tried the follwing with no luck: XmlDocument docNav = new XmlDocument(); docNav.LoadXml(xmlString);...

.NET Framework

How to set an string element's value to a CDATA block?

by: soccerdad | last post by:

I've got a class hierarchy generated from a .xsd schema file using the XSD.EXE tool. One of the elements will have its "inner text" set to a CDATA block. The XSD.EXE tool exposed a "Value" property...

.NET Framework

HTML and CDATA produced by Rails

by: Peter Michaux | last post by:

Hi, I am experimenting with some of the Ruby on Rails JavaScript generators and see something I haven't before. Maybe it is worthwhile? In the page below the script is enclosed in //<!]> ...

Javascript

Convert CDATA expression to Javascript RegExp

by: Max | last post by:

Hello everyone! Can anyone help me to convert the CDATA expression "CDATA ::= (Char* - (Char* ']]>' Char*)" to Javascript Regular Expression? Thanks, Max

.NET Framework

display the value from xml file when user click the drop down menu

by: dkyadav80 | last post by:

Hi sir, I'm new about xml, javascript. I have two selection field(html) first is city and second is state. the city and state values should be store in xml file. when user select city then all...

Javascript

Cloud Servers without Credit Card and Email Registration: A Simpler Way to Get on the Cloud

by: CloudSolutions | last post by:

Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...

General

One-click Importing Excel Data into a*Database

by: ryjfgjl | last post by:

In our work, we often need to import Excel data into databases (such as MySQL, SQL Server, Oracle) for data analysis and processing. Usually, we use database tools like Navicat or the Excel import...

Microsoft Excel

Easy Steps to Fix "Canon Printer Won't Connect to WiFi Network"

by: taylorcarr | last post by:

A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...

General

Basic Javascript concepts

by: aa123db | last post by:

Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...

Javascript

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++