Navigate and/or update an existing XML file

yinjennytam

Hi all,

I'm new to .NET and XML and I have a question. Given an XML file, I want to
navigate its content and look for one or two particular elements to get their
values. At this point, it suffices to open the XML file for read-only access.

Once I have processed these values, I might need to update a bunch of
subelements of a certain element. For example, I may need to update the
Field Name attribute plus the DataField element value in Fields.

<Fields>
<Field Name="EMPLOYEE_ID">
<DataField>EMPLOYEE_ID</DataField>
<rd:TypeName>System.Int64</rd:TypeName>
</Field>
<Field Name="LAST_NAME">
<DataField>LAST_NAME</DataField>
<rd:TypeName>System.String</rd:TypeName>
</Field>
<Field Name="FIRST_NAME">
<DataField>FIRST_NAME</DataField>
<rd:TypeName>System.String</rd:TypeName>
</Field>
</Fields>

The thing is that I may not need to update this XML file. However, when I
need to, this XML file may be big and my question is whether I should use
XmlDocument for read and write purpose together with XPath expressions? Or
should I use XPathDocument for better performance?

Any other suggestions or ideas?

Thank you very much!

Nov 12 '05 #1

Subscribe Post Reply

5762

Pascal Schmitt

Hello!

The thing is that I may not need to update this XML file. However, when I
need to, this XML file may be big and my question is whether I should use
XmlDocument for read and write purpose together with XPath expressions? Or
should I use XPathDocument for better performance?

That depends. XmlDocument is the slowest way to do it - if you are able
to do it using XPath & read-only, use XPathDocument. But if the file is
/really/ big (some MBytes) it's best to use XmlTextReader - it is not as
flexible as evaluating a simple XPath expression but the fastest way
(and your document does not look that complex - it should be easy)

<http://msdn2.microsoft.com/library/System.Xml.XmlTextReader>

If you need fast reading & writing and the data you edit does not depend
on anything else in the document (like "change last name of employee
42") you can use XmlTextReader and XmlTextWriter to "stream process" the
document:

<http://msdn2.microsoft.com/library/System.Xml.XmlTextWriter>
This could look like this (untested):

\\\

XmlTextReader r = new XmlTextReader("file");
XmlTextWriter w = new XmlTextWriter("tempfile");

string id = "42";
bool isField, isEmployeeFieldm, isTargetDataField, isTargetEmployee;

while( r.Read() ){
switch( r.NodeType ){
...
case XmlNodeType.Element:
isField = r.LocalName.Equals( "Field" );
w.WriteStartElement( r.Name );
isTargetDataField = isEmployeeField &&
r.LocalName.Equals("DataField");
break;
...
case XmlNodeType.Attribute:
isEmployeeField = isField && r.LocalName.Equals("Name") &&
r.Value.Equals("EMPLOYEE_ID");
w.WriteStartAttribute( r.Name );
break;
...
case XmlNodeType.Text:
isTargetEmployee = isTargetDataField && r.Value.Equals("42");
....

}
}

///

--
Pascal Schmitt

Nov 12 '05 #2

yinjennytam

Thank you for your help! Really appreciated!

The thing is that it is not always read-only access and the file may be a
few hundred kilo bytes. I'm not sure how slow XmlDocument can be.

Would you recommend that I simply use XmlTextReader to search for some
elements while reading in the file instead of using XPath and/or
XPathDocument? Then when I realize that I need to update the Xml file,
simply create a new Xml file using XmlTextWriter?

What I'm worried is that what I need to read (parse) first is near the end
of the Xml file and what I need to update is actually near the top of the Xml
file ...

Any suggestions are welcome. Thank you!
"Pascal Schmitt" wrote:

Hello!
The thing is that I may not need to update this XML file. However, when I
need to, this XML file may be big and my question is whether I should use
XmlDocument for read and write purpose together with XPath expressions? Or
should I use XPathDocument for better performance?

That depends. XmlDocument is the slowest way to do it - if you are able
to do it using XPath & read-only, use XPathDocument. But if the file is
/really/ big (some MBytes) it's best to use XmlTextReader - it is not as
flexible as evaluating a simple XPath expression but the fastest way
(and your document does not look that complex - it should be easy)

<http://msdn2.microsoft.com/library/System.Xml.XmlTextReader>

If you need fast reading & writing and the data you edit does not depend
on anything else in the document (like "change last name of employee
42") you can use XmlTextReader and XmlTextWriter to "stream process" the
document:

<http://msdn2.microsoft.com/library/System.Xml.XmlTextWriter>
This could look like this (untested):

\\\

XmlTextReader r = new XmlTextReader("file");
XmlTextWriter w = new XmlTextWriter("tempfile");

string id = "42";
bool isField, isEmployeeFieldm, isTargetDataField, isTargetEmployee;

while( r.Read() ){
switch( r.NodeType ){
...
case XmlNodeType.Element:
isField = r.LocalName.Equals( "Field" );
w.WriteStartElement( r.Name );
isTargetDataField = isEmployeeField &&
r.LocalName.Equals("DataField");
break;
...
case XmlNodeType.Attribute:
isEmployeeField = isField && r.LocalName.Equals("Name") &&
r.Value.Equals("EMPLOYEE_ID");
w.WriteStartAttribute( r.Name );
break;
...
case XmlNodeType.Text:
isTargetEmployee = isTargetDataField && r.Value.Equals("42");
....

}
}

///

--
Pascal Schmitt

Nov 12 '05 #3

Pascal Schmitt

Hello!

Would you recommend that I simply use XmlTextReader to search for some
elements while reading in the file instead of using XPath and/or
XPathDocument? Then when I realize that I need to update the Xml
file,
simply create a new Xml file using XmlTextWriter?
No - the XmlReaders are not caching - you have to use the information
when you read it (if you don't save it on your own) if you want to
"skip" back to the beginning of a file you have to read it from the
beginning with a new XmlReader.

The thing is that it is not always read-only access and the file may
be a
few hundred kilo bytes. I'm not sure how slow XmlDocument can be.

What I'm worried is that what I need to read (parse) first is near the
end
of the Xml file and what I need to update is actually near the top of
the Xml
file ...

The "slow" bit of XmlDocument is that it parses the XML completely and
creates a full DOM-Tree in-memory. This is no problem, if your file is
just in the area of KBytes and you nead Random access.
So for your case I would recommend you to use XmlDocument with its
XPath-abilities for more flexibility and shorter Code (untested):

XmlDocument doc = new XmlDocument();
doc.Load("file");
XmlElement e =
doc.SelectSingleNode("/Fields/Field[@Name='EMPLOYEE_ID']/DataField[.='42']")
as XmlElement;
if( e == null ) throw new Exception();
XmlElement n =
doc.SelectSingleNode("/Fields/Field[@Name='FIRST_NAME']/DataField") as
XmlElement;
if( n == null ) throw new Excepttion();

// do something with the nodes

doc.Save("file");

--
Pascal Schmitt

Nov 12 '05 #4

yinjennytam

Your help and sample code are greatly appreciated and I will go ahead
experimenting with XmlDocument and XPath as suggested.

Thank you!
Jenny

"Pascal Schmitt" wrote:

Hello!
> Would you recommend that I simply use XmlTextReader to search for some
> elements while reading in the file instead of using XPath and/or
> XPathDocument? Then when I realize that I need to update the Xml
> file,
> simply create a new Xml file using XmlTextWriter?

No - the XmlReaders are not caching - you have to use the information
when you read it (if you don't save it on your own) if you want to
"skip" back to the beginning of a file you have to read it from the
beginning with a new XmlReader.

> The thing is that it is not always read-only access and the file may
> be a
> few hundred kilo bytes. I'm not sure how slow XmlDocument can be.
>
> What I'm worried is that what I need to read (parse) first is near the
> end
> of the Xml file and what I need to update is actually near the top of
> the Xml
> file ...

The "slow" bit of XmlDocument is that it parses the XML completely and
creates a full DOM-Tree in-memory. This is no problem, if your file is
just in the area of KBytes and you nead Random access.
So for your case I would recommend you to use XmlDocument with its
XPath-abilities for more flexibility and shorter Code (untested):

XmlDocument doc = new XmlDocument();
doc.Load("file");
XmlElement e =
doc.SelectSingleNode("/Fields/Field[@Name='EMPLOYEE_ID']/DataField[.='42']")
as XmlElement;
if( e == null ) throw new Exception();
XmlElement n =
doc.SelectSingleNode("/Fields/Field[@Name='FIRST_NAME']/DataField") as
XmlElement;
if( n == null ) throw new Excepttion();

// do something with the nodes

doc.Save("file");

--
Pascal Schmitt

Nov 12 '05 #5

yinjennytam

Hi Pascal

I've played with XmlDocument and XPath expressions and I seem to get what I
wanted so far. Thanks for your help again.

However, as I reach an Xml node and I'd like to create a new Xml element
with attribute and a child element, I do the followings:

XmlElement fieldElem = myXmlDocument.CreateElement("Field");
XmlElement dataField = myXmlDocument.CreateElement("DataField");
XmlText text = myXmlDocument.CreateTextNode("Field 1");
fieldElem.AppendChild(dataField);
dataField.AppendChild(text);
fields.AppendChild(fieldElem);

fieldElem.SetAttribute("Name", "Field 1");

------------

The XML file shows

<Field Name="Field 1" xmlns="">
<DataField>Field 1</DataField>
</Field>
Is there any way that I could get rid of the xmlns part? In other words I
want it to be just <Field Name="Field 1"> ... </Field>

I found that if I call SetAttribute() for an existing Xml element, it does
not add this annoying xmlns part. For some reasons, in my case, I can't seem
to get rid of it.

Please help! Thanks again!
Jenny

"Pascal Schmitt" wrote:

Hello!
> Would you recommend that I simply use XmlTextReader to search for some
> elements while reading in the file instead of using XPath and/or
> XPathDocument? Then when I realize that I need to update the Xml
> file,
> simply create a new Xml file using XmlTextWriter?

No - the XmlReaders are not caching - you have to use the information
when you read it (if you don't save it on your own) if you want to
"skip" back to the beginning of a file you have to read it from the
beginning with a new XmlReader.

> The thing is that it is not always read-only access and the file may
> be a
> few hundred kilo bytes. I'm not sure how slow XmlDocument can be.
>
> What I'm worried is that what I need to read (parse) first is near the
> end
> of the Xml file and what I need to update is actually near the top of
> the Xml
> file ...

The "slow" bit of XmlDocument is that it parses the XML completely and
creates a full DOM-Tree in-memory. This is no problem, if your file is
just in the area of KBytes and you nead Random access.
So for your case I would recommend you to use XmlDocument with its
XPath-abilities for more flexibility and shorter Code (untested):

XmlDocument doc = new XmlDocument();
doc.Load("file");
XmlElement e =
doc.SelectSingleNode("/Fields/Field[@Name='EMPLOYEE_ID']/DataField[.='42']")
as XmlElement;
if( e == null ) throw new Exception();
XmlElement n =
doc.SelectSingleNode("/Fields/Field[@Name='FIRST_NAME']/DataField") as
XmlElement;
if( n == null ) throw new Excepttion();

// do something with the nodes

doc.Save("file");

--
Pascal Schmitt

Nov 12 '05 #6

Kevin Yu [MSFT]

Hi Jenny,

The xmlns field indicates the default namespace for this field. Since you
didn't set the namespace for this field, and the parent node namespace is
set, the xmlns is automatically generated to indicate the default namespace
for this field is empty.

You can specify the qualified name and namespace URI for the element when
creating like the following.

XmlElement fieldElem = myXmlDocument.CreateElement("Field",
"www.parentnodens.com");

If the namespace "www.parentnodens.com" is the same as parent node
namespace, the xmlns will be omit automantically.

Kevin Yu
=======
"This posting is provided "AS IS" with no warranties, and confers no
rights."

Nov 12 '05 #7

yinjennytam

Thanks, Kevin. I actually figured it out myself right after I posted my
problem. Thanks anyway though because this confirms what I think. :-)

Jenny
"Kevin Yu [MSFT]" wrote:

Hi Jenny,

The xmlns field indicates the default namespace for this field. Since you
didn't set the namespace for this field, and the parent node namespace is
set, the xmlns is automatically generated to indicate the default namespace
for this field is empty.

You can specify the qualified name and namespace URI for the element when
creating like the following.

XmlElement fieldElem = myXmlDocument.CreateElement("Field",
"www.parentnodens.com");

If the namespace "www.parentnodens.com" is the same as parent node
namespace, the xmlns will be omit automantically.

Kevin Yu
=======
"This posting is provided "AS IS" with no warranties, and confers no
rights."

Nov 12 '05 #8

Kevin Yu [MSFT]

You're welcome, Jenny.

Kevin Yu
=======
"This posting is provided "AS IS" with no warranties, and confers no
rights."

Nov 12 '05 #9

Similar topics

Bulk Insert / Update / Delete

by: Philip Boonzaaier | last post by:

I want to be able to generate SQL statements that will go through a list of data, effectively row by row, enquire on the database if this exists in the selected table- If it exists, then the colums...

PostgreSQL Database

Update XML File

by: Doug Bell | last post by:

Hi I have an application where I need to Update an XML file. The source is an Access DB. The Application imports new records from the Access DB into a DataSet and then needs to update the XML...

.NET Framework

Using VB to navigate from one record to another

by: Joe Bond | last post by:

Hi. I have a simple MS Access 2000 form in which I enter some customer data. When the address field is entered I need to see if a duplicate record exists. I need to know this *right away* before...

Microsoft Access / VBA

OleDbDataAdapter won't update database from dataset (Display at Width = 65)

by: M. David Johnson | last post by:

I cannot get my OleDbDataAdapter to update my database table from my local dataset table. The Knowledge Base doesn't seem to help - see item 10 below. I have a Microsoft Access 2000 database...

Visual Basic .NET

How to best update remote compressed, encrypted archives incrementally?

by: robert | last post by:

Hello, I want to put (incrementally) changed/new files from a big file tree "directly,compressed and password-only-encrypted" to a remote backup server incrementally via FTP,SFTP or DAV.... At...

Python

Webbrowser.navigate() not firing

by: Ryan Ramsey | last post by:

I have been chasing this one down for a week and have narrowed it down to a machine issue. I have the following code: webBrowser.Navigate(http://finao.net/post_dkp.php?database=40); Basically...

C# / C Sharp

query to append and update?

by: MN | last post by:

I have to import a tab-delimited text file daily into Access through a macro. All of the data needs to be added to an existing table. Some of the data already exists but may be updated by the...

Microsoft Access / VBA

Web Browser control - how to navigate to a local file.

by: timnels | last post by:

I have a web browser control that I'd like to point at a HTML file in my installation directory. I am doing something like: string path = Path.GetDirectoryName(...

C# / C Sharp

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Basic Javascript concepts

by: aa123db | last post by:

Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...

Javascript

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing