473,395 Members | 2,713 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,395 software developers and data experts.

High Performance Xml parser

Hi,
I am looking for component which allows me to parse my xml file.
the reason i am asking this, is because my xml files are huge it can
reach as far as 1GB more or less.
the time to parse such a file is something like 5 Hours.
Now i am using the XmlRead, XmlNode ... (I do not load the file to the
memory).
Can you suggest better components to use?

** I tried SAX but i couldn't understand how it works, because there is
no examples for .net , and very bad documentation.
p.s : I am writing in C#.

Regards, Rony

Nov 27 '06 #1
3 4539
Hello rony_16,

rI am looking for component which allows me to parse my xml file.
rthe reason i am asking this, is because my xml files are huge it can
rreach as far as 1GB more or less.
rthe time to parse such a file is something like 5 Hours.
rNow i am using the XmlRead, XmlNode ... (I do not load the file to
rthe
rmemory).
rCan you suggest better components to use?

For .NET Xml text reader is the fastest .NET way, especially in .NET 2.0
The statistic of the unmanaged parsers are there http://piccolo.sourceforge.net/bench.html

r** I tried SAX but i couldn't understand how it works, because there
ris
rno examples for .net , and very bad documentation.
rp.s : I am writing in C#.

..NET 2.0 has no SAX realization. XmlTextReader is the pull based model, not
the push (SAX).
Btw, see there http://search.live.com/results.aspx?q=SAX+parser+.NET some
of SAX realizations for .NET

---
WBR,
Michael Nemtsev [C# MVP] :: blog: http://spaces.live.com/laflour

"At times one remains faithful to a cause only because its opponents do not
cease to be insipid." (c) Friedrich Nietzsche

Nov 27 '06 #2
On Mon, 27 Nov 2006 18:46:12 +0000 (UTC), Michael Nemtsev
<ne*****@msn.comwrote:
>
For .NET Xml text reader is the fastest .NET way, especially in .NET 2.0
The statistic of the unmanaged parsers are there http://piccolo.sourceforge.net/bench.html
Is there a benchmark comparing these parsers to .NET's XML parser?

Thanks,

Sam
------------------------------------------------------------
We're hiring! B-Line Medical is seeking Mid/Sr. .NET
Developers for exciting positions in medical product
development in MD/DC. Work with a variety of technologies
in a relaxed team environment. See ads on Dice.com.

Nov 27 '06 #3
Hello Samuel,

Have never met this, but eager to see results.

If nobody gives u the answer, I think that the best idea is to ask http://www.stylusstudio.com/xmldev/
guys directly or via their mailgroup
I've been working with their XML/XSLT tools which allow to select the any
kind of parser (.NET/C++/Java) for you work
I think that they performed such test and can give u some info about it

SNOn Mon, 27 Nov 2006 18:46:12 +0000 (UTC), Michael Nemtsev
SN<ne*****@msn.comwrote:
SN>
>For .NET Xml text reader is the fastest .NET way, especially in .NET
2.0 The statistic of the unmanaged parsers are there
http://piccolo.sourceforge.net/bench.html
SNIs there a benchmark comparing these parsers to .NET's XML parser?
SN>
SNThanks,
SN>
SNSam
SN>
SN------------------------------------------------------------ We're
SNhiring! B-Line Medical is seeking Mid/Sr. .NET Developers for
SNexciting positions in medical product development in MD/DC. Work
SNwith a variety of technologies in a relaxed team environment. See
SNads on Dice.com.
SN>
---
WBR,
Michael Nemtsev [C# MVP] :: blog: http://spaces.live.com/laflour

"At times one remains faithful to a cause only because its opponents do not
cease to be insipid." (c) Friedrich Nietzsche
Nov 27 '06 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

6
by: Andreas Lauffer | last post by:
I changed from Access97 to AccessXP and I have immense performance problems. Details: - Access XP MDB with Jet 4.0 ( no ADP-Project ) - Linked Tables to SQL-Server 2000 over ODBC I used...
2
by: sree | last post by:
hello, I am working on a project that requires improving the performance of xml to reduce the access time . I use xml to take the values from a database located in US and store the values and...
2
by: Tom Kerigan | last post by:
I know that longer element names increase the size of an XML document, ultimately resulting in a larger amount of data at parse-time. Is there anything else, specifically related to an element name...
3
by: rony | last post by:
Hi, I am looking for component which allows me to parse my xml file. the reason i am asking this, is because my xml files are huge it can reach as far as 1GB more or less. the time to parse such...
9
by: starlight | last post by:
Hallo, there were some posts about this, but nothing I could find useful. I have a large XML file (80MB) and need certain information out of it. I though I could use XSLT with an fairy simple...
1
by: Robert Strickland | last post by:
I have a .Net web service (written for 1.1 Framework using C#) running on Windows 2003 with all latest patches. To help monitor the service, the code creates several performance counters. One...
2
by: Jay Loden | last post by:
All, In studying Python, I have predictably run across quite a bit of talk about the GIL and threading in Python. As my day job, I work with a (mostly Java) application that is heavily threaded....
0
by: dotnetrocks | last post by:
Hi, I'm writing a high performance tcp/ip server using IOCP. Recently I found XF.Server component at http://www.kodart.com They claim that it is the fastest server implementation. Is it possible?...
2
by: Paul McGuire | last post by:
I just ran my pyparsing unit tests with the latest Python 2.6b1 (labeled internally as Python 2.6a3 - ???), and the current 1.5.0 version of pyparsing runs with no warnings or regressions. I was...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.