473,395 Members | 1,680 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,395 software developers and data experts.

"well-former" tool (Linux)

Hi!

I need to parse large XML files which don't terminate tags without body
(i.e. <TAG> when it had to be <TAG />). I'm also not sure about whether
the tags are correctly case-sensitive. I do have an XSD - is there a
tool that can "repair" those XML files?

Timo
Jul 20 '05 #1
6 1628
In article <2i************@uni-berlin.de>,
Timo Nentwig <tc*@spamgourmet.org> writes:
I need to parse large XML files which don't terminate tags without body
(i.e. <TAG> when it had to be <TAG />). I'm also not sure about whether
the tags are correctly case-sensitive. I do have an XSD - is there a
tool that can "repair" those XML files?


These are SGML, not XML files :-) Search about SGML.

Well, SGML processors do not support XML Schema...
Search for the DTD or convert XML Schema -> DTD (converters exist).
Then use an SGML normalizer.
Jul 20 '05 #2
Victor Porton wrote:
These are SGML, not XML files :-) Search about SGML.


Well, a file that uses
http://www.bibletechnologies.net/osisCore.1.1.1.xsd is an XML document,
isn't it?
Jul 20 '05 #3
Timo Nentwig wrote:
Victor Porton wrote:
These are SGML, not XML files :-) Search about SGML.

Well, a file that uses
http://www.bibletechnologies.net/osisCore.1.1.1.xsd is an XML document,
isn't it?


What do you mean by "uses"? If it does not conform to the XML production
rules, it isn't. Your description of the file sounds like SGML, not XML.
--
Johannes Koch
In te domine speravi; non confundar in aeternum.
(Te Deum, 4th cent.)
Jul 20 '05 #4
Johannes Koch wrote:
What do you mean by "uses"? If it does not conform to the XML production
rules, it isn't. Your description of the file sounds like SGML, not XML.


sgml2xml can't correct it anyway.
Jul 20 '05 #5
Timo Nentwig wrote:
Johannes Koch wrote:
What do you mean by "uses"? If it does not conform to the XML
production rules, it isn't. Your description of the file sounds like
SGML, not XML.


sgml2xml can't correct it anyway.


Maybe "xmllint --recover" helps.

Jul 20 '05 #6
Rolf Magnus wrote:
Maybe "xmllint --recover" helps.


Unfortunately not :-(

Jul 20 '05 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

99
by: Jim Hubbard | last post by:
It seems that Microsoft not only does not need the classic Visual Basic developer army (the largest army of developers the world has ever seen), but now they don't need ANY Windows developer at a...
6
by: M Welinder | last post by:
The title more or less says it all: in C99, is the value of INT_MIN % -1 well defined (when performed as signed integers) under the assumption of two-complement representation. Note, that...
2
by: Alfonso Morra | last post by:
Hi, I am writing a timer class that I want to be able to get to notify me (via a callback func), when a specified interval has elapsed. I have most of the timer functionality figured - however,...
6
by: Bryan Dickerson | last post by:
Still considering myself a newbie to the world of XML, etc., 'cause my program is still not working (not that I'll be an expert when it does, but...). When I try to apply XPath statements to the...
169
by: JohnQ | last post by:
(The "C++ Grammer" thread in comp.lang.c++.moderated prompted this post). It would be more than a little bit nice if C++ was much "cleaner" (less complex) so that it wasn't a major world wide...
7
by: pike | last post by:
db2 8.1 FP11 on AIX 5.3.0.0 . The db2diag.log is intermittently reporting EMFILE (24) "Too many open files" errors. The culprit is always db2hmon. Sample db2diag.log output follows: ...
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.