473,320 Members | 1,691 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,320 software developers and data experts.

XML and its Uses

How do other things layer on to xml?

I mean like dtd, wsdl, soap, etc.?

Thanks.

-g
Sep 23 '06 #1
17 1536
Geoff wrote:
How do other things layer on to xml?
I mean like dtd, wsdl, soap, etc.?
Could you clarify what question you're asking?

DTDs are part of the XML spec. They're one way of defining a document
type. Schemas are another, more recent and more powerful, solution.

WSDL and SOAP are applications of XML -- document types, or languages if
you prefer, implemented in XML.

--
() ASCII Ribbon Campaign | Joe Kesselman
/\ Stamp out HTML e-mail! | System architexture and kinetic poetry
Sep 23 '06 #2
Could you clarify what question you're asking?

More specifically, for example, if I have an xml spec that says:

</lastname>smith</lastname>

.. . . and continue on expressing everything about a person, first name, age,
height, address, etc. What does a dtd do or what additional info does it
add to that? Same question for schemas.

Wsdl has to do with web services but besides an xml file, etc. one might see
a wsdl file as well, what info does a wsdl file add to the above?

Thanks.

-g
Sep 23 '06 #3
Geoff wrote:
. . . and continue on expressing everything about a person, first name, age,
height, address, etc. What does a dtd do or what additional info does it
add to that? Same question for schemas.
The DTD, or schema, provides a formal description of what the proper
structure is for that file, what values are acceptable, what default
values are.... Part documentation, part error-checking (since validation
will check to make sure the document matches this description). Schema
can also specify data types and value ranges, to further constrain this.

(DTDs are arguably obsolete since -- unlike schemas -- they don't
understand how to work with XML Namespaces. Some folks will challange
that assertion.)
Wsdl has to do with web services but besides an xml file, etc. one might see
a wsdl file as well, what info does a wsdl file add to the above?
Additional document structure to describe what service you're talking to
and what you want it to do for you.

(BTW, you should try to think in terms of documents, not files. A lot of
XML never actually gets stored in a file, existing only on the network
and in memory buffers.)
--
() ASCII Ribbon Campaign | Joe Kesselman
/\ Stamp out HTML e-mail! | System architexture and kinetic poetry
Sep 23 '06 #4
Geoff wrote:
>Could you clarify what question you're asking?

More specifically, for example, if I have an xml spec that says:

</lastname>smith</lastname>

...and continue on expressing everything about a person, first name, age,
height, address, etc. What does a dtd do or what additional info does it
add to that?
Whatever you specify. A DTD is a description of the names for your
element types, and how they fit together. You can write it, so you
get to decide what information you want to describe, and how.

Alternatively, you use one someone else has written.
Same question for schemas.
Ditto, except that schemas allow much tighter specification of the
types of data you want to allow (eg surname must be alphabetic; or
date must be numeric; or whatever), which is useful for data-based
applications, not so useful for normal text documents.

///Peter
--
XML FAQ: http://xml.silmaril.ie/
Sep 24 '06 #5
Joe Kesselman wrote:
(DTDs are arguably obsolete since -- unlike schemas -- they don't
understand how to work with XML Namespaces. Some folks will challange
that assertion.)
It was very clear from this summer's Extreme Markup Conference that
DTDs are anything but obsolete. Namespaces are irrelevant for large
classes of documents, especially in the publishing field, where
schemas have little if anything to offer apart from enforcing the
format of dates. Many publishing applications also require the
facilities offered by declared entities, which are not available in
W3C Schemas.

///Peter
--
XML FAQ: http://xml.silmaril.ie/
Sep 24 '06 #6
Peter Flynn wrote:
It was very clear from this summer's Extreme Markup Conference that
DTDs are anything but obsolete.
I did say "arguably". I agree they're still being used; there's less
consensus on whether that's a good thing.
Namespaces are irrelevant for large
classes of documents, especially in the publishing field
Definitely disagree. That's true initially when you aren't exchanging
data with other applications or working with applications smart enough
to allow plug-in extensions. As soon as those assertions start breaking
down, namespaces become valuable and DTDs start falling apart. (I've
been involved in this since before namespaces existed, so I've had the
experience of trying to design namespaces into a DTD. Even if you kluge
like mad, it's still putting lipstick on a pig -- the results are not
pretty and the pig doesn't enjoy it at all.)
Many publishing applications also require the
facilities offered by declared entities, which are not available in
W3C Schemas.
Correction: They require macro/import capabilities. Entities are
certainly the traditional way of doing that, inherited from SGML. and I
agree that schema doesn't support either. XML uses other tools --
XInclude, simple XSLT transformations, XPointer and so on -- to provide
similar capabilities.

The problem here is that the document world, understandably given their
SGML heritage. wants to treat XML as much like SGML as possible. They're
among the few who really like the DTD syntax, having "grown up with
it". They like entities because they're SGMLish. They dislike namespaces
because SGML didn't have them. I understand the resistance to change,
but I disagree that this resistance is well founded, and I feel that we
should be gently encouraging them to accept that XML is *not* SGML and
has its own (equivalent) solutions... for exactly the same reasons we
made the effort to teach C++ coders that just because they can write "as
if it was C" doesn't mean those are still the best answers in the new
environment.

I'm not dogmatic about this. People should do what's needed to get the
job done. But they should also design with an eye toward the future, and
I really don't think DTDs are it.

Your milage may vary.

--
() ASCII Ribbon Campaign | Joe Kesselman
/\ Stamp out HTML e-mail! | System architexture and kinetic poetry
Sep 24 '06 #7
Peter Flynn wrote:
Namespaces are irrelevant for large
classes of documents, especially in the publishing field,
Assume some witty rebutal of that on the general theme of "bollocks"

Sep 24 '06 #8

Peter Flynn wrote:
Namespaces are irrelevant for large
classes of documents, especially in the publishing field,
OK, I've got back to a keyboad where all the keys work (I was on an
evil Sony Vaio wireless thing!)

Namespaces are useful. Namespaces are _especially_ useful in a metadata
processing context, and that is particularly relevant to document
publishing.

Taking the "sliding windows" view of data and metadata, we have a
layered view of our data in terms of abstraction. The stuff "in the
window" is treated as relevant data that's needed by the operation
currently being performed. Anything "below" this abstraction is opaque
data that's merely transported unchanged, anything "above" it is
metadata that's not immediately of relevance.

The power of this model is that it allows us to build
application-independent tools. A tool that understands email (or
internal document routing) can route documents around as unopened black
boxes, just using the RFC822 headers. An indexing tool can use Dublin
Core properties, without needing to know how to edit it. The editor can
display the Dublin Core metadata, and even permit some generalised
processing of it, because it's visible, it knows crude data typing of
it (just as strings or URLs) and it leaves the rest (and the meaning)
to the human user.

Gaining this sort of layered metadata handling is very easily done, if
you reference standard namespaces, such as Dublin Core or some shared
distribution vocabulary. Namespaces are ideal for providing that as a
simple implementation that allows complex documents to be aggregated
from simple known prototcols.

Sep 25 '06 #9
Andy Dingley wrote:
Peter Flynn wrote:
>Namespaces are irrelevant for large
classes of documents, especially in the publishing field,
Peter is talking about the observed situation, de-facto.
Namespaces are useful. Namespaces are _especially_ useful in a metadata
processing context, and that is particularly relevant to document
publishing.
Peter is not expressing doubts about conceptual usefulness of namespaces.
Sep 25 '06 #10
Juergen Kahrs wrote:
Peter is not expressing doubts about conceptual usefulness of namespaces.
"Namespaces are irrelevant for large classes of documents, especially
in the publishing field, where schemas have little if anything to offer
apart from enforcing the
format of dates. "

"Irrelevant for" and "little [...] to offer" (future tense) seem, IMHO,
to be expressing strong doubts about the conceptual usefulness, and the
future potential of, namespaces. I don't read these statements as
merely applying to a current situation.

Sep 25 '06 #11
Andy Dingley wrote:
Juergen Kahrs wrote:
>Peter is not expressing doubts about conceptual usefulness of namespaces.

"Namespaces are irrelevant for large classes of documents, especially
in the publishing field, where schemas have little if anything to offer
apart from enforcing the
format of dates. "

"Irrelevant for" and "little [...] to offer" (future tense) seem, IMHO,
to be expressing strong doubts about the conceptual usefulness, and the
future potential of, namespaces. I don't read these statements as
merely applying to a current situation.
Peter talked about the relevance of namespaces in a special context.
In this special context (in the publishing field), did he mention
his doubts. But he did not talk about namespaces in general or in
all fields.

To summarize: I see no contradiction between his statement and yours.
Sep 25 '06 #12
Juergen Kahrs wrote:
Peter talked about the relevance of namespaces in a special context.
In this special context (in the publishing field), did he mention
his doubts.
Agreed. I think he's accurately reporting the current state of practice
(for the reasons I discussed, largely that publishings adoption of XML
has largely been by minimal-effort migration from SGML), but I still
believe that his implication that this is a *desirable* state of affairs
is unjustified and misguided.

Namespaces do have value in that space, whether they're being applied or
not. Schemas could also have value in those places where you want to
constrain a value (eg, detect cases where a page-layout attribute is
outside the meaningful range) or where you're working with a document
(eg a textbook?) with structured formalisms that are outside DTD's
checking ability but within that of schema.

I agree that people have been getting by without schemas, and many are
getting by without validation at all. Heck, most HTML users have been
getting by with flagrant violation of HTML's (SGML-based) DTD, mostly
because browsers have been coded to tolerate broken documents -- but
those of us who have to write that guess-the-user's-intent code know how
much it has cost them in lost interoperability, code bloat, and so on.
--
() ASCII Ribbon Campaign | Joe Kesselman
/\ Stamp out HTML e-mail! | System architexture and kinetic poetry
Sep 25 '06 #13

Juergen Kahrs wrote:
Peter talked about the relevance of namespaces in a special context.
"It was very clear from this summer's Extreme Markup Conference that
DTDs are anything but obsolete. Namespaces are irrelevant for large
classes of documents, especially in the publishing field, where
schemas have little if anything to offer apart from enforcing the
format of dates. "

This makes a number of statements:

1. DTD are not obsolete
2. Data typing by Schema is not significantly useful
3. Schema is not useful (in this area)
4. Namespacing is not useful.

5. These current observations are unlikely to change in the
forseeable future.
6. All of these observations are noted for, and apply to the
publishing field.

Obviously 1 is true generally if it is true for any special case, so {
1 } is just as valid as { 1 if and only if 6 }. Also if any of 1..4 are
already untrue by now, then 5 would be irrelevant

I don't happen to agree with any of these 1..4. I see 1 and 3 as untrue
simply because Schema supersedes DTD for simple usability reasons.
However I don't feel particularly strongly over it.

My specific point is that 4 is emphatically incorrect, even in the case
of 6. My own background is in publishing, in particular the use of
complicated and powerful content assembly engines that can work
independently of an application-specific content vocabulary [Dingley &
Shabajee, 2003]. These work almost entirely by taking three common and
"well known" vocabularies (RDF for a structural model, HTML / DocBook
for text structure and Dublin Core / MPEG-7 for metadata
representation) and combining them with custom application-specific
vocabularies to instantiate a powerful content-domain-aware publishing
engine "from nowhere". Namespaces are very powerful and publishing is a
central target field that benefits from them already.

Sep 25 '06 #14
Andy Dingley wrote:
Juergen Kahrs wrote:
>Peter is not expressing doubts about conceptual usefulness of namespaces.

"Namespaces are irrelevant for large classes of documents, especially
in the publishing field, where schemas have little if anything to offer
apart from enforcing the format of dates. "

"Irrelevant for" and "little [...] to offer" (future tense) seem, IMHO,
to be expressing strong doubts about the conceptual usefulness, and the
future potential of, namespaces. I don't read these statements as
merely applying to a current situation.
I should perhaps clarify this (thanks to both of you for picking up on
what I wrote). I was looking at it from the position of editing a text
document for publication (either authoring or editing). During this
process, namespaces are not particularly relevant, and while a schema
offers some more validation, there is often little to validate except
ID/IDREF links, and perhaps a few data formats (dates are a known
problem). In deeply technical documents (math is one obvious example),
schema-level validation may offer more. In preparing a novel for press,
less.

After publication, where you are preparing other derivative material,
I don't dispute that namespaces offer more, and metadata is a good
example. But at this stage the text is fixed (by definition, in most
cases, since the document has now been published), at least for the
time being, and if the document structure management has been done
with (for example) RNG, then a schema can replace the DTD for these
tasks -- Andy's second example is a case in point (the first can be
done just as easily with effectivities).

(And yes, it would be nice if the author had access to namespaces and
would use them to add value to the work being written, but we're not
yet even able to provide authors with a usable XML editor to write
with, so such aspirations are pie in the sky for the moment.)

In any event, in my business we have to deal with things as they are,
and hope that we can persuade the client to move gently towards things
as we might wish them to be. I have three current clients with their
text in SGML and no intention of moving it to XML yet (one day) because
(as Joe pointed out) it means moving from something they know to some
new state, and that's a business decision, with costs and benefits.
I had one client who moved prematurely from SGML to XML against the
recommendation of one senior editor (and mine) and is now paying
a stiff price, not just in cash but in human terms.

The discussion on DTDs in Montreal was interesting. The argument for
entities is unfortunately not completely resolved by the tools Joe
mentions, although they help. Images are a case in point. The problem
with XSLT 1.0 is that in the headlong rush to support data-based
applications it ignored some major features which publishers rely on
(such as the handling of white-space in Mixed Content, and the
processing of ENTITIES attribute values). Some of the other arguments
were regarded as non-prohibitative, but still constitute significant
cost-bearing blockages to using XML to its fullest for document
publishing.

Some of the blame must also rest with publishers: if they are so
anxious to have things done a certain way, they need to get together,
join the W3C, nominate people to the relevant committees, and
ultimately cut some code. Sitting there and whining about it post
hoc is unconvincing, when those who cared were trying to argue
the case unsupported and unfunded.

The dislike of namespaces is not because they weren't there in SGML
(actually, CONCUR, LINK and entities between them provide a large
amount of their functionality) but because they were implemented as
static labels, and the implementations don't provide for proper
inheritance or persistence. But we argued long over this at the time,
and the view was ultimately that the namespace URI did not need to
be dereferenced, nor that their should be anything useful at it to
resolve, which may turn out to be a mistake.

Joe's comparison between C and C++ coding style raised a smile: at
one stage mid-way through the discussions about XML Data, as it was
at the time, I saw a post from someone wishing "that XML would have
a GOTO statement" :-)

///Peter
Sep 25 '06 #15
Peter Flynn wrote:
But we argued long over this at the time,
and the view was ultimately that the namespace URI did not need to
be dereferenced, nor that their should be anything useful at it to
resolve, which may turn out to be a mistake.
Having been involved in this debate:

The conclusion was that there was no consensus on what, if anything,
should be retrieved via a dereferenced namespace URI. There were too
many different things people wanted to hang off that reference. It
quickly became evident that what was needed was some way to associate
*multiple* data items with a namespace URI, and that this fan-out was
going to have to be addressed by a separate specification. Tim B-L is
supposed to be tackling that as part of the Semantic Web effort, or at
least that was where we left it at the close of the namespace debate.

--
Joe Kesselman / Beware the fury of a patient man. -- John Dryden
Sep 25 '06 #16
Joseph Kesselman wrote:
Peter Flynn wrote:
> But we argued long over this at the time,
and the view was ultimately that the namespace URI did not need to
be dereferenced, nor that their should be anything useful at it to
resolve, which may turn out to be a mistake.

Having been involved in this debate:

The conclusion was that there was no consensus on what, if anything,
should be retrieved via a dereferenced namespace URI. There were too
many different things people wanted to hang off that reference. It
quickly became evident that what was needed was some way to associate
*multiple* data items with a namespace URI, and that this fan-out
was going to have to be addressed by a separate specification.
XML Link might have resolved this issue...
Tim B-L is supposed to be tackling that as part of the Semantic Web
effort, or at least that was where we left it at the close of the
namespace debate.
We'll get there one day :-)

///Peter

Sep 26 '06 #17
Peter Flynn wrote:
Joseph Kesselman wrote:
>Peter Flynn wrote:

The conclusion was that there was no consensus on what, if anything,
should be retrieved via a dereferenced namespace URI. There were too
many different things people wanted to hang off that reference. It
quickly became evident that what was needed was some way to associate
*multiple* data items with a namespace URI, and that this fan-out
was going to have to be addressed by a separate specification.

XML Link might have resolved this issue...
Well, RDDL (http://www.rddl.org/) is one attempt at using XLinks to
identify various resources that can be associated with a namespace. RDDL
makes it possible to, for example, automatically identify one or schemas
that can be used to validate documents (and much more).

However, I've never seen a tool or API that makes use of RDDL in this way.

// Magnus
Sep 27 '06 #18

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

3
by: Erik Lechak | last post by:
Hello all, I wrote the code below. It is simply a dictionary that uses regular expressions to match keys. A quick look at _test() will give you an example. Is there a module that already...
4
by: Richard Davies | last post by:
Hi I have a ASP and SQL Server website which uses a couple of Stored Procedures during the Shopping Cart process. 1 inserts the Customer information into a table and the next inserts the Payment...
8
by: b83503104 | last post by:
Hi, I want to accept the user's answer yes or no. If I do this: answer = input('y or n?') and type y on the keyboard, python complains Traceback (most recent call last):
5
by: | last post by:
We have this situation: We have a unmanaged c++ executable located in a certain folder (a.exe) It loads a mixed mode managed/unmanaged dll using MFC dynamically from a completely different path...
2
by: ChasW | last post by:
Greetings, I have a form that uses a query as its record source. In the form I have a text box that uses this as its control source: =DCount("", "qry_Search_by_Name") The DCount function...
4
by: Rob Dob | last post by:
Hi, I have both a Customer and a Orders Table, the key field is CustomerID. I also have a DataGrid that uses a BindingSource which uses a dataset which uses a join between the Customers table...
2
by: Serg | last post by:
Hi, all I have a trouble. I have a WebService that uses MCpp assembly that uses some MFC classes. Everything perfectly works on my machine. But when i install web site on clients machine it...
1
by: napi | last post by:
Does anyone know of any public domain C++ compiler that uses CFront as its front end? If you do would appreciate the links or other related info. p.s. Info on any commercial C++ compiler that...
1
by: AvniDani | last post by:
i am writing a web project running under IIS on VS 2008. on a standard exe if i write a line that uses one of the new language features of VB.NET 2008, the IDE uses the .net 3.5 compiler to compile...
0
MikeJaskov
by: MikeJaskov | last post by:
Hi all. Like you know I'm working on some project so I need something to know. I was serching on internet and found some uses of Kernel32,advapi32 and rundll32. So I use API Text Viwer but there are...
0
by: DolphinDB | last post by:
Tired of spending countless mintues downsampling your data? Look no further! In this article, you’ll learn how to efficiently downsample 6.48 billion high-frequency records to 61 million...
0
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
1
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
0
by: Vimpel783 | last post by:
Hello! Guys, I found this code on the Internet, but I need to modify it a little. It works well, the problem is this: Data is sent from only one cell, in this case B5, but it is necessary that data...
0
by: jfyes | last post by:
As a hardware engineer, after seeing that CEIWEI recently released a new tool for Modbus RTU Over TCP/UDP filtering and monitoring, I actively went to its official website to take a look. It turned...
0
by: ArrayDB | last post by:
The error message I've encountered is; ERROR:root:Error generating model response: exception: access violation writing 0x0000000000005140, which seems to be indicative of an access violation...
1
by: Defcon1945 | last post by:
I'm trying to learn Python using Pycharm but import shutil doesn't work
0
by: af34tf | last post by:
Hi Guys, I have a domain whose name is BytesLimited.com, and I want to sell it. Does anyone know about platforms that allow me to list my domain in auction for free. Thank you
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.