473,395 Members | 1,941 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,395 software developers and data experts.

Programming language + XML + XSL = compiler ?

Hi folks

New around here. This post may be a duplicate, I think I bungled the
previous one.

I've recently started out with XML and XSL and I'm very impressed with
both. My use is non-web in nature. In fact, I'm writing a compiler of
sorts, from high-level spec in XML, with eventual generation of VHDL.

I've made the following observations:

1. Given pure source code (Eg:Java,C++), one could generate an abstract
syntax tree in XML.
2. XML is also apposite for symbol tables and intermediate froms.
3. XSLT can do powerful transformations of said internediate forms.
4. XSL can generate plain text.

I'm wondering wondering what it would take to implement a compiler for
a given language in XML/XSL. More specifically, are there any XSL or
XML based systems that assist in scanning pure source into XML so that
the transformations can begin? Grammar could be provided in marked-up
EBNF. This is probably over-optimistic could the process would need
knowledge of LL/LALR etc.

The alternative approach is to use your standard scanner/parser
generator with it's own (non-interchangable) EBNF and have the action
code generate tagged tokens, giving you an AST in XML.

My questions:
1. Are there any decent XML shema's for EBNF?
2. Are there any tools / XSL extensions to do the above?
3. What are your thoughts on the whole idea?

Best regards,
Richard Brady

Jun 23 '06 #1
19 1658
rn*****@gmail.com wrote:
I'm wondering wondering what it would take to implement a compiler for
a given language in XML/XSL.
Remember, XSL is a pure functional langauge. That may make some of the
manipulations a serious compiler would want to perform difficult.

But, yes, a basic old-fashioned just-replace-with-equivalent-code
transformation ought to be possible. This is not the set of tools I
would choose to do it in -- custom data structures are likely to be much
more efficient than trying to model everything in XML -- but if you're
looking for a pedagogical exercise, it could be a fun thing to try.

Were I you, I'd start with a language that has simpler syntax and
semantics than Java or C++, and use that to establish your framework and
understand the issues (including performance and memory space). Then, if
you're still feeling ambitious, go for it.

(Personally, I've stopped writing compilers for the sake of writing
compilers. I'd rather wait until I'm reasonably sure there's something
mine will do better than any existing alternative.)

More specifically, are there any XSL or XML based systems that assist in scanning pure source into XML so that
the transformations can begin? Grammar could be provided in marked-up
EBNF. This is probably over-optimistic could the process would need
knowledge of LL/LALR etc.

The alternative approach is to use your standard scanner/parser
generator with it's own (non-interchangable) EBNF and have the action
code generate tagged tokens, giving you an AST in XML.

My questions:
1. Are there any decent XML shema's for EBNF?
2. Are there any tools / XSL extensions to do the above?
3. What are your thoughts on the whole idea?

Best regards,
Richard Brady

Jun 23 '06 #2

rn*****@gmail.com wrote:
3. XSLT can do powerful transformations of said internediate forms.


IMHE, XSLT can't do "powerful", for interesting values of powerful. It
does "Bulky but simple" very well indeed and is excellent for text
processing with blocks of text. If you try and do anything involving
semantics though, it starts to fall flat.

Jun 23 '06 #3
rn*****@gmail.com wrote:
I'm wondering wondering what it would take to implement a compiler for
a given language in XML/XSL. More specifically, are there any XSL or
This has been suggested 5 years ago in

http://www.idealliance.org/papers/xm.../03-05-04.html

and also in comp.compilers late last year.
Google for "xml as intermediate representation"
3. What are your thoughts on the whole idea?


Teaching compiler construction could be an area
where this idea might be useful. But in real life,
I would _not_ like to write my source code in XML.
Jun 23 '06 #4
Then there's always the inverse: a programming language whose syntax is
based on XML, so it can be styled by XSLT (or generated from other XML
sources by XSLT). IBM's Bean Markup Language and BML Scripting Pages,
which I was involved in some years ago, were one such example, though in
that case the language was designed for configuration and assembly of
existing Java components (and BSF scripts, also something that I helped
develop) rather than being intended as a completely stand-alone
programming environment.
--
() ASCII Ribbon Campaign | Joe Kesselman
/\ Stamp out HTML e-mail! | System architexture and kinetic poetry
Jun 24 '06 #5
On Fri, 23 Jun 2006 23:46:38 -0400, Joe Kesselman
<ke************@comcast.net> wrote:
Then there's always the inverse: a programming language whose syntax is
based on XML, so it can be styled by XSLT (or generated from other XML
sources by XSLT).


Ant

SQL DDL, as managed by the Apache Torque project

Jun 24 '06 #6
Hm. I don't usually think of Ant as a programming language, and I don't
think I've heard of anyone styling it, but that's an interesting
observation -- stylesheets to adapt Ant builds to different
environments, perhaps...
--
() ASCII Ribbon Campaign | Joe Kesselman
/\ Stamp out HTML e-mail! | System architexture and kinetic poetry
Jun 24 '06 #7
Hi folks

Thanks for the input. I'm not actually thinking of doing this as a
project, just curious about whether any investigation has been done on
the topic. I currently use this technique to transform digital signal
processing (DSP) algorithms specified in XML into VHDL implementations
of those algorithms.

Perhaps I'm off the mark with suggesting it for C++ or Java, but it's a
particularly interesting concept when considering higher-level
specifications such as in model driven engineering (like UML in MDA). A
large emphasis becomes interoperability/exchangability/portability of
models, but at the core of it all is transformations between models.

It's interesting because we all have different backgrounds and
therefore unique perspectives on an concept like this.

Thanks again,
Richard

Jun 26 '06 #8
rn*****@gmail.com wrote:
the topic. I currently use this technique to transform digital signal
processing (DSP) algorithms specified in XML into VHDL implementations
of those algorithms.


Where did you get the DSP algorithms in XML ?
Is there any standard for this ?
Jun 26 '06 #9

Joe Kesselman wrote:
Hm. I don't usually think of Ant as a programming language,
In its limited little way, it's a declarative programming language (as
was Prolog). You can use it as a convenient dependency resolver.
I don't think I've heard of anyone styling it,


I have an XSLT stylesheet that auto-documents Ant external targets.
Handy when I'm shipping deployment scripts to users.

I also auto-generate Ant from a vile combination of our bug tracker,
change control, and Torque repository. This generates database patch
scripts for upgrading customer databases from arbitrary start points to
new product versions.

Jun 26 '06 #10
Another interesting reference on this topic: The folks working on a test
suite for the W3C's DOM APIs were developing a limited XML-based
"programming language" for binding/language-independent description of
the tests, together with stylesheets which would render those into
executable form. Again, that's a fairly primitive "compiler", but it's
yet another instance of XML-based programming languages.
Jun 26 '06 #11
Juergen
Where did you get the DSP algorithms in XML ?
That's a good question. I'm writing them myself and learning as I go
along.
Is there any standard for this ?


Not that I know of, but I wouldn't be surprised.

So, now that I have these XML descriptions being converted into VHDL,
the next step is to layer something more interesting on top. No DSP
designer wants to type their spec in XML ... yuk!

There are several options: text (something more readable than XML),
visual, or both. My application is Software Defined Radio so I'm
looking at Waveform Description Language (WDL) which has a textual form
with equivalent representations in visual and XML forms. Another option
would be UML2.

For WDL I have a Lex/Yacc grammar, so I can just create a little
compiler to mark-up the code. I thought there'd be plenty other folk
with the same issue and some standard solution might have been
developed. I'm a big fan of standard solutions because of the re-use
which they encourage.

Richard

Jun 26 '06 #12
Juergen

PS: Your links (both the article and the comp.compiler discussion) were
very useful. Thank you.

Jun 26 '06 #13
rn*****@gmail.com wrote:
So, now that I have these XML descriptions being converted into VHDL,
the next step is to layer something more interesting on top. No DSP
designer wants to type their spec in XML ... yuk!
Two years ago, someone posted here about a VHDL
application. Use Google Newsgroups to search for
"Use variables to get unique nodes". He posted
this stuff:

<?xml version="1.0" encoding="UTF-8"?>
<Root>
<E>
<!-- If we have the key match with pin only
the xslt does not work. If the maych is at least TEST/Pin
it is OK
-->
<Pin PinName="A" PinClass="T"/>
<!--<Pin PinName="AX" PinClass="T"/> -->
</E>
<x>
<!-- in case the TEST node below is not selected by Index
AND PinName="A" and CellType="M" matches
the result is wrong
-->
<TEST Index="42" CellType="M">
<Pin PinName="z" PinClass="R"/>
<!--<Pin PinName="AX" PinClass="T"/>-->
<Pin PinName="A" PinClass="T"/>
</TEST>
<TEST Index="43" CellType="M">
<Pin PinName="A" PinClass="T"/>
<Pin PinName="D" PinClass="R"/>
<Pin PinName="C" PinClass="R"/>
<Pin PinName="X" PinClass="R"/>
</TEST>
<TEST Index="44" CellType="M">
<Pin PinName="D" PinClass="R"/>
<Pin PinName="C" PinClass="R"/>
<Pin PinName="X" PinClass="R"/>
<Pin PinName="A" PinClass="T"/>
</TEST>
<TEST Index="45" CellType="M">
<Pin PinName="z" PinClass="R"/>
</TEST>
</x>
</Root>

There are several options: text (something more readable than XML),
visual, or both. My application is Software Defined Radio so I'm
looking at Waveform Description Language (WDL) which has a textual form
with equivalent representations in visual and XML forms. Another option
would be UML2.


This is rather abstract. You should post an example.
Jun 26 '06 #14
rn*****@gmail.com wrote:
PS: Your links (both the article and the comp.compiler discussion) were
very useful. Thank you.


This paper about lcc may also be interesting for you:

http://www.cs.princeton.edu/software/lcc/asdl.html

This is not strictly involving XML, but they also
use a tree-representation.
Jun 26 '06 #15
Thanks!

Jun 28 '06 #16
I recently came across the str:tokenize() function at the EXSLT
project. It takes a string and converts it into several tokens.

QUOTE:

The string given by the first argument is split at any occurrence of
any of these characters. For example:

str:tokenize('2001-06-03T11:40:23', '-T:')
Gives the node set consisting of:

<token>2001</token>
<token>06</token>
<token>03</token>
<token>11</token>
<token>40</token>
<token>23</token>

UNQUOTE.

This is close to what I had in mind. If you could extend the idea so
that it takes an EBNF in XML form instead of a list of delimiters,
wouldn't you have part of a scanner and parser?

Jun 29 '06 #17
rn*****@gmail.com wrote:
This is close to what I had in mind. If you could extend the idea so
that it takes an EBNF in XML form instead of a list of delimiters,
wouldn't you have part of a scanner and parser?


Yes, but what is it good for ?
Do you want to write a universal Parser which
takes an EBNF and parses according to the EBNF ?
Such universal parsers have been suggested and
implemented much earlier. For example, Niklaus
Wirth presented such a parser in 1976:

http://www.inf.ethz.ch/personal/wirt.../Compilerbau0/
http://www.oberon.ethz.ch/WirthPubl/CBEAll.pdf
Jun 30 '06 #18
Do you want to write a universal Parser which
takes an EBNF and parses according to the EBNF ?
It doesn't need to be written from scratch, just integrated as an
extension to XSL, so that XSL can take structured text (in a non-XML
form) as input and parse it, and generate xml/html/text as output.
Yes, but what is it good for ?
1. Compiler-compiler not needed as separate step in the whole process.
2. Definition of a standard syntax for EBNF using a DTD or XSD.
3. Entire compiler could be written in XSL.
4. An interesting platform for academic exploration of compiler design.

Jul 3 '06 #19
Andy Dingley <di*****@codesmiths.comwrote:
Joe Kesselman wrote:
>>Hm. I don't usually think of Ant as a programming language,


In its limited little way, it's a declarative programming language (as
was Prolog). You can use it as a convenient dependency resolver.

>>I don't think I've heard of anyone styling it,


I have an XSLT stylesheet that auto-documents Ant external targets.
Handy when I'm shipping deployment scripts to users.

I also auto-generate Ant from a vile combination of our bug tracker,
change control, and Torque repository. This generates database patch
scripts for upgrading customer databases from arbitrary start points to
new product versions.
hi,

I have designed and implemented a general-purpose XML system that I
called "Active Tags" which allows to make programs with XML tags ; it
allows to mix declarative tags with imperative instructions, uses XPath
to access object's properties, and much more...

it is not "yet another markup language" but a framework that allows
several markup languages to run together

the implementation (Java) :
http://reflex.gforge.inria.fr
the specs :
http://disc.inria.fr/perso/philippe....tive-tags.html
http://disc.inria.fr/perso/philippe....l/active-tags/

--
Cordialement,

///
(. .)
--------ooO--(_)--Ooo--------
| Philippe Poulard |
-----------------------------
http://reflex.gforge.inria.fr/
Have the RefleX !
Jul 3 '06 #20

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

34
by: nobody | last post by:
This article is posted at the request of C.W. Yang who asked me to detail my opinion of Lisp, and for the benefit of people like him, who may find themselves intrigued by this language. The...
32
by: cat_dog_ass | last post by:
I am used to programming in DOS mode via Borland C++. I would now like to create programs for the Windows envirinment. Is it absoultely necessary to use Visual C++ to do this? Are there other tools...
134
by: evolnet.regular | last post by:
I've been utilising C for lots of small and a few medium-sized personal projects over the course of the past decade, and I've realised lately just how little progress it's made since then. I've...
22
by: Matt | last post by:
Some people identify Microsoft C# is Proprietary programming language. What is Proprietary programming language then? How does it differ from other languages such as C++, or Java?? Please...
3
by: Visually Seen # | last post by:
Hey everybody, I want to create my own programming language, using C#. Could anyone give me some tips, advice on this goal? Let's say that in my programming language uses the keyword, integer...
29
by: SG | last post by:
Hi everyone, I am a complete novice at computers and programming and right now, all i need to know is that why do many people prefer C to C++? Is it just because they are used to using C and are...
139
by: Joe Mayo | last post by:
I think I become more and more alone... Everybody tells me that C++ is better, because once a project becomes very large, I should be happy that it has been written in C++ and not C. I'm the only...
84
by: aarklon | last post by:
Hi all, I found an interesting article here:- http://en.wikipedia.org/wiki/Criticism_of_the_C_programming_language well what do you guys think of this article....??? Is it constructive...
53
by: Vicent Giner | last post by:
Hello. I am new to Python. It seems a very interesting language to me. Its simplicity is very attractive. However, it is usually said that Python is not a compiled but interpreted programming...
43
by: Adem24 | last post by:
The World Joint Programming Language Standardization Committe (WJPLSC) hereby proclaims to the people of the world that a new programming language is needed for the benefit of the whole mankind in...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.