473,320 Members | 1,707 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,320 software developers and data experts.

Semantic Structure of HTML page

Hi,

Does anyone know of any application, or simply any development, on
extracting semantic structure of HTML pages?

Cheers,
Michael

Jul 23 '05 #1
7 1664
da*****@hotmail.com wrote:
Does anyone know of any application, or simply any development, on
extracting semantic structure of HTML pages?


View / Source...

Explain what sort of output you're expecting to see.

--
Mark.
http://tranchant.plus.com/
Jul 23 '05 #2
Hi,

Well, I'm expecting that it can group paragraphs under headings etc.,
and with the syntax hidden. It can be in the form of a directory or a
tree, where nodes are grouped/branched out from a particular
heading/parent.

Cheers,
Michael

Jul 23 '05 #3
da*****@hotmail.com wrote:
Hi,

Well, I'm expecting that it can group paragraphs under headings etc.,
and with the syntax hidden. It can be in the form of a directory or a
tree, where nodes are grouped/branched out from a particular
heading/parent.


The W3 validator's parse tree option does a crude job of this:

http://shorl.com/belihanafafu

--
Mark.
http://tranchant.plus.com/
Jul 23 '05 #4
In article <11**********************@z14g2000cwz.googlegroups .com>,
da*****@hotmail.com wrote:
Well, I'm expecting that it can group paragraphs under headings etc.,
and with the syntax hidden. It can be in the form of a directory or a
tree, where nodes are grouped/branched out from a particular
heading/parent.


Firefox has the DOM inspector that does something like that.

--
Kris
<kr*******@xs4all.netherlands> (nl)
Jul 23 '05 #5
Hi,

The DOM inspector seems to output the parse tree, and doesn't seem to
convey any semantic meanings though.

Michael

Jul 23 '05 #6
In article <11**********************@c13g2000cwb.googlegroups .com>,
da*****@hotmail.com wrote:
The DOM inspector seems to output the parse tree, and doesn't seem to
convey any semantic meanings though.


Semantics are in the eye of the beholder.

--
Kris
<kr*******@xs4all.netherlands> (nl)
Jul 23 '05 #7
Kris <kr*******@xs4all.netherlands> wrote:
da*****@hotmail.com wrote:
The DOM inspector seems to output the parse tree, and doesn't seem to
convey any semantic meanings though.


Semantics are in the eye of the beholder.


Boy, I hope that's not true in the general case, or there are a lot of
people wasting a lot of time with HTML and its cousins.

--
Joel.
Jul 23 '05 #8

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

5
by: Richard Cornford | last post by:
I am interested in hearing opinions on the semantic meaning of FORM (elements) in HTML. I have to start of apologising because this question arose in a context that is not applicable to the...
3
by: pr10n | last post by:
Hello, all! I'm working on a personal blog and I'm trying to make it's markup more semantic, using the right elements, removing needless div's and so on. I was wondering if someone would be kind...
2
by: dayzman | last post by:
Hi, I'm in need of a program that extracts the semantic structure of HTML pages -- a program that groups paragraphs with the corresponding headings etc. I know it's not too difficult to extract...
3
by: dayzman | last post by:
Hi, I'm interested in projects evolve about extracing semantic structure of HTML documents. What I mean by extracting semantic structure is to analyse HTML doc and outputs a model (perhaps a...
3
by: dayzman | last post by:
Hi, I've read somewhere that feature-based analysis can be used to extract the semantic structure of HTML documents. By semantic structure, they mean the model of the rendered view a reader...
2
by: Eric Lindsay | last post by:
Most of the web pages whose source I look at have a very elaborate structure (even apart from any massive use of Javascript), often with div in div in div layered up to six deep, lots of classes...
12
by: rshepard | last post by:
I'm a bit embarrassed to have to ask for help on this, but I'm not finding the solution in the docs I have here. Data are assembled for writing to a database table. A representative tuple looks...
2
by: ravidor | last post by:
I need to display the number of a question out of total number of questions (example 6/18). What HTML tags should I use to build it semantically?
19
by: mdh | last post by:
Perhaps slightly OT, but related to Declarator form. From A8.5, the declarators have the syntax; declarator: ptr (opt) direct-declarator. At the bottom of the page, it says, amongst...
0
by: ryjfgjl | last post by:
ExcelToDatabase: batch import excel into database automatically...
0
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
1
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
0
by: jfyes | last post by:
As a hardware engineer, after seeing that CEIWEI recently released a new tool for Modbus RTU Over TCP/UDP filtering and monitoring, I actively went to its official website to take a look. It turned...
0
by: ArrayDB | last post by:
The error message I've encountered is; ERROR:root:Error generating model response: exception: access violation writing 0x0000000000005140, which seems to be indicative of an access violation...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
1
by: Defcon1945 | last post by:
I'm trying to learn Python using Pycharm but import shutil doesn't work
1
by: Shællîpôpï 09 | last post by:
If u are using a keypad phone, how do u turn on JavaScript, to access features like WhatsApp, Facebook, Instagram....
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.