By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
443,660 Members | 1,102 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 443,660 IT Pros & Developers. It's quick & easy.

HTML tags optimization [ interesting problem]

P: n/a

hi all,

i use SGMLParser to process HTML files, in order to do some
optimizations,

something like this:

<i><b>TEXT1</b></i><b><i><u>TEXT2</u></i></b>

optimise to

<i><b>TEXT1<u>TEXT2</u></b><i>
at the very beginning, i was thinking of analysing each text-block, to
know their color, size, if is bold or italic, but i found it was too
complicated.

e.g

<font color=red><font size=6>TEXT1</font></font>

optimise to

<font color=red size=6>TEXT1</font>
but if there is TEXT2 exist

<font color=red><font size=6>TEXT1</font>TEXT2</font>

we can not do any optimization.

my problem is I can not find a method to treat all those situation, I
had too much thinking and get fool now
anyone can give me some advices?

thanks


PS:

other examples:

1
<font size=5><font size=7>TEXT</font></font>
=>
<font size=7>TEXT</font>

2
<i>TEXT </i><i>TEXT</i>
=>
<i>TEXT TEXT</i>

3
<i>TEXT<i>TEXT</i></i>
=>
<i>TEXT</i>

etc...

Aug 31 '05 #1
Share this Question
Share on Google+
6 Replies


P: n/a
DENG a écrit :
hi all,

i use SGMLParser to process HTML files, in order to do some
optimizations,

something like this:

<i><b>TEXT1</b></i><b><i><u>TEXT2</u></i></b>

optimise to

<i><b>TEXT1<u>TEXT2</u></b><i>
Doesn't Tidy do this already ?

at the very beginning, i was thinking of analysing each text-block, to
know their color, size, if is bold or italic, but i found it was too
complicated.

e.g

<font color=red><font size=6>TEXT1</font></font>
Whoever writes such a thing in 2005 ought to be shot down for the sake
of mankind !-)
optimise to

<font color=red size=6>TEXT1</font>
You call this "optimized HTML" ? Even MS Word produces better code...

but if there is TEXT2 exist

<font color=red><font size=6>TEXT1</font>TEXT2</font>

we can not do any optimization.

my problem is I can not find a method to treat all those situation,
Tidy is open source AFAICT.
I
had too much thinking and get fool now
anyone can give me some advices?


Learn to use Tidy and CSS.
Aug 31 '05 #2

P: n/a
I know very well Tidy, sir

Tidy do a nice job but it is writen in Java, and have Python ported

my aim is to learn Python, learn how to program

I know many people write "hello the world" in 2005, why I can not write
this program in 2005?

you are french, right? peut etre we can talk about it in ecole
polytechnique? i'll be there waiting for you

thanks

Sep 1 '05 #3

P: n/a
I know very well Tidy, sir

Tidy do a nice job but it is writen in Java, and have Python ported

my aim is to learn Python, learn how to program

I know many people write "hello the world" in 2005, why I can not write
this program in 2005?

you are french, right? peut etre we can talk about it in ecole
polytechnique? i'll be there waiting for you

thanks

Sep 1 '05 #4

P: n/a
DENG wrote:
I know very well Tidy, sir

Tidy do a nice job but it is writen in Java,
Seems like we're not talking about the same program here. Tidy (aka
HTMLTidy) is written in C. You must be talking about it's Java port JTidy.
and have Python ported

my aim is to learn Python, learn how to program
No one could have guess from your post, and I dont have psychic powers,
ok ?

From a professional POV, using existing tools that have proven to be
reliable is far better than reinventing the square wheel, hence my answer.
you are french, right? peut etre we can talk about it in ecole
polytechnique?
Peut-être pas, je ne mets jamais les pieds chez les polytechniciens.
i'll be there waiting for you


I'm afraid you'll learn a looooong time.

Regards,
--
bruno desthuilliers
python -c "print '@'.join(['.'.join([w[::-1] for w in p.split('.')]) for
p in 'o****@xiludom.gro'.split('@')])"
Sep 1 '05 #5

P: n/a
DENG enlightened us with:
i use SGMLParser to process HTML files, in order to do some
optimizations,

something like this:

<i><b>TEXT1</b></i><b><i><u>TEXT2</u></i></b>

optimise to

<i><b>TEXT1<u>TEXT2</u></b><i>
Why not optimize it to:

<span class='onerole'>TEXT1</span>
<span class='secondrole'>TEXT2</span>?
[ snipped stuff about <font> tags ]
If you're serious about using HTML, I suggest you read a book that's
not ten years old.
anyone can give me some advices?


Yes, read the following:

http://www.w3.org/QA/Tips/
http://www.w3.org/TR/CSS21/
http://www.w3.org/TR/xhtml1/
http://validator.w3.org/

Sybren
--
The problem with the world is stupidity. Not saying there should be a
capital punishment for stupidity, but why don't we just take the
safety labels off of everything and let the problem solve itself?
Frank Zappa
Sep 1 '05 #6

P: n/a
hi, Sybren,

thanks for your reply, if use CSS:

<span class=type1>text<span class=type2>text</span></span><span
class=type1>text</span>

optimise to:

<span class=type1>text<span class=type2>text</span>text</span>

what i need is the METHOD to do optimization, in fact, i have ready
write a program to analyse the syntax of CSS, to make it works with all
situation

Sep 2 '05 #7

This discussion thread is closed

Replies have been disabled for this discussion.