472,127 Members | 1,949 Online
Bytes | Software Development & Data Engineering Community
Post +

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 472,127 software developers and data experts.

Clean up Word HTML

Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.
Jul 19 '05 #1
8 5416
Not to my knowledge.... perhaps one of the more "experienced" NG users can
help......?

--
Regards

Steven Burn
Ur I.T. Mate Group
www.it-mate.co.uk

Keeping it FREE!

Disclaimer:
I know I'm probably wrong, I just like taking part ;o)
Travis Pupkin <tp*****@dorrk.com> wrote in message
news:MP************************@news.individual.ne t...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.

Jul 19 '05 #2
"Travis Pupkin" <tp*****@dorrk.com> wrote in message
news:MP************************@news.individual.ne t...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?


I don't know of an ASP method for doing this (though I wouldn't be surprised
if one existed). Whenever I need to copy something from Word into an
HTML/ASP document, I will first paste it into Notepad, then select
everything in Notepad, copy it, and paste that into the HTML/ASP document.
It's an extra step, but seems to get rid of the formatting characters.

HTH
Peter Foti
Jul 19 '05 #3
Personally I just detest Word....... period (prefer to use my own custom
built editor and before that, have used Notepad since I first started in web
dev).

--
Regards

Steven Burn
Ur I.T. Mate Group
www.it-mate.co.uk

Keeping it FREE!

Disclaimer:
I know I'm probably wrong, I just like taking part ;o)
Peter Foti <pe****@systolicNOSPAMnetworks.com> wrote in message
news:vv************@corp.supernews.com...
"Travis Pupkin" <tp*****@dorrk.com> wrote in message
news:MP************************@news.individual.ne t...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?
I don't know of an ASP method for doing this (though I wouldn't be

surprised if one existed). Whenever I need to copy something from Word into an
HTML/ASP document, I will first paste it into Notepad, then select
everything in Notepad, copy it, and paste that into the HTML/ASP document.
It's an extra step, but seems to get rid of the formatting characters.

HTH
Peter Foti

Jul 19 '05 #4
"Steven Burn" <nobody@PVT_it-mate.co.uk> wrote in message
news:uR*************@TK2MSFTNGP11.phx.gbl...
Personally I just detest Word....... period (prefer to use my own custom
built editor and before that, have used Notepad since I first started in web dev).


Oh, I agree 100%. I would NEVER use Word as a web editor. But
occassionally, I might have some content that needs to be put into a web
page that comes from a Word document. :)

Peter Foti
Jul 19 '05 #5
On Tue, 30 Dec 2003 11:52:35 -0800, Travis Pupkin <tp*****@dorrk.com>
wrote:
I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?


That gets tough, because a lot of the Word is XML, and a lot is nested
tables to get formatting, and it's hard to tell what you may or may
not need.

Jeff
Jul 19 '05 #6
Jon
> Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.


Not what you want, however Dreamweaver can 'tidy up' a lot of Words HTML
(and has a menu option to do so)

Jon
Jul 19 '05 #7
Obviously Word as an HTML editor is a bad idea. What I'm talking about
is when a client has a web-based WYSIWYG content editor and frequently
pastes word data into it -- the HTML created is bulky and atrocious.

In trying out some PHP the other day, I found a PHP script which is a
complex series of find-and-replace commands aimed at cleaning up
unnecessary tags and Word html code fromn a string, but leaving the
essentials like bold italics in place.

Just wondering if anyone knew of an ASP equivalent.
Jul 19 '05 #8
The commercial Advanced Content Editor (ACE) has a special button that will
clean up word XML... if the user remembers to press it on paste.

I agree a custom function would be nice. This problem limits the utility of
roll-your-own web-based CMS systems.

"Travis Pupkin" <tp*****@dorrk.com> wrote in message
news:MP************************@news.individual.ne t...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.

Jul 19 '05 #9

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

20 posts views Thread by Al Moritz | last post: by
232 posts views Thread by robert maas, see http://tinyurl.com/uh3t | last post: by
9 posts views Thread by jd | last post: by
10 posts views Thread by Greg Lovern | last post: by
reply views Thread by leo001 | last post: by

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.