By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
458,110 Members | 1,531 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 458,110 IT Pros & Developers. It's quick & easy.

Clean up Word HTML

P: n/a
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.
Jul 19 '05 #1
Share this Question
Share on Google+
8 Replies


P: n/a
Not to my knowledge.... perhaps one of the more "experienced" NG users can
help......?

--
Regards

Steven Burn
Ur I.T. Mate Group
www.it-mate.co.uk

Keeping it FREE!

Disclaimer:
I know I'm probably wrong, I just like taking part ;o)
Travis Pupkin <tp*****@dorrk.com> wrote in message
news:MP************************@news.individual.ne t...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.

Jul 19 '05 #2

P: n/a
"Travis Pupkin" <tp*****@dorrk.com> wrote in message
news:MP************************@news.individual.ne t...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?


I don't know of an ASP method for doing this (though I wouldn't be surprised
if one existed). Whenever I need to copy something from Word into an
HTML/ASP document, I will first paste it into Notepad, then select
everything in Notepad, copy it, and paste that into the HTML/ASP document.
It's an extra step, but seems to get rid of the formatting characters.

HTH
Peter Foti
Jul 19 '05 #3

P: n/a
Personally I just detest Word....... period (prefer to use my own custom
built editor and before that, have used Notepad since I first started in web
dev).

--
Regards

Steven Burn
Ur I.T. Mate Group
www.it-mate.co.uk

Keeping it FREE!

Disclaimer:
I know I'm probably wrong, I just like taking part ;o)
Peter Foti <pe****@systolicNOSPAMnetworks.com> wrote in message
news:vv************@corp.supernews.com...
"Travis Pupkin" <tp*****@dorrk.com> wrote in message
news:MP************************@news.individual.ne t...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?
I don't know of an ASP method for doing this (though I wouldn't be

surprised if one existed). Whenever I need to copy something from Word into an
HTML/ASP document, I will first paste it into Notepad, then select
everything in Notepad, copy it, and paste that into the HTML/ASP document.
It's an extra step, but seems to get rid of the formatting characters.

HTH
Peter Foti

Jul 19 '05 #4

P: n/a
"Steven Burn" <nobody@PVT_it-mate.co.uk> wrote in message
news:uR*************@TK2MSFTNGP11.phx.gbl...
Personally I just detest Word....... period (prefer to use my own custom
built editor and before that, have used Notepad since I first started in web dev).


Oh, I agree 100%. I would NEVER use Word as a web editor. But
occassionally, I might have some content that needs to be put into a web
page that comes from a Word document. :)

Peter Foti
Jul 19 '05 #5

P: n/a
On Tue, 30 Dec 2003 11:52:35 -0800, Travis Pupkin <tp*****@dorrk.com>
wrote:
I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?


That gets tough, because a lot of the Word is XML, and a lot is nested
tables to get formatting, and it's hard to tell what you may or may
not need.

Jeff
Jul 19 '05 #6

P: n/a
Jon
> Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.


Not what you want, however Dreamweaver can 'tidy up' a lot of Words HTML
(and has a menu option to do so)

Jon
Jul 19 '05 #7

P: n/a
Obviously Word as an HTML editor is a bad idea. What I'm talking about
is when a client has a web-based WYSIWYG content editor and frequently
pastes word data into it -- the HTML created is bulky and atrocious.

In trying out some PHP the other day, I found a PHP script which is a
complex series of find-and-replace commands aimed at cleaning up
unnecessary tags and Word html code fromn a string, but leaving the
essentials like bold italics in place.

Just wondering if anyone knew of an ASP equivalent.
Jul 19 '05 #8

P: n/a
The commercial Advanced Content Editor (ACE) has a special button that will
clean up word XML... if the user remembers to press it on paste.

I agree a custom function would be nice. This problem limits the utility of
roll-your-own web-based CMS systems.

"Travis Pupkin" <tp*****@dorrk.com> wrote in message
news:MP************************@news.individual.ne t...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.

Jul 19 '05 #9

This discussion thread is closed

Replies have been disabled for this discussion.