473,809 Members | 2,731 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Clean up Word HTML

Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.
Jul 19 '05 #1
8 5480
Not to my knowledge.... perhaps one of the more "experience d" NG users can
help......?

--
Regards

Steven Burn
Ur I.T. Mate Group
www.it-mate.co.uk

Keeping it FREE!

Disclaimer:
I know I'm probably wrong, I just like taking part ;o)
Travis Pupkin <tp*****@dorrk. com> wrote in message
news:MP******** *************** *@news.individu al.net...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.

Jul 19 '05 #2
"Travis Pupkin" <tp*****@dorrk. com> wrote in message
news:MP******** *************** *@news.individu al.net...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?


I don't know of an ASP method for doing this (though I wouldn't be surprised
if one existed). Whenever I need to copy something from Word into an
HTML/ASP document, I will first paste it into Notepad, then select
everything in Notepad, copy it, and paste that into the HTML/ASP document.
It's an extra step, but seems to get rid of the formatting characters.

HTH
Peter Foti
Jul 19 '05 #3
Personally I just detest Word....... period (prefer to use my own custom
built editor and before that, have used Notepad since I first started in web
dev).

--
Regards

Steven Burn
Ur I.T. Mate Group
www.it-mate.co.uk

Keeping it FREE!

Disclaimer:
I know I'm probably wrong, I just like taking part ;o)
Peter Foti <pe****@systoli cNOSPAMnetworks .com> wrote in message
news:vv******** ****@corp.super news.com...
"Travis Pupkin" <tp*****@dorrk. com> wrote in message
news:MP******** *************** *@news.individu al.net...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?
I don't know of an ASP method for doing this (though I wouldn't be

surprised if one existed). Whenever I need to copy something from Word into an
HTML/ASP document, I will first paste it into Notepad, then select
everything in Notepad, copy it, and paste that into the HTML/ASP document.
It's an extra step, but seems to get rid of the formatting characters.

HTH
Peter Foti

Jul 19 '05 #4
"Steven Burn" <nobody@PVT_i t-mate.co.uk> wrote in message
news:uR******** *****@TK2MSFTNG P11.phx.gbl...
Personally I just detest Word....... period (prefer to use my own custom
built editor and before that, have used Notepad since I first started in web dev).


Oh, I agree 100%. I would NEVER use Word as a web editor. But
occassionally, I might have some content that needs to be put into a web
page that comes from a Word document. :)

Peter Foti
Jul 19 '05 #5
On Tue, 30 Dec 2003 11:52:35 -0800, Travis Pupkin <tp*****@dorrk. com>
wrote:
I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?


That gets tough, because a lot of the Word is XML, and a lot is nested
tables to get formatting, and it's hard to tell what you may or may
not need.

Jeff
Jul 19 '05 #6
Jon
> Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.


Not what you want, however Dreamweaver can 'tidy up' a lot of Words HTML
(and has a menu option to do so)

Jon
Jul 19 '05 #7
Obviously Word as an HTML editor is a bad idea. What I'm talking about
is when a client has a web-based WYSIWYG content editor and frequently
pastes word data into it -- the HTML created is bulky and atrocious.

In trying out some PHP the other day, I found a PHP script which is a
complex series of find-and-replace commands aimed at cleaning up
unnecessary tags and Word html code fromn a string, but leaving the
essentials like bold italics in place.

Just wondering if anyone knew of an ASP equivalent.
Jul 19 '05 #8
The commercial Advanced Content Editor (ACE) has a special button that will
clean up word XML... if the user remembers to press it on paste.

I agree a custom function would be nice. This problem limits the utility of
roll-your-own web-based CMS systems.

"Travis Pupkin" <tp*****@dorrk. com> wrote in message
news:MP******** *************** *@news.individu al.net...
Hi,

I use a WYSIWYG rich text editor on a few web sites for clients to
manage their content, but a lot of them use word and paste into the box
and end up with a lot of crappy code as a result.

Has anyone created an ASP include that will clean up the word html and
just leave the basic formatting intact?

Thanks.

Jul 19 '05 #9

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

4
1802
by: dmiller23462 | last post by:
Somebody take a look and give me any suggestions? My brain is nuked... Here's my deal....I have online submission forms on my intranet at work here....I am appending to an Access DB with the input from all HTML fields and then querying aforementioned DB with different variables (search by name, wave, reason, etc). The output that I'm getting (SELECT * 'cause I need all of the data included in the search) I would like to display in a nice...
20
7363
by: Al Moritz | last post by:
Hi all, I was always told that the conversion of Word files to HTML as done by Word itself sucks - you get a lot of unnecessary code that can influence the design on web browsers other than Internet Explorer. Our computer expert in my company had told me already a while ago that I should learn HTML and encode myself. I was never inclined to do so (I am no computer expert), and when upon his suggestion I looked how my pages (converted to...
1
615
by: ajk | last post by:
. Hi, All: I know how to insert files into a Word doc using C#. However, the program I've written to do this runs much too slowly. The "myObj".Application.Selection.InsertFile method executes at a snails pace. Here are the detais: I wrote a C# program that creates a new Word doc and then loops through a
232
13385
by: robert maas, see http://tinyurl.com/uh3t | last post by:
I'm working on examples of programming in several languages, all (except PHP) running under CGI so that I can show both the source files and the actually running of the examples online. The first set of examples, after decoding the HTML FORM contents, merely verifies the text within a field to make sure it is a valid representation of an integer, without any junk thrown in, i.e. it must satisfy the regular expression: ^ *?+ *$ If the...
9
3476
by: jd | last post by:
I am looking for python code (working or sample code) that can take an html document created by Microsoft Word and clean it up (if you've never had to look at a Word-generated html document, consider yourself lucky ;-) Alternatively, if you know of a non-python solution, I'd like to hear about it. Thanks... -- jeff
10
9829
by: Greg Lovern | last post by:
I have a very large html table created by MS Word, saved as it's "Web Page, Filtered" file type. Every html table cell has lots of formatting tags. Most of the file size is that formatting. Is there a free or inexpensive editor that can quickly remove all formatting to minimize the file size? I tried a few freeware editors, but wasn't able to find a way to clean it up.
0
1448
by: service0031 | last post by:
Housekeeping is an interesting job or career where things are always changing, and even though you may work in a less than clean environment, you are required to always look clean and professional. Having appropriate housekeeping uniforms, dresses, or skirts for your cleaning tasks is a necessity that simply cannot be ignored. You are expected to look your best under less than ideal circumstances, and this mean having the highest quality...
0
9601
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
10635
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
1
10378
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
9198
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
7653
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6881
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5687
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
4332
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
3
3013
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.