473,573 Members | 2,925 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Converting Word formatted contents to HTML

Hi,
I have a large table in Word 2003 that has formatted text in the cells and
wish to read and convert a cells formatted contents to html output via
vb.net code. The formatting contains the following: bold text, italic text,
superscript & subscript text, bullet points. The output html is then saved
in an XML file as a CDATA section for later use in the application when
formatted output is required.

I have looked up many sites on the web and also searched in user groups but
have not yet seen a way to code the above in vb.net 2003. In my VB.net app I
currently open and read the word table and output all cell contents into an
XML structure OK - I just need to know how to read and convert the formatted
text inside a cell to html format.
Can anyone suggest tools/components or code or examples that would help me
do this.

I am using VB.net 2003 and Office Professional Edition 2003.
cheers,
Craig, New Zealand

Nov 20 '05 #1
3 3359
Hi Graig,

An XML file is not always an dataset.
A dataset can always be made as a XML file to disk or streaming to another
place.

I hope I make this a little bit clear with saying that because maybe it is
cryptic.

To use an XML file you can read it as a XML doc using the loadXML or access
it as a streaming file using the XML reader.

If you want to know more, feel free to ask?

Cor
Nov 20 '05 #2
* "Craig Petrie" <pe****@paradis e.net.nz> scripsit:
I have a large table in Word 2003 that has formatted text in the cells and
wish to read and convert a cells formatted contents to html output via
vb.net code.


Maybe you can save the document to HTML using Word automation. I would
ask this question in one of the Word VBA/programming groups.

--
Herfried K. Wagner [MVP]
<URL:http://dotnet.mvps.org/>
Nov 20 '05 #3
Hi Craig,

Simply saving the document in html format will allow the table to be
displayed in a web browser (with a few exceptions)

If you need assistance let me know.

"Craig Petrie" <pe****@paradis e.net.nz> wrote in message
news:es******** ******@TK2MSFTN GP11.phx.gbl...
Hi,
I have a large table in Word 2003 that has formatted text in the cells and
wish to read and convert a cells formatted contents to html output via
vb.net code. The formatting contains the following: bold text, italic text, superscript & subscript text, bullet points. The output html is then saved
in an XML file as a CDATA section for later use in the application when
formatted output is required.

I have looked up many sites on the web and also searched in user groups but have not yet seen a way to code the above in vb.net 2003. In my VB.net app I currently open and read the word table and output all cell contents into an XML structure OK - I just need to know how to read and convert the formatted text inside a cell to html format.
Can anyone suggest tools/components or code or examples that would help me
do this.

I am using VB.net 2003 and Office Professional Edition 2003.
cheers,
Craig, New Zealand

Nov 20 '05 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

20
7327
by: Al Moritz | last post by:
Hi all, I was always told that the conversion of Word files to HTML as done by Word itself sucks - you get a lot of unnecessary code that can influence the design on web browsers other than Internet Explorer. Our computer expert in my company had told me already a while ago that I should learn HTML and encode myself. I was never inclined to...
8
5725
by: prabha | last post by:
Hello Everybody, I have to conert the word doc to multiple html files,according to the templates in the word doc. I had converted the word to xml.Also through Exsl ,had finished the multiple output html files. The problem is while reading through the worddoc paragraph,the special characters are not identified. So in the xml file,it's...
3
1913
by: Mike Turco | last post by:
My phone directory project is moving along. Its a series of reports and documents that all get imported into Word, via RTF files or whatever. I need to create a couple of indexes. Is there a way to insert the index marks within a database report, such that when I export the file to RTF, word will recognize those indexes? Thanks, Mike
2
579
by: CM | last post by:
Hi, Could anyone please help me? I am completing my Master's Degree and need to reproduce a Webpage in Word. Aspects of the page are lost and some of the text goes. I would really appreciate it. The link to the document is http://www.surveymonkey.com/s.asp?u=689952259313 I have spent 15 hours trying to sort this but to no avail.
0
1794
by: robwahl | last post by:
Hi, I have a members only area of a site (using ASP and MS Access) where I need users to be able to either view or download reports (PDF or MS Word doc - doesn't matter which). I want to store the reports in an OLE object field in the MS Access database. Using ASP, I can get the report to dump its contents in binary into an html table in the...
3
7901
by: =?Utf-8?B?U1MgbWFkaHU=?= | last post by:
Hi, I have two word files. listprice.doc,product.doc I converted each of the documents into bytes byte bytedata - lisprice.doc byte bytedata1-product.doc I create a new array and copy both the contents in it and upload it to a file in d: I am getting only the lisprice.doc document in D:\File.doc,I am not getting
5
9096
by: Frederik Van Bogaert | last post by:
Hi! I've taken my first steps into the world of c++ by trying to write a text adventure game. Things are proceeding fine, but there's some code in there that isn't very well coded. More specifically, I use the following code: ... string word ; size_t pos = action.find(" "); word = action.substr (0,pos); if (pos < action.size() - 2)
0
1072
by: Joeyeti | last post by:
Hi fellow VB knowers (I am but a learner still). I have a question for you which I struggle with. I need to convert nested Lists in MS WORD (whether numbered or bulleted or mixed) from their original format to a tag-formatted text (as used for instance for Wiki articles or phpBB Forums and such). In my particular case I need the text to have...
2
7589
by: Artie | last post by:
Hi, I've searched the web but can't find a solution to an apparently really simple problem. My app contains an HTML string and I need to be able to invoke the Print Dialog to print the HTML correctly formatted (i.e. not as raw HTML) to a printer that the User chooses. So, I need something like:
0
7771
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main...
1
7771
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For...
0
8060
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the...
0
6406
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then...
1
5580
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes...
0
5289
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert...
0
3730
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in...
1
1296
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
0
1036
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.