473,761 Members | 8,372 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

converting documents to HTML

can anyone recommend a good tool to convert documents to HTML on the
fly. I need to integrate this tool with a VB app so it must have an
API.

thanks in advance
Davinder
da******@gujral .co.uk
Jul 20 '05 #1
21 4245
In article <de************ **************@ posting.google. com>,
da******@gujral .co.uk says...
can anyone recommend a good tool to convert documents to HTML on the
fly. I need to integrate this tool with a VB app so it must have an
API.

What kind of document?
Jul 20 '05 #2
90% of all docs will be office documents. The other 10% are pdf's, gif, jpeg and bmp

Jacqui or (maybe) Pete <po****@spamcop .net> wrote in message news:<MP******* *************** **@news.CIS.DFN .DE>...
In article <de************ **************@ posting.google. com>,
da******@gujral .co.uk says...
can anyone recommend a good tool to convert documents to HTML on the
fly. I need to integrate this tool with a VB app so it must have an
API.

What kind of document?

Jul 20 '05 #3
On Wed, 09 Jul 2003 10:14:57 +0200, Davinder <"Davinder"
<da******@gujra l.co.uk>> wrote:
90% of all docs will be office documents. The other 10% are pdf's, gif,
jpeg and bmp


Which office documents?

pdfs can be run through any of a number of filters. Try googling for
them. .gifs and jpegs can already be displayed inline by most browsers
that are capable of displaying images. .bmps need to be converted to jpgs
or pngs or gifs.

Ciao

Zak

--
=============== =============== =============== =============== ============
http://www.carfolio.com/ Searchable database of 10 000+ car specs
=============== =============== =============== =============== ============
Jul 20 '05 #4
Have you not investigated the object models of each Office app? Word, for
example, gives you access to a Document object, which has a SaveAs method,
one of whose parameters is FileFormat, which can take a value of
wdFormatHTML (this is Word XP). You may also be save as Compact HTML by
experimenting with the FileConverters object.

Have a look at this, to get you started.
http://msdn.microsoft.com/library/de...ordObjects.asp

--
############### #######
## PH, London ##
############### #######

"Davinder" <da******@gujra l.co.uk> wrote in message
news:de******** *************** ***@posting.goo gle.com...
90% of all docs will be office documents. The other 10% are pdf's, gif, jpeg and bmp
Jacqui or (maybe) Pete <po****@spamcop .net> wrote in message

news:<MP******* *************** **@news.CIS.DFN .DE>...
In article <de************ **************@ posting.google. com>,
da******@gujral .co.uk says...
can anyone recommend a good tool to convert documents to HTML on the
fly. I need to integrate this tool with a VB app so it must have an
API.

What kind of document?

Jul 20 '05 #5
In article <be**********@t itan.btinternet .com>, fo******@REMOVE herlihy.eu.com
says...
Have you not investigated the object models of each Office app? Word, for
example, gives you access to a Document object, which has a SaveAs method,
one of whose parameters is FileFormat, which can take a value of
wdFormatHTML (this is Word XP). You may also be save as Compact HTML by
experimenting with the FileConverters object.

Have a look at this, to get you started.
http://msdn.microsoft.com/library/de...ordObjects.asp

Yes, but the HTML that word produces is absolute GARBAGE!
Jul 20 '05 #6
Philip.
i have tried the the word office model...it worked well although i was
looking for something alittle more sophisticated. For example,
converting a word doc with 40+ pages would give me 1 large html rather
than linked pages.

Currently i am using net-it-central...this works great but its TOO
expensive for us to buy another license.

Davinder Gujral
da******@gujral .co.uk
"Philip Herlihy" <fo******@REMOV Eherlihy.eu.com > wrote in message news:<be******* ***@titan.btint ernet.com>...
Have you not investigated the object models of each Office app? Word, for
example, gives you access to a Document object, which has a SaveAs method,
one of whose parameters is FileFormat, which can take a value of
wdFormatHTML (this is Word XP). You may also be save as Compact HTML by
experimenting with the FileConverters object.

Have a look at this, to get you started.
http://msdn.microsoft.com/library/de...ordObjects.asp

--
############### #######
## PH, London ##
############### #######

"Davinder" <da******@gujra l.co.uk> wrote in message
news:de******** *************** ***@posting.goo gle.com...
90% of all docs will be office documents. The other 10% are pdf's, gif,

jpeg and bmp

Jacqui or (maybe) Pete <po****@spamcop .net> wrote in message

news:<MP******* *************** **@news.CIS.DFN .DE>...
In article <de************ **************@ posting.google. com>,
da******@gujral .co.uk says...
> can anyone recommend a good tool to convert documents to HTML on the
> fly. I need to integrate this tool with a VB app so it must have an
> API.
What kind of document?

Jul 20 '05 #7
Of course. But who cares?

--
############### #######
## PH, London ##
############### #######

"Mr. Clean" <mr*****@protct orandgamble.com > wrote in message
news:MP******** *************** *@news-server.austin.r r.com...
In article <be**********@t itan.btinternet .com>, fo******@REMOVE herlihy.eu.com says...
Have you not investigated the object models of each Office app? Word, for example, gives you access to a Document object, which has a SaveAs method, one of whose parameters is FileFormat, which can take a value of
wdFormatHTML (this is Word XP). You may also be save as Compact HTML by
experimenting with the FileConverters object.

Have a look at this, to get you started.
http://msdn.microsoft.com/library/de...ordObjects.asp

Yes, but the HTML that word produces is absolute GARBAGE!

Jul 20 '05 #8
On Thu, 10 Jul 2003 00:03:37 +0000 (UTC), "Philip Herlihy"
<fo******@REMOV Eherlihy.eu.com > wrote:
Of course. But who cares?


(Further context vanished because the quoted text was part of the sig.
Please have a read of http://www.xs4all.nl/~sbpoley/toppost.htm).

Maybe your readers might just care? I tried using Word-generated HTML
just once. It was horrible. My hand-coded version took 2 seconds to load
from my local hard disk. The Word-generated version took 30 seconds.
(That's not a typo - it took about fifteen times as long!!) By the time
it had come from a server over a modem link, you can be pretty sure that
most of my visitors would have gone elsewhere.

--
Stephen Poley

http://www.xs4all.nl/~sbpoley/webmatters/
Jul 20 '05 #9
On top-posting:

Thanks for the link to that intelligent article, which did make me think
about it again, despite an initial hostile prejudice. In general I ignore
off-topic complaints about posting style as the mostly come from fuss-pots
and are mainly noise - I do acknowledge that your comments here are informed
and useful. However, I'm not going to stop top-posting, because I strongly
prefer it, and I'm voting with my postings, as it were. I also rather like
OE, which happens to make bottom-posting awkward. Even if there was an
option to reverse OE's top-posting into bottom-posting I wouldn't use it.
Some folk will rail against violation of "standards" , and always against
Microsoft, but there are more important issues in my own life. I've taken
on board those points the article made about quoting, though.

Amusingly, one of my hobby horses is postings which take you off-topic
without changing the subject line. Tut, tut... :-)

I'll get back the HTML thread in a reply to Stephen...
--
############### #######
## PH, London ##
############### #######

"Darin McGrew" <mc****@stanfor dalumni.org> wrote in message
news:be******** **@blue.rahul.n et...
A: It's backwards and makes discussions harder to follow:
http://www.cs.tut.fi/~jkorpela/usenet/brox.html

Jul 20 '05 #10

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
1187
by: tjordah | last post by:
Hi! Im looking for a developed XML format that hides the low-level details of XSL-FO but that easily can be converted to nice-looking XSL-FO documents as well as HTML through a set of (pre-developed) XSLT-documents. I'm picturing a format where I can do something like this (from http://www.w3schools.com/xslfo/xslfo_xslt.asp): <header> W3Schools
20
7354
by: Al Moritz | last post by:
Hi all, I was always told that the conversion of Word files to HTML as done by Word itself sucks - you get a lot of unnecessary code that can influence the design on web browsers other than Internet Explorer. Our computer expert in my company had told me already a while ago that I should learn HTML and encode myself. I was never inclined to do so (I am no computer expert), and when upon his suggestion I looked how my pages (converted to...
29
3907
by: Armand Karlsen | last post by:
I have a website ( http://www.zen62775.zen.co.uk ) that I made HTML 4.01 Transitional and CSS compliant, and I'm thinking of converting it into XHTML to learn a little about it. Which XHTML variant would you recommend? The w3c HTML validator mentions XHTML 1.0 Transitional, Basic, Strict, and XHTML 1.1. Would I be able to make my existing CSS work in the XHTML page without modification to the .css file?
2
2870
by: mike | last post by:
regards: I follow the following steps to converting from HTML to XHTML http://webpageworkshop.co.uk/main/xhtml_converting My parser is http://htmlparser.sourceforge.net/ Xhtml version is 1.0 from http://nds.nokia.com/uaprof/N6600r100.xml but nokia mobile browser cannot identify the converted file(XHTML1.0). Is there something wrong with my procedure.
3
9927
by: Stephan Brunner | last post by:
Hi I have created two flavors of an XSLT stylesheet to transform all attributes of an XML document to elements: They both work as expected with MSXML and XMLSPY but throw an exception ========================= <?xml version="1.0" encoding="iso-8859-1"?> <xsl:stylesheet version="1.0"
6
1174
by: Glenn | last post by:
Hi, I have a fairly urgent requirement to generate PDF documents from within a C# .NET component, based on generated HTML reports. These HTML reports contain images and use CSS styles extensively. I need to incorporate proper paging and section breaks into these PDF documents also. I've googled this process and there seems to be a myriad of tools out there! Instead of re-inventing the wheel I'd really appreciate any recommendations...
9
6731
by: anupamjain | last post by:
Hi, After 2 weeks of search/hit-and-trial I finally thought to revert to the group to find solution to my problem.(something I should have done much earlier) This is the deal : On a JSP page, I want to grab a URL and parse /change the HTML and send it to the JSP page. I take the URL from the user in a textbox (not the
1
1311
by: =?Utf-8?B?U3FsQmVnaW5uZXI=?= | last post by:
I want to automate a process of converting documents (*.doc) to html pages using C#. Please note that documents might contain images within it. Any pointers in this regard would be of great help to me. Thanks! Regards Pradeep
0
1341
by: Andre Majorel | last post by:
Is there some command-line program for Unix to make all links relative in HTML documents saved in wget -x fashion ? (http://foo.com/a/b.html saved as ./foo.com/a/b.html.) For example, - if ./foo.com/a/b.html contains <img src="/images/d.jpg"> and ./foo.com/images/d.jpg exists, replace that tag by <img src="../images/d.jpg">
0
9531
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
9345
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
9957
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
1
9905
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
8780
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
7332
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6609
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5373
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
3
2752
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.