473,569 Members | 2,799 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Reading Microsoft Word files

I need to read some (about 15,000) word documents and add some wrappers
around them for importing into a database. What is the best way to read a
..doc file in VB? I would like to save the formatting as .RTF so it can be
included.

I can do this in Word VBA, but I would prefer to do it in straight VB
(either VB6 or .Net).

Thanks for any help on this.

David
Jul 17 '05 #1
4 15264
Word Automation would work for you..

Basic connection to the Word object for automation:
http://www.veign.com/vrc_codeview.asp?type=app&id=62

--
Chris Hanscom - Microsoft MVP (VB)
Veign's Resource Center
http://www.veign.com/vrc_main.asp
--
Read. Decide. Sign the petition to Microsoft.
http://classicvb.org/petition/
"David Erickson" <er*******@eart hlink.net> wrote in message
news:6i******** ********@newsre ad2.news.atl.ea rthlink.net...
I need to read some (about 15,000) word documents and add some wrappers
around them for importing into a database. What is the best way to read a
.doc file in VB? I would like to save the formatting as .RTF so it can be
included.

I can do this in Word VBA, but I would prefer to do it in straight VB
(either VB6 or .Net).

Thanks for any help on this.

David

Jul 17 '05 #2
Veign,

Thank you. I did not think of Word automation. How is the performance for
this? If I am changing 1000 files each day, how much system overhead will
this cause?

David

"Veign" <me@home.com> wrote in message
news:8t******** ********@newsre ad3.news.atl.ea rthlink.net...
Word Automation would work for you..

Basic connection to the Word object for automation:
http://www.veign.com/vrc_codeview.asp?type=app&id=62

--
Chris Hanscom - Microsoft MVP (VB)
Veign's Resource Center
http://www.veign.com/vrc_main.asp
--
Read. Decide. Sign the petition to Microsoft.
http://classicvb.org/petition/
"David Erickson" <er*******@eart hlink.net> wrote in message
news:6i******** ********@newsre ad2.news.atl.ea rthlink.net...
I need to read some (about 15,000) word documents and add some wrappers
around them for importing into a database. What is the best way to read
a
.doc file in VB? I would like to save the formatting as .RTF so it can
be
included.

I can do this in Word VBA, but I would prefer to do it in straight VB
(either VB6 or .Net).

Thanks for any help on this.

David


Jul 17 '05 #3
I guess it really depends on your system and the size of the Word Documents.
The easiest way is to create a simple test...

--
Chris Hanscom - Microsoft MVP (VB)
Veign's Resource Center
http://www.veign.com/vrc_main.asp
--
Read. Decide. Sign the petition to Microsoft.
http://classicvb.org/petition/
"David Erickson" <er*******@eart hlink.net> wrote in message
news:yA******** *********@newsr ead2.news.atl.e arthlink.net...
Veign,

Thank you. I did not think of Word automation. How is the performance for this? If I am changing 1000 files each day, how much system overhead will
this cause?

David

"Veign" <me@home.com> wrote in message
news:8t******** ********@newsre ad3.news.atl.ea rthlink.net...
Word Automation would work for you..

Basic connection to the Word object for automation:
http://www.veign.com/vrc_codeview.asp?type=app&id=62

--
Chris Hanscom - Microsoft MVP (VB)
Veign's Resource Center
http://www.veign.com/vrc_main.asp
--
Read. Decide. Sign the petition to Microsoft.
http://classicvb.org/petition/
"David Erickson" <er*******@eart hlink.net> wrote in message
news:6i******** ********@newsre ad2.news.atl.ea rthlink.net...
I need to read some (about 15,000) word documents and add some wrappers
around them for importing into a database. What is the best way to read a
.doc file in VB? I would like to save the formatting as .RTF so it can
be
included.

I can do this in Word VBA, but I would prefer to do it in straight VB
(either VB6 or .Net).

Thanks for any help on this.

David



Jul 17 '05 #4
Thanks for your help. I have tried this and it works okay, but very slowly.
I thought that was going to be okay, until I talked to the user again and
found out there are actually more than 100,000 documents to process.

What I need to do is:
1. Read the doc file
2. extract account number, name, and date
3. convert to RTF
4. Add an HL7 wrapper around the RTF file
5. save into the application's HL7 inbox.

Is there a package that will allow me to open a .doc file more quickly than
word automation?

Thanks again!
Jul 17 '05 #5

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
5157
by: hokiegal99 | last post by:
This is not really a Python-centric question, however, I am using Python to solve this problem (as of now) so I thought it appropiate to pose the question here. I have some functions that search for files that contain certian strings and if the files found to have these string do not already have a filename extension (such as '.doc' or...
8
18215
by: Phil Slater | last post by:
I'm trying to process a collection of text files, reading word by word. The program run hangs whenever it encounters a word with an accented letter (like rôle or passé) - ie something that's not a "char" with an ASCII code in 0..127 I've searched the ANSI C++ standard, the internet and various text books, but can't see how to workaround...
4
12793
by: Erpman | last post by:
I am trying to access the data with in a wav file. I am testing with very small files in order to keep the code simple to start with. Basically, im writing the entire wav file to a byte using a fileStream. The problem is that when reading back the data word by word and printing the results on a general form, once the data chunk header has...
10
1685
by: T Cordon | last post by:
I am using a StreamReader to read text from an HTML file and display it as part of a page in a Label Control. Buy it is not displaying characters as: ñ, ó, ú, etc. Please Help. Thanks
2
1064
by: Stu | last post by:
Hi, I have a web app that needs to receive freight booking in MS Word format. There are a number of tables in the docs that contain the relevant data (origin/destination etc.). Can someone please point me in the right direction for the most reliable way of doing this. Also, do I need Word installed on the server or can I get away with...
2
11638
by: Eshban Bahadur | last post by:
Hello, I want to read (RTF) files in my VB.NET programme. How can i do it. I apply the same method of reading text files, but it does not save the formatting of text like (bold, italics, underline) etc. So , how can i make programme which allows me to read (RTF) Rich Text Format files or Microsoft WORD files in VB.NET through FILE HANDLING....
3
4647
by: tigrrgrr42 | last post by:
I am working(vb.net03and05) with word documents stored in a sql db and I am currently bringing them from a byte array into a temp file to pop into word and make word do its thing as a com object. Is it possible to go straight from a byte array to document in word instead of a temp file? Also is there a way to change the save behaviour to...
3
3925
by: ahammad | last post by:
Well, the title pretty much describes what I want to do. I want to be able to read the contents of a Word document (*.doc). I also want to be able to read it to a CString object, and then search that CString object for specific substrings. The problem I am having is reading the Word file itself. I have tried numerous things but no luck. Also,...
1
2427
navanova
by: navanova | last post by:
Greetings, I have a problem of opening ms word and excel files on my computer. The files are there for a long time. I use to open and modify them. Suddenly, when i try to open the word files, a dialog box appears that says "There was an error opening the file". when i try to open the excel files, a dialog box appears that says "File format not...
0
7697
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main...
0
7924
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. ...
0
8120
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that...
0
7968
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the...
0
6283
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then...
1
5512
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes...
0
3653
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in...
0
3640
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
2113
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.