By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
429,044 Members | 1,287 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 429,044 IT Pros & Developers. It's quick & easy.

How to read data from a PDF file?

P: n/a
Hi.

Now I'm working on a project one of the requirements of which is
reading data from a PDF file into a table. I have googled quite a long,
but couldn't find any appropriate method to do this. It's unbelievable
that there is no COM component for these purpose with automation
support which could be easily utilized in VBA. Could anyone help me
with this? What would you do?

Thanks in advance

Martin

Aug 13 '06 #1
Share this Question
Share on Google+
4 Replies


P: n/a
"martin-g" <ma******@mail.ruwrote in message
news:11**********************@i42g2000cwa.googlegr oups.com...
Hi.

Now I'm working on a project one of the requirements of which is
reading data from a PDF file into a table. I have googled quite a long,
but couldn't find any appropriate method to do this. It's unbelievable
that there is no COM component for these purpose with automation
support which could be easily utilized in VBA. Could anyone help me
with this? What would you do?
Well, I would first go to the company that made the pdf format creator, and
see what they offer. Further, I would check out their newsgroups, and
support for their products...

This is not really a ms-access question, but a problem that any developer
would have using a PDF file. I would think that going to the company and
using their newsgroups, support and community of people would be the first
choice.

However, you stopped by in this friendly newsgroup and appealed to ms-access
users. And, there is LOTS of help, and support for ms-access uses here!!

I would grab one of the many pdf to text creators. Google for this...you
find lots and lots.

Some converters have batch ability, and command line ability. That should
suffice here. I suppose a com object would be the best choice as you ask.

Anyway, after it is converted to a text file, you either import the data (if
it is in nice columnar format already), or, you write some code to parse out
the data.

There is lots of choices out there...I don't know which one to choose.

A good starting point would be to use ghostscript, and ghostview.
I would perhaps start out with ghostview (Google this), as it is a nice PDF
and post-script viewer, and does include text extraction abilities...

--
Albert D. Kallal (Access MVP)
Edmonton, Alberta Canada
pl*****************@msn.com
Aug 13 '06 #2

P: n/a
I was hoping to edit PDF files manually to add data and notes (notes
that would print with the PDF) to previously-created contracts. It was
astounding how difficult to accomplish this was, using the software of
the creator. I gave up and hoped to simply de-construct the PDF files
into a format which I could cut-and-paste or "drop as an object" into
my Access report. Again, no joy.

A little research turned-up PDF Converter by Nuance. They currently
own the rights to what was once Scansoft Omniform, Dragonspeak and a
few other clever programs. I bought PDF Converter (standard version)
and in less time than it takes to Google them, the PDF was converted
to a Microsoft Word document. It needed tidied-up, to be sure, but
without a single typo. Every line of text was correct and the majority
of the logos as well. If you could work-up a few macros (either in
Access or Word, or both) maybe this will help. At the least, I hope it
gives you ideas.

Regards,
The Woo
On 13 Aug 2006 01:32:52 -0700, "martin-g" <ma******@mail.ruwrote:
>Hi.

Now I'm working on a project one of the requirements of which is
reading data from a PDF file into a table. I have googled quite a long,
but couldn't find any appropriate method to do this. It's unbelievable
that there is no COM component for these purpose with automation
support which could be easily utilized in VBA. Could anyone help me
with this? What would you do?

Thanks in advance

Martin
Aug 13 '06 #3

P: n/a

Albert D. Kallal wrote:
However, you stopped by in this friendly newsgroup and appealed to ms-access
users. And, there is LOTS of help, and support for ms-access uses here!!
Yeah, thank you very much. I found some material at www.adobe.com, but
those SDK-s are too large for my project. As I suspected, I must fix
upon converters.

Martin

Aug 13 '06 #4

P: n/a
There are lots of PDF Libraries, that can be called from VB, allowing you to
work directly with PDF documents. I've looked at dozens over the years. A
quick Web search yielded multiple pages of hits.

One such Library that I use regularly is here:
www.dynaforms.com
--

HTH
Stephen Lebans
http://www.lebans.com
Access Code, Tips and Tricks
Please respond only to the newsgroups so everyone can benefit.
"martin-g" <ma******@mail.ruwrote in message
news:11**********************@i42g2000cwa.googlegr oups.com...
Hi.

Now I'm working on a project one of the requirements of which is
reading data from a PDF file into a table. I have googled quite a long,
but couldn't find any appropriate method to do this. It's unbelievable
that there is no COM component for these purpose with automation
support which could be easily utilized in VBA. Could anyone help me
with this? What would you do?

Thanks in advance

Martin

Aug 13 '06 #5

This discussion thread is closed

Replies have been disabled for this discussion.