Connecting Tech Pros Worldwide Forums | Help | Site Map

Read / retrieve data from PDF files

Member
 
Join Date: Aug 2006
Posts: 110
#1: Apr 12 '07
Dear Friends,

I posted this query 4 days ago and I am still waiting for the reply from you all. I really need your support.......plz

I have some pdf files on my database/FTP. Is there any process so that I can retrieve data from those pdf files with some search keywords?

I need the same result as we get on google. Is this possible in PHP?

Please advice.........

Thanks
Deepak

Moderator
 
Join Date: Nov 2006
Location: Boston, USA
Posts: 505
#2: Apr 13 '07

re: Read / retrieve data from PDF files


You may need to employ some third-party tools. This site lists quite a few, you may also look at PowerGrep, and run a Google query on something like "text search pdf" - it will give you some tools to consider.
Motoma's Avatar
Moderator
 
Join Date: Jan 2007
Location: Maine, USA
Posts: 2,904
#3: Apr 14 '07

re: Read / retrieve data from PDF files


Essentially, a PDF is gzipped XML. Using this knowledge you could write a tool to correctly uncompress and analyze the data.
Or you could use free tools like FPDF and FPDI.
Reply