469,287 Members | 2,770 Online
Bytes | Developer Community
New Post

Home Posts Topics Members FAQ

Post your question to a community of 469,287 developers. It's quick & easy.

PDF Parser

i am trying to make a PDF parser in PHP which will be able to extract
data from PDF files. i want to basically convert pdf files to XML
data. any idea from where i could start?
Jul 17 '05 #1
4 16907
Farhan wrote:

i am trying to make a PDF parser in PHP which will be able to extract
data from PDF files. i want to basically convert pdf files to XML
data. any idea from where i could start?


http://ca2.php.net/manual/en/ref.pdf.php
You may find the links in the comments helpful.
I've never done anything with PDFs/PHP, but some of the tutorials looked
promising.

Regards,
Shawn
--
Shawn Wilson
sh***@glassgiant.com
http://www.glassgiant.com

I have a spam filter. Please include "PHP" in the
subject line to ensure I'll get your message.
Jul 17 '05 #2
thanks, shawn, for your reply. but PDFLib is not really what i am
looking for. i need to extract data from PDF files. someone at #php in
freenode told me that PDFLib with PID will be able to do that. but i
don't think PID comes along with PHP, we need to by it.

farhan
Jul 17 '05 #3
See my PDF highlighting code:

http://www.conradish.net/pdfhi.php.txt

Pay attention to line 451 to 462.

Uzytkownik "Farhan" <go**********@hotmail.com> napisal w wiadomosci
news:b6*************************@posting.google.co m...
thanks, shawn, for your reply. but PDFLib is not really what i am
looking for. i need to extract data from PDF files. someone at #php in
freenode told me that PDFLib with PID will be able to do that. but i
don't think PID comes along with PHP, we need to by it.

farhan

Jul 17 '05 #4
> See my PDF highlighting code:

http://www.conradish.net/pdfhi.php.txt

Pay attention to line 451 to 462.


thanks chung. i will try looking at the code some other time, because
the server you are ointing me to seems to be down right now. but just
a question - do you just search the binary data or do you follow the
Adobe PDF Specification? if you do, is it too complicated? thanks
again.

farhan
Jul 17 '05 #5

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

13 posts views Thread by Paulo Pinto | last post: by
11 posts views Thread by Jean de Largentaye | last post: by
3 posts views Thread by Himanshu Garg | last post: by
5 posts views Thread by thewarden | last post: by
28 posts views Thread by Marc Gravell | last post: by
18 posts views Thread by Just Another Victim of the Ambient Morality | last post: by
reply views Thread by zhoujie | last post: by
reply views Thread by suresh191 | last post: by
By using this site, you agree to our Privacy Policy and Terms of Use.