By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
444,089 Members | 2,418 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 444,089 IT Pros & Developers. It's quick & easy.

Searching and extract text from PDF

P: 1
I'm a researcher, as well as a starter in python, i need to find key words, say earnings, sales, expenditure etc. in PDF documents, and extract relevant paragraphs around those key words out of the whole document.
I have tried pyPDF and PDFtools, but none of them work.
Can anybody give me a module or example?
Thanks a lot.

Nov 20 '07 #1
Share this Question
Share on Google+
1 Reply

P: 4
I would recommend Text Mining Tool.

Its features:

- No payment or license restrictions. Tool is absolutely free.
- Works as converter of PDF, DOC, RTF, CHM, HTML files to text.
- User-friendly interface with hotkeys available.
- Console tool minetext for automation of text converting is included.
- .NET 2.0 framework based.
- No installation is need. Just unpack the program and use.

Dec 13 '07 #2

Post your reply

Sign in to post your reply or Sign up for a free account.