By using this site, you agree to our updated Privacy Policy and our Terms of Use. Manage your Cookies Settings.
431,883 Members | 1,952 Online
Bytes IT Community
+ Ask a Question
Need help? Post your question and get tips & solutions from a community of 431,883 IT Pros & Developers. It's quick & easy.

Need parser for PDF and DOC files

P: n/a
Hello All
I do implementation of search engine. And I need indexing PDF & DOC files.
Somebody known solutions for parse this files formats?

Jul 17 '05 #1
Share this Question
Share on Google+
1 Reply


P: n/a
Sergey Sedzyalo wrote:
Hello All
I do implementation of search engine. And I need indexing PDF & DOC files.
Somebody known solutions for parse this files formats?

Sergey,

What you require is a full text search engine. There are many such
implementations available. Apache Lucene is the only free one I am aware
of, but I do not think they have (native) MS Word or PDF support (i.e.,
you have to write a parser for it and it will do the indexing). But
there are commercial alternatives as well.

Ray
Jul 17 '05 #2

This discussion thread is closed

Replies have been disabled for this discussion.