Sergey Sedzyalo wrote:
Hello All
I do implementation of search engine. And I need indexing PDF & DOC files.
Somebody known solutions for parse this files formats?
Sergey,
What you require is a full text search engine. There are many such
implementations available. Apache Lucene is the only free one I am aware
of, but I do not think they have (native) MS Word or PDF support (i.e.,
you have to write a parser for it and it will do the indexing). But
there are commercial alternatives as well.
Ray