Connecting Tech Pros Worldwide Help | Site Map

Zend Lucene too slow to index question

sliverdigger
Guest
 
Posts: n/a
#1: Aug 24 '08
I have been experimenting with Zend/Php/Lucene.
I have used Lucene extensively as Java in Tomcat Servlets.

Zend's php Lucene works the same way, but it is much slower
to index for some reason--way too slow to ever finish,
when working a moderately sized site--on a shared host.

I could try to figure out how to break the site up into chunks
and then index once and then update many times.

Are there any zippier alternatives to Zend/Lucene?
(google Custom Search give me poor match results).

--
cut the board three times and it's still too short
sliverdigger
Guest
 
Posts: n/a
#2: Aug 24 '08

re: Zend Lucene too slow to index question


sliverdigger wrote:
Quote:
I have been experimenting with Zend/Php/Lucene.
I have used Lucene extensively as Java in Tomcat Servlets.
>
Zend's php Lucene works the same way, but it is much slower
to index for some reason--way too slow to ever finish,
when working a moderately sized site--on a shared host.
>
I could try to figure out how to break the site up into chunks
and then index once and then update many times.
>
Are there any zippier alternatives to Zend/Lucene?
(google Custom Search give me poor match results).
>

....hhmmmmmmmm.
Might have to try making the index with java, and then use
php only for queries. Not sure it's all that compatible.

--
cut the board three times and it's still too short
C. (http://symcbean.blogspot.com/)
Guest
 
Posts: n/a
#3: Aug 24 '08

re: Zend Lucene too slow to index question


On 24 Aug, 17:02, sliverdigger <sliverdig...@closenuf.netwrote:
Quote:
I have been experimenting with Zend/Php/Lucene.
I have used Lucene extensively as Java in Tomcat Servlets.
>
Zend's php Lucene works the same way, but it is much slower
to index for some reason--way too slow to ever finish,
when working a moderately sized site--on a shared host.
>
I could try to figure out how to break the site up into chunks
and then index once and then update many times.
>
Are there any zippier alternatives to Zend/Lucene?
(google Custom Search give me poor match results).
>
I've never used Lucene to have it to compare to but mnogo is very fast
in my experience. But its mostly written in C - which can be tricky to
install on a shared host (but if they do java....?)

C.
sliverdigger
Guest
 
Posts: n/a
#4: Aug 25 '08

re: Zend Lucene too slow to index question


C. (http://symcbean.blogspot.com/) wrote:
Quote:
On 24 Aug, 17:02, sliverdigger <sliverdig...@closenuf.netwrote:
Quote:
>I have been experimenting with Zend/Php/Lucene.
>I have used Lucene extensively as Java in Tomcat Servlets.
>>
>Zend's php Lucene works the same way, but it is much slower
>to index for some reason--way too slow to ever finish,
>when working a moderately sized site--on a shared host.
>>
>I could try to figure out how to break the site up into chunks
>and then index once and then update many times.
>>
>Are there any zippier alternatives to Zend/Lucene?
>(google Custom Search give me poor match results).
>>
>
I've never used Lucene to have it to compare to but mnogo is very fast
in my experience. But its mostly written in C - which can be tricky to
install on a shared host (but if they do java....?)
>
C.
Thanks, I'll look at that. Someone did tell me: yes, you
can create the Lucene index with a java application (no worries
about execution time) and then scp it up to the website, and
then query it with Php/Lucene from Zend. So I'll probably do that.



--
cut the board three times and it's still too short
Hans-Werner Hilse
Guest
 
Posts: n/a
#5: Aug 26 '08

re: Zend Lucene too slow to index question


Hi,

sliverdigger wrote:
Quote:
Are there any zippier alternatives to Zend/Lucene?
(google Custom Search give me poor match results).
Depends. A little bit different approach would be to use YAZ extension
for PHP as communication means to a (independent) Indexdata Zebra
server (which is a Z39.50 based index server, which is really good at
indexing many structured formats, especially XML). But it's certainly
more work than a quickly set up Lucene. Both YAZ and Zebra are GPLed,
but you can get commercial support from Indexdata.

This is e.g. used by the Koha library management software (Free
Software).

-hwh
Closed Thread