Hi there,
i have some troubles with my TSearch2 Installation. I have done this
installation as described in http://www.sai.msu.su/~megera/oddmus...compound_words <http://www.sai.msu.su/%7Emegera/oddmuse/index.cgi/Tsearch_V2_comp ound_words>
I used the german myspell dictionary from http://lingucomponent.openoffice.org/spell_dic.html and converted it with
my2ispell
Nearly everything is working fine so far, except two problems:
1.) The stopword-file seems to be ignored: If i try it with SELECT
to_tsvector("de fault_german", "ein Haus") i get
"ein":1 "haus":2
ein should be a Stopword for german (and is defined the german.stop file as
well)
2.) The compound words feature doesn"t work too. I have tried a lot of words,
i.e. "Fehlermeld ung" with SELECT to_tsvector("de fault_german", "Fehlermeldung" )
i only get
"fehlermeldung" :1 but i would expect "fehler" and "meldung" as seperated
entries. Is there anything wrong with the dictonary or my configuration?
My current configuration:
pg_ts_cfg:
default default C
default_russian default ru_RU.KOI8-R
simple default NULL
default_german default de_DE.ISO8859-1
pg_ts_cfgmap:
default_german host {simple}
default_german hword {simple}
default_german int {simple}
default_german nlhword {simple}
default_german nlpart_hword {simple}
default_german nlword {simple}
default_german part_hword {simple}
default_german sfloat {simple}
default_german uint {simple}
default_german uri {simple}
default_german url {simple}
default_german version {simple}
default_german word {simple}
default_german lpart_hword {de_ispell,germ an_snowball}
default_german lword {de_ispell,germ an_snowball}
default_german lhword {de_ispell,germ an_snowball}
pg_ts_dict:
de_ispell | 17166 |
DictFile="/usr/local/pgsql/share/contrib/dictonary/german.dict",
AffFile="/usr/local/pgsql/share/contrib/dictonary/german.aff",
StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" | 17167 | NULL
german_snowball | 17357 | NULL | 17162 | Snowball stemmer for german
Can anyone help me?
regards
Timo
---------------------------(end of broadcast)---------------------------
TIP 4: Don't 'kill -9' the postmaster 7 5460
Timo,
I forward your message to openfts mailing list.
Also, could you specify if locale settings are correct for your
database and what dictionary you have downloaded.
Oleg
On Fri, 5 Nov 2004, Timo Haberkern wrote: Hi there,
i have some troubles with my TSearch2 Installation. I have done this installation as described in http://www.sai.msu.su/~megera/oddmus...compound_words <http://www.sai.msu.su/%7Emegera/oddmuse/index.cgi/Tsearch_V2_comp ound_words>
I used the german myspell dictionary from http://lingucomponent.openoffice.org/spell_dic.html and converted it with my2ispell
Nearly everything is working fine so far, except two problems:
1.) The stopword-file seems to be ignored: If i try it with SELECT to_tsvector("de fault_german", "ein Haus") i get "ein":1 "haus":2
ein should be a Stopword for german (and is defined the german.stop file as well)
2.) The compound words feature doesn"t work too. I have tried a lot of words, i.e. "Fehlermeld ung" with SELECT to_tsvector("de fault_german", "Fehlermeldung" ) i only get "fehlermeldung" :1 but i would expect "fehler" and "meldung" as seperated entries. Is there anything wrong with the dictonary or my configuration?
My current configuration:
pg_ts_cfg:
default default C default_russian default ru_RU.KOI8-R simple default NULL default_german default de_DE.ISO8859-1 pg_ts_cfgmap:
default_german host {simple} default_german hword {simple} default_german int {simple} default_german nlhword {simple} default_german nlpart_hword {simple} default_german nlword {simple} default_german part_hword {simple} default_german sfloat {simple} default_german uint {simple} default_german uri {simple} default_german url {simple} default_german version {simple} default_german word {simple} default_german lpart_hword {de_ispell,germ an_snowball} default_german lword {de_ispell,germ an_snowball} default_german lhword {de_ispell,germ an_snowball}
pg_ts_dict:
de_ispell | 17166 | DictFile="/usr/local/pgsql/share/contrib/dictonary/german.dict", AffFile="/usr/local/pgsql/share/contrib/dictonary/german.aff", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" | 17167 | NULL german_snowball | 17357 | NULL | 17162 | Snowball stemmer for german Can anyone help me?
regards
Timo
---------------------------(end of broadcast)--------------------------- TIP 4: Don't 'kill -9' the postmaster
Regards,
Oleg
_______________ _______________ _______________ _______________ _
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)---------------------------
TIP 7: don't forget to increase your free space map settings
Timo,
please, check you apply patch for compound word support.
What is version of postgresql ?
Does ispell dict works for non-compound words ?
Oleg
On Fri, 5 Nov 2004, Timo Haberkern wrote: Hi there,
i have some troubles with my TSearch2 Installation. I have done this installation as described in http://www.sai.msu.su/~megera/oddmus...compound_words <http://www.sai.msu.su/%7Emegera/oddmuse/index.cgi/Tsearch_V2_comp ound_words>
I used the german myspell dictionary from http://lingucomponent.openoffice.org/spell_dic.html and converted it with my2ispell
Nearly everything is working fine so far, except two problems:
1.) The stopword-file seems to be ignored: If i try it with SELECT to_tsvector("de fault_german", "ein Haus") i get "ein":1 "haus":2
ein should be a Stopword for german (and is defined the german.stop file as well)
2.) The compound words feature doesn"t work too. I have tried a lot of words, i.e. "Fehlermeld ung" with SELECT to_tsvector("de fault_german", "Fehlermeldung" ) i only get "fehlermeldung" :1 but i would expect "fehler" and "meldung" as seperated entries. Is there anything wrong with the dictonary or my configuration?
My current configuration:
pg_ts_cfg:
default default C default_russian default ru_RU.KOI8-R simple default NULL default_german default de_DE.ISO8859-1 pg_ts_cfgmap:
default_german host {simple} default_german hword {simple} default_german int {simple} default_german nlhword {simple} default_german nlpart_hword {simple} default_german nlword {simple} default_german part_hword {simple} default_german sfloat {simple} default_german uint {simple} default_german uri {simple} default_german url {simple} default_german version {simple} default_german word {simple} default_german lpart_hword {de_ispell,germ an_snowball} default_german lword {de_ispell,germ an_snowball} default_german lhword {de_ispell,germ an_snowball}
pg_ts_dict:
de_ispell | 17166 | DictFile="/usr/local/pgsql/share/contrib/dictonary/german.dict", AffFile="/usr/local/pgsql/share/contrib/dictonary/german.aff", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" | 17167 | NULL german_snowball | 17357 | NULL | 17162 | Snowball stemmer for german Can anyone help me?
regards
Timo
---------------------------(end of broadcast)--------------------------- TIP 4: Don't 'kill -9' the postmaster
Regards,
Oleg
_______________ _______________ _______________ _______________ _
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)---------------------------
TIP 2: you can get off all lists at once with the unregister command
(send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
Oleg,
i use TSearch2 with PostgreSQL 7.4.6 and i applied the compoundword
patch yesterday. The configuration changed a little bit but the result
is the same. I get no compound words. I'm using the locale de_DE with
encoding ISO8859-1 for the database.
I think i spell is working correctly except the compound words. If i try
SELECT lexize('de_ispe ll', 'springt')
i get
lexize
{springen,sprin gen}
which seems correct.
But a SELECT lexize('de_ispe ll', 'Autobahn')
results in
lexize
{autobahn}
i would expect {auto,bahn, autobahn}
The new configuration after the compound word patch:
Actions dict_name
<http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=2&sor tdir=asc&string s=expanded&page =1>
dict_init
<http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=3&sor tdir=asc&string s=expanded&page =1>
dict_initoption
<http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=4&sor tdir=asc&string s=expanded&page =1>
dict_lexize
<http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=5&sor tdir=asc&string s=expanded&page =1>
dict_comment
<http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=6&sor tdir=asc&string s=expanded&page =1>
Edit
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s imple&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=>
Delete
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=si mple&database=s elina_rotex&sch ema=public&tabl e=pg_ts_dict&re turn_url=tblpro perties.php%3Fd atabase%3Dselin a_rotex%26amp%3 Bschema%3Dpubli c%26table%3Dpg_ ts_dict&return_ desc=Back&sortk ey=&sortdir=>
simple dex_init(text) /NULL/ dex_lexize(inte rnal,internal,i nteger)
Simple example of dictionary.
Edit
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=e n_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=>
Delete
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=en _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=>
en_stem snb_en_init(tex t)
/usr/local/pgsql/share/contrib/english.stop
snb_lexize(inte rnal,internal,i nteger) English Stemmer. Snowball.
Edit
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=r u_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=>
Delete
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=ru _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=>
ru_stem snb_ru_init(tex t)
/usr/local/pgsql/share/contrib/russian.stop
snb_lexize(inte rnal,internal,i nteger) Russian Stemmer. Snowball.
Edit
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=i spell_template& database=selina _rotex&schema=p ublic&table=pg_ ts_dict&return_ url=tblproperti es.php%3Fdataba se%3Dselina_rot ex%26amp%3Bsche ma%3Dpublic%26t able%3Dpg_ts_di ct&return_desc= Back&sortkey=&s ortdir=>
Delete
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=is pell_template&d atabase=selina_ rotex&schema=pu blic&table=pg_t s_dict&return_u rl=tblpropertie s.php%3Fdatabas e%3Dselina_rote x%26amp%3Bschem a%3Dpublic%26ta ble%3Dpg_ts_dic t&return_desc=B ack&sortkey=&so rtdir=>
ispell_template spell_init(text ) /NULL/
spell_lexize(in ternal,internal ,integer) ISpell interface. Must have
..dict and .aff files
Edit
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s ynonym&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=>
Delete
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=sy nonym&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=>
synonym syn_init(text) /NULL/
syn_lexize(inte rnal,internal,i nteger) Example of synonym dictionary
Edit
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=d e_ispell&databa se=selina_rotex &schema=public& table=pg_ts_dic t&return_url=tb lproperties.php %3Fdatabase%3Ds elina_rotex%26a mp%3Bschema%3Dp ublic%26table%3 Dpg_ts_dict&ret urn_desc=Back&s ortkey=&sortdir =>
Delete
<http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=de _ispell&databas e=selina_rotex& schema=public&t able=pg_ts_dict &return_url=tbl properties.php% 3Fdatabase%3Dse lina_rotex%26am p%3Bschema%3Dpu blic%26table%3D pg_ts_dict&retu rn_desc=Back&so rtkey=&sortdir= >
de_ispell spell_init(text )
DictFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.dic t",
AffFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.aff ",
StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop"
spell_lexize(in ternal,internal ,integer) /NULL/
Timo
Oleg Bartunov wrote: Timo,
please, check you apply patch for compound word support. What is version of postgresql ? Does ispell dict works for non-compound words ?
Oleg
On Fri, 5 Nov 2004, Timo Haberkern wrote:
Hi there,
i have some troubles with my TSearch2 Installation. I have done this installation as described in http://www.sai.msu.su/~megera/oddmus...compound_words <http://www.sai.msu.su/%7Emegera/oddmuse/index.cgi/Tsearch_V2_comp ound_words>
I used the german myspell dictionary from http://lingucomponent.openoffice.org/spell_dic.html and converted it with my2ispell
Nearly everything is working fine so far, except two problems:
1.) The stopword-file seems to be ignored: If i try it with SELECT to_tsvector("de fault_german", "ein Haus") i get "ein":1 "haus":2
ein should be a Stopword for german (and is defined the german.stop file as well)
2.) The compound words feature doesn"t work too. I have tried a lot of words, i.e. "Fehlermeld ung" with SELECT to_tsvector("de fault_german", "Fehlermeldung" ) i only get "fehlermeldung" :1 but i would expect "fehler" and "meldung" as seperated entries. Is there anything wrong with the dictonary or my configuration?
My current configuration:
pg_ts_cfg:
default default C default_russian default ru_RU.KOI8-R simple default NULL default_german default de_DE.ISO8859-1 pg_ts_cfgmap:
default_german host {simple} default_german hword {simple} default_german int {simple} default_german nlhword {simple} default_german nlpart_hword {simple} default_german nlword {simple} default_german part_hword {simple} default_german sfloat {simple} default_german uint {simple} default_german uri {simple} default_german url {simple} default_german version {simple} default_german word {simple} default_german lpart_hword {de_ispell,germ an_snowball} default_german lword {de_ispell,germ an_snowball} default_german lhword {de_ispell,germ an_snowball}
pg_ts_dict:
de_ispell | 17166 | DictFile="/usr/local/pgsql/share/contrib/dictonary/german.dict", AffFile="/usr/local/pgsql/share/contrib/dictonary/german.aff", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" | 17167 | NULL german_snowball | 17357 | NULL | 17162 | Snowball stemmer for german Can anyone help me?
regards
Timo
---------------------------(end of broadcast)--------------------------- TIP 4: Don't 'kill -9' the postmaster
Regards, Oleg _______________ _______________ _______________ _______________ _ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
---------------------------(end of broadcast)---------------------------
TIP 3: if posting/reading through Usenet, please send an appropriate
subscribe-nomail command to ma*******@postg resql.org so that your
message can get through to the mailing list cleanly
On Fri, 5 Nov 2004, Timo Haberkern wrote: Oleg,
i use TSearch2 with PostgreSQL 7.4.6 and i applied the compoundword patch yesterday. The configuration changed a little bit but the result is the same. I get no compound words. I'm using the locale de_DE with encoding ISO8859-1 for the database.
I think i spell is working correctly except the compound words. If i try
SELECT lexize('de_ispe ll', 'springt')
i get
lexize {springen,sprin gen}
which seems correct.
But a SELECT lexize('de_ispe ll', 'Autobahn')
results in
lexize {autobahn}
i would expect {auto,bahn, autobahn}
Hmm, have you checked 'Autobahn' in ispell dictionary ? Does dictionary
you used supports 'Z' flag for compound words ? The new configuration after the compound word patch:
Seems you overestimate my capabilities :) Actions dict_name <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=2&sor tdir=asc&string s=expanded&page =1> dict_init <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=3&sor tdir=asc&string s=expanded&page =1> dict_initoption <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=4&sor tdir=asc&string s=expanded&page =1> dict_lexize <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=5&sor tdir=asc&string s=expanded&page =1> dict_comment <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=6&sor tdir=asc&string s=expanded&page =1> Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s imple&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=si mple&database=s elina_rotex&sch ema=public&tabl e=pg_ts_dict&re turn_url=tblpro perties.php%3Fd atabase%3Dselin a_rotex%26amp%3 Bschema%3Dpubli c%26table%3Dpg_ ts_dict&return_ desc=Back&sortk ey=&sortdir=> simple dex_init(text) /NULL/ dex_lexize(inte rnal,internal,i nteger) Simple example of dictionary. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=e n_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=en _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> en_stem snb_en_init(tex t) /usr/local/pgsql/share/contrib/english.stop snb_lexize(inte rnal,internal,i nteger) English Stemmer. Snowball. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=r u_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=ru _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> ru_stem snb_ru_init(tex t) /usr/local/pgsql/share/contrib/russian.stop snb_lexize(inte rnal,internal,i nteger) Russian Stemmer. Snowball. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=i spell_template& database=selina _rotex&schema=p ublic&table=pg_ ts_dict&return_ url=tblproperti es.php%3Fdataba se%3Dselina_rot ex%26amp%3Bsche ma%3Dpublic%26t able%3Dpg_ts_di ct&return_desc= Back&sortkey=&s ortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=is pell_template&d atabase=selina_ rotex&schema=pu blic&table=pg_t s_dict&return_u rl=tblpropertie s.php%3Fdatabas e%3Dselina_rote x%26amp%3Bschem a%3Dpublic%26ta ble%3Dpg_ts_dic t&return_desc=B ack&sortkey=&so rtdir=> ispell_template spell_init(text ) /NULL/ spell_lexize(in ternal,internal ,integer) ISpell interface. Must have .dict and .aff files Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s ynonym&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=sy nonym&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> synonym syn_init(text) /NULL/ syn_lexize(inte rnal,internal,i nteger) Example of synonym dictionary Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=d e_ispell&databa se=selina_rotex &schema=public& table=pg_ts_dic t&return_url=tb lproperties.php %3Fdatabase%3Ds elina_rotex%26a mp%3Bschema%3Dp ublic%26table%3 Dpg_ts_dict&ret urn_desc=Back&s ortkey=&sortdir => Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=de _ispell&databas e=selina_rotex& schema=public&t able=pg_ts_dict &return_url=tbl properties.php% 3Fdatabase%3Dse lina_rotex%26am p%3Bschema%3Dpu blic%26table%3D pg_ts_dict&retu rn_desc=Back&so rtkey=&sortdir= > de_ispell spell_init(text ) DictFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.dic t", AffFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.aff ", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" spell_lexize(in ternal,internal ,integer) /NULL/ Timo
Oleg Bartunov wrote:
Timo,
please, check you apply patch for compound word support. What is version of postgresql ? Does ispell dict works for non-compound words ?
Oleg
On Fri, 5 Nov 2004, Timo Haberkern wrote:
Hi there,
i have some troubles with my TSearch2 Installation. I have done this installation as described in http://www.sai.msu.su/~megera/oddmus...compound_words <http://www.sai.msu.su/%7Emegera/oddmuse/index.cgi/Tsearch_V2_comp ound_words>
I used the german myspell dictionary from http://lingucomponent.openoffice.org/spell_dic.html and converted it with my2ispell
Nearly everything is working fine so far, except two problems:
1.) The stopword-file seems to be ignored: If i try it with SELECT to_tsvector("de fault_german", "ein Haus") i get "ein":1 "haus":2
ein should be a Stopword for german (and is defined the german.stop file as well)
2.) The compound words feature doesn"t work too. I have tried a lot of words, i.e. "Fehlermeld ung" with SELECT to_tsvector("de fault_german", "Fehlermeldung" ) i only get "fehlermeldung" :1 but i would expect "fehler" and "meldung" as seperated entries. Is there anything wrong with the dictonary or my configuration?
My current configuration:
pg_ts_cfg:
default default C default_russian default ru_RU.KOI8-R simple default NULL default_german default de_DE.ISO8859-1 pg_ts_cfgmap:
default_german host {simple} default_german hword {simple} default_german int {simple} default_german nlhword {simple} default_german nlpart_hword {simple} default_german nlword {simple} default_german part_hword {simple} default_german sfloat {simple} default_german uint {simple} default_german uri {simple} default_german url {simple} default_german version {simple} default_german word {simple} default_german lpart_hword {de_ispell,germ an_snowball} default_german lword {de_ispell,germ an_snowball} default_german lhword {de_ispell,germ an_snowball}
pg_ts_dict:
de_ispell | 17166 | DictFile="/usr/local/pgsql/share/contrib/dictonary/german.dict", AffFile="/usr/local/pgsql/share/contrib/dictonary/german.aff", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" | 17167 | NULL german_snowball | 17357 | NULL | 17162 | Snowball stemmer for german Can anyone help me?
regards
Timo
---------------------------(end of broadcast)--------------------------- TIP 4: Don't 'kill -9' the postmaster
Regards, Oleg _______________ _______________ _______________ _______________ _ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
Regards,
Oleg
_______________ _______________ _______________ _______________ _
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)---------------------------
TIP 2: you can get off all lists at once with the unregister command
(send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
sorry for the late answer, i was on holyday,
see my remarks below
Oleg Bartunov wrote: On Fri, 5 Nov 2004, Timo Haberkern wrote:
Oleg,
i use TSearch2 with PostgreSQL 7.4.6 and i applied the compoundword patch yesterday. The configuration changed a little bit but the result is the same. I get no compound words. I'm using the locale de_DE with encoding ISO8859-1 for the database.
I think i spell is working correctly except the compound words. If i try
SELECT lexize('de_ispe ll', 'springt')
i get
lexize {springen,sprin gen}
which seems correct.
But a SELECT lexize('de_ispe ll', 'Autobahn')
results in
lexize {autobahn}
i would expect {auto,bahn, autobahn}
Hmm, have you checked 'Autobahn' in ispell dictionary ? Does dictionary you used supports 'Z' flag for compound words ?
Autobahn is in the ispell dictionary. What does a ispell dictionary
need to support the Z flag???
Timo
The new configuration after the compound word patch:
Seems you overestimate my capabilities :)
Actions dict_name <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=2&sor tdir=asc&string s=expanded&page =1> dict_init <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=3&sor tdir=asc&string s=expanded&page =1> dict_initoption <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=4&sor tdir=asc&string s=expanded&page =1> dict_lexize <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=5&sor tdir=asc&string s=expanded&page =1> dict_comment <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=6&sor tdir=asc&string s=expanded&page =1> Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s imple&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=si mple&database=s elina_rotex&sch ema=public&tabl e=pg_ts_dict&re turn_url=tblpro perties.php%3Fd atabase%3Dselin a_rotex%26amp%3 Bschema%3Dpubli c%26table%3Dpg_ ts_dict&return_ desc=Back&sortk ey=&sortdir=> simple dex_init(text) /NULL/ dex_lexize(inte rnal,internal,i nteger) Simple example of dictionary. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=e n_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=en _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> en_stem snb_en_init(tex t) /usr/local/pgsql/share/contrib/english.stop snb_lexize(inte rnal,internal,i nteger) English Stemmer. Snowball. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=r u_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=ru _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> ru_stem snb_ru_init(tex t) /usr/local/pgsql/share/contrib/russian.stop snb_lexize(inte rnal,internal,i nteger) Russian Stemmer. Snowball. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=i spell_template& database=selina _rotex&schema=p ublic&table=pg_ ts_dict&return_ url=tblproperti es.php%3Fdataba se%3Dselina_rot ex%26amp%3Bsche ma%3Dpublic%26t able%3Dpg_ts_di ct&return_desc= Back&sortkey=&s ortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=is pell_template&d atabase=selina_ rotex&schema=pu blic&table=pg_t s_dict&return_u rl=tblpropertie s.php%3Fdatabas e%3Dselina_rote x%26amp%3Bschem a%3Dpublic%26ta ble%3Dpg_ts_dic t&return_desc=B ack&sortkey=&so rtdir=> ispell_template spell_init(text ) /NULL/ spell_lexize(in ternal,internal ,integer) ISpell interface. Must have .dict and .aff files Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s ynonym&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=sy nonym&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> synonym syn_init(text) /NULL/ syn_lexize(inte rnal,internal,i nteger) Example of synonym dictionary Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=d e_ispell&databa se=selina_rotex &schema=public& table=pg_ts_dic t&return_url=tb lproperties.php %3Fdatabase%3Ds elina_rotex%26a mp%3Bschema%3Dp ublic%26table%3 Dpg_ts_dict&ret urn_desc=Back&s ortkey=&sortdir => Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=de _ispell&databas e=selina_rotex& schema=public&t able=pg_ts_dict &return_url=tbl properties.php% 3Fdatabase%3Dse lina_rotex%26am p%3Bschema%3Dpu blic%26table%3D pg_ts_dict&retu rn_desc=Back&so rtkey=&sortdir= > de_ispell spell_init(text ) DictFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.dic t", AffFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.aff ", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" spell_lexize(in ternal,internal ,integer) /NULL/ Timo
Oleg Bartunov wrote:
Timo,
please, check you apply patch for compound word support. What is version of postgresql ? Does ispell dict works for non-compound words ?
Oleg
On Fri, 5 Nov 2004, Timo Haberkern wrote:
Hi there,
i have some troubles with my TSearch2 Installation. I have done this installation as described in http://www.sai.msu.su/~megera/oddmus...compound_words <http://www.sai.msu.su/%7Emegera/oddmuse/index.cgi/Tsearch_V2_comp ound_words>
I used the german myspell dictionary from http://lingucomponent.openoffice.org/spell_dic.html and converted it with my2ispell
Nearly everything is working fine so far, except two problems:
1.) The stopword-file seems to be ignored: If i try it with SELECT to_tsvector("de fault_german", "ein Haus") i get "ein":1 "haus":2
ein should be a Stopword for german (and is defined the german.stop file as well)
2.) The compound words feature doesn"t work too. I have tried a lot of words, i.e. "Fehlermeld ung" with SELECT to_tsvector("de fault_german", "Fehlermeldung" ) i only get "fehlermeldung" :1 but i would expect "fehler" and "meldung" as seperated entries. Is there anything wrong with the dictonary or my configuration?
My current configuration:
pg_ts_cfg:
default default C default_russian default ru_RU.KOI8-R simple default NULL default_german default de_DE.ISO8859-1 pg_ts_cfgmap:
default_german host {simple} default_german hword {simple} default_german int {simple} default_german nlhword {simple} default_german nlpart_hword {simple} default_german nlword {simple} default_german part_hword {simple} default_german sfloat {simple} default_german uint {simple} default_german uri {simple} default_german url {simple} default_german version {simple} default_german word {simple} default_german lpart_hword {de_ispell,germ an_snowball} default_german lword {de_ispell,germ an_snowball} default_german lhword {de_ispell,germ an_snowball}
pg_ts_dict:
de_ispell | 17166 | DictFile="/usr/local/pgsql/share/contrib/dictonary/german.dict", AffFile="/usr/local/pgsql/share/contrib/dictonary/german.aff", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" | 17167 | NULL german_snowball | 17357 | NULL | 17162 | Snowball stemmer for german Can anyone help me?
regards
Timo
---------------------------(end of broadcast)--------------------------- TIP 4: Don't 'kill -9' the postmaster
Regards, Oleg _______________ _______________ _______________ _______________ _ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
Regards, Oleg _______________ _______________ _______________ _______________ _ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
---------------------------(end of broadcast)---------------------------
TIP 6: Have you searched our list archives? http://archives.postgresql.org
On Wed, 17 Nov 2004, Timo Haberkern wrote: sorry for the late answer, i was on holyday,
see my remarks below
Oleg Bartunov wrote:
On Fri, 5 Nov 2004, Timo Haberkern wrote:
Oleg,
i use TSearch2 with PostgreSQL 7.4.6 and i applied the compoundword patch yesterday. The configuration changed a little bit but the result is the same. I get no compound words. I'm using the locale de_DE with encoding ISO8859-1 for the database.
I think i spell is working correctly except the compound words. If i try
SELECT lexize('de_ispe ll', 'springt')
i get
lexize {springen,sprin gen}
which seems correct.
But a SELECT lexize('de_ispe ll', 'Autobahn')
results in
lexize {autobahn}
i would expect {auto,bahn, autobahn}
Hmm, have you checked 'Autobahn' in ispell dictionary ? Does dictionary you used supports 'Z' flag for compound words ?
Autobahn is in the ispell dictionary. What does a ispell dictionary need to support the Z flag???
Try ispell -C Autobahn
search 'compound' in 'man ispell' for details.
the problem exists only if ispell *does* splits word correctly while tsearch2
doesn't. You should find correct ispell dictionary for german or create it
yourself. You may consult monzilla.net http://staff.science.uva.nl/~christo...roject-dr.html Timo
The new configuration after the compound word patch:
Seems you overestimate my capabilities :)
Actions dict_name <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=2&sor tdir=asc&string s=expanded&page =1> dict_init <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=3&sor tdir=asc&string s=expanded&page =1> dict_initoption <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=4&sor tdir=asc&string s=expanded&page =1> dict_lexize <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=5&sor tdir=asc&string s=expanded&page =1> dict_comment <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=6&sor tdir=asc&string s=expanded&page =1> Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s imple&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=si mple&database=s elina_rotex&sch ema=public&tabl e=pg_ts_dict&re turn_url=tblpro perties.php%3Fd atabase%3Dselin a_rotex%26amp%3 Bschema%3Dpubli c%26table%3Dpg_ ts_dict&return_ desc=Back&sortk ey=&sortdir=> simple dex_init(text) /NULL/ dex_lexize(inte rnal,internal,i nteger) Simple example of dictionary. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=e n_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=en _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> en_stem snb_en_init(tex t) /usr/local/pgsql/share/contrib/english.stop snb_lexize(inte rnal,internal,i nteger) English Stemmer. Snowball. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=r u_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=ru _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> ru_stem snb_ru_init(tex t) /usr/local/pgsql/share/contrib/russian.stop snb_lexize(inte rnal,internal,i nteger) Russian Stemmer. Snowball. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=i spell_template& database=selina _rotex&schema=p ublic&table=pg_ ts_dict&return_ url=tblproperti es.php%3Fdataba se%3Dselina_rot ex%26amp%3Bsche ma%3Dpublic%26t able%3Dpg_ts_di ct&return_desc= Back&sortkey=&s ortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=is pell_template&d atabase=selina_ rotex&schema=pu blic&table=pg_t s_dict&return_u rl=tblpropertie s.php%3Fdatabas e%3Dselina_rote x%26amp%3Bschem a%3Dpublic%26ta ble%3Dpg_ts_dic t&return_desc=B ack&sortkey=&so rtdir=> ispell_template spell_init(text ) /NULL/ spell_lexize(in ternal,internal ,integer) ISpell interface. Must have .dict and .aff files Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s ynonym&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=sy nonym&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> synonym syn_init(text) /NULL/ syn_lexize(inte rnal,internal,i nteger) Example of synonym dictionary Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=d e_ispell&databa se=selina_rotex &schema=public& table=pg_ts_dic t&return_url=tb lproperties.php %3Fdatabase%3Ds elina_rotex%26a mp%3Bschema%3Dp ublic%26table%3 Dpg_ts_dict&ret urn_desc=Back&s ortkey=&sortdir => Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=de _ispell&databas e=selina_rotex& schema=public&t able=pg_ts_dict &return_url=tbl properties.php% 3Fdatabase%3Dse lina_rotex%26am p%3Bschema%3Dpu blic%26table%3D pg_ts_dict&retu rn_desc=Back&so rtkey=&sortdir= > de_ispell spell_init(text ) DictFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.dic t", AffFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.aff ", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" spell_lexize(in ternal,internal ,integer) /NULL/ Timo
Oleg Bartunov wrote:
Timo,
please, check you apply patch for compound word support. What is version of postgresql ? Does ispell dict works for non-compound words ?
Oleg
On Fri, 5 Nov 2004, Timo Haberkern wrote:
> Hi there, > > i have some troubles with my TSearch2 Installation. I have done this > installation as described in > http://www.sai.msu.su/~megera/oddmus...compound_words > <http://www.sai.msu.su/%7Emegera/oddmuse/index.cgi/Tsearch_V2_comp ound_words> > I used the german myspell dictionary from > http://lingucomponent.openoffice.org/spell_dic.html and converted it > with > my2ispell > > Nearly everything is working fine so far, except two problems: > > 1.) The stopword-file seems to be ignored: If i try it with SELECT > to_tsvector("de fault_german", "ein Haus") i get "ein":1 "haus":2 > > ein should be a Stopword for german (and is defined the german.stop file > as > well) > > 2.) The compound words feature doesn"t work too. I have tried a lot of > words, > i.e. "Fehlermeld ung" with SELECT to_tsvector("de fault_german", > "Fehlermeldung" ) > i only get > "fehlermeldung" :1 but i would expect "fehler" and "meldung" as seperated > entries. Is there anything wrong with the dictonary or my configuration? > > > My current configuration: > > pg_ts_cfg: > > default default C > default_russian default ru_RU.KOI8-R > simple default NULL > default_german default de_DE.ISO8859-1 > pg_ts_cfgmap: > > default_german host {simple} > default_german hword {simple} > default_german int {simple} > default_german nlhword {simple} > default_german nlpart_hword {simple} > default_german nlword {simple} > default_german part_hword {simple} > default_german sfloat {simple} > default_german uint {simple} > default_german uri {simple} > default_german url {simple} > default_german version {simple} > default_german word {simple} > default_german lpart_hword {de_ispell,germ an_snowball} > default_german lword {de_ispell,germ an_snowball} > default_german lhword {de_ispell,germ an_snowball} > > > pg_ts_dict: > > de_ispell | 17166 | > DictFile="/usr/local/pgsql/share/contrib/dictonary/german.dict", > AffFile="/usr/local/pgsql/share/contrib/dictonary/german.aff", > StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" | > 17167 | NULL > german_snowball | 17357 | NULL | 17162 | Snowball stemmer for > german > > > > Can anyone help me? > > regards > > Timo > > > ---------------------------(end of broadcast)--------------------------- > TIP 4: Don't 'kill -9' the postmaster >
Regards, Oleg _______________ _______________ _______________ _______________ _ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
Regards, Oleg _______________ _______________ _______________ _______________ _ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
Regards,
Oleg
_______________ _______________ _______________ _______________ _
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)---------------------------
TIP 2: you can get off all lists at once with the unregister command
(send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
Timo,
take a look into .aff file and search 'compoundwords' .
german ispell file I got from http://j3e.de/ispell/igerman98/ has no
support for compound words: 'compoundwords off'
Norwegian, for example, has:
compoundwords controlled z
compoundmin 4
Oleg
On Wed, 17 Nov 2004, Oleg Bartunov wrote: On Wed, 17 Nov 2004, Timo Haberkern wrote:
sorry for the late answer, i was on holyday,
see my remarks below
Oleg Bartunov wrote:
On Fri, 5 Nov 2004, Timo Haberkern wrote:
Oleg,
i use TSearch2 with PostgreSQL 7.4.6 and i applied the compoundword patch yesterday. The configuration changed a little bit but the result is the same. I get no compound words. I'm using the locale de_DE with encoding ISO8859-1 for the database.
I think i spell is working correctly except the compound words. If i try
SELECT lexize('de_ispe ll', 'springt')
i get
lexize {springen,sprin gen}
which seems correct.
But a SELECT lexize('de_ispe ll', 'Autobahn')
results in
lexize {autobahn}
i would expect {auto,bahn, autobahn}
Hmm, have you checked 'Autobahn' in ispell dictionary ? Does dictionary you used supports 'Z' flag for compound words ?
Autobahn is in the ispell dictionary. What does a ispell dictionary need to support the Z flag???
Try ispell -C Autobahn search 'compound' in 'man ispell' for details. the problem exists only if ispell *does* splits word correctly while tsearch2 doesn't. You should find correct ispell dictionary for german or create it yourself. You may consult monzilla.net http://staff.science.uva.nl/~christo...roject-dr.html
Timo
The new configuration after the compound word patch:
Seems you overestimate my capabilities :) Actions dict_name <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=2&sor tdir=asc&string s=expanded&page =1> dict_init <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=3&sor tdir=asc&string s=expanded&page =1> dict_initoption <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=4&sor tdir=asc&string s=expanded&page =1> dict_lexize <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=5&sor tdir=asc&string s=expanded&page =1> dict_comment <http://www.rotex-service.com/phppgadmin/display.php?dat abase=selina_ro tex&schema=publ ic&table=pg_ts_ dict&return_url =tblproperties. php%3Fdatabase% 3Dselina_rotex% 26amp%3Bschema% 3Dpublic%26tabl e%3Dpg_ts_dict& return_desc=Bac k&sortkey=6&sor tdir=asc&string s=expanded&page =1> Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s imple&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=si mple&database=s elina_rotex&sch ema=public&tabl e=pg_ts_dict&re turn_url=tblpro perties.php%3Fd atabase%3Dselin a_rotex%26amp%3 Bschema%3Dpubli c%26table%3Dpg_ ts_dict&return_ desc=Back&sortk ey=&sortdir=> simple dex_init(text) /NULL/ dex_lexize(inte rnal,internal,i nteger) Simple example of dictionary. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=e n_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=en _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> en_stem snb_en_init(tex t) /usr/local/pgsql/share/contrib/english.stop snb_lexize(inte rnal,internal,i nteger) English Stemmer. Snowball. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=r u_stem&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=ru _stem&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> ru_stem snb_ru_init(tex t) /usr/local/pgsql/share/contrib/russian.stop snb_lexize(inte rnal,internal,i nteger) Russian Stemmer. Snowball. Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=i spell_template& database=selina _rotex&schema=p ublic&table=pg_ ts_dict&return_ url=tblproperti es.php%3Fdataba se%3Dselina_rot ex%26amp%3Bsche ma%3Dpublic%26t able%3Dpg_ts_di ct&return_desc= Back&sortkey=&s ortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=is pell_template&d atabase=selina_ rotex&schema=pu blic&table=pg_t s_dict&return_u rl=tblpropertie s.php%3Fdatabas e%3Dselina_rote x%26amp%3Bschem a%3Dpublic%26ta ble%3Dpg_ts_dic t&return_desc=B ack&sortkey=&so rtdir=> ispell_template spell_init(text ) /NULL/ spell_lexize(in ternal,internal ,integer) ISpell interface. Must have .dict and .aff files Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=s ynonym&database =selina_rotex&s chema=public&ta ble=pg_ts_dict& return_url=tblp roperties.php%3 Fdatabase%3Dsel ina_rotex%26amp %3Bschema%3Dpub lic%26table%3Dp g_ts_dict&retur n_desc=Back&sor tkey=&sortdir=> Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=sy nonym&database= selina_rotex&sc hema=public&tab le=pg_ts_dict&r eturn_url=tblpr operties.php%3F database%3Dseli na_rotex%26amp% 3Bschema%3Dpubl ic%26table%3Dpg _ts_dict&return _desc=Back&sort key=&sortdir=> synonym syn_init(text) /NULL/ syn_lexize(inte rnal,internal,i nteger) Example of synonym dictionary Edit <http://www.rotex-service.com/phppgadmin/display.php?act ion=confeditrow &strings=expand ed&page=1&key%5 Bdict_name%5D=d e_ispell&databa se=selina_rotex &schema=public& table=pg_ts_dic t&return_url=tb lproperties.php %3Fdatabase%3Ds elina_rotex%26a mp%3Bschema%3Dp ublic%26table%3 Dpg_ts_dict&ret urn_desc=Back&s ortkey=&sortdir => Delete <http://www.rotex-service.com/phppgadmin/display.php?act ion=confdelrow& strings=expande d&page=1&key%5B dict_name%5D=de _ispell&databas e=selina_rotex& schema=public&t able=pg_ts_dict &return_url=tbl properties.php% 3Fdatabase%3Dse lina_rotex%26am p%3Bschema%3Dpu blic%26table%3D pg_ts_dict&retu rn_desc=Back&so rtkey=&sortdir= > de_ispell spell_init(text ) DictFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.dic t", AffFile="/usr/local/pgsql/share/contrib/dictonary/german_comb.aff ", StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" spell_lexize(in ternal,internal ,integer) /NULL/ Timo
Oleg Bartunov wrote:
> Timo, > > please, check you apply patch for compound word support. > What is version of postgresql ? > Does ispell dict works for non-compound words ? > > Oleg > > On Fri, 5 Nov 2004, Timo Haberkern wrote: > >> Hi there, >> >> i have some troubles with my TSearch2 Installation. I have done this >> installation as described in >> http://www.sai.msu.su/~megera/oddmus...compound_words >> <http://www.sai.msu.su/%7Emegera/oddmuse/index.cgi/Tsearch_V2_comp ound_words> >> I used the german myspell dictionary from >> http://lingucomponent.openoffice.org/spell_dic.html and converted it >> with >> my2ispell >> >> Nearly everything is working fine so far, except two problems: >> >> 1.) The stopword-file seems to be ignored: If i try it with SELECT >> to_tsvector("de fault_german", "ein Haus") i get "ein":1 "haus":2 >> >> ein should be a Stopword for german (and is defined the german.stop >> file as >> well) >> >> 2.) The compound words feature doesn"t work too. I have tried a lot of >> words, >> i.e. "Fehlermeld ung" with SELECT to_tsvector("de fault_german", >> "Fehlermeldung" ) >> i only get >> "fehlermeldung" :1 but i would expect "fehler" and "meldung" as >> seperated >> entries. Is there anything wrong with the dictonary or my >> configuration? >> >> >> My current configuration: >> >> pg_ts_cfg: >> >> default default C >> default_russian default ru_RU.KOI8-R >> simple default NULL >> default_german default de_DE.ISO8859-1 >> pg_ts_cfgmap: >> >> default_german host {simple} >> default_german hword {simple} >> default_german int {simple} >> default_german nlhword {simple} >> default_german nlpart_hword {simple} >> default_german nlword {simple} >> default_german part_hword {simple} >> default_german sfloat {simple} >> default_german uint {simple} >> default_german uri {simple} >> default_german url {simple} >> default_german version {simple} >> default_german word {simple} >> default_german lpart_hword {de_ispell,germ an_snowball} >> default_german lword {de_ispell,germ an_snowball} >> default_german lhword {de_ispell,germ an_snowball} >> >> >> pg_ts_dict: >> >> de_ispell | 17166 | >> DictFile="/usr/local/pgsql/share/contrib/dictonary/german.dict", >> AffFile="/usr/local/pgsql/share/contrib/dictonary/german.aff", >> StopFile="/usr/local/pgsql/share/contrib/dictonary/german.stop" | >> 17167 | NULL >> german_snowball | 17357 | NULL | 17162 | Snowball stemmer for >> german >> >> >> >> Can anyone help me? >> >> regards >> >> Timo >> >> >> ---------------------------(end of >> broadcast)--------------------------- >> TIP 4: Don't 'kill -9' the postmaster >> > > Regards, > Oleg > _______________ _______________ _______________ _______________ _ > Oleg Bartunov, sci.researcher, hostmaster of AstroNet, > Sternberg Astronomical Institute, Moscow University (Russia) > Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ > phone: +007(095)939-16-83, +007(095)939-23-83 > > ---------------------------(end of broadcast)--------------------------- > TIP 2: you can get off all lists at once with the unregister command > (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org) > >
Regards, Oleg _______________ _______________ _______________ _______________ _ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
Regards, Oleg _______________ _______________ _______________ _______________ _ Oleg Bartunov, sci.researcher, hostmaster of AstroNet, Sternberg Astronomical Institute, Moscow University (Russia) Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/ phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)--------------------------- TIP 2: you can get off all lists at once with the unregister command (send "unregister YourEmailAddres sHere" to ma*******@postg resql.org)
Regards,
Oleg
_______________ _______________ _______________ _______________ _
Oleg Bartunov, sci.researcher, hostmaster of AstroNet,
Sternberg Astronomical Institute, Moscow University (Russia)
Internet: ol**@sai.msu.su, http://www.sai.msu.su/~megera/
phone: +007(095)939-16-83, +007(095)939-23-83
---------------------------(end of broadcast)---------------------------
TIP 7: don't forget to increase your free space map settings This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics |
by: Alexander Rüegg |
last post by:
Hi,
Is it possible to get all the positions of a lexeme in a result-set of a
query? For example, we have the table
TEXT TEXT_IDX
'TSearch2 is very cool' ...
'It would be much cooler with lexeme positions'
|
by: George Essig |
last post by:
I have installed tsearch2 and have noticed that the gist index used to do searches grows and grows
as I update rows, delete rows, or run VACUUM FULL ANALYZE. Below are some details:
PostgreSQL 7.4RC1
Red Hat 9
Table "public.series"
Column | Type | Modifiers
---------------+-------------------+--------------------------------------------------------
id | integer | not null...
|
by: Markus Wollny |
last post by:
Hello!
I'm currently testing deployment of tsearch2 on our forum table. The
table is huge in itself - some 2GB of data without the indexes. I have
got PostgreSQL 7.4RC2 running on a test machine, installed tsearch2 to
my database, added the new column to the table and tried to update it in
the recommended fashion:
UPDATE ct_com_board_message
SET ftindex=to_tsvector('default',coalesce(user_login,'')
|
by: Pavel Stehule |
last post by:
Hello
I try tsearch2 within czech environment. It is works fine, but I have two
questions.
1. I have words "se", "ve" in my czech stop words. But I get this words in
result. Why? Have I problem with my configuration?
tsearch2=# select * from ts_debug('jmenuji se Pavel Stìhule a bydlím ve
Skalici.');
|
by: Ben |
last post by:
I'm trying to make myself a dictionary for tsearch2 that converts
numbers to their english word equivalents. This seems to be working
great, except that I can't figure out how to make my lexize function
return multiple lexemes. For instance, I'd like "100" to get converted
to {one,hundred}, not {"one hundred"} as is currently happening.
How do I specify the output of the lexize function so that this will
happen?
| |
by: Ben |
last post by:
I just made myself an ispell dictionary for tsearch2, thinking (very
incorrectly, it turns out) that looking up a misspelled word with the
ispell dictionary would return possible words that I really meant to
type instead.
So that obviously isn't what the ispell dictionary is for in tsearch2
land. Is there such a dictionary? If I were to make one using the aspell
libraries, can a C-language stored proc in postgres link against shared...
|
by: Markus Wollny |
last post by:
Hi!
Sorry to bother you, but I just don't know how to get tsearch2 configured correctly for my setup. I've got a 7.4.3 database-cluster initdb'ed with de_DE@euro as locale, the database is with Unicode encoding.
I made and installed contrib/tsearch2 after installing the dump/reload-patch http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/regprocedure_7.4.patch.gz as advised by the docs. So far everything is looking good, I have...
|
by: Marcel Boscher |
last post by:
Hello everybody,
i tried to "J.U.S.T" install the FullTextSearchTool tsearch2 under the
guidiance of :
http://www.sai.msu.su/~megera/postgres/gist/tsearch/V2/
<http://www.sai.msu.su/%7Emegera/postgres/gist/tsearch/V2/>
and i'm almost losing my mind...
<runningagainstwallheadsfirst>
after i installed postgre which works fine i change to the folder
/contrib/tsearch2/
|
by: Dawid Kuroczko |
last post by:
I'm trying to use tsearch2 with database which is in 'UNICODE' encoding.
It works fine for English text, but as I intend to search Polish texts I did:
insert into pg_ts_cfg('default_polish', 'default', 'pl_PL.UTF-8');
(and I updated other pg_ts_* tables as written in manual).
However, Polish-specific chars are being eaten alive, it seems.
I.e. doing select to_tsvector('default_polish', body) from messages;
results in list of words but...
|
by: marktang |
last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look !
Part I. Meaning of...
|
by: Oralloy |
last post by:
Hello folks,
I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>".
The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed.
This is as boiled down as I can make it.
Here is my compilation command:
g++-12 -std=c++20 -Wnarrowing bit_field.cpp
Here is the code in...
| |
by: Hystou |
last post by:
Overview:
Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
|
by: agi2029 |
last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own....
Now, this would greatly impact the work of software developers. The idea...
|
by: conductexam |
last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one.
At the time of converting from word file to html my equations which are in the word document file was convert into image.
Globals.ThisAddIn.Application.ActiveDocument.Select();...
|
by: adsilva |
last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
|
by: 6302768590 |
last post by:
Hai team
i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
|
by: muto222 |
last post by:
How can i add a mobile payment intergratation into php mysql website.
| |
by: bsmnconsultancy |
last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...
| |