471,347 Members | 1,740 Online
Bytes | Software Development & Data Engineering Community
Post +

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 471,347 software developers and data experts.

dotlucene question: tighten search results.

If anyone has experience with DotLucene, then this question might be right
up your alley.

I have two lists of music titles. Each from a different source. I am
trying to match determine possible matches to associate them. I know that
any association on text will not be perfect, but I am interested in the
probability of two titles being the same.

Using dotLucene, I have created an index with one set and enumerating the
second while performing a search. I get back a set of hits but it does not
give me very precise results. Maybe it is because titles do not give much
content to search on. I am looking for a way to make the results more
strict or take into account work position and proximity when calculating the

For example, the title "Give In To Me" is matching 100% with the title
"Heaven Give Me World". Likewise, "You Rock My World (Dance Mix)" is 100% a
match for "How's My World Treatin' You".

Can I tighten the search system so that it is more strict?
Jan 25 '06 #1
2 1013
You can use add proximity to the words you are searching for (~x where
x is the maximum distance between words).

Also remember to check your stop word list as those will be excluded
from the index and so may skew the results.

Jan 26 '06 #2
Also check that the query is using AND rather than OR for the search.

Jan 26 '06 #3

This discussion thread is closed

Replies have been disabled for this discussion.

Similar topics

7 posts views Thread by Richard Gutery | last post: by
1 post views Thread by David Miller | last post: by

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.