473,394 Members | 1,766 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,394 software developers and data experts.

dotlucene question: tighten search results.

If anyone has experience with DotLucene, then this question might be right
up your alley.

I have two lists of music titles. Each from a different source. I am
trying to match determine possible matches to associate them. I know that
any association on text will not be perfect, but I am interested in the
probability of two titles being the same.

Using dotLucene, I have created an index with one set and enumerating the
second while performing a search. I get back a set of hits but it does not
give me very precise results. Maybe it is because titles do not give much
content to search on. I am looking for a way to make the results more
strict or take into account work position and proximity when calculating the
score.

For example, the title "Give In To Me" is matching 100% with the title
"Heaven Give Me World". Likewise, "You Rock My World (Dance Mix)" is 100% a
match for "How's My World Treatin' You".

Can I tighten the search system so that it is more strict?
Jan 25 '06 #1
2 1062
You can use add proximity to the words you are searching for (~x where
x is the maximum distance between words).

Also remember to check your stop word list as those will be excluded
from the index and so may skew the results.

Jan 26 '06 #2
Also check that the query is using AND rather than OR for the search.

Jan 26 '06 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

7
by: Richard Gutery | last post by:
I have written a COM component who's purpose is to allow a user to change his Windows (NT/AD) password via a web page. Now the component works fine when I run it in VB 6, but not via the web page....
1
by: David Miller | last post by:
Dear Sir or Madam: I am writing an application in VB.Net, and I have an interesting threading situation that I cannot seem to figure out. I have a search class (CSearch) that is responsible for...
0
by: Peter Rilling | last post by:
I am working on trying to compare "titles" from two different lists and trying to determine the most likely match. When a similar entry appears in both, the result works well, but what I need to...
0
by: | last post by:
I have a question about spawning and displaying subordinate list controls within a list control. I'm also interested in feedback about the design of my search application. Lots of code is at the...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.