473,402 Members | 2,072 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,402 software developers and data experts.

c/c++ text searching engine library

I am looking for a c/c++ text search engine library that supports:

- free text searching - not only beginning of words but substrings as
well
- wildcard searching - I want strings such as *test* to be supported
- regular expressions

I know about clucene, but, unless I am mistaken, lucene doesn't
support, for instance, having the * at the beginning of the searched
text, and it doesn't seem to support searching substrings.

Is there anything else out there? I came across a library developed at
some university, the library had a deity name but I can't recall it.

Thanks
Nov 21 '07 #1
4 2966
I just want to add that I used google to search for a library but I
could not find anything 100% satisfactory.
Nov 21 '07 #2
Costa a écrit :
I just want to add that I used google to search for a library but I
could not find anything 100% satisfactory.
http://en.wikipedia.org/wiki/Compari...ession_engines
Nov 21 '07 #3
I am actually looking for an embeddable c/c++ text search engine such
as clucene not for a regular expression library. If the engine
doesn't support regular expressions that's fine but I want it to
support wildcard searches and substrings.
Nov 21 '07 #4
Costa wrote:
I am looking for a c/c++ text search engine library that supports:
- free text searching - not only beginning of words but substrings as
well
- wildcard searching - I want strings such as *test* to be supported
- regular expressions
I know about clucene, but, unless I am mistaken, lucene doesn't
support, for instance, having the * at the beginning of the searched
text, and it doesn't seem to support searching substrings.
Is there anything else out there? I came across a library
developed at some university, the library had a deity name but
I can't recall it.
Boost has a regex component that looks pretty good. For
wildcard matching, of course, the conventions are somewhat
system dependent. In my own code, I map the wildcard pattern to
a regular expression, and use that.

--
James Kanze (GABI Software) email:ja*********@gmail.com
Conseils en informatique orientée objet/
Beratung in objektorientierter Datenverarbeitung
9 place Sémard, 78210 St.-Cyr-l'École, France, +33 (0)1 30 23 00 34
Nov 22 '07 #5

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
by: Rare Book School | last post by:
RARE BOOK SCHOOL 2005 Rare Book School is pleased to announce its schedule of courses for 2005, including sessions at the University of Virginia, the Walters Art Museum/Johns Hopkins University...
0
by: Adam | last post by:
I am currently determining the architecture for a rewrite of an existing retailed software product - moving from Smalltalk to C# .Net. It is to be a rich-client single-user desktop application. The...
0
by: Brandon | last post by:
I'm looking for a .NET component for text indexing and searching. I'd like to be able to create an index by passing the component various strings (chapters, etc; strings that could be one...
7
by: teo | last post by:
hallo, I need to extract a word and few text that precedes and follows it (about 30 + 30 chars) from a long textual document. Like the description that Google returns when it has found a...
12
by: Boris Borcic | last post by:
Hello, I am trying to use UI Automation to drive an MS Windows app (with pywinauto). I need to scrape the app's window contents and use some form of OCR to get at the texts (pywinauto can't...
0
by: JosAH | last post by:
Greetings, Introduction At the end of the last Compiler article part I stated that I wanted to write about text processing. I had no idea what exactly to talk about; until my wife commanded...
0
by: JosAH | last post by:
Greetings, Introduction Last week I was a bit too busy to cook up this part of the article series; sorry for that. This article part wraps up the Text Processing article series. The ...
0
by: JosAH | last post by:
Greetings, welcome back; above we discussed the peripherals of the Library class: loading and saving such an instantiation of it, the BookMark interface and then some. This part of the article...
3
by: Daniel Fetchinson | last post by:
Does Python 2.5.2's embedded SQLite support full text searching? Sqlite itself is not distributed with python. Only a python db api compliant wrapper is part of the python stdlib and as such it...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.