473,652 Members | 3,123 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Have to dvlp an App similar to Google Desktop,bt clueless abt how to strt!

2 New Member
Hi

I'm doing my Fourth Year B.E. project under the internship of MindTree!
Now they have only told us what to do , but haven't guide much!

Now, we have been told to develop an search application similar to Google Desktop one, only lighter version of it!

This is how it should work:
For all the Office 07 formats (docx,pptx....) which are XML based,rather than Binary based, the user should enter any search term(s) and our Prog should return all the files containing them! To make the matter a bit easier, this should search in a given folder rather than the whole computer!Also it should support for format extension , that is later if we want our application to support .pdf, there should be provision for that!
So ,as our guide told us, It'll need XML Parsing,Databas e maybe,Indexing maybe....

This is what we have been told to do! However nothing else told! So now our team is kinda clueless about how to start!! We have about 3 months to finish the project!

Any help will be appreciated! We have planned to develop this in JAVA,although it isn't compulsory !Anyone have any idea how Google Desktop works!
Jan 7 '10 #1
3 1786
Frinavale
9,735 Recognized Expert Moderator Expert
Your question is way to big.
Check out this thread on how to develop software

The experts on this site are more than happy to help you with your problems but they cannot do your assignment/program for you. Attempt the assignment/program yourself first and post questions regarding any difficulties you have or about a particular function of the code that you don't know how to achieve

Please take the time to research the problem before you post your question. We are more than willing to help you with a specific problem but you have to do your part to learn the basics and also formulate a specific question we can help with.
Jan 7 '10 #2
chaarmann
785 Recognized Expert Contributor
just divide this complex problem into parts
1.) how to read in a file as a String
2.) how to parse the string for the search term
3.) how to show the matching results.

This can all be done with a Java application No database or webapplication needed. Java supports reading local file/directory and xml-parsing. The output can be just on the command line for the easiest way;later you could use a graphical GUI, like Swing.

Now break problem into smaller parts:
1.)
- get list of all files (use class File)
- sort out the files that don't match the extension. (use regular expression)
- for each file: read file-content into a String.

This step can be done in roughly 20 lines of code, 10 minutes (professional) or 1day (beginner).
Tip: try your own to learn, but if you are stuck, browse this forum, you can already see complete code (by myself) that does exactly that.

2.)
-ask user to enter search word (on command line).
- use a SAX (or DOM) parser to parse the String from step 1 to get an internal DOM-structure from the XML-document inside the String.
- recursively traverse all nodes of DOM structure to find the search word. For each occurrence found, put out a message (on command line)

This step can be done in roughly 50 lines of code, 30 minutes (professional) or 2 days(beginner)

3.)You are already finished, but to make your application more beautiful, instead of just reading search word from command line, you could use Swing to open a window and make a form with input fields where you can define the search word and search directory and file extensions.
The output could go into a text box with a seach progress bar.

This step can be done in roughly 300 lines of code, 3 hours (professional) or 5 days(beginner)

P.S: If you do not only want to read in XML-based Word document, but any format of MS-Office documents, there are also Java-APIs that does that for you. (Apache POI)
Jan 8 '10 #3
eggman89
2 New Member
hey chaarmann!! This is exactly the kind of help we needed! Thanks!!!
:)We are totally noob and lost!
Jan 9 '10 #4

Sign in to post your reply or Sign up for a free account.

Similar topics

0
1277
by: jelle | last post by:
Hi, Has anyone so far been messing accessing / quering the google desktop database files? It seems it could be the best reflection of one in a couple of hundred mb, and i intent to unleash some furious queries on it, when i'd know how to access it. Any suggestion how this could be done in python? My intention is to apply data mining to the google desktop database. How about it, who's in for it? Cheers,
2
1830
by: Brian | last post by:
Taking cluelessness to a whole new level. < http://support.apollohosting.com/ > The beginning of the html is below. <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> <html> <head>
31
2579
by: somebody | last post by:
No, I'm not a troll, just angry. I just started learning C++, and find it extremely lacking compared to Java. For example, I had to write my own functions to do something as simple as a case insensitive string comparison, and a substring search. Then I found that there are no networking classes to speak of (except sockets). After I thought about it, C++ really pales by comparison. Java has a rich set of networking classes, regular...
3
3305
by: Ben Phipps | last post by:
I'm trying to use the IDL that's part of the Desktop search API in my C# application. I'm having problems getting things to work. Can anyone point me in the right direction for using IDL files with C#? The other problem is I tried including it in a basic C++ app using the 2005 express edition beta 2 and it won't compile the idl saying that there are 2 library blocks. The idl can be found on the desktop search developer page. ...
6
1210
by: Chrysan | last post by:
I have few textboxes in a page(.aspx), and some of the textboxes with white color BackColor, blue color Border Line (like xp-style control), but some of the textboxes with yellow color BackColor, without blue color Border Line (like traditional 3D textbox style). Why this happen? Please help......thanks.
0
938
by: AAJ | last post by:
Hi all I have been set the task of documenting a number of projects that have developed and grown over the last 5 years. The documentation I collate and produce for the project, plus links and and other info I find needs to be manually added to a central repositiory, and made searchable (on both technical and user guide levels) via our intranet. The projects involves C#, VB6, SQL Server 2000, MS Access front Ends, Dreamweaver, IIS...
7
2136
by: admyc | last post by:
Hello Is there a way to get a website whose index.html document is just the document that sets-up the framesets and frames to contain info so that google will reference it? Any help much appreciated AM
3
2480
by: boris | last post by:
i am not able to understand how can i create google desktop api using only C and C++.should i also need to learn other languages or knowledge of my favourite c++ will do.I am confused.If it's possible to create with only C++ then plz describe how!
162
10216
by: Sh4wn | last post by:
Hi, first, python is one of my fav languages, and i'll definitely keep developing with it. But, there's 1 one thing what I -really- miss: data hiding. I know member vars are private when you prefix them with 2 underscores, but I hate prefixing my vars, I'd rather add a keyword before it. Python advertises himself as a full OOP language, but why does it miss one of the basic principles of OOP? Will it ever be added to python?
0
8367
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
8811
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
1
8467
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
8589
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
5619
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
4145
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
0
4291
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
1
2703
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
1
1914
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.