473,326 Members | 2,134 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,326 software developers and data experts.

Search through office files

I have a Microsoft (R) Visio (TM) document, which contains a link
inside (eg. to "http://blabla/"). When I do a full-text search from
within Windows Explorer, searching for "blabla", it finds that
document. But when I view the document as a plain text file, viewing it
in Notepad or Wordpad, or even programmatically searching for the text
"blabla" from a C programme, then it can't be found.

I would like to make a C programme that would be able to search through
a Visio file so that I can automate some processes. But it is as if the
text is not inside. How do I solve the issue? Is it possible that it's
a matter of charset, or some cypher stuff, or what? Is there some
library that would make it possible?

thanks,

Darko

May 5 '06 #1
2 1777
In article <11**********************@j33g2000cwa.googlegroups .com>,
Darko <da**************@gmail.com> wrote:
I have a Microsoft (R) Visio (TM) document, which contains a link
inside (eg. to "http://blabla/"). When I do a full-text search from
within Windows Explorer, searching for "blabla", it finds that
document. But when I view the document as a plain text file, viewing it
in Notepad or Wordpad, or even programmatically searching for the text
"blabla" from a C programme, then it can't be found.
That doesn't sound like a C-specific problem.

I would like to make a C programme that would be able to search through
a Visio file so that I can automate some processes.
The Visio file format is not defined (or even mentioned) by the
ANSI/ISO C standards, so this really isn't the right newsgroup.
But it is as if the
text is not inside. How do I solve the issue? Is it possible that it's
a matter of charset, or some cypher stuff, or what? Is there some
library that would make it possible?


[OT]
UTF-8. Each character is probably occupying two bytes. For regular
printable ASCII characters, alternate bytes are probably binary 0's.
--
All is vanity. -- Ecclesiastes
May 5 '06 #2
Darko said the following, on 05/05/06 14:11:
I have a Microsoft (R) Visio (TM) document, which contains a link
inside (eg. to "http://blabla/"). When I do a full-text search from
within Windows Explorer, searching for "blabla", it finds that
document. But when I view the document as a plain text file, viewing it
in Notepad or Wordpad, or even programmatically searching for the text
"blabla" from a C programme, then it can't be found.

I would like to make a C programme that would be able to search through
a Visio file so that I can automate some processes. But it is as if the
text is not inside. How do I solve the issue? Is it possible that it's
a matter of charset, or some cypher stuff, or what? Is there some
library that would make it possible?


Your basic problem appears to be how to interpret the contents of MS
Visio files. This question is off-topic for this group, which discusses
the standard C language and its use. You are more likely to get useful
responses by asking in a Windows-related newsgroup.

[OT]
I suspect that your difficulty may have to do with text being stored in
some non-obvious character set, which Windows Explorer knows about, but
you, at the moment, don't. The following site has some information on
different file formats, and you might find it useful:
<http://www.wotsit.org/>
[/OT]

--
Rich Gibbs
ri*****@gmail.com
"You can observe a lot by watching." -- Yogi Berra

May 5 '06 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

0
by: John Rager | last post by:
I am using the Index search server that comes with MS IIS 3.0. The search works ok but how do I limit the search to only HTML/ASP files and not include MS Office files in the search results?
0
by: Danny | last post by:
I bulding some application in ASP . i want that my App. will search some context at office files. how do i do it whitout Index serive ? ther is any API ? ther is any ASP code ?
1
by: MP | last post by:
I was wondering if anyone can point me to some example code of searching for a file, like *.*, or *.pdf, much like the Windows search function. I am at a smaller location, away from the main...
3
by: hazly | last post by:
I'm very new in the web technology and need advice on search engine. I want to develop a portal using PHP and MySQL on Linux. Need to know on the following features : 1. search engine that could...
0
by: Chung Leong | last post by:
Here's a short tutorial on how to the OLE-DB extension to access Windows Indexing Service. Impress your office-mates with a powerful full-text search feature on your intranet. It's easier than you...
3
by: Chung Leong | last post by:
Here's the rest of the tutorial I started earlier: Aside from text within a document, Indexing Service let you search on meta information stored in the files. For example, MusicArtist and...
5
by: Mark | last post by:
Hi I have an application (in vb.NET 2005) which holds data in SQL Server and some of the SQL records are simply paths to related files. I would like to be able to do a text search on both the...
16
by: Computer geek | last post by:
Hello, I am new to VB.NET and programming in general. I have taught myself a lot of the basics with vb.net but am still quite the novice. I am working on a little application now and I need some...
1
by: abhilash12 | last post by:
hai i can't search word in open office og doc files save as type is microsoft word 97/2000/xp (.doc) using java pls help me
3
by: darrel | last post by:
I'm in need of a cheap, windows-based web site indexer/search engine that, ideally, has some .net integration and/or can sit along side of an asp.net web site fairly easily. We've used DTSearch...
0
by: DolphinDB | last post by:
Tired of spending countless mintues downsampling your data? Look no further! In this article, you’ll learn how to efficiently downsample 6.48 billion high-frequency records to 61 million...
0
isladogs
by: isladogs | last post by:
The next Access Europe meeting will be on Wednesday 6 Mar 2024 starting at 18:00 UK time (6PM UTC) and finishing at about 19:15 (7.15PM). In this month's session, we are pleased to welcome back...
0
by: Vimpel783 | last post by:
Hello! Guys, I found this code on the Internet, but I need to modify it a little. It works well, the problem is this: Data is sent from only one cell, in this case B5, but it is necessary that data...
0
by: jfyes | last post by:
As a hardware engineer, after seeing that CEIWEI recently released a new tool for Modbus RTU Over TCP/UDP filtering and monitoring, I actively went to its official website to take a look. It turned...
0
by: ArrayDB | last post by:
The error message I've encountered is; ERROR:root:Error generating model response: exception: access violation writing 0x0000000000005140, which seems to be indicative of an access violation...
1
by: Defcon1945 | last post by:
I'm trying to learn Python using Pycharm but import shutil doesn't work
0
by: af34tf | last post by:
Hi Guys, I have a domain whose name is BytesLimited.com, and I want to sell it. Does anyone know about platforms that allow me to list my domain in auction for free. Thank you
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.