473,396 Members | 1,734 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,396 software developers and data experts.

Document identification

Hello to you all

In my .Net application, I want to import document images (Tiff images),
having same format, in a batch,

For e.g. I have a document folder containing mixed images like 'Participant
Registration forms', 'Team Registration Forms', 'Event Details', etc.

I want to process each type of documents in separate batches

i.e. The Batch of importing 'Participant Registration forms' should import
only Participant Registration forms by identifying them from the source folder

The identification would be based on the identification mark or specific
word printed on the documents. This Identification reference will be
pre-defined for each document type before any processing.

Is there any DLL/ActiveX component available to achieve this?

The Identification mark may have shifted little from its pre-defined
position due to scan problem, document size changes, tilted scans, etc.

Also the Identification mark may be distorted due to distorted or bad
scanned documents
So the component should resolve these issues while identifying the document.

Thanks
Jeetendra

Dec 29 '05 #1
2 1628
What you are looking for are basically OCR (Optical Character Recognition)
libraries. Do you need it to be free?

"suresh" <su****@discussions.microsoft.com> wrote in message
news:06**********************************@microsof t.com...
Hello to you all

In my .Net application, I want to import document images (Tiff images),
having same format, in a batch,

For e.g. I have a document folder containing mixed images like
'Participant
Registration forms', 'Team Registration Forms', 'Event Details', etc.

I want to process each type of documents in separate batches

i.e. The Batch of importing 'Participant Registration forms' should import
only Participant Registration forms by identifying them from the source
folder

The identification would be based on the identification mark or specific
word printed on the documents. This Identification reference will be
pre-defined for each document type before any processing.

Is there any DLL/ActiveX component available to achieve this?

The Identification mark may have shifted little from its pre-defined
position due to scan problem, document size changes, tilted scans, etc.

Also the Identification mark may be distorted due to distorted or bad
scanned documents
So the component should resolve these issues while identifying the
document.

Thanks
Jeetendra

Dec 29 '05 #2
Hello Peter,

Thanks for the reply
Actually, I gone through specifications of so many third pary OCR components
but none of it giving the document identification feature.
The component library should be saticfying my requirements then no matter
whether it is freeware or I have to buy it.

Regards
Jeetendra
"Peter Rilling" ने लिखा:
What you are looking for are basically OCR (Optical Character Recognition)
libraries. Do you need it to be free?

"suresh" <su****@discussions.microsoft.com> wrote in message
news:06**********************************@microsof t.com...
Hello to you all

In my .Net application, I want to import document images (Tiff images),
having same format, in a batch,

For e.g. I have a document folder containing mixed images like
'Participant
Registration forms', 'Team Registration Forms', 'Event Details', etc.

I want to process each type of documents in separate batches

i.e. The Batch of importing 'Participant Registration forms' should import
only Participant Registration forms by identifying them from the source
folder

The identification would be based on the identification mark or specific
word printed on the documents. This Identification reference will be
pre-defined for each document type before any processing.

Is there any DLL/ActiveX component available to achieve this?

The Identification mark may have shifted little from its pre-defined
position due to scan problem, document size changes, tilted scans, etc.

Also the Identification mark may be distorted due to distorted or bad
scanned documents
So the component should resolve these issues while identifying the
document.

Thanks
Jeetendra


Dec 30 '05 #3

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
by: Brad H McCollum | last post by:
I've looked through many suggestions and partial examples all over this newsgroup and still am not coming up with anything that does specifically what I'm wanting to accomplish. I'm writing a VB...
0
by: Philippe Poulard | last post by:
A sane document management approach: I manage my documents with a key, composed with some fields of my XML documents. When I have to refer to documents, I use a canonical form of the key so that...
6
by: David List | last post by:
I'm having a problem using different properties of the document object in the example javascripts in my textbook with browsers that identify themselves as using the Mozilla engine. One example of...
5
by: Carl Ribbegaardh | last post by:
Is there any known list of compiler identification macros? I'm using VS 2003, g++ on windows, sun's cc and g++ on solaris. Is it possible to identify the compiler using macros? I'm aware of WIN32...
3
by: Martin Mrazek | last post by:
Hi, how can one HTML document create more than one cookie? I have bloody long html form, to save all its values in 4KB of one cookie is impossible... MM
136
by: Matt Kruse | last post by:
http://www.JavascriptToolbox.com/bestpractices/ I started writing this up as a guide for some people who were looking for general tips on how to do things the 'right way' with Javascript. Their...
23
by: Roel Melchers | last post by:
My ACCESS-database contains all members of my association. When the members attend to a meeting I want to record their presence. When they enter they identify themselves by putting their finger...
10
by: Henrik Dahl | last post by:
Hello! I have an xml schema which has a date typed attribute. I have used xsd.exe to create a class library for XmlSerializer. The result of XmlSerializer.Serialize(...) should be passed as the...
3
by: =?iso-8859-1?q?Marcos_Jos=E9_Setim?= | last post by:
Hi, I would like to know if is better to use document.forms to detect forms or getElementById. Thanks
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.