473,739 Members | 2,355 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Looking for people who have built document scanning/retrieval intoan Access database

I am looking for others who have built systems to scan documents, index
them and then make them accessible from an Access database. My
environment is a nonprofit with about 20-25 case workers who use
laptops. They have Access databases on their laptops and the data is
replicated.

The idea is that each case worker would scan their own documents,
either remotely or back at the office.

And NO I am not planning to store the scanned images in the Access
database. I already know not to do that. The Access database would
only have a record with an index of the document its file name.

Conceptual Approach
--------------------
Use a document scanner that can put the documents in a directory with a
sequential number affixed. Something like c:\ScannedDocs

Then I would plan to have a program - probably Access/VBA that goes
through all documents ( in sequential file number order)in the directory
and brings up the scanned image. At this point case worker will
identify the document by client/consumer and type of document.

Then I propose to copy the document to another location, something like
c:\IndexedDocs
And rename the doc to include the client/consumer #, document type and
scan date and a sequential number in the document name
xxxxxxx-TTTTTTTTTT-yyyy-mm-dd-ssss

I would delete the source document from the scanned-in documents folder.

At the same time I would add a record to the Access database that link
to the client/consumer, identify the document type and scan date and
would have the file name of the indexed document.

When viewing the document within Access, I would plan to use the method
of retrieving it, and inserting it into a blob within an Access form for
display only. I would NOT store the image in the Access database

The Access database is already planned to be replicated, so this
approach allows the information on scanned documents to be available to
central office personnel as well. I am planning to have a central file
of scanned images, so each time the user would come into the office and
the Access database would be replicated, all new scanned and indexed
documents would be uploaded to a central repository.

The laptop users would only have scanned documents on their own
clients/consumers. Generally about 100-150 clients/consumers at a time.
I am guessing that initially the system would record 10-20 scanned
images per consumer. However over time, I would anticipate that more
and more documents would be scanned. The central database would have a
copy of all scanned documents.

ISSUES

Anyone done this on a distributed, laptop oriented basis before?
If so, guidance would be appreciated

Any suggestions on scanners to use?

Anyone have experience in having multiple users scan in their own images
and run indexing processes vs. having a central scanning and indexing
function?

Should I try to combining multiple images together? Most documents are
single page but a few are 2-3 pages and one is a whopping 18 pages.
Paperport software says it has features to combine multiple scanned images?
Should I try to combine multiple scanned images together or keep
separate and just use page numbers?

What document format and resolution is needed? I am assuming that I
would use JPG but need suggestions on resolution.

Anyone have examples of doing this they would be willing to share?

Any comments on or suggestions for improving the overall approach.

Thanks

Bob

bobalston9 AT yahoo DOT com
Aug 13 '06 #1
6 3874
Any comments on or suggestions for improving the overall approach.

As I understand your post you will store the Image rather than have
Optical Character Recognition read the text?

I haven't used OCR much but it's never failed me, although the
formatting and location of text can often be disappointing, and TTBOMK
it's included in most scanning software.

***
Have you considered an Indexing Service Application? Properly set up,
it will maintain catalogs of whatever folders and documents you
instruct it to, and its search capabilites (for documents) are many
times more powerful than one is likely even to dream about for
Access/Jet. Indexing Service is not really so well-documented, seems to
be infrequently used, and may appear arcane and difficult. Once you
become slightly familiar with it, well ... you can fall in love with
it!
It's fully accessible through ADO. Thoretically at least, by using an
unconnected (as opposed to disconnected - never connected rather than
once connected) ADP which will have zip to do with SQL-Server, one
could create ADO recordsets and use them for both forms and reports.

This would be my way ... but for whatever reason it does not seem to be
the way of many others.

Aug 13 '06 #2
Lyle Fairfield wrote:
>>Any comments on or suggestions for improving the overall approach.


As I understand your post you will store the Image rather than have
Optical Character Recognition read the text?

I haven't used OCR much but it's never failed me, although the
formatting and location of text can often be disappointing, and TTBOMK
it's included in most scanning software.

***
Have you considered an Indexing Service Application? Properly set up,
it will maintain catalogs of whatever folders and documents you
instruct it to, and its search capabilites (for documents) are many
times more powerful than one is likely even to dream about for
Access/Jet. Indexing Service is not really so well-documented, seems to
be infrequently used, and may appear arcane and difficult. Once you
become slightly familiar with it, well ... you can fall in love with
it!
It's fully accessible through ADO. Thoretically at least, by using an
unconnected (as opposed to disconnected - never connected rather than
once connected) ADP which will have zip to do with SQL-Server, one
could create ADO recordsets and use them for both forms and reports.

This would be my way ... but for whatever reason it does not seem to be
the way of many others.
Thanks for the suggestions. I will look into that.

and yes it is the image I want to capture, not OCR to text because they
need the image of the client signature.

Bob
Aug 13 '06 #3
Download smart-it accounting from www.smartit.co.za
Goto Customers Maintenance and then click on the scan tab. Scan in
something. If that is what you want to do then I wil send the code or you
can use the program.
Alfred -- email to ad*******@gmail .com
"Bob Alston" <bo********@yah oo.comwrote in message
news:bF******** *****@newsfe02. lga...
>I am looking for others who have built systems to scan documents, index
them and then make them accessible from an Access database. My environment
is a nonprofit with about 20-25 case workers who use laptops. They have
Access databases on their laptops and the data is replicated.

The idea is that each case worker would scan their own documents,
either remotely or back at the office.

And NO I am not planning to store the scanned images in the Access
database. I already know not to do that. The Access database would only
have a record with an index of the document its file name.

Conceptual Approach
--------------------
Use a document scanner that can put the documents in a directory with a
sequential number affixed. Something like c:\ScannedDocs

Then I would plan to have a program - probably Access/VBA that goes
through all documents ( in sequential file number order)in the directory
and brings up the scanned image. At this point case worker will identify
the document by client/consumer and type of document.

Then I propose to copy the document to another location, something like
c:\IndexedDocs
And rename the doc to include the client/consumer #, document type and
scan date and a sequential number in the document name
xxxxxxx-TTTTTTTTTT-yyyy-mm-dd-ssss

I would delete the source document from the scanned-in documents folder.

At the same time I would add a record to the Access database that link to
the client/consumer, identify the document type and scan date and would
have the file name of the indexed document.

When viewing the document within Access, I would plan to use the method of
retrieving it, and inserting it into a blob within an Access form for
display only. I would NOT store the image in the Access database

The Access database is already planned to be replicated, so this approach
allows the information on scanned documents to be available to central
office personnel as well. I am planning to have a central file of scanned
images, so each time the user would come into the office and the Access
database would be replicated, all new scanned and indexed documents would
be uploaded to a central repository.

The laptop users would only have scanned documents on their own
clients/consumers. Generally about 100-150 clients/consumers at a time.
I am guessing that initially the system would record 10-20 scanned images
per consumer. However over time, I would anticipate that more and more
documents would be scanned. The central database would have a copy of all
scanned documents.

ISSUES

Anyone done this on a distributed, laptop oriented basis before?
If so, guidance would be appreciated

Any suggestions on scanners to use?

Anyone have experience in having multiple users scan in their own images
and run indexing processes vs. having a central scanning and indexing
function?

Should I try to combining multiple images together? Most documents are
single page but a few are 2-3 pages and one is a whopping 18 pages.
Paperport software says it has features to combine multiple scanned
images?
Should I try to combine multiple scanned images together or keep separate
and just use page numbers?

What document format and resolution is needed? I am assuming that I would
use JPG but need suggestions on resolution.

Anyone have examples of doing this they would be willing to share?

Any comments on or suggestions for improving the overall approach.

Thanks

Bob

bobalston9 AT yahoo DOT com

Aug 13 '06 #4
alfred wrote:
Download smart-it accounting from www.smartit.co.za
Goto Customers Maintenance and then click on the scan tab. Scan in
something. If that is what you want to do then I wil send the code or you
can use the program.
Alfred -- email to ad*******@gmail .com
"Bob Alston" <bo********@yah oo.comwrote in message
news:bF******** *****@newsfe02. lga...
>>I am looking for others who have built systems to scan documents, index
them and then make them accessible from an Access database. My environment
is a nonprofit with about 20-25 case workers who use laptops. They have
Access databases on their laptops and the data is replicated.

The idea is that each case worker would scan their own documents,
either remotely or back at the office.

And NO I am not planning to store the scanned images in the Access
database. I already know not to do that. The Access database would only
have a record with an index of the document its file name.

Conceptual Approach
--------------------
Use a document scanner that can put the documents in a directory with a
sequential number affixed. Something like c:\ScannedDocs

Then I would plan to have a program - probably Access/VBA that goes
through all documents ( in sequential file number order)in the directory
and brings up the scanned image. At this point case worker will identify
the document by client/consumer and type of document.

Then I propose to copy the document to another location, something like
c:\IndexedDoc s
And rename the doc to include the client/consumer #, document type and
scan date and a sequential number in the document name
xxxxxxx-TTTTTTTTTT-yyyy-mm-dd-ssss

I would delete the source document from the scanned-in documents folder.

At the same time I would add a record to the Access database that link to
the client/consumer, identify the document type and scan date and would
have the file name of the indexed document.

When viewing the document within Access, I would plan to use the method of
retrieving it, and inserting it into a blob within an Access form for
display only. I would NOT store the image in the Access database

The Access database is already planned to be replicated, so this approach
allows the information on scanned documents to be available to central
office personnel as well. I am planning to have a central file of scanned
images, so each time the user would come into the office and the Access
database would be replicated, all new scanned and indexed documents would
be uploaded to a central repository.

The laptop users would only have scanned documents on their own
clients/consumers. Generally about 100-150 clients/consumers at a time.
I am guessing that initially the system would record 10-20 scanned images
per consumer. However over time, I would anticipate that more and more
documents would be scanned. The central database would have a copy of all
scanned documents.

ISSUES

Anyone done this on a distributed, laptop oriented basis before?
If so, guidance would be appreciated

Any suggestions on scanners to use?

Anyone have experience in having multiple users scan in their own images
and run indexing processes vs. having a central scanning and indexing
function?

Should I try to combining multiple images together? Most documents are
single page but a few are 2-3 pages and one is a whopping 18 pages.
Paperport software says it has features to combine multiple scanned
images?
Should I try to combine multiple scanned images together or keep separate
and just use page numbers?

What document format and resolution is needed? I am assuming that I would
use JPG but need suggestions on resolution.

Anyone have examples of doing this they would be willing to share?

Any comments on or suggestions for improving the overall approach.

Thanks

Bob

bobalston9 AT yahoo DOT com


Thanks, downloading it now.

Is your app built in Access? What language is the code in?

Right now I don't have a scanner hooked up but hopefully I will get the
gist from the user interface.

Bob
Aug 13 '06 #5
alfred wrote:
Download smart-it accounting from www.smartit.co.za
Goto Customers Maintenance and then click on the scan tab. Scan in
something. If that is what you want to do then I wil send the code or you
can use the program.
Alfred -- email to ad*******@gmail .com
"Bob Alston" <bo********@yah oo.comwrote in message
news:bF******** *****@newsfe02. lga...
>>I am looking for others who have built systems to scan documents, index
them and then make them accessible from an Access database. My environment
is a nonprofit with about 20-25 case workers who use laptops. They have
Access databases on their laptops and the data is replicated.

The idea is that each case worker would scan their own documents,
either remotely or back at the office.

And NO I am not planning to store the scanned images in the Access
database. I already know not to do that. The Access database would only
have a record with an index of the document its file name.

Conceptual Approach
--------------------
Use a document scanner that can put the documents in a directory with a
sequential number affixed. Something like c:\ScannedDocs

Then I would plan to have a program - probably Access/VBA that goes
through all documents ( in sequential file number order)in the directory
and brings up the scanned image. At this point case worker will identify
the document by client/consumer and type of document.

Then I propose to copy the document to another location, something like
c:\IndexedDoc s
And rename the doc to include the client/consumer #, document type and
scan date and a sequential number in the document name
xxxxxxx-TTTTTTTTTT-yyyy-mm-dd-ssss

I would delete the source document from the scanned-in documents folder.

At the same time I would add a record to the Access database that link to
the client/consumer, identify the document type and scan date and would
have the file name of the indexed document.

When viewing the document within Access, I would plan to use the method of
retrieving it, and inserting it into a blob within an Access form for
display only. I would NOT store the image in the Access database

The Access database is already planned to be replicated, so this approach
allows the information on scanned documents to be available to central
office personnel as well. I am planning to have a central file of scanned
images, so each time the user would come into the office and the Access
database would be replicated, all new scanned and indexed documents would
be uploaded to a central repository.

The laptop users would only have scanned documents on their own
clients/consumers. Generally about 100-150 clients/consumers at a time.
I am guessing that initially the system would record 10-20 scanned images
per consumer. However over time, I would anticipate that more and more
documents would be scanned. The central database would have a copy of all
scanned documents.

ISSUES

Anyone done this on a distributed, laptop oriented basis before?
If so, guidance would be appreciated

Any suggestions on scanners to use?

Anyone have experience in having multiple users scan in their own images
and run indexing processes vs. having a central scanning and indexing
function?

Should I try to combining multiple images together? Most documents are
single page but a few are 2-3 pages and one is a whopping 18 pages.
Paperport software says it has features to combine multiple scanned
images?
Should I try to combine multiple scanned images together or keep separate
and just use page numbers?

What document format and resolution is needed? I am assuming that I would
use JPG but need suggestions on resolution.

Anyone have examples of doing this they would be willing to share?

Any comments on or suggestions for improving the overall approach.

Thanks

Bob

bobalston9 AT yahoo DOT com


Yes, that looks right on. I would appreciate a copy of the code.

Thank you.

Bob

bobalston9 AT yahoo DOT com
Aug 13 '06 #6
Bob Alston wrote:
I am looking for others who have built systems to scan documents, index
them and then make them accessible from an Access database. My
environment is a nonprofit with about 20-25 case workers who use
laptops. They have Access databases on their laptops and the data is
replicated.

The idea is that each case worker would scan their own documents,
either remotely or back at the office.

And NO I am not planning to store the scanned images in the Access
database. I already know not to do that. The Access database would
only have a record with an index of the document its file name.

Conceptual Approach
--------------------
Use a document scanner that can put the documents in a directory with a
sequential number affixed. Something like c:\ScannedDocs

Then I would plan to have a program - probably Access/VBA that goes
through all documents ( in sequential file number order)in the directory
and brings up the scanned image. At this point case worker will
identify the document by client/consumer and type of document.

Then I propose to copy the document to another location, something like
c:\IndexedDocs
And rename the doc to include the client/consumer #, document type and
scan date and a sequential number in the document name
xxxxxxx-TTTTTTTTTT-yyyy-mm-dd-ssss

I would delete the source document from the scanned-in documents folder.

At the same time I would add a record to the Access database that link
to the client/consumer, identify the document type and scan date and
would have the file name of the indexed document.

When viewing the document within Access, I would plan to use the method
of retrieving it, and inserting it into a blob within an Access form for
display only. I would NOT store the image in the Access database

The Access database is already planned to be replicated, so this
approach allows the information on scanned documents to be available to
central office personnel as well. I am planning to have a central file
of scanned images, so each time the user would come into the office and
the Access database would be replicated, all new scanned and indexed
documents would be uploaded to a central repository.

The laptop users would only have scanned documents on their own
clients/consumers. Generally about 100-150 clients/consumers at a time.
I am guessing that initially the system would record 10-20 scanned
images per consumer. However over time, I would anticipate that more
and more documents would be scanned. The central database would have a
copy of all scanned documents.

ISSUES

Anyone done this on a distributed, laptop oriented basis before?
If so, guidance would be appreciated

Any suggestions on scanners to use?

Anyone have experience in having multiple users scan in their own images
and run indexing processes vs. having a central scanning and indexing
function?

Should I try to combining multiple images together? Most documents are
single page but a few are 2-3 pages and one is a whopping 18 pages.
Paperport software says it has features to combine multiple scanned images?
Should I try to combine multiple scanned images together or keep
separate and just use page numbers?

What document format and resolution is needed? I am assuming that I
would use JPG but need suggestions on resolution.

Anyone have examples of doing this they would be willing to share?

Any comments on or suggestions for improving the overall approach.

Thanks

Bob

bobalston9 AT yahoo DOT com
I know that my client uses a scanner. It's a good one. I scans both
sides, somehow can stick in a stack of documents and knows when 1 stack
completes a file and another goes on.

One thing they do is have a stamp. They stamp the document and enter
the "id" information on it. Then when they scan the document they know
the type and the identifier for the database association.

I too used the ScannedDocument s folder. When they scan, since I have
various types of documents I want to catalog, I have subfolders under
scanned docs...like AR, Project, Customer, etc.

Because there can be thousands of documents, I check to current date. I
then check to see if a folder for that year exists. Ex:
\ScannedDocumen ts\Ar\2006.

I use ScannedDocument s\Ar as my holding folder for AR files. Once the
file's been tagged to an Access record, I move it to 2006. This
compartamentali zes the files and keeps the files in each folder down.
There's nothing like going to a folder that has 20000+ documents in it
and waiting for explorer to present the list. It's a yawner.

If memory serves me correct, their scanning software creates the
filename. So for the most part I can assume the filename is unique.
However, a user might have a file on his hard drive he wants accessable
to all and I allow him so select the file via a File/Open. In this
case, he might have a filename that already exist in the 2006 folder.
If he copied it to 2006, it'd overwrite it unless I made an adjustment.
So I create a counter to it. The original might be Test.Txt. The
next will be Text1.Txt and so on.

If your scanner allows you to script filenames, that can make life
simple...simply parse the filename to figure out which records to
associate it with. If not, I allow the user to view the doc and with
the stamped info on it they can tag it to the document quickly.

It's funny, I wrote the program but haven't been involed in the process.
I think most documents that are scanned come in as Tifs. I know the
documents that are multi-pages remain multi-paged...as in 1 file. I
wish I new the type of scanner they use...it's a good one.
Aug 13 '06 #7

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

22
2282
by: Martin MOKREJ© | last post by:
Hi, I'm looking for some easy way to do something like include in c or PHP. Imagine I would like to have: cat somefile.py a = 222 b = 111 c = 9
14
3167
by: Matt | last post by:
Any progammers looking for a killer app to develop? How about a voice enabled forum? One of the most powerful, exciting, and engrossing experiences on the Internet is the Forum. The first great Internet forums were the Usenet newsgroups. Usenet is still a powerful force, but many different types of forums are also very popular (such as message boards like Vbulliten and XMBforum). I love forums. Love em love em love em. My web site...
1
1522
by: Larry Rekow | last post by:
ve built various web apps using Frontpage and/or ASP and Access, but now I'm trying to figure a way to do the following: My friend gets parts lists in invoices (they are in an excel spreadsheet) that he has to classify as a certain type: eg. part number 111-111-111 is type R9, and part 222-222-222 is type 486, etc. we have a database that is not quite complete, but contains most of these classifications.
6
2657
by: Bob Alston | last post by:
Looking for someone with experience building apps with multiple instances of forms open. I am building an app for a nonprofit organizations case workers. They provide services to the elderly. so far I have built a traditional app, switchboard, forms, etc. Part of this app is to automate the forms they previously prepared manually. After the app was built and works just fine, I find out there are several case managers using MS word...
1
2431
by: daniell | last post by:
Hello, Can anyone tell me how scanning a barcode into ms access works? I am trying to build a database that when a bar code is scanned, it will auto populate a form based on code. Can anyone explain to me how this concepts works? Warm Regards
0
8969
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
9483
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
0
9341
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
1
9269
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
9211
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
0
8216
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
0
4572
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
2
2748
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
3
2195
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.