473,466 Members | 1,343 Online
Bytes | Software Development & Data Engineering Community
Create Post

Home Posts Topics Members FAQ

Parsing data from printer file

Hi there,

We are thinking of developing a product that needs to be able to parse
data printed via some printer driver. The data will then be used to
fill a PDF form. We believe the printer solution will be best as there
are multiple applications that we need to retrieve data from.

This is a brand new area for us, so I'm basically wondering if there is
any out-of-the-box printer drivers that create files with a structure
that we can then use to extract data from? I guess the printer files
need to have a fixed position structure so that we can tell our
software to "get X characters starting at position X".

I would appreciate any recommendations, ideas and experience regarding
this type of development. Am I missing something that makes this type
of software more or less impossible?

We plan to develop it using VB.NET.

Thank you in advance!

Peter

Dec 25 '05 #1
1 1613

Well, there are a number of printer drivers that will send the
output to a PDF file, and I am sure it should be easy to find
a component that will let you extract the text from those files.

Or install a standard PCL printer and configure it to print to
a file rather than a local port. You could then parse the pcl
output directly.

But scraping the data? Even if you could get the programs
to output the text at specific positions, I am not sure that
information might survive intact to the printer file. There are
often minor translations going on due to attempts to reconcile
differences between the printer capabilities and the requirements
of the programs (I think). Font substitution should be expected,
but perhaps the positions are intact? Needs testing.

Question is: If you can actually make the programs output
the text at specific positions, would it not be easier to make
them output something you could actually use directly?
Or are you saying that you do not have such control over
the programs, but want to specify those text positions to
the *reader*?

I do not think it is impossible, but screen scraping (in this case
print scraping?) is normally not the way to do things (not the
way to do anything). Even if you get it to work in a controlled
environment, it is a sure road to Support Hell.

Or am I just being pessimistic?

ok, let me put it another way: Screen/print scraping is not
the way to do anything *if* it can be avoided.

Then there is the whole deal of keeping track of what the
printer driver is doing and how it is configured. I guess something
like this could be useful (haven't tried it - just a quick Google
result):
http://www.blackice.com/Printer%20Dr...erProducts.htm
http://www.blackice.com/Printer%20Dr...Tool%20Kit.htm

/JB

On 25 Dec 2005 07:46:38 -0800, pi*****@yahoo.com wrote:
Hi there,

We are thinking of developing a product that needs to be able to parse
data printed via some printer driver. The data will then be used to
fill a PDF form. We believe the printer solution will be best as there
are multiple applications that we need to retrieve data from.

This is a brand new area for us, so I'm basically wondering if there is
any out-of-the-box printer drivers that create files with a structure
that we can then use to extract data from? I guess the printer files
need to have a fixed position structure so that we can tell our
software to "get X characters starting at position X".

I would appreciate any recommendations, ideas and experience regarding
this type of development. Am I missing something that makes this type
of software more or less impossible?

We plan to develop it using VB.NET.

Thank you in advance!

Peter


Dec 25 '05 #2

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

5
by: MouseHart | last post by:
I've written a simple program in VB 6.0 to list all my MP3 files. To show them on the screen I used an MSFlexGrid named TextGrid (which is not associated with any table or text file) in the...
7
by: MouseHart | last post by:
I have written a small VB program which lists my music files on my computer, but I can't get it to print exactly the way I want it to. The data is viewed on screen in an MSFlexGrid named...
2
by: Peter Sprenger | last post by:
Hello, I hope somebody can help me with my problem. I am writing Zope python scripts that will do parsing on text for dynamic webpages: I am getting a text from an oracle database that contains...
9
by: gov | last post by:
Hi, I've just started to learn programming and was told this was a good place to ask questions :) Where I work, we receive large quantities of data which is currently all printed on large,...
18
by: Jeff Rodriguez | last post by:
If main is prototyped as: int main(int argc, char *argv); You will end up with a bunch of arguments in *argv, and the number in argc. Now what I want to do is emulate that same action on a...
6
by: notregister | last post by:
hi, i have been trying to send a printer-ready data to an unknown printer to print. the reason why the printer is unknown is that i do no wish installed any printer driver. so at the end my...
4
by: Schwarty | last post by:
I hope I posted this to the correct group. If not, please let me know and I will get it posted in the correct section. I have a web application developed in ASP.NET using C# for the code behind....
3
by: toton | last post by:
Hi, I have some ascii files, which are having some formatted text. I want to read some section only from the total file. For that what I am doing is indexing the sections (denoted by .START in...
7
by: RB0135 | last post by:
Hi All, I have some Windows BMP, 1BPP, monochrome files that I need to get the raw data from to load a graphics buffer on a Roll Printer (which I know can be done). Lets forget about the Roll...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
1
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
0
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The...
0
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated ...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.