473,326 Members | 2,081 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,326 software developers and data experts.

[Half-off] How to get textboxes (text blocks) from ps/pdf files?

Hi!

I need to get textboxes/textblocks from pdf files. I can convert them
into ps.
Is anyone knows about method, trick, routine to I can get the textboxes
from ps or pdf?
(Pythonic, COM, or command line solutions needed.)

I need to redraw them into my application, and user can reorder them,
and next I concat. every text to process it.

I need these infos:
x, y, w, h, text

Example:
page1
textbox1{x:100,y:100;w:600;h:27;text:"TextBox1 /xfc /xfa"}
textbox2{x:100,y:180;w:600;h:27;text:"TextBox2"}
page2
textbox1{x:100,y:100;w:600;h:27;text:"TextBox1"}
textbox2{x:100,y:180;w:600;h:27;text:"TextBox2"}
....

Any solution?

Thanks for it!
dd

ps1:
I tried every pdf2text and pdf2html application. All failed in the
test.
Only one provide good informations, the pdftohtml, because it is
makes divs with abs. position and size and the texts.
But this program is not handle the iso-8859-2 chars, so I lost them.

ps2:
The program must run under Windows XP. So the solution is os specific.
Jan 3 '07 #1
0 817

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
by: André Hänsel | last post by:
Hi! I compared the printed results of two of my pages. One is a very old page, "styled" almost without any CSS. The other one is a modern page, styled completely in CSS and styled for printing,...
2
by: Astra | last post by:
Hi All This is really bizarre prob that I don't know how I'm going to get round it. In essence I have created a site that allows the user to select their preferred shoe size, this then gets...
27
by: Steven T. Hatton | last post by:
I've finally gotten around to reading Accelerated C++ by Andrew Koenig and Barbara Moo. There's a lot of good stuff in what I've read so far. Even though it is _very_ basic, they present some...
3
by: mitchellpal | last post by:
guys.... help me out here... my code is running halfway... how do i complete the other right half....... pp.. the user should input an odd number btw 0 and 20 then the program displays th shape as...
2
by: spoonerstreet | last post by:
I am trying to take standard MS Time: 2/22/2006 8:56:37 AM (stored in MS Decimal Format) and convert it to the half hour of the day it occurred in: 2/22/2006 8:56:37 AM would be in the 18th...
0
by: T | last post by:
When I call graphics.DrawRectangle(Pens.Black, 100, 100, 10, 10) the top left pixel is not painted if PixelOffsetMode is set to PixelOffstMode.Half. Why? Looks like a serious bug to me....
4
by: trint | last post by:
I am developing a new website for our company with c# dotNet. There is no problem displaying this website with ie6. Last week I downloaded ie7 because a friend of mine said he could only see half...
8
by: zionist.news | last post by:
The world have been after Bill Gates for no reason. The richest group was and remains the Zionist jew Rothschilds family who own HALF the worlds total wealth through numerous frontmen zionists. ...
2
by: Sef | last post by:
On another forum I am working on getting a mod working for Half Life 2. It involves adding the gun from Portal to Half Life 2 however there is many differences between the two games. We have...
4
by: mathieu | last post by:
Hello, Has anyone heard of the 'half' floating point type. That would be a 16bits floating point, see for example: http://oss.sgi.com/projects/ogl-sample/registry/ARB/half_float_pixel.txt ...
0
by: DolphinDB | last post by:
Tired of spending countless mintues downsampling your data? Look no further! In this article, you’ll learn how to efficiently downsample 6.48 billion high-frequency records to 61 million...
0
by: ryjfgjl | last post by:
ExcelToDatabase: batch import excel into database automatically...
0
by: Vimpel783 | last post by:
Hello! Guys, I found this code on the Internet, but I need to modify it a little. It works well, the problem is this: Data is sent from only one cell, in this case B5, but it is necessary that data...
0
by: jfyes | last post by:
As a hardware engineer, after seeing that CEIWEI recently released a new tool for Modbus RTU Over TCP/UDP filtering and monitoring, I actively went to its official website to take a look. It turned...
0
by: ArrayDB | last post by:
The error message I've encountered is; ERROR:root:Error generating model response: exception: access violation writing 0x0000000000005140, which seems to be indicative of an access violation...
1
by: PapaRatzi | last post by:
Hello, I am teaching myself MS Access forms design and Visual Basic. I've created a table to capture a list of Top 30 singles and forms to capture new entries. The final step is a form (unbound)...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
0
by: af34tf | last post by:
Hi Guys, I have a domain whose name is BytesLimited.com, and I want to sell it. Does anyone know about platforms that allow me to list my domain in auction for free. Thank you
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.