473,395 Members | 1,613 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,395 software developers and data experts.

PDF library?

I have a big PDF file that I'd like to crunch, i.e. I want to select a
certain rectangular area from each page and make a new PDF combining
the selected areas from adjacent pages. I guess that means I need a
Python wrapper for GhostScript, or something similar. Anyone know if
that exists? Thanks.
Jul 18 '05 #1
5 1510
On Tue, 20 Apr 2004 12:14:03 -0700, Paul Rubin wrote:
I have a big PDF file that I'd like to crunch, i.e. I want to select a
certain rectangular area from each page and make a new PDF combining the
selected areas from adjacent pages. I guess that means I need a Python
wrapper for GhostScript, or something similar. Anyone know if that
exists? Thanks.

http://www.reportlab.org/

handles pdf files.

Simon.

Jul 18 '05 #2
Simon Burton <si****@NOTTHISBIT.webone.com.au> writes:
http://www.reportlab.org/

handles pdf files.


Reportlab generates reports in pdf format, but I want to do the
opposite, namely read in pdf files that have already been generated by
a different program, and crunch on them. Any more ideas? Thanks.
Jul 18 '05 #3
Aloha,

Paul Rubin schrieb:
Simon Burton <si****@NOTTHISBIT.webone.com.au> writes:
http://www.reportlab.org/
handles pdf files.

Reportlab generates reports in pdf format, but I want to do the
opposite, namely read in pdf files that have already been generated by
a different program, and crunch on them. Any more ideas? Thanks.


The commercial version (reportlab.com) mentions a tool named
PageCatcher, that seems to be able to extract pages and page descriptions
out of .pdf documents. There is not that many information on the web-page.

If you read comp.text.tex you will find various solutions for composing
and a few for extracting data/content from .pdf documents. Afaik there
is at the moment (read as: i'm working on it) no free-self-contained-
python solution. But as python is very interface-friendly you can use
general tools like gs easily.

For your problem i would suggest to use gs als a .pdf to .ps filter
in the first place, work on the .ps and distill back with gs.

Wishing a happy day
LOBI
Jul 18 '05 #4
Andreas Lobinger schrieb:
If you read comp.text.pdf you will find various solutions for composing

Jul 18 '05 #5
Paul Rubin <http://ph****@NOSPAM.invalid> wrote in
news:7x************@ruckus.brouhaha.com:
Simon Burton <si****@NOTTHISBIT.webone.com.au> writes:
http://www.reportlab.org/

handles pdf files.


Reportlab generates reports in pdf format, but I want to do the
opposite, namely read in pdf files that have already been generated by
a different program, and crunch on them. Any more ideas? Thanks.


Reportlab does that as well, but you either have to pay them money or live
with a Reportlab watermark added to each page you process. So, if you are
doing this for fun it may not be a useful answer, but if its commercial you
can investigate it for free and pay later to remove the watermark.

Jul 18 '05 #6

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
by: pieter.breed | last post by:
Hi All, The company I work for has traditionally used COM/ActiveX for the solutions that it provides. We are in the process of moving to .NET and a few applications have been written in VB.NET...
3
by: K.S.Liang | last post by:
Hi all, 1> If there are more than one dynamic linking libraries in the file system, how do I know which one is loaded into system? Any C library or system call can tell me which *.so or *.sl is...
1
by: Jim | last post by:
Have fully operational software package developed on VB.NET that worked until Jan 1 2003, with early stage deployments on Oct 10, Oct 23, Nov 11, Dec 12 and Dec 30. When attempted final...
10
by: mwt | last post by:
So in a further attempt to learn some Python, I've taken the little Library program (http://groups.google.com/group/comp.lang.python/browse_thread/thread/f6a9ccf1bc136f84) I wrote and added...
10
by: Julian | last post by:
I get the following error when i try to link a fortran library to a c++ code in .NET 2005. LINK : fatal error LNK1104: cannot open file 'libc.lib' the code was working fine when built using...
20
by: Frank-O | last post by:
Hi , Recently I have been commited to the task of "translating" some complex statistical algorithms from Matlab to C++. The goal is to be three times as fast as matlab ( the latest) . I've...
6
by: =?Utf-8?B?WW9naSBXYXRjaGVy?= | last post by:
Hello, I am using Visual Studio-2003. I created a project to build my library. Since I am using third party libraries as well, I have specified those additional library dependencies in project...
0
by: JosAH | last post by:
Greetings, the last two article parts described the design and implementation of the text Processor which spoonfeeds paragraphs of text to the LibraryBuilder. The latter object organizes, cleans...
0
by: JosAH | last post by:
Greetings, welcome back; above we discussed the peripherals of the Library class: loading and saving such an instantiation of it, the BookMark interface and then some. This part of the article...
16
by: Xiaoxiao | last post by:
Hi, I got a C library, is there a way to view the public function names in this library so that I can use in my C program? Thanks.
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.