PDF library? - Python

Paul Rubin

I have a big PDF file that I'd like to crunch, i.e. I want to select a
certain rectangular area from each page and make a new PDF combining
the selected areas from adjacent pages. I guess that means I need a
Python wrapper for GhostScript, or something similar. Anyone know if
that exists? Thanks.

Jul 18 '05 #1

Subscribe Post Reply

1510

Simon Burton

On Tue, 20 Apr 2004 12:14:03 -0700, Paul Rubin wrote:

I have a big PDF file that I'd like to crunch, i.e. I want to select a
certain rectangular area from each page and make a new PDF combining the
selected areas from adjacent pages. I guess that means I need a Python
wrapper for GhostScript, or something similar. Anyone know if that
exists? Thanks.

http://www.reportlab.org/

handles pdf files.

Simon.

Jul 18 '05 #2

Paul Rubin

Simon Burton <si****@NOTTHISBIT.webone.com.au> writes:

http://www.reportlab.org/

handles pdf files.

Reportlab generates reports in pdf format, but I want to do the
opposite, namely read in pdf files that have already been generated by
a different program, and crunch on them. Any more ideas? Thanks.

Jul 18 '05 #3

Andreas Lobinger

Aloha,

Paul Rubin schrieb:

Simon Burton <si****@NOTTHISBIT.webone.com.au> writes:
http://www.reportlab.org/
handles pdf files.

Reportlab generates reports in pdf format, but I want to do the
opposite, namely read in pdf files that have already been generated by
a different program, and crunch on them. Any more ideas? Thanks.

The commercial version (reportlab.com) mentions a tool named
PageCatcher, that seems to be able to extract pages and page descriptions
out of .pdf documents. There is not that many information on the web-page.

If you read comp.text.tex you will find various solutions for composing
and a few for extracting data/content from .pdf documents. Afaik there
is at the moment (read as: i'm working on it) no free-self-contained-
python solution. But as python is very interface-friendly you can use
general tools like gs easily.

For your problem i would suggest to use gs als a .pdf to .ps filter
in the first place, work on the .ps and distill back with gs.

Wishing a happy day
LOBI

Jul 18 '05 #4

Andreas Lobinger

Andreas Lobinger schrieb:

If you read comp.text.pdf you will find various solutions for composing

Jul 18 '05 #5

Duncan Booth

Paul Rubin <http://ph****@NOSPAM.invalid> wrote in
news:7x************@ruckus.brouhaha.com:

Simon Burton <si****@NOTTHISBIT.webone.com.au> writes:
http://www.reportlab.org/

handles pdf files.

Reportlab generates reports in pdf format, but I want to do the
opposite, namely read in pdf files that have already been generated by
a different program, and crunch on them. Any more ideas? Thanks.

Reportlab does that as well, but you either have to pay them money or live
with a Reportlab watermark added to each page you process. So, if you are
doing this for fun it may not be a useful answer, but if its commercial you
can investigate it for free and pay later to remove the watermark.

Jul 18 '05 #6

Similar topics

Deploying .NET COM class library

by: pieter.breed | last post by:

Hi All, The company I work for has traditionally used COM/ActiveX for the solutions that it provides. We are in the process of moving to .NET and a few applications have been written in VB.NET...

.NET Framework

How to get the information of the dynamic linkink library in UNIX?

by: K.S.Liang | last post by:

Hi all, 1> If there are more than one dynamic linking libraries in the file system, how do I know which one is loaded into system? Any C library or system call can tell me which *.so or *.sl is...

C / C++

Deployemnt- class or library not registered

by: Jim | last post by:

Have fully operational software package developed on VB.NET that worked until Jan 1 2003, with early stage deployments on Oct 10, Oct 23, Nov 11, Dec 12 and Dec 30. When attempted final...

Visual Basic .NET

450 Pound Library Program

by: mwt | last post by:

So in a further attempt to learn some Python, I've taken the little Library program (http://groups.google.com/group/comp.lang.python/browse_thread/thread/f6a9ccf1bc136f84) I wrote and added...

Python

error when linking a Fortran library to c++ code in VC8

by: Julian | last post by:

I get the following error when i try to link a fortran library to a c++ code in .NET 2005. LINK : fatal error LNK1104: cannot open file 'libc.lib' the code was working fine when built using...

.NET Framework

Matrix library where's the speed ?

by: Frank-O | last post by:

Hi , Recently I have been commited to the task of "translating" some complex statistical algorithms from Matlab to C++. The goal is to be three times as fast as matlab ( the latest) . I've...

C / C++

Using Additional Library Directories & Additional Dependencies

by: =?Utf-8?B?WW9naSBXYXRjaGVy?= | last post by:

Hello, I am using Visual Studio-2003. I created a project to build my library. Since I am using third party libraries as well, I have specified those additional library dependencies in project...

.NET Framework

Text retrieval systems - 4A: the Library

by: JosAH | last post by:

Greetings, the last two article parts described the design and implementation of the text Processor which spoonfeeds paragraphs of text to the LibraryBuilder. The latter object organizes, cleans...

Java

Text retrieval systems - 4B: the Library

by: JosAH | last post by:

Greetings, welcome back; above we discussed the peripherals of the Library class: loading and saving such an instantiation of it, the BookMark interface and then some. This part of the article...

Java

Way to view public function names in a library

by: Xiaoxiao | last post by:

Hi, I got a C library, is there a way to view the public function names in this library so that I can use in my C program? Thanks.

C / C++

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server