473,503 Members | 8,784 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Pdf -> Doc

Is there any python module which will convert Acrobat
PDF files to MS Word documents?

I googled for it, but could not find much results apart
from a pdf2doc plugin which needs the compelte acrobat
installation to work.

Thanks for any hints.

-Anand Pillai
Jul 18 '05 #1
3 4612
| Is there any python module which will convert Acrobat PDF files
| to MS Word documents?
| ...

I recently found a command-line tool
that will convert PDF files to HTML ....

http://sourceforge.net/projects/pdftohtml/

I've only tried it with a few files,
but the only failure I had was with
a PDF file that I also couldn't copy/paste
from using the Adobe Acrobat reader ....

Perhaps a Python wrapper for this program
could suffice along with subsequent HTML to Doc
conversion if needed ....

--
Cousin Stanley
Human Being
Phoenix, Arizona

Jul 18 '05 #2
Anand Pillai wrote:
Is there any python module which will convert Acrobat
PDF files to MS Word documents?

I googled for it, but could not find much results apart
from a pdf2doc plugin which needs the compelte acrobat
installation to work.

Thanks for any hints.

-Anand Pillai

Could you use Jython (the Java port of Python) [http://www.jython.org]?
If that was an option, you could use Cocoon
[http://cocoon.apache.org/2.1/], which I *think* can convert to/from
PDF. I also *think* it can convert to DOC, and it can definitely
convert to RTF.

Sorry to be so vague, not used Coccon in anger before, but looked at it
in some depth a while back.

Hope this is some use,
Best wishes

John

--
John Niven
[Reply via newsgroup]

Jul 18 '05 #3
jb
you can convert the pdf in any format like html or rtf with a 3rd party tool
and then, if you have word on on machine, you can with python use win23com
to do a macro which will open word, open your exported file and save it as a
doc file. then you can batch your filez

you can even write the macro string in python, and execute it on the fly
through word

good luck

jb
"Anand Pillai" <py*******@Hotpop.com> a écrit dans le message de
news:84**************************@posting.google.c om...
Is there any python module which will convert Acrobat
PDF files to MS Word documents?

I googled for it, but could not find much results apart
from a pdf2doc plugin which needs the compelte acrobat
installation to work.

Thanks for any hints.

-Anand Pillai

Jul 18 '05 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

1
6794
by: Christian Schmidbauer | last post by:
Hello! I prepare my XML document like this way: ------------------------------------------------------- PrintWriter writer; Document domDocument; Element domElement; // Root tag
2
3185
by: Eshrath | last post by:
Hi, What I am trying to do: ======================= I need to form a table in html using the xsl but the table that is formed is quite long and cannot be viewed in our application. So we are...
2
10534
by: Donald Firesmith | last post by:
I am having trouble having Google Adsense code stored in XSL converted properly into HTML. The <> unfortunately become &lt; and &gt; and then no longer work. XSL code is: <script...
0
2040
by: Arne Schirmacher | last post by:
I want to display a MySQL database field that can contain HTML markup. If I use <esql:get-string> then I get all of the database field, but all tags are escaped which is not what I want. If I use...
34
10969
by: Mark Moore | last post by:
It looks like there's a pretty serious CSS bug in IE6 (v6.0.2800.1106). The HTML below is validated STRICT HTML 4.01 and renders as I would expect in Opera, FrontPage, and Netscape. For some...
11
13656
by: Les Paul | last post by:
I'm trying to design an HTML page that can edit itself. In essence, it's just like a Wiki page, but my own very simple version. It's a page full of plain old HTML content, and then at the bottom,...
2
2435
by: bissatch | last post by:
Hi, I am currently writing a simple PHP program that uses an XML file to output rows for a 'Whats New' page. Once written, I will only require updating the XML file and any pages that use the...
0
1071
by: vdex42 | last post by:
Apologies if this has been asked before, but I haven't been able to find the answer to this yet: My problem is that .NET will not allow me to insert escaped '>' characters (i.e. &gt;) within the...
2
2022
by: santaji | last post by:
I am getting xml string in request attribute in following format &lt;files&gt; &lt;file&gt; &lt;filename&gt;somefile.ext&lt;/filename&gt; &lt;/file&gt; &lt;files&gt; the above string I want to convert to tags. expected...
1
5461
by: VaidehiPawar | last post by:
I am a beginner level in xml..my output page does not convert &gt &lt it shows something like this " &lt;b&gt;Location.&lt;/b&gt;&lt;br /&gt; &lt;UL&gt;&lt;LI&gt;Park Central New York " can anyone help? here is my code ...
0
7194
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
7316
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
1
6976
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
7449
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
5566
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
0
3160
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The...
0
1495
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated ...
1
729
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.
0
372
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.