473,385 Members | 1,342 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,385 software developers and data experts.

Reading PDF file (as a whole or in parts) ASP .NET

Hi mates

I have learnt that PDF file/s can be read using API programming. I need a s/w to read parts of PDF files and then later store them in the database. The PDFs contain pictures as well as text. I tried using the professional converter s/w, but the tend to disturb the format of the text/pictures.

Does anyone has the code? Any help otherwise?

Thanks
Jul 31 '07 #1
1 1233
May be this tool will help you: http://text-mining-tool.com

Text Mining Tool is a freeware program for extraction of text from files of the next types: pdf, doc, rtf, chm, html without need to have installed any other programs like Word, Arcrobat, etc.

The beauty of the program is that it works, extremely simply, on almost all common forms of documents. That includes HTML web pages, both DOC and RTF document formats from Microsoft Word and others like Open Office, Windows Help files ending in CHM, and portable documents using PDF format.

Console tool minetext for automation of text converting is included. It may help you!
Dec 13 '07 #2

Sign in to post your reply or Sign up for a free account.

Similar topics

3
by: markspace | last post by:
Hi all, Here's a question I haven't been able to answer on my own. I want to read in the entire contents of a file into some structure in memory then process it. Like for example read an image...
19
by: Lionel B | last post by:
Greetings, I need to read (unformatted text) from stdin up to EOF into a char buffer; of course I cannot allocate my buffer until I know how much text is available, and I do not know how much...
2
by: jimmyfishbean | last post by:
Hi, I am using VB6, SAX (implementing IVBSAXContentHandler). I need to extract binary encoded data (images) from large XML files and decode this data and generate the appropriate images onto...
11
by: Matt DeFoor | last post by:
I have some log files that I'm working with that look like this: 1000000000 3456 1234 1000000001 3456 1235 1000020002 3456 1223 1000203044 3456 986 etc. I'm trying to read the file...
5
by: ericunfuk | last post by:
I'm wondering if the following is fesible. I copy a whole file into memory, then I traverse forwards and backwards in the part of the memory contains the file, to get the chunk of the file I...
8
by: Sharkolomew | last post by:
Hi all. I have a problem I want my program to do the following. Create a file of student ID #s. Then, Read the file. Search for the line of student ID # say 10002. At that line, input test...
3
by: miss time | last post by:
Hi all, my java friends ^-^ I have next week quiz in reading file text ,and understand the topic very well. can any one give some question related to this topic .this help me more to...
3
by: jain236 | last post by:
Hi , i have file of 32kb , i want to read the whole file into string , i tried this by doing the below code, but i dint got the whole content of the file in the string , i guess the variable is not...
1
Coldfire
by: Coldfire | last post by:
Hi, The strange problem i am having is, the input element of type='file' not reading file names after 20 file elements. It simple returns null on reading the 'name' of file. The code is...
1
by: bjoarn | last post by:
I have an Application C# handling file reading, building index on this file, using dll wrapped with SWIG. The dll is originaly programmed in C++. Dll reports back to the the C# programm throug...
1
by: CloudSolutions | last post by:
Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...
0
by: ryjfgjl | last post by:
In our work, we often need to import Excel data into databases (such as MySQL, SQL Server, Oracle) for data analysis and processing. Usually, we use database tools like Navicat or the Excel import...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: ryjfgjl | last post by:
If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.