473,815 Members | 4,001 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

how to convert a html file into DOM and saving it to a file using python

3 New Member
i want to convert a html file to dom using python
Jul 8 '10 #1
1 1908
bvdet
2,851 Recognized Expert Moderator Specialist
Try Beautiful Soup to parse your HTML document. You can create a customized parser as a subclass of Beautiful Soup to access the parse tree using the Document Object Model.
Jul 8 '10 #2

Sign in to post your reply or Sign up for a free account.

Similar topics

2
1711
by: CB Hamlyn | last post by:
Hello, I'm currently working on a Modification for the PC game Battlefield 1942. I'm a pretty good VBA Programmer with some Foxpro under belt as well. After reading a ton of stuff on-line I'm simply floored by everything Python has to offer. Anyway, I want to create an installation program that takes a folder and all it's sub-folders and copies them from a compressed form into the proper directory. Basically a simplified version...
3
323
by: Andy Smith | last post by:
Hi, I need to convert HTML files to PDF using a .net component. I've seen components that do this but they all are either com wrappers on activex controls, or print to a 'Convert to PDF' printer. We can't use either of these options - we need very quick conversion and will be producing high volumes of html/pdf files. Does anyone know of a .net component that will convert to PDF - ideally made by a fairly large vendor so we can get...
0
907
by: chalres | last post by:
i'd like to convert html page into word file .. do you have any reference of this problem? give me some advice.
8
4218
by: John Salerno | last post by:
I figured my first step is to install the win32 extension, which I did, but I can't seem to find any documentation for it. A couple of the links on Mark Hammond's site don't seem to work. Anyway, all I need to do is search in the Word document for certain strings and either delete them or replace them. Easy enough, if only I knew which function, etc. to use. Hope someone can push me in the right direction.
5
3962
by: Damiano | last post by:
Hello, How could i convert Html page in a PDF file? Is there a free library? Thanks
0
1321
vinoj
by: vinoj | last post by:
How to convert a html file in pdf format using python. I have searched in google and i have got to know about a module named zopyx.convert, i am completely new to this module. please help me if any one of you know other techniques to do this or any tutorial as such of the above mentioned module. I am using Python 2.3.4 on LINUX Platform. Thank you,
3
2096
by: krishindevnet | last post by:
Hi , I am working on a project which has editor which generates the document templates in HTML .I am required to convert these HTML templates to dot file extension . If anybody knows about that please help me out and guide me overcoming these problem. thanks and regards, Krish
6
1765
by: LizzyLiz | last post by:
Hi I need to convert a .csv file to .xls file using python 2.1.3 which means I can't use pyExcelerator! Does anyone know how I can do this? Many thanks LizzyLiz
3
1551
by: zaggi | last post by:
Hi, I need to convert HTML file to any image file. I need to get it done in c#.NET only. Could you please suggest me some ways of doing it. Thanks, Zaggi
0
10670
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
0
10408
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven tapestry of website design and digital marketing. It's not merely about having a website; it's about crafting an immersive digital experience that captivates audiences and drives business growth. The Art of Business Website Design Your website is...
1
10426
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
0
9225
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing, and deployment—without human intervention. Imagine an AI that can take a project description, break it down, write the code, debug it, and then launch it, all on its own.... Now, this would greatly impact the work of software developers. The idea...
1
7686
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6897
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5570
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
0
5708
by: adsilva | last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
3
3030
bsmnconsultancy
by: bsmnconsultancy | last post by:
In today's digital era, a well-designed website is crucial for businesses looking to succeed. Whether you're a small business owner or a large corporation in Toronto, having a strong online presence can significantly impact your brand's success. BSMN Consultancy, a leader in Website Development in Toronto offers valuable insights into creating effective websites that not only look great but also perform exceptionally well. In this comprehensive...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.