Help | Site Map
Connecting Tech Pros Worldwide
 
 
LinkBack Thread Tools
  #1  
Old December 17th, 2006, 07:25 PM
vertigo
Guest
 
Posts: n/a
Default url filtering

Hello

I want to do some text analysis based on html documents grabbed from
internet.
Is there any library which could allow me easily getting text from html
documents
(cutting javascript, html tags and other not nececary data) ?

Thanx
  #2  
Old December 17th, 2006, 07:25 PM
Dennis Benzinger
Guest
 
Posts: n/a
Default Re: url filtering

Am Sun, 17 Dec 2006 20:14:32 +0100
schrieb vertigo <spam@spam.pl>:
Quote:
Hello
>
I want to do some text analysis based on html documents grabbed from
internet.
Is there any library which could allow me easily getting text from
html documents
(cutting javascript, html tags and other not nececary data) ?
>
Thanx
Try Beautiful Soup: http://www.crummy.com/software/BeautifulSoup/


Dennis
 

Bookmarks

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are Off
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

What is Bytes?

We are a network of experts and professionals in IT and software development that help one another with answers to tough questions and share insights. Get the best answers to your questions from over network members.
Post your question now . . .
It's fast and it's free

Popular Articles