How to search text on an html page

Hi,
I want to search for a particular text on an html page ---> Build Complete
Further execution should be only if the build is successful which is denoted by the text 'Build Complete' on the webpage.

Expand|Select|Wrap|Line Numbers

 
URL = "http://11.12.13.27:8080/cruisecontrol"
 
from urllib2 import urlopen

from HTMLParser import HTMLParser
 
import re
 
# Fetching links using HTMLParser

def get_links(url):

    parser = MyHTMLParser()

    parser.feed(urlopen(url).read())

    parser.close()

    return parser.links
 
# Build url for Deploy page

def get_deploy_url():

    url = URL + "/buildresults/Poker-TTM_%s_nightly_build" % branch

    print url

    check_re = re.compile(r"/Build Complete/")

    print check_re

    if check_re.search(url):

        print "hello"

        for link in get_links(url):

            if link["href"].startswith("Deploy"):

                return "%s/%s" % (URL, link["href"])

        print link["href"]
 
# Build url for Destination page

def get_destination_url():

    url = get_deploy_url()

    print url

    destination_re = re.compile(r"%s" % destination)

    for link in get_links(url):

        if destination_re.search(link["href"]):

            return "http://11.12.13.27:8080/cruisecontrol/" + link["href"]
 
# Parsing HTML pages 

class MyHTMLParser(HTMLParser):

    def __init__(self, *args, **kwd):

        HTMLParser.__init__(self, *args, **kwd)

        self.links = []
 
    def handle_starttag(self, tag, attrs):

        if tag == "a":

            attrs = dict(attrs)

            if "href" in attrs:

                self.links.append(dict(attrs))
 
    def handle_endtag(self, tag):

        pass
 
if __name__ == "__main__":

    # Read the branch name and the test destination to deploy on

    lines = [x.split(':') for x in open("branch_dest.txt")]

    print lines

    branch = "%s" % lines[0][1].strip()

    print branch

    destination = "%s" % lines[1][1].strip()

    print destination
 
    final_url = get_destination_url()

    if final_url is None:

        print "Could not find a destination to deploy"

    else:

        print final_url

I am getting the below error

Expand|Select|Wrap|Line Numbers

 
Traceback (most recent call last):

  File "C:\deploy_input.py", line 61, in <module>

    final_url = get_destination_url()

  File "C:\deploy_input.py", line 33, in get_destination_url

    for link in get_links(url):

  File "C:\deploy_input.py", line 11, in get_links

    parser.feed(urlopen(url).read())

  File "C:\Python26\lib\urllib2.py", line 126, in urlopen

    return _opener.open(url, data, timeout)

  File "C:\Python26\lib\urllib2.py", line 382, in open

    req.timeout = timeout

AttributeError: 'NoneType' object has no attribute 'timeout'

Help!

May 13 '10 #1

Subscribe Post Reply

1085

Similar topics

improve string catching of html page

by: Sheela | last post by:

Hi all gurus in tha club, I scripted a prog that extract a string from an html page excluding all the tags. The problem is that it works quite slowly and I wanted to know if somebody of us as an...

PHP

MySQL and searching TEXT fields

by: Michi | last post by:

I was wondering what the best solution is for making large numbers of TEXT (or BLOB?) fields searchable. For example, if I have a forum, what is the best way to be able to search for specific...

MySQL Database

Reading in an HTML page from CGI Scripts

by: Brent V | last post by:

Hopefully someone has had to handle this type of situation in .NET before. I have an ASP.NET (VB.NET) that has an interface to an API CGI script program. I send a credit card number, amount, etc to...

ASP.NET

where to place the javascript in html page

by: acord | last post by:

Hi, I m getting annoying display problem when placing javascript tags in a html page. Should the javasscript tags placed at the beginning of a html page before anything start? or placed between...

Javascript

Taking data from a text file to parse html page

by: DH | last post by:

Hi, I'm trying to strip the html and other useless junk from a html page.. Id like to create something like an automated text editor, where it takes the keywords from a txt file and removes them...

Python

Client found response content type of 'text/html; charset=Windows-

by: =?Utf-8?B?S2VzdGZpZWxk?= | last post by:

Hi Our company has a .Net web service that, when called via asp.net web pages across our network works 100%! The problem is that when we try and call the web service from a remote machine, one...

.NET Framework

HTML-page clicked in Google don't get me to the website

by: =?Utf-8?B?Y2F0aGFyaW51cyB2YW4gZGVyIHdlcmY=?= | last post by:

Hello, I have build a website with approximately 30 html-pages. When I search this website in Google, I see the index.html or home.html on this website, but also other html-pages on this...

.NET Framework

Ajax does not display a DIV which contains a html page with javascript

by: paulie | last post by:

Hi, I have been experiencing an issue when trying to use AJAX to reload a DIV area using a timer of 2000ms, which contains a html page with another DIV and javascript. Scenario -------------...

Javascript

write a variable to a html page

by: jpollack | last post by:

I don't know JavaScript but have been tasked to write a script that will change the value of a Boolean variable to the word "Yes" on a table row. I have been trying to achieve this based on my...

Javascript

Java Script Error in HTML Page

by: imtmub | last post by:

I have a page, Head tag Contains many Scripts and style sheet for Menu and Page. This code working fine and displaying menus and page as i wanted. Check this page for reference....

Javascript

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server