473,770 Members | 1,743 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Screen Scraping for Modern Applications?

Hello,

I'm very new to python and still familiarizing myself with the
language, sorry if the post seems moronic or simple.

For a side project I'm working on I need to be able to scrape a modern
computer desktop. Is there any basic material already available to do
this? I'd rather not need to write my own to interact with hardware.

Thanks!

Jun 12 '06 #1
7 3567
I just learned about a python library called BeautifulSoup in a thread
responding to my Southwest Airlines script that I wrote and posted on
the Python cookbook web site. I wrote my script using the provided
HTMLParser class. BeautifulSoup could have saved me some time perhaps.
Take a look. Feel free to browse my code for an example of how to use
HTMLParser for a moderately involved project.

BeautifulSoup link:
http://www.crummy.com/software/BeautifulSoup/

My Southwest Boarding pass script link:
http://aspn.activestate.com/ASPN/Coo.../Recipe/496790

Enjoy.
- Ken
lj*****@gmail.c om wrote:
Hello,

I'm very new to python and still familiarizing myself with the
language, sorry if the post seems moronic or simple.

For a side project I'm working on I need to be able to scrape a modern
computer desktop. Is there any basic material already available to do
this? I'd rather not need to write my own to interact with hardware.

Thanks!


Jun 12 '06 #2
In article <11************ *********@h76g2 000cwa.googlegr oups.com>,
<lj*****@gmail. com> wrote:
Jun 12 '06 #3

Cameron Laird wrote:
In article <11************ *********@h76g2 000cwa.googlegr oups.com>,
<lj*****@gmail. com> wrote:
.
.
.
For a side project I'm working on I need to be able to scrape a modern
computer desktop. Is there any basic material already available to do
this? I'd rather not need to write my own to interact with hardware.

.
.
.
Oh, no.

Very hard. Very, very hard.

At least as you've expressed it here, this is an exceptionally
difficult task. What does "scrape" mean to you? Do you want
to reproduce the display remotely, or extract text, or some other
possibility? Does "modern computer" mean Windows to you?


Scrape means simply scraping pixel colors from locations on the screen.
I'll worry about assembling it into meaningful information.

Previously, I used Java, and it has a very handy built in class called
Robot that can, amongst other things, report the color of on screen
pixels. Thusly, I assumed the task could be accomplished reasonably
easily.

Jun 12 '06 #4
<lj*****@gmail. com> wrote in message
news:11******** **************@ c74g2000cwc.goo glegroups.com.. .


Scrape means simply scraping pixel colors from locations on the screen.
I'll worry about assembling it into meaningful information.


I've used pywinauto to interact with a Flash animation panel, running within
an Internet Explorer browser. By "interact" I mean moving and clicking the
mouse, and capturing images, which can then be processed to access
individual pixels.

-- Paul
Jun 12 '06 #5
lj*****@gmail.c om wrote:
Scrape means simply scraping pixel colors from locations on the screen.
I'll worry about assembling it into meaningful information.


import ImageGrab
im = ImageGrab.grab( )
v = im.getpixel((x, y))

requires:

http://www.pythonware.com/products/pil/

</F>

Jun 12 '06 #6
> > Scrape means simply scraping pixel colors from locations on the screen.
I'll worry about assembling it into meaningful information.


import ImageGrab
im = ImageGrab.grab( )
v = im.getpixel((x, y))

requires:

http://www.pythonware.com/products/pil/

</F>


##
# (New in 1.1.3) The <b>ImageGrab</b> module can be used to copy
# the contents of the screen to a PIL image memory.
# <p>
# The current version works on Windows only.</p>
#

How about doing the same on linux?
Jun 12 '06 #7
In article <11************ **********@c74g 2000cwc.googleg roups.com>,
<lj*****@gmail. com> wrote:
Jun 12 '06 #8

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
4357
by: Jonathan Epstein | last post by:
I would like to perform a more classical type of "screen scraping" than what most people now associate with this term. I only want to find all the text on the current screen, and obtain associated screen coordinates. This probably must be done using OCR. This need only run on Windows. A fairly-pure Python solution would be ideal because most of the software which would use this functionality is also written in Python. The ideal...
4
5740
by: Roland Hall | last post by:
Am I correct in assuming screen scraping is just the response text sent to the browser? If so, would that mean that this could not be screen scraped? function moi() { var tag = '<a href='; var tagType1 = '"mail'+'to:', tagType2 = '">', tagType3 = '<\/a>'; var user1 = 'web', user2 = 'master', user3 = '@'; var dom1 = 'danger', dom2 = 'ous', dom3 = 'ly'; var tld = '.us';...
0
2161
by: Robert Martinez | last post by:
I've seen a lot about screen scraping with .NET, mostly in VB.net. I have been able to convert most of it over, but it is still just very basic stuff. Can someone help direct me toward some good info / samples on the following: I want to be able to do 3 things: 1) Set up a module in IBUYSPY Portal (like in the right or left pane) that simply just scrapes headlines from certain sites. 2) Allow users to set up scraping of certain sites...
3
2358
by: Jim Giblin | last post by:
I need to scrape specific information from another website, specifically the prices of precious metals from several different vendors. While I will credit the vendors as the data source, I do not want to use the format of their pages, and want the inforamtion consolidated to a single page of my design. I did something like this for a client a couple of years ago in ASP, but it was complex, and I do not have access to the code. A...
4
3469
by: rachel | last post by:
Hello, I am currently contracted out by a real estate agent. He has a page that he has created himself that has a list of homes.. their images and data in html format. He wants me to take this page and reformat it so that it looks different. Do I use screen scraping to do this? Could someone please point me to a good screen scraping
0
963
by: bruce | last post by:
Hi.. I'm not that familiar with Pythin, but I wasn wondering if there are any XPath/Python Gurus that I might be able to talk to regarding screen scraping applications... Thanks -Bruce bedouglas@earthlink.net
6
8125
by: Christopher Glenn | last post by:
I have very basic html skills. My friend who has a wide screen monitor and is using IE7 sent me a jpg screen shot of my home page. I have attached this jpg, but I recall a while back that attachments were discouraged. I am sorry if I have offended anyone by attaching. Note the left side table tiles or repeats towards the right side of the screen, but the text does not, so it cannot be seen under the graphic.
4
3321
by: different.engine | last post by:
Folks: I am screen scraping a large volume of data from Yahoo Finance each evening, and parsing with Beautiful Soup. I was wondering if anyone could give me some pointers on how to make it less obvious to Yahoo that this is what I am doing, as I fear that they probably monitor for this type of activity, and will soon ban my IP.
3
5169
by: WFDGW2 | last post by:
I want to write or obtain C++ code that will scrape text from a dialog box within a poker client, and then record that text somewhere else. What do I do? Thanks.
0
9617
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look ! Part I. Meaning of...
0
9454
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it. First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
0
10257
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed. This is as boiled down as I can make it. Here is my compilation command: g++-12 -std=c++20 -Wnarrowing bit_field.cpp Here is the code in...
0
9904
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each protocol has its own unique characteristics and advantages, but as a user who is planning to build a smart home system, I am a bit confused by the choice of these technologies. I'm particularly interested in Zigbee because I've heard it does some...
1
7456
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules. He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms. Adolph will...
0
6710
by: conductexam | last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one. At the time of converting from word file to html my equations which are in the word document file was convert into image. Globals.ThisAddIn.Application.ActiveDocument.Select();...
0
5354
by: TSSRALBI | last post by:
Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols. I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
1
4007
by: 6302768590 | last post by:
Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
2
3609
muto222
by: muto222 | last post by:
How can i add a mobile payment intergratation into php mysql website.

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.