473,385 Members | 1,712 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,385 software developers and data experts.

traversing yahoogroups group messages


i want to traverse a set of messages in a Yahoogroups group from a Python
program.

to get to the messages of the group, one must log in.

this presents, i think, two problems,
1) handling the form element for the login, which has a javascript submit
routine,
2) keeping login state with cookies.

to someone who knows something about the issues here, my questions are:

1) is it possible to do this in Python?
2) if so, how do i handle the form and the javascript?
3) does Java Python have a javascript engine and do i need Java Python here?
4) if i need to use cookies, how do i know what to name and what to set into
a cookie?

for context, i include the form element below.
the submit routine hash() is javascript.



<form method=post
action="https://login.yahoo.com/config/login?ej1cd3h7oogel" autocomplete=off
name=login_form onsubmit="return
hash(this,'http://login.yahoo.com/config/login')">
<table bgcolor=#6996e0 border="0" cellpadding="2" cellspacing="0"
width="100%">
<tr><td>
<table bgcolor="#eeeeee" border="0" cellpadding="2" cellspacing="0"
width="100%">
<tr><td bgcolor="#ffffff" align="center">

<table border="0" cellspacing="6" cellpadding="6" bgcolor="ffffff"
width="100%">
<tr bgcolor="eeeeee">
<td align="center">
<font face="arial"><b>Existing Yahoo! users</b></font><br>
<font face="arial" size="-1"><nobr>&nbsp;Enter your ID and password to sign
in&nbsp; </nobr></font>
<table border="0" cellpadding="4" cellspacing="0">
<tr> <td align="right">
<input type=hidden name=".tries" value="1" >
<input type=hidden name=".src" value="ygrp" >
<input type=hidden name=".md5" value="" >
<input type=hidden name=".hash" value="" >
<input type=hidden name=".js" value="" >
<input type=hidden name=".last" value="" >
<input type=hidden name="promo" value="" >
<input type=hidden name=".intl" value="us" >
<input type=hidden name=".bypass" value="" >
<input type=hidden name=".partner" value="" >
<input type=hidden name=".u" value="a4o4r550k8vss" >
<input type=hidden name=".v" value="0" >
<input type=hidden name=".challenge" value="RoxqFKs548c9Abju6nBMrQ3J1uly" >
<input type=hidden name=".yplus" value="" >
<input type=hidden name=".emailCode" value="" >
<input type=hidden name="pkg" value="" >
<input type=hidden name="stepid" value="" >
<input type=hidden name=".ev" value="" >
<input type=hidden name="hasMsgr" value=0>
<input type=hidden name=".chkP" value="Y">
<input type=hidden name=".done"
value="http://groups.yahoo.com/group/legality-of-drivers-license/messages/35
" >
<script language=javascript>
<!--
browser_string = navigator.appVersion + " " + navigator.userAgent;
if ( browser_string.indexOf("MSIE") < 0 ) {
if (navigator.mimeTypes) {
for (i = 0 ; i < navigator.mimeTypes.length ; i++) {
if (navigator.mimeTypes[i].suffixes.indexOf("yps") > -1) {
doGotIt();
}
}
} else {
dontGotIt();
}
} else {
if (browser_string.indexOf("Windows")>=0) {
doGotIt();
document.write('<object
classid="clsid:41695A8E-6414-11D4-8FB3-00D0B7730277"
CODEBASE="javascript:dontGotIt();" ID="Ymsgr" width="1" height="1">');
document.write('</object>');
}
}
//-->
</script>
<table border="0" cellpadding="2" cellspacing="0">
<tr> <td align="right" nowrap><font face="arial" size="-1">
Yahoo! ID:
</font></td>
<td><input name="login" size="17" value=""></td>
</tr>
<tr> <td align="right" nowrap><font face="arial"
size="-1">Password:</font></td>
<td><input name="passwd" type="password" size="17" maxlength="32"></td></tr>
<tr> <td colspan="2" nowrap align="center"><font face="arial" size="-1">
<input type="checkbox" name=".persistent" value="y">Remember my ID on this
computer</font></td>
</tr><tr>
<td>&nbsp;</td>

<td><input name=".save" type="submit" value="Sign In"></td>
</tr>
</table>
</td></tr>
<tr>
<td nowrap bgcolor="eeeeee" align="center">
<font face="arial" size="-1">Mode:
Standard | <a
href="https://login.yahoo.com/config/login?.src=ygrp&.v=0&.u=a4o4r550k8vss&.
last=&promo=&.intl=us&.bypass=&.partner=&pkg=&step id=&.done=http%3a//groups.
yahoo.com/group/legality-of-drivers-license/messages/35"> Secure</a>
</font>
</td>
</tr>
</table>
</td></tr>
<tr bgcolor="eeeeee">
<td valign="top" align="center"> <font face="arial" size="-1">
<a
href="http://us.rd.yahoo.com/reg/sihflib/*http://login.yahoo.com/config/logi
n?.src=ygrp&.intl=us&.help=1&.v=0&.u=a4o4r550k8vss &.last=&.last=&promo=&.byp
ass=&.partner=&pkg=&stepid=&.done=http%3a//groups.yahoo.com/group/legality-o
f-drivers-license/messages/35">Sign-in help</a>&nbsp;&nbsp;&nbsp;<a
href="http://us.rd.yahoo.com/reg/fpflib/*http://edit.yahoo.com/config/eval_f
orgot_pw?new=1&.done=http://groups.yahoo.com/group/legality-of-drivers-licen
se/messages/35&.src=ygrp&partner=&.partner=&.intl=us&pkg=&step id=&.last=">Fo
rgot your password?</a>
</font></td></tr>
</table>
</td></tr></table>
</td></tr></table>
</form>


Jul 18 '05 #1
1 2910
lothar wrote:
i want to traverse a set of messages in a Yahoogroups group from a Python
program.

to get to the messages of the group, one must log in.

this presents, i think, two problems,
1) handling the form element for the login, which has a javascript submit
routine,
2) keeping login state with cookies.

to someone who knows something about the issues here, my questions are:

1) is it possible to do this in Python?
Yes.
2) if so, how do i handle the form and the javascript?
There are a variety of approaches, including ones which depend on which
platform you are using (e.g. Win32, Linux, other?) and which depend
on how sophisticated and flexible you want the result to be.
3) does Java Python have a javascript engine and do i need Java Python here?
Do you realize that Java has absolutely nothing to do with Javascript
except forming part of its name? And no, you don't need it here.
4) if i need to use cookies, how do i know what to name and what to set into
a cookie?
By asking the server, and watching the cookies that come back from
it. The ClientCookie module would presumably help. You could also
just turn off cookies in your browser and access the site, and see
if it still works... maybe you don't need them at all.
for context, i include the form element below.
the submit routine hash() is javascript.


There have been similar questions and many responses on this subject
in the past. I suggest using Google Groups to check the newsgroup
archives, using search words such as "web scraping", possibly paying
close attention to any threads with responses by Cameron Laird or
John J Lee. ;-)

-Peter
Jul 18 '05 #2

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

2
by: Mike | last post by:
I am sure that I am making a simple boneheaded mistake and I would appreciate your help in spotting in. I have just installed apache_2.0.53-win32-x86-no_ssl.exe php-5.0.3-Win32.zip...
1
by: w.p. | last post by:
Hello! I want change default tab traversing in my app. But i don't know how to do it :( Belowe i include simple example - i want change default tab order: radiobutton "mode11" -> radiobutton...
0
by: Savas Ates | last post by:
im sending email with these codes i call my sub with two forms call mailgonder ("info@isbuluyorum.com","isa_rizeli@yahoo.com","ok","ok") call mailgonder...
1
by: Tom Olson | last post by:
I've been having this problem for the past week or so. When I download messages from this group OE always says there are between 3 and 74 more messages to download, but I cannot download them. I...
2
by: thomson | last post by:
Hi all, can any one tell me which is fast traversing a XML file or a hash file is fast, i got few few field names and values in XML which i will use to retrieve. I can use Hash File also to do the...
4
by: plmanikandan | last post by:
Hi, I am new to link list programming.I need to traverse from the end of link list.Is there any way to find the end of link list without traversing from start(i.e traversing from first to find the...
1
by: Thomas Qi | last post by:
There is a basic sql below: SELECT CTEnr AS ID, TimeStamp, DatagramSize, Source AS LocalIP, Destination AS RemoteIP, Protocol, Messages, SourcePort AS LocalPort, DestPort AS RemotePort FROM...
30
by: asit | last post by:
We kno that data can be pushed onto the stack or popped 4m it. Can stack be traversed ??
1
by: somcool | last post by:
I am facing an error while traversing a query in MS Access Details - When I click a button, a form which has the query opens up. There are certain fields which are in the form of combo box in the...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.