473,399 Members | 3,038 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,399 software developers and data experts.

running same script on same data on two different machines -->different result

I am running the same script on the same data on two different
machines (the folder is synchronised with Dropbox).
I get two different results. All the script does is count words in
different files and perform a simple set operation on the word lists.
The laptop is a Macbook Pro (2 1/2 years old) running OS X 10.5.5 with
Python 2.5.1
The desktop is an iMac (brand new) running OS X 10.5.5 also with
Python 2.5.1

I have tried running the script on an ubuntu server with Python 2.5.2
and the results corresponded with my laptop's output.
How can I find out the cause of this anomaly? What tests can I
perform?

Thank you,

Christopher Brewster
Aston University
Nov 14 '08 #1
5 2527
Christopher Brewster wrote:
I am running the same script on the same data on two different
machines (the folder is synchronised with Dropbox).
I get two different results. All the script does is count words in
different files and perform a simple set operation on the word lists.
The laptop is a Macbook Pro (2 1/2 years old) running OS X 10.5.5 with
Python 2.5.1
The desktop is an iMac (brand new) running OS X 10.5.5 also with
Python 2.5.1

I have tried running the script on an ubuntu server with Python 2.5.2
and the results corresponded with my laptop's output.
How can I find out the cause of this anomaly? What tests can I
perform?
OK, as a university denizen you are presumably a smart type. Do you
*really* think this is an adequate problem description for debugging?

You might drop lucky, but more information couldn't possibly hurt. We
*try* to be mindreaders, but it would help to know whether you are
talking about string handling or floating-point computations, for example.

If the latter then it's probably because one machine is based on PowerPC
architecture and the other is a more recent Intel-architecture Mac.

regards
Steve
--
Steve Holden +1 571 484 6266 +1 800 494 3119
Holden Web LLC http://www.holdenweb.com/

Nov 14 '08 #2
On Nov 14, 3:22*pm, Steve Holden <st...@holdenweb.comwrote:
Christopher Brewster wrote:
I am running the same script on the same data on two different
machines (the folder is synchronised with Dropbox).
I get two different results. All the script does is count words in
different files and perform a simple set operation on the word lists.
The laptop is a Macbook Pro (2 1/2 years old) running OS X 10.5.5 with
Python 2.5.1
The desktop is an iMac (brand new) running OS X 10.5.5 also with
Python 2.5.1
I have tried running the script on an ubuntu server with Python 2.5.2
and the results corresponded with my laptop's output.
How can I find out the cause of this anomaly? What tests can I
perform?

OK, as a university denizen you are presumably a smart type. Do you
*really* think this is an adequate problem description for debugging?

You might drop lucky, but more information couldn't possibly hurt. We
*try* to be mindreaders, but it would help to know whether you are
talking about string handling or floating-point computations, for example..

If the latter then it's probably because one machine is based on PowerPC
architecture and the other is a more recent Intel-architecture Mac.

regards
*Steve
--
Steve Holden * * * *+1 571 484 6266 * +1 800 494 3119
Holden Web LLC * * * * * * *http://www.holdenweb.com/
Thanks for the suggestion but they are both Intel machines.
There is no floating point just simple additions.

No matter how smart you are, if you do not do this sort of thing
often,
you do not know exactly what sort of information to provide or what
questions to ask.
So that is exactly my question - what are the right questions?
What information do I need to provide to try to solve this?

Christopher
Nov 14 '08 #3

On Nov 14, 2008, at 10:14 AM, Christopher Brewster wrote:
I am running the same script on the same data on two different
machines (the folder is synchronised with Dropbox).
I get two different results. All the script does is count words in
different files and perform a simple set operation on the word lists.
The laptop is a Macbook Pro (2 1/2 years old) running OS X 10.5.5 with
Python 2.5.1
The desktop is an iMac (brand new) running OS X 10.5.5 also with
Python 2.5.1

I have tried running the script on an ubuntu server with Python 2.5.2
and the results corresponded with my laptop's output.
How can I find out the cause of this anomaly? What tests can I
perform?
No idea what Dropbox is, but it is a potential point of failure.
Ensure it is doing its job. Programmatically ensure that the source
files are exactly the same before you start your Python program.

Then try your program on different source files. If the problem shows
up on some source files and not on others, try to figure out the
pattern that relates the files.

Or take your "problem" data file and chop it in half by deleting the
lines from the first half of the file. See if the problem still
occurs. If not, try using the latter half of the file. By using a
binary search like this, maybe you can isolate the problem data to a
very small portion making visual detection of the problem easier.

Until you get more info, this is just generic debugging and isn't
specific to Python.

Good luck
Philip
Nov 14 '08 #4
On Fri, 14 Nov 2008 07:14:20 -0800, Christopher Brewster wrote:
I am running the same script on the same data on two different machines
(the folder is synchronised with Dropbox). I get two different results.
All the script does is count words in different files and perform a
simple set operation on the word lists. The laptop is a Macbook Pro (2
1/2 years old) running OS X 10.5.5 with Python 2.5.1
The desktop is an iMac (brand new) running OS X 10.5.5 also with Python
2.5.1

I have tried running the script on an ubuntu server with Python 2.5.2
and the results corresponded with my laptop's output. How can I find out
the cause of this anomaly? What tests can I perform?
Try eliminating files and see if you can narrow the problem down to a
single file.

Make sure the files really are synchronized. Try comparing their md5
checksums.

Create a batch of test files, copy them from one machine to the other,
and then confirm that the script calculates the same result.

Lastly, make sure that both machines really are using the same script!

And if you do find the result, please let us know... I'm intrigued.

--
Steven
Nov 14 '08 #5
On Nov 15, 2:14*am, Christopher Brewster <cbrews...@gmail.comwrote:
I am running the same script on the same data on two different
machines (the folder is synchronised with Dropbox).
I get two different results. All the script does is count words in
different files and perform a simple set operation on the word lists.
1. "same data" versus "different files": are you using "different" in
the same sense as in "different machines" and "different results"? How
do you know the data is the same?

2. Either show us your script, or tell us (with a reasonable degree of
precision):
* how do you define a "word"
* what is a "word list"
* what is "a simple set operation on the word lists"
* does the script use any of: random module, current date/time,
iteration over dictionaries while updating them, etc

3. (a) Which of the two sets of results is correct? (b) What is your
basis for answering (a)?

Nov 14 '08 #6

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

6
by: Pedro Alves | last post by:
Hi I'm having serious problems with a mission critical app that runs on postgres (and has been running for the past 3 years). It's rather large, and lately things are not going well. The...
3
by: Mal Reeve | last post by:
Hello, I have a fairly simple database. NOT a FE/BE situation. Just 1 mdb residing on a server that is accessed by different machines. On 1 machine a query (and the report based on it) work...
7
by: muttu2244 | last post by:
Hi Everyone I want to run a python script in all the machines that are connected through local network and collect the information about that machine such as HDD size, RAM capacity(with number...
7
by: gmax2006 | last post by:
Hi, I use RedHat linux. How can I find where exactly the current python script is running? I use this code: #test.py import os,sys
2
by: KayC | last post by:
Hi, I run Access 2002 SP3. I have rolled out a FE database to different machines. On one particular machine the date format settings are different in Access. Does anyone know how to change this? I...
19
by: Zytan | last post by:
I want multiple instances of the same .exe to run and share the same data. I know they all can access the same file at the same time, no problem, but I'd like to have this data in RAM, which they...
7
by: Salad | last post by:
I have an app on three different machines; front end on the PC, backend on the network. These are my results on loading the application when I press OK from the Logon form (username/pw) to...
3
by: pjsrikanth | last post by:
Hi, My web application is running on 5000 different machines. For example, if i have to know who are all the clients which are using my application, what would be my appoach? Correct me if iam...
0
by: dustonheaven | last post by:
I need to calculate running balance on data containing Null, which is sorted by few columns. As example below, Clr is sorted by Null Desc, then by date, then just by ID. So far, I just come out...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
by: Hystou | last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
by: Hystou | last post by:
Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
0
agi2029
by: agi2029 | last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.