How safe is a set of floats?

Thomas Nelson

I want to generate all the fractions between 1 and limit (with
limit>1) in an orderly fashion, without duplicates.

def all_ratios(limit):
s = set()
hi = 1.0
lo = 1.0
while True:
if hi/lo not in s:
s.add(hi/lo)
yield (hi,lo)
hi += 1
if hi/lo limit:
lo += 1
hi = lo

I use a set to keep from giving duplicates; but is this safe? In C
they always tell you not to trust floating point equality comparisons,
since they may not work as you expect. My code seems fine for the
limited amount I've tested, but I'm curious: is there a gaurantee
about sets of floats? Or a warning?

Thanks,

Tom

May 4 '07 #1

Subscribe Post Reply

1736

Alex Martelli

Thomas Nelson <th*@mail.utexas.eduwrote:

I want to generate all the fractions between 1 and limit (with
limit>1) in an orderly fashion, without duplicates.

def all_ratios(limit):
s = set()
hi = 1.0
lo = 1.0
while True:
if hi/lo not in s:
s.add(hi/lo)
yield (hi,lo)
hi += 1
if hi/lo limit:
lo += 1
hi = lo

I use a set to keep from giving duplicates; but is this safe? In C
they always tell you not to trust floating point equality comparisons,
since they may not work as you expect. My code seems fine for the
limited amount I've tested, but I'm curious: is there a gaurantee
about sets of floats? Or a warning?

sets of floats work exactly like sets of anything else and thus in
particular they DO intrinsically rely on == comparisons, i.e., exact
equality checks (just like dicts whose keys are floats, etc).

In your code, some "fractions" that actually differ from others you're
previously seen will in fact be skipped because they don't differ _by
enough_ -- i.e. they do compare == to within the limited precision of
floating-point computations. But if you do want to be yielding floats,
and never want to yield the (num, denom) tuples for two items that *as
float* compare ==, there's nothing you can do about that issue.

My main suggestion to you actually would be to compute hi/lo ONCE per
iteration rather than 3 times -- I detest repetition in principle and
here it may be costing you a few nanoseconds' speed:-)

[[If you don't truly care about whether the fractions you yield do
compare as == "as floats", you might e.g. use gmpy.mpq rather than
division to perform your checks]]
Alex

May 4 '07 #2

Paul McGuire

On May 4, 9:50 am, a...@mac.com (Alex Martelli) wrote:

Thomas Nelson <t...@mail.utexas.eduwrote:
I want to generate all the fractions between 1 and limit (with
limit>1) in an orderly fashion, without duplicates.

def all_ratios(limit):
s = set()
hi = 1.0
lo = 1.0
while True:
if hi/lo not in s:
s.add(hi/lo)
yield (hi,lo)
hi += 1
if hi/lo limit:
lo += 1
hi = lo

I use a set to keep from giving duplicates; but is this safe? In C
they always tell you not to trust floating point equality comparisons,
since they may not work as you expect. My code seems fine for the
limited amount I've tested, but I'm curious: is there a gaurantee
about sets of floats? Or a warning?

sets of floats work exactly like sets of anything else and thus in
particular they DO intrinsically rely on == comparisons, i.e., exact
equality checks (just like dicts whose keys are floats, etc).

In your code, some "fractions" that actually differ from others you're
previously seen will in fact be skipped because they don't differ _by
enough_ -- i.e. they do compare == to within the limited precision of
floating-point computations. But if you do want to be yielding floats,
and never want to yield the (num, denom) tuples for two items that *as
float* compare ==, there's nothing you can do about that issue.

My main suggestion to you actually would be to compute hi/lo ONCE per
iteration rather than 3 times -- I detest repetition in principle and
here it may be costing you a few nanoseconds' speed:-)

[[If you don't truly care about whether the fractions you yield do
compare as == "as floats", you might e.g. use gmpy.mpq rather than
division to perform your checks]]

Alex- Hide quoted text -

- Show quoted text -

Does set membership test for equality ("==") or identity ("is")? I
just did some simple class tests, and it looks like sets test for
identity. So if I were to create a Rational class in which
Rational(1,2) and Rational(2,4) both evaluate to 0.5, such that
Rational(1,2) == Rational(2,4) evaluates to True, a set of such
Rationals would still hold both instances.

-- Paul

May 4 '07 #3

Arnaud Delobelle

On May 4, 3:21 pm, Thomas Nelson <t...@mail.utexas.eduwrote:

I want to generate all the fractions between 1 and limit (with
limit>1) in an orderly fashion, without duplicates.

def all_ratios(limit):
s = set()
hi = 1.0
lo = 1.0
while True:
if hi/lo not in s:
s.add(hi/lo)
yield (hi,lo)
hi += 1
if hi/lo limit:
lo += 1
hi = lo

I use a set to keep from giving duplicates; but is this safe? In C
they always tell you not to trust floating point equality comparisons,
since they may not work as you expect. My code seems fine for the
limited amount I've tested, but I'm curious: is there a gaurantee
about sets of floats? Or a warning?

There won't be either, but you actually don't need to store the
previous fractions. All you need to verify is that the denominator
and numerator are relatively prime (i.e. their gcd is 1). That could
be implemented as:

------------------------------------
from itertools import count

def gcd(x, y):
while x:
x, y = y % x, x
return y

def all_ratios(limit):
for d in count(1):
for n in xrange(d, int(limit*d) + 1):
if gcd(d, n) == 1:
yield n, d
------------------------------------

HTH

--
Arnaud

May 4 '07 #4

Peter Otten

Paul McGuire wrote:

Does set membership test for equality ("==") or identity ("is")?

As Alex said, equality:

>>a = 0.0
b = -0.0
a is b

False

>>a == b

True

>>set([a, b])

set([0.0])

Peter

May 4 '07 #5

Arnaud Delobelle

On May 4, 5:04 pm, Paul McGuire <p...@austin.rr.comwrote:

Does set membership test for equality ("==") or identity ("is")? I
just did some simple class tests, and it looks like sets test for
identity.

Sets are like dictionaries, they test for equality:

>>a=1,2
b=1,2
a is b

False

>>a in set([b])

True

--
Arnaud

May 4 '07 #6

Paul McGuire

On May 4, 11:50 am, Arnaud Delobelle <arno...@googlemail.comwrote:

On May 4, 5:04 pm, Paul McGuire <p...@austin.rr.comwrote:

Does set membership test for equality ("==") or identity ("is")? I
just did some simple class tests, and it looks like sets test for
identity.

Sets are like dictionaries, they test for equality:

>a=1,2
b=1,2
a is b

False

>a in set([b])

True

--
Arnaud

Just to beat this into the ground, "test for equality" appears to be
implemented as "test for equality of hashes". So if you want to
implement a class for the purposes of set membership, you must
implement a suitable __hash__ method. It is not sufficient to
implement __cmp__ or __eq__, which I assumed "test for equality" would
make use of. Not having a __hash__ method in my original class caused
my initial confusion.

So would you suggest that any class implemented in a general-purpose
class library should implement __hash__, since one cannot anticipate
when a user might want to insert class instances into a set? (It
certainly is not on my current checklist of methods to add to well-
behaved classes.)
-- Paul

May 4 '07 #7

Peter Otten

Paul McGuire wrote:

Just to beat this into the ground, "test for equality" appears to be
implemented as "test for equality of hashes". So if you want to
implement a class for the purposes of set membership, you must
implement a suitable __hash__ method. It is not sufficient to
implement __cmp__ or __eq__, which I assumed "test for equality" would
make use of. Not having a __hash__ method in my original class caused
my initial confusion.

As with dictionaries, only items with the same hash are considered for
equality testing.

So would you suggest that any class implemented in a general-purpose
class library should implement __hash__, since one cannot anticipate
when a user might want to insert class instances into a set? (It
certainly is not on my current checklist of methods to add to well-
behaved classes.)

A meaningful implementation would also have to make sure that the attributes
used to calculate hash and equality don't change over time.

No, I wouldn't bother because YAGNI.

Peter

May 4 '07 #8

Klaas

On May 4, 10:15 am, Paul McGuire <p...@austin.rr.comwrote:

Just to beat this into the ground, "test for equality" appears to be
implemented as "test for equality of hashes". So if you want to
implement a class for the purposes of set membership, you must
implement a suitable __hash__ method. It is not sufficient to
implement __cmp__ or __eq__, which I assumed "test for equality" would
make use of. Not having a __hash__ method in my original class caused
my initial confusion.

overriding __hash__ (even to raise NotImplementedError) is always wise
if you have override __eq__. And of course __hash__ is necessary for
using hashtable-based structures (how else could it determine whether
objects are equal? compare against every existing element?)

Finally, two objects which return the same __hash__ but return False
for __eq__ are, of course, unequal. sets/dicts do not simply "test
for equality of hashes"

So would you suggest that any class implemented in a general-purpose
class library should implement __hash__, since one cannot anticipate
when a user might want to insert class instances into a set? (It
certainly is not on my current checklist of methods to add to well-
behaved classes.)

a class should be only inserted into a set if it is immutable, and
thus designed to such. User's might also execute 'del x.attr', so
perhaps you should start each method with a series of hasattr()
checks...

-Mike

May 8 '07 #9

Dave Borne

On 4 May 2007 07:21:49 -0700, Thomas Nelson <th*@mail.utexas.eduwrote:

I want to generate all the fractions between 1 and limit (with
limit>1) in an orderly fashion, without duplicates.

Might I suggest the Stern-Brocot tree
(http://en.wikipedia.org/wiki/Stern-Brocot_tree)
It will eliminate the need for sets as the algorithm gurantees: "Every
positive rational number can be found in this tree exactly once and in
lowest terms". The order will be different than your algorithm,
though.

#An overly simplified fraction class for this example:
class Fraction:
def __init__ (self, num, den):
self.num = num
self.den = den
def __repr__ (self):
return '%(num)d/%(den)d' % self.__dict__

def all_ratios(limit):
seq = [Fraction(1,1), Fraction(limit,1)]
while True:
newseq = seq[:1]
pairs = [seq[x:x+2] for x in range(len(seq)-1)]
for pair in pairs:
#find the mediant value between each pair in the series
newval = Fraction(pair[0].num+pair[1].num, pair[0].den+pair[1].den)
yield newval
newseq.append(newval)
newseq.append(pair[1])
seq = newseq
-Dave

May 9 '07 #10

Similar topics

IComparer for floats? Heavy Math

by: Tom | last post by:

Has anyone ever seen a IComparer for floats the returns magnitude. i.e. instead of returning -1, it would return -5. To let you know HOW different the two numbers are. obviously for int it is a -...

.NET Framework

List of strings to list of floats ?

by: Madhusudan Singh | last post by:

Is it possible to convert a very long list of strings to a list of floats in a single statement ? I have tried float(x) and float(x) but neither work. I guess I would have to write a loop if...

Python

is it safe to zero float array with memset?

by: 69dbb24b2db3daad932c457cccfd6 | last post by:

Hello, I have to initialize all elements of a very big float point array to zero. It seems memset(a, 0, len) is faster than a simple loop. I just want to know whether it is safe to do so, since I...

C / C++

container with floats appears to be too long

by: freelanceinaz | last post by:

My problem page is at http://girlschorus.org/test.html I have a container with a relatively positioned graphic at the top, then two floats which are relatively positioned (for a a two-column...

HTML / CSS

filesystemwatcher and determining when it is safe to work on the directory

by: topher | last post by:

When using filesystemwatcher to keep an eye on a directory to see if there are any files, how will I know when it is safe to work on the files in a directory? In other words, how will I know that...

C# / C Sharp

Is this safe?

by: Mark P | last post by:

Is the following safe: double min_value = -numeric_limits<double>::max(); Basically, I'm trying to get a negative number with a very large magnitude. Thanks, Mark

C / C++

clearing floats to content edge

by: yb | last post by:

Hi, Is there a CSS method to clear a float such that it aligns with the left content edge. For example: X X X X X X X X

HTML / CSS

strings (dollar.cents) into floats

by: luca bertini | last post by:

Hi, i have strings which look like money values (ie 34.45) is there a way to convert them into float variables? everytime i try I get this error: "numb = float(my_line) ValueError: empty string...

Python

Re: [Python-Dev] Why don't range and xrange threat floats as floats?

by: Matthieu Brucher | last post by:

2008/11/5 L V <somelauw@yahoo.com>: Hi, I don't think the Python developers list is th best list to post this kind of question. What version of Python did you use for this test? Matthieu

Python

Cloud Servers without Credit Card and Email Registration: A Simpler Way to Get on the Cloud

by: CloudSolutions | last post by:

Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...

General

Easy Steps to Fix "Canon Printer Won't Connect to WiFi Network"

by: taylorcarr | last post by:

A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...

General

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Basic Javascript concepts

by: aa123db | last post by:

Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...

Javascript

Batch import of multiple excel files into the database

by: ryjfgjl | last post by:

If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...

Data Management

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++