Iterating over test data in unit tests

Ben Finney

Howdy all,

Summary: I'm looking for idioms in unit tests for factoring out
repetitive iteration over test data. I explain my current practice,
and why it's unsatisfactory.
When following test-driven development, writing tests and then coding
to satisfy them, I'll start with some of the simple tests for a class.

import unittest

import bowling # Module to be tested

class Test_Frame(unittest.TestCase):

def test_instantiate(self):
""" Frame instance should be created """
instance = bowling.Frame()
self.failUnless(instance)

class Test_Game(unittest.TestCase):

def test_instantiate(self):
""" Game instance should be created """
instance = bowling.Game()
self.failUnless(instance)

As I add tests for more interesting functionality, they become more
data dependent.

class Test_Game(unittest.TestCase):

# ...

def test_one_throw(self):
""" Single throw should result in expected score """
game = bowling.Game()
throw = 5
game.add_throw(throw)
self.failUnlessEqual(throw, game.get_score())

def test_three_throws(self):
""" Three throws should result in expected score """
game = bowling.Game()
throws = (5, 7, 4)
game.add_throw(throws[0])
game.add_throw(throws[1])
game.add_throw(throws[2])
self.failUnlessEqual(sum(throws), game.get_score())

This cries out, of course, for a test fixture to set up instances.

class Test_Game(unittest.TestCase):

def setUp(self):
""" Set up test fixtures """
self.game = bowling.Game()

def test_one_throw(self):
""" Single throw should result in expected score """
throw = 5
score = 5
self.game.add_throw(throw)
self.failUnlessEqual(score, game.get_score())

def test_three_throws(self):
""" Three throws should result in expected score """
throws = [5, 7, 4]
score = sum(throws)
for throw in throws:
game.add_throw(throw)
self.failUnlessEqual(score, game.get_score())

def test_strike(self):
""" Strike should add the following two throws """
throws = [10, 7, 4, 7]
score = 39
for throw in throws:
game.add_throw(throw)
self.failUnlessEqual(score, game.get_score())

So far, this is just following what I see to be common practice for
setting up *instances* to test.

But the repetition of the test *inputs* also cries out to me to be
refactored. I see less commonality in doing this.

My initial instinct is just to put it in the fixtures.

class Test_Game(unittest.TestCase):

def setUp(self):
""" Set up test fixtures """
self.game = bowling.Game()

self.game_data = {
'one': dict(score=5, throws=[5]),
'three': dict(score=17, throws=[5, 7, 5]),
'strike': dict(score=39, throws=[10, 7, 5, 7]),
}

def test_one_throw(self):
""" Single throw should result in expected score """
throws = self.game_data['one']['throws']
score = self.game_data['one']['score']
for throw in throws:
self.game.add_throw(throw)
self.failUnlessEqual(score, game.get_score())

def test_three_throws(self):
""" Three throws should result in expected score """
throws = self.game_data['three']['throws']
score = self.game_data['three']['score']
for throw in throws:
game.add_throw(throw)
self.failUnlessEqual(score, game.get_score())

def test_strike(self):
""" Strike should add the following two throws """
throws = self.game_data['strike']['throws']
score = self.game_data['strike']['score']
for throw in throws:
game.add_throw(throw)
self.failUnlessEqual(score, game.get_score())

But this now means that the test functions are almost identical,
except for choosing one data set or another. Maybe that means I need
to have a single test:

def test_score_throws(self):
""" Game score should be calculated from throws """
for dataset in self.game_data:
score = dataset['score']
for throw in dataset['throws']:
self.game.add_throw(throw)
self.failUnlessEqual(score, self.game.get_score())

Whoops, now I'm re-using a fixture instance. Maybe I need an instance
of the class for each test case.

def setUp(self):
""" Set up test fixtures """
self.game_data = {
'one': dict(score=5, throws=[5]),
'three': dict(score=17, throws=[5, 7, 5]),
'strike': dict(score=39, throws=[10, 7, 5, 7]),
}

self.game_params = {}
for key, dataset in self.game_data.items():
params = {}
instance = bowling.Game()
params['instance'] = instance
params['dataset'] = dataset
self.game_params[key] = params

def test_score_throws(self):
""" Game score should be calculated from throws """
for params in self.game_params.values():
score = params['dataset']['score']
instance = params['instance']
for throw in params['dataset']['throws']:
instance.add_throw(throw)
self.failUnlessEqual(score, instance.get_score())

Good, now the tests for different sets of throws are in a dictionary
that's easy to add to. Of course, now I need to actually know which
one is failing.

def test_score_throws(self):
""" Game score should be calculated from throws """
for key, params in self.game_params.items():
score = params['dataset']['score']
instance = params['instance']
for throw in params['dataset']['throws']:
instance.add_throw(throw)
self.failUnlessEqual(score, instance.get_score(),
msg="Score mismatch for set '%s'" % key
)

It works. It's rather confusing though, since the actual test --
iterate over the throws and check the score -- is in the midst of the
iteration over data sets.

Also, that's just *one* type of test I might need to do. Must I then
repeat all that iteration code for other tests I want to do on the
same data?

Maybe I need to factor out the iteration into a generic iteration
function, taking the actual test as a function object. That way, the
dataset iterator doesn't need to know about the test function, and
vice versa.

def iterate_test(self, test_func, test_params=None):
""" Iterate a test function for all the sets """
if not test_params:
test_params = self.game_params
for key, params in test_params.items():
dataset = params['dataset']
instance = params['instance']
test_func(key, dataset, instance)

def test_score_throws(self):
""" Game score should be calculated from throws """
def test_func(key, dataset, instance):
score = dataset['score']
for throw in dataset['throws']:
instance.add_throw(throw)
self.failUnlessEqual(score, instance.get_score())

self.iterate_test(test_func)

That's somewhat clearer; the test function actually focuses on what
it's testing. Those layers of indirection are annoying, but they allow
the data sets to grow without writing more code to handle them.
Testing a rules-based system involves lots of data sets, and each data
set represents a separate test case; but the code for each of those
test cases is mindlessly repetitive. Factoring them out seems like it
needs a lot of indirection, and seems to make each test harder to
read. Different *types* of tests would need multiple iterators, more
complex test parameter dicts, or some more indirection. Those all
sound ugly, but so does repetitively coding every test function
whenever some new data needs to be tested.

How should this be resolved?

--
\ "I never forget a face, but in your case I'll be glad to make |
`\ an exception." -- Groucho Marx |
_o__) |
Ben Finney

Dec 6 '05 #1

Subscribe Post Reply

2662

Scott David Daniels

Ben Finney wrote:

Summary: I'm looking for idioms in unit tests for factoring out
repetitive iteration over test data....
How about something like:
import unittest, bowling
class Test_Game(unittest.TestCase):
def setUp(self):
""" Set up test fixtures """
self.game = bowling.Game()
def runs(self, throws):
"""Run a series of scores and return the result"""
for throw in throws:
self.game.add_throw(throw)
return self.game.get_score()
def test_one_throw(self):
""" Single throw should result in expected score """ self.assertEqual(5, self.runs([5]))
def test_three_throws(self):
""" Three throws should result in expected score """ self.assertEqual(5 + 7 + 4, self.runs([5, 7, 4]))
def test_strike(self):
""" Strike should add the following two throws """

self.assertEqual(39, self.runs([10, 7, 4, 7]))
There is no reason you cannot write support functions.

--
-Scott David Daniels
sc***********@acm.org

Dec 6 '05 #2

Ben Finney

Scott David Daniels <sc***********@acm.org> wrote:

Ben Finney wrote:
Summary: I'm looking for idioms in unit tests for factoring out
repetitive iteration over test data....
How about something like:
class Test_Game(unittest.TestCase): [...] def runs(self, throws):
"""Run a series of scores and return the result"""

[...] def test_one_throw(self):
""" Single throw should result in expected score """

self.assertEqual(5, self.runs([5]))
def test_three_throws(self):
""" Three throws should result in expected score """

self.assertEqual(5 + 7 + 4, self.runs([5, 7, 4]))
def test_strike(self):
""" Strike should add the following two throws """

self.assertEqual(39, self.runs([10, 7, 4, 7]))

Yes, I'm quite happy that I can factor out iteration *within* a single
data set. That leaves a whole lot of test cases identical except for
the data they use.

The question remains: how can I factor out iteration of *separate test
cases*, where the test cases are differentiated only by the data they
use? I know at least one way: I wrote about it in my (long) original
post. How else can I do it, with less ugliness?

--
\ "I went to a garage sale. 'How much for the garage?' 'It's not |
`\ for sale.'" -- Steven Wright |
_o__) |
Ben Finney

Dec 6 '05 #3

Bengt Richter

On Tue, 6 Dec 2005 12:19:40 +1100 (EST), Ben Finney <bi****************@benfinney.id.au> wrote:

Howdy all,

Summary: I'm looking for idioms in unit tests for factoring out
repetitive iteration over test data. I explain my current practice,
and why it's unsatisfactory.

Does this do what you want?
http://codespeak.net/py/current/doc/...ing-more-tests

Regards,
Bengt Richter

Dec 6 '05 #4

Steven Bethard

Ben Finney wrote:

Maybe I need to factor out the iteration into a generic iteration
function, taking the actual test as a function object. That way, the
dataset iterator doesn't need to know about the test function, and
vice versa.

def iterate_test(self, test_func, test_params=None):
""" Iterate a test function for all the sets """
if not test_params:
test_params = self.game_params
for key, params in test_params.items():
dataset = params['dataset']
instance = params['instance']
test_func(key, dataset, instance)

def test_score_throws(self):
""" Game score should be calculated from throws """
def test_func(key, dataset, instance):
score = dataset['score']
for throw in dataset['throws']:
instance.add_throw(throw)
self.failUnlessEqual(score, instance.get_score())

self.iterate_test(test_func)

That's somewhat clearer; the test function actually focuses on what
it's testing. Those layers of indirection are annoying, but they allow
the data sets to grow without writing more code to handle them.

Don't know if this helps, but I'd be more likely to write this as
something like (untested)::

def get_tests(self, test_params=None):
""" Iterate a test function for all the sets """
if not test_params:
test_params = self.game_params
for key, params in test_params.items():
dataset = params['dataset']
instance = params['instance']
yield key, dataset, instance

def test_score_throws(self):
""" Game score should be calculated from throws """
for key, dataset, instance in self.get_tests()
score = dataset['score']
for throw in dataset['throws']:
instance.add_throw(throw)
self.failUnlessEqual(score, instance.get_score())

That is, make an interator to the various test information, and just put
your "test_func" code inside a for-loop.

STeVe

Dec 6 '05 #5

Ben Finney

Ben Finney <bi****************@benfinney.id.au> wrote:

Summary: I'm looking for idioms in unit tests for factoring out
repetitive iteration over test data.

Thanks to those who've offered suggestions, especially those who
suggested I look at generator functions. This leads to::

import unittest

import bowling # Module to be tested

class Test_Game(unittest.TestCase):
""" Test case for the Game class """

def setUp(self):
""" Set up test fixtures """
self.game_data = {
'none': dict(score=0, throws=[], frame=1),
'one': dict(score=5, throws=[5], frame=1),
'two': dict(score=9, throws=[5, 4], frame=2),
'three': dict(score=14, throws=[5, 4, 5], frame=2),
'strike': dict(score=26, throws=[10, 4, 5, 7], frame=3),
}

self.game_params = {}
for key, dataset in self.game_data.items():
params = {}
instance = bowling.Game()
params['instance'] = instance
params['dataset'] = dataset
self.game_params[key] = params

def iterate_params(test_params=None):
""" Yield the test parameters """
if not test_params:
test_params = self.game_params
for key, params in test_params.items():
dataset = params['dataset']
instance = params['instance']
yield key, dataset, instance

def test_score_throws(self):
""" Game score should be calculated from throws """
for key, dataset, instance in self.iterate_params():
score = dataset['score']
for throw in dataset['throws']:
instance.add_throw(throw)
self.failUnlessEqual(score, instance.get_score())

def test_current_frame(self):
""" Current frame should be as expected """
for key, dataset, instance in self.iterate_params():
frame = dataset['frame']
for throw in dataset['throws']:
instance.add_throw(throw)
self.failUnlessEqual(frame, instance.current_frame)

That's much better. Each test is now clearly about looping through the
datasets, but the infrastructure to do so is factored out. Adding a
test case modelled on the existing cases just means adding a new entry
to the game_data dictionary. Setting up a different kind of test --
e.g. for invalid game data -- just means setting up a new params
dictionary and feeding that to the same generator function.

I like it. Can it be improved? Are there readability problems that can
be fixed? Is the test fixture setup too complex? Should the iterator
become even more general, and be refactored out to a test framework
for the project?

--
\ "Those who can make you believe absurdities can make you commit |
`\ atrocities." -- Voltaire |
_o__) |
Ben Finney

Dec 6 '05 #6

Scott David Daniels

Ben Finney wrote:

Ben Finney <bi****************@benfinney.id.au> wrote:
Summary: I'm looking for idioms in unit tests for factoring out
repetitive iteration over test data.

Thanks to those who've offered suggestions, especially those who
suggested I look at generator functions. This leads to::

Here's another way (each test should independently test one feature):

class Test_Game(unittest.TestCase):
""" Test case for the Game class """
score = 0
throws = []
frame = 1

def setUp(self):
""" Set up test fixtures """

self.game = bowling.Game()

def test_score_throws(self):
""" Game score should be calculated from throws """
for throw in self.throws:
self.game.add_throw(throw)
self.assertEqual(self.score, self.game.get_score())

def test_current_frame(self):
""" Current frame should be as expected """
frame = dataset['frame']
for throw in self.throws:
self.game.add_throw(throw)
self.assertEqual(self.frame, self.game.current_frame)

class Test_one(Test_Game):
score = 5
throws = [5]
frame = 1

class Test_two(Test_Game):
score = 9
throws = [5, 4]
frame = 2

class Test_three(Test_Game):
score = 14
throws = [5, 4, 5]
frame = 2

class Test_strike(Test_Game):
score = 26
throws = [10, 4, 5, 7]
frame = 3

--Scott David Daniels
sc***********@acm.org

Dec 6 '05 #7

Similar topics

VB to Python migration

by: Josh | last post by:

We have a program written in VB6 (over 100,000 lines of code and 230 UI screens) that we want to get out of VB and into a better language. The program is over 10 years old and has already been...

Python

Unit Testing and Test Cases

by: Water Cooler v2 | last post by:

Here's my understanding as of now. If I were writing a function bool IsValidContact(Offerer objOfferer, Accepter objAccepter, TermsAndConditions objTermsAndConditions); Before writing the...

C# / C Sharp

Tracing the execution of scripts?

by: Michael B. Trausch | last post by:

Alright, I seem to be at a loss for what I am looking for, and I am not even really all that sure if it is possible or not. I found the 'pdb' debugger, but I was wondering if there was something...

Python

Test Driven Development and C#

by: Deckarep | last post by:

Hello Group, I actually have two seperate questions regarding Unit Testing with NUnit in C#. Please keep in mind that I'm new to the concept of Unit Testing and just barely coming around to...

C# / C Sharp

unit test for GUI code

by: shuisheng | last post by:

Dear All, I was told that unit test is a powerful tool for progamming. If I am writing a GUI code, is it possible to still using unit test? I have a little experience in using unittest++. But...

C / C++

version 2.0 of C++ Unit Test Library released!

by: rich_sposato | last post by:

I released version 2.0 of C++ Unit Test Library. You can download it from SourceForget.Net at http://sourceforge.net/projects/cppunittest/ .. I wrote this unit test library because other unit...

C / C++

Requesting critique of a C unit test environment

by: Ark Khasin | last post by:

Unit testing is an integral component of both "formal" and "agile" models of development. Alas, it involves a significant amount of tedious labor. There are test automation tools out there but...

C / C++

reliable unit test logging

by: Vyacheslav Maslov | last post by:

Hi all! I have many many many python unit test, which are used for testing some remote web service. The most important issue here is logging of test execution process and result. I strongly...

Python

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Merging data from multiple Excel files

by: ryjfgjl | last post by:

In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...

Data Management

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server

Discussion: How does Zigbee compare with other wireless protocols in smart home applications?

by: tracyyun | last post by:

Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

General