How smart is the Python interpreter?

ssecorp

def str_sort(string):
s = ""
for a in sorted(string):
s+=a
return s
if i instead do:

def str_sort(string):
s = ""
so = sorted(string)
for a in so:
s+=a
return s
will that be faster or the interpreter can figure out that it only has
to do sorted(string) once? or that kind of cleverness is usually
reserved for compilers and not interpreters?

Jul 31 '08 #1

Subscribe Post Reply

1090

Ulrich Eckhardt

ssecorp wrote:

def str_sort(string):
s = ""
for a in sorted(string):
s+=a
return s
if i instead do:

def str_sort(string):
s = ""
so = sorted(string)
for a in so:
s+=a
return s

will that be faster or the interpreter can figure out that it only has
to do sorted(string) once?

Actually, by replacing sorted() with a function that outputs when it is
called you could have seen that this is only called once in both cases. It
must not be called more than once in fact, consider e.g. the case that it
introduces side effects (like e.g. reading a file).

Uli

--
Sator Laser GmbH
GeschÃ¤ftsfÃ¼hrer: Thorsten FÃ¶cking, Amtsgericht Hamburg HR B62 932

Jul 31 '08 #2

Heiko Wundram

Am Donnerstag, 31. Juli 2008 13:09:57 schrieb ssecorp:

def str_sort(string):
s = ""
for a in sorted(string):
s+=a
return s
if i instead do:

def str_sort(string):
s = ""
so = sorted(string)
for a in so:
s+=a
return s
will that be faster or the interpreter can figure out that it only has
to do sorted(string) once? or that kind of cleverness is usually
reserved for compilers and not interpreters?

In a statement of the form

for <namein <iterable>:

the expression <iterablewill only be evaluated once (to retrieve an
iterator), so basically, both ways of stating it are equivalent and make
negligible difference in runtime (the second version will be slower, because
you have additional code to assign/fetch a local).

Anyway, if you care about speed, probably:

def str_sort(string):
return "".join(sorted(string))

will be the fastest way of stating this.

--
Heiko Wundram

Jul 31 '08 #3

Diez B. Roggisch

ssecorp wrote:

def str_sort(string):
s = ""
for a in sorted(string):
s+=a
return s
if i instead do:

def str_sort(string):
s = ""
so = sorted(string)
for a in so:
s+=a
return s
will that be faster or the interpreter can figure out that it only has
to do sorted(string) once? or that kind of cleverness is usually
reserved for compilers and not interpreters?

There isn't much cleverness involved here - why on earth should one execute
the sorted(string) several times?

The

for <namein <iterable_yielding_expression>

construct will evaluate the <iterable_yielding_expressionof course only
once.

Diez

Jul 31 '08 #4

Gary Herron

ssecorp wrote:

def str_sort(string):
s = ""
for a in sorted(string):
s+=a
return s
if i instead do:

def str_sort(string):
s = ""
so = sorted(string)
for a in so:
s+=a
return s
will that be faster or the interpreter can figure out that it only has
to do sorted(string) once? or that kind of cleverness is usually
reserved for compilers and not interpreters?
--
http://mail.python.org/mailman/listinfo/python-list

The 'for' statement is only executed once of course. It's the body of
the 'for' which is executed multiple times. So in both pieces of code,
the 'sorted' is only executed once, and the returned string is bound to
a name in the second but not the first.

However, you are worrying about optimizing the wrong things here. The
's+=a' line has terrible (quadratic) performance. Instead use the
string method 'join' which has linear performance.

def str_sort(string):
return "".join(sorted(string))

No for loop, no inefficient accumulation.

Gary Herron

Jul 31 '08 #5

Steven D'Aprano

On Thu, 31 Jul 2008 04:09:57 -0700, ssecorp wrote:

def str_sort(string):
s = ""
for a in sorted(string):
s+=a
return s
if i instead do:

def str_sort(string):
s = ""
so = sorted(string)
for a in so:
s+=a
return s
will that be faster

Oh dear. Premature optimization, the root of all (programming) evil.

You can test which is faster by using the timeit module. In the
interactive interpreter, define the two functions above with different
names, and a string to supply as argument. Then call:

from timeit import Timer
t1 = Timer('str_sort1(s)', 'from __main__ import str_sort1, s')
t2 = Timer('str_sort2(s)', 'from __main__ import str_sort2, s')
t1.repeat(number=1000)
t2.repeat(number=1000)

I'll be hugely surprised if there was any meaningful difference.

or the interpreter can figure out that it only has
to do sorted(string) once? or that kind of cleverness is usually
reserved for compilers and not interpreters?

Python uses a compiler. It doesn't do a lot of clever optimization, but
it does some. In this case, no, it doesn't optimize your function, so
technically the first may be a tiny bit faster. But, frankly, your
function is so painfully inefficient, doing a lot of useless work, that
you probably won't notice any real difference.

The correct way to do what you have done above is ''.join(sorted(s)).
Anything else is much slower.

>>def sort_str(s):

.... ss = ""
.... for c in sorted(s):
.... ss += c
.... return ss
....

>>s = "abcdefghijklmnopqrstuvwxyz"*100
from timeit import Timer
t1 = Timer('"".join(sorted(s))', 'from __main__ import s')
t2 = Timer('sort_str(s)', 'from __main__ import sort_str, s')
t1.repeat(number=1000)

[1.6792540550231934, 1.6882510185241699, 1.660383939743042]

>>t2.repeat(number=1000)

[2.5500221252441406, 2.4761130809783936, 2.5888760089874268]
--
Steven

Jul 31 '08 #6

Terry Reedy

ssecorp wrote:

def str_sort(string):
s = ""
for a in sorted(string):
s+=a
return s
if i instead do:

def str_sort(string):
s = ""
so = sorted(string)
for a in so:
s+=a
return s
will that be faster or the interpreter can figure out that it only has
to do sorted(string) once? or that kind of cleverness is usually
reserved for compilers and not interpreters?

The optimizations performed by a Python interpreter and where they are
performed depend on the implementation and version. CPython is
conservative about optimizations. Not only do the developers want to be
sure they are 100% correct (unlike too many optimizing compilers), but
Guido also rejects some that are too tricky and too fragile (easily
broken by new maintainers). In Python 3.0, here are two compiler
optimizations

>>from dis import dis
def f(): return 1+2

>>dis(f)

1 0 LOAD_CONST 3 (3)
3 RETURN_VALUE

# constant arithmetic (folding); done with floats also

>>def f():

a,b = 1,2
return a+b

>>dis(f)

2 0 LOAD_CONST 3 ((1, 2))
3 UNPACK_SEQUENCE 2
6 STORE_FAST 0 (a)
9 STORE_FAST 1 (b)

3 12 LOAD_FAST 0 (a)
15 LOAD_FAST 1 (b)
18 BINARY_ADD
19 RETURN_VALUE

# tuples with constant members are pre-built by the compiler and stored
in the code object. What you don't see (if it is still there) is an
optimization in the interpreter loop for BINARY_ADD that takes a
shortcut if both operands are ints.

tjr

Aug 1 '08 #7

Similar topics

[(J)Python] embedding python

by: vincent Salaun | last post by:

hi all, here's my problem : I've embedded a python interpreter in our java application (based on the NetBeans palteforrm) using the Jython API : http://www.jython.org/docs/javadoc/index.html...

Python

Python Interpreter question.

by: Anon | last post by:

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hello all I am a beginner teaching myself python, and I am enjoying it immensely :) As a language it is great, I real treat to study, I actually...

Python

Which kid's beginners programming - Python or Forth?

by: BORT | last post by:

Please forgive me if this is TOO newbie-ish. I am toying with the idea of teaching my ten year old a little about programming. I started my search with something like "best FREE programming...

Python

118

Python vs. Lisp -- please explain

by: 63q2o4i02 | last post by:

Hi, I've been thinking about Python vs. Lisp. I've been learning Python the past few months and like it very much. A few years ago I had an AI class where we had to use Lisp, and I absolutely...

Python

Python language extension mechanism for Python 3000... Worth for PEP?

by: Petr Prikryl | last post by:

Do you think that the following could became PEP (pre PEP). Please, read it, comment it, reformulate it,... Abstract Introduction of the mechanism for language extensions via modules...

Python

113

Python does not play well with others

by: John Nagle | last post by:

The major complaint I have about Python is that the packages which connect it to other software components all seem to have serious problems. As long as you don't need to talk to anything outside...

Python

Python C extension providing... Python's own API?

by: Adam Atlas | last post by:

Does anyone know if it would be possible to create a CPython extension -- or use the ctypes module -- to access Python's own embedding API (http://docs.python.org/api/initialization.html &c.)?...

Python

Multiple python interpreters within the same process

by: Marcin Kalicinski | last post by:

How do I use multiple Python interpreters within the same process? I know there's a function Py_NewInterpreter. However, how do I use functions like Py_RunString etc. with it? They don't take any...

Python

Cloud Servers without Credit Card and Email Registration: A Simpler Way to Get on the Cloud

by: CloudSolutions | last post by:

Introduction: For many beginners and individual users, requiring a credit card and email registration may pose a barrier when starting to use cloud servers. However, some cloud server providers now...

General

Wordpress or something else?

by: Faith0G | last post by:

I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...

Content Management Systems

Access Europe: Command bars, the Access Shortcut Tool and a simple Audit Log - Wed 3 April

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...

General

One-click Importing Excel Data into a*Database

by: ryjfgjl | last post by:

In our work, we often need to import Excel data into databases (such as MySQL, SQL Server, Oracle) for data analysis and processing. Usually, we use database tools like Navicat or the Excel import...

Microsoft Excel

Batch import of multiple excel files into the database

by: ryjfgjl | last post by:

If we have dozens or hundreds of excel to import into the database, if we use the excel import function provided by database editors such as navicat, it will be extremely tedious and time-consuming...

Data Management

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Looking to do Android software development, any suggestions? Is flutter better?

by: nemocccc | last post by:

hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

General

How to build RAID in BIOS?

by: Hystou | last post by:

There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...

Computer Hardware