472,345 Members | 1,621 Online
Bytes | Software Development & Data Engineering Community
+ Post

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 472,345 software developers and data experts.

How increase a C/C++ Program speed ?

I Know C is the fastest progrmming language. However, by using some
bitwise operation you can get faster the your program.
For instance, we talk about swap function. For a integer swapping we
use this generally.

void swap(int* a,int* b){
int c;
c=*a;
*b=*a;
*b=c;
}

we use a local variable and swapping varibales with each other, it may
be expensive. We do this swap function like that:

void swap(int*a ,int *b){
*a ^= *b;
*b ^= *a;
*a ^= *b;
}

Yeah, this function swap the integers, too. But it is faster than the
first one. Also do not need a local variable.

Jul 23 '05 #1
15 4718
RAYYILDIZ wrote:
I Know C is the fastest progrmming language.
You know incorrectly. But never mind...
However, by using some
bitwise operation you can get faster the your program.
Whatever that means...
For instance, we talk about swap function. For a integer swapping we
use this generally.

void swap(int* a,int* b){
int c;
c=*a;
*b=*a;
*b=c;
Actually, you have a serious error there. It should be

int c = *a;
*a = *b;
*b = c;
}

we use a local variable and swapping varibales with each other, it may
be expensive. We do this swap function like that:

void swap(int*a ,int *b){
*a ^= *b;
*b ^= *a;
*a ^= *b;
That's a bad idea, generally.
}

Yeah, this function swap the integers, too. But it is faster than the
first one. Also do not need a local variable.


It's not faster because it doesn't work in a hypothetical case

int a = 42;
swap(&a, &a);

Unless it's proven that your original (with my corrections) swap is too
slow and affects your overall program too much, there is no need to do
that kind of micro-optimizations. Remember, "Premature optimization is
the root of all evil".

V
Jul 23 '05 #2
RAYYILDIZ wrote:
I Know C is the fastest progrmming language. I won't bite this flamebait, but:
we use a local variable and swapping varibales with each other, it may
be expensive. We do this swap function like that:
void swap(int*a ,int *b){
*a ^= *b;
*b ^= *a;
*a ^= *b;
}

Yeah, this function swap the integers, too. But it is faster than the
first one. On any CPU I've ever worked with, it won't be. It might work better for
extremely old architectures or some embedded systems that I've never
worked with, though.
Also do not need a local variable.

You are mistaken. Most architectures can't operate on two memory
operands at once, so the data must be loaded into registers. In this
case, the CPU registers can essentially be regarded as temporary variables.

Unless I'm missing something, the code produced will most likely consist
of 6 + 2 (for loading the pointers a and b into registers) memory reads,
3 xor instructions, and 3 memory writes, unless the compiler has
knowledge about the function parameters. (i.e. whether a can equal b or not)

The usual swapping approach with a temporary temporary variable will
easily be optimised by the compiler to load the data at a and b into
registers, and write them back into the reversed memory locations. The
temporary variable will most likely never even be put into memory. (2 +
2 reads, 2 writes) Even if it is put on the stack, for example because
you've disabled compiler optimisations, that only adds one read and
write each, that is still considerably less than the XOR method.

I'm probably off by a few instructions here or there, depending on the
CPU architecture, and how memory offsets are calculated, but I think my
point still stands.

The technique was used on very old register- and memory-starved systems,
as far as I know. It's no longer useful today.

~phil
Jul 23 '05 #3
On 2005-03-01 09:53:05 -0500, "RAYYILDIZ" <ra*******@gmail.com> said:
I Know C is the fastest progrmming language.
Says who?
However, by using some
bitwise operation you can get faster the your program.
For instance, we talk about swap function. For a integer swapping we
use this generally.

void swap(int* a,int* b){
int c;
c=*a;
*b=*a;
*b=c;
}
First, you've written that incorrectly. I hope you meant:

void swap(int*a,int*b)
{
int c = *a;
*a = *b;
*b = c;
}

Second, we already have std::swap in the Standard Library:

#include <algorithm>

....
{
int i = 25;
int j = 12;
swap(i,j);
//Now i == 12 and j == 25
}
....

std::swap has the advantage that it will work with any assignable type.

we use a local variable and swapping varibales with each other, it may
be expensive. We do this swap function like that:

void swap(int*a ,int *b){
*a ^= *b;
*b ^= *a;
*a ^= *b;
}
This is a very bad idea, for several reasons:
1) It's premature optimization.
2) It may actually be slower than the more obvious swapping algorithm
on some platforms
3) It is less readable
4) It is not always correct:

int i = 25;
int j = 25;
swap(&i,&j);
Yeah, this function swap the integers, too. But it is faster than the
first one.
Says who?
Also do not need a local variable.


Who cares?

--
Clark S. Cox, III
cl*******@gmail.com

Jul 23 '05 #4

"RAYYILDIZ" <ra*******@gmail.com> wrote in message
news:11**********************@o13g2000cwo.googlegr oups.com...
I Know C is the fastest progrmming language. However, by using some
bitwise operation you can get faster the your program.
For instance, we talk about swap function. For a integer swapping we
use this generally. void swap(int* a,int* b){
int c;
c=*a;
*b=*a;
*b=c;
} we use a local variable and swapping varibales with each other, it may
be expensive. We do this swap function like that: void swap(int*a ,int *b){
*a ^= *b;
*b ^= *a;
*a ^= *b;
} Yeah, this function swap the integers, too. But it is faster than the
first one. Also do not need a local variable.


Have you actually measured it? On my machine, there's no significant
difference in execution time. Moreover, calling std::swap is significantly
faster than either version.

Using exclusive-or for swapping is a cute trick, but rarely useful in
practice.


Jul 23 '05 #5
RAYYILDIZ wrote:
I Know C is the fastest progrmming language. However, by using some
bitwise operation you can get faster the your program.
For instance, we talk about swap function. For a integer swapping we
use this generally.

void swap(int* a,int* b){
int c;
c=*a;
*b=*a;
*b=c;
}
we use a local variable and swapping varibales with each other, it may
be expensive. We do this swap function like that:

void swap(int*a ,int *b){
*a ^= *b;
*b ^= *a;
*a ^= *b;
}


NNNNNNNNOOOOOOOOOO!!!!!!!!!!!

The XORing version, on C++ compiler I've ever used is SLOWER once you
enable even basic optimisation.

SSSSLLLOOOWWWEEERR!!

(sorry, but this comes up so often). Any modern compiler can easily
remove unused variables, or just keep them in a register. They aren't
stupid!

Lets see what the average compiler will do (I check g++ 3.3 at
optimisation -O1):

For your first swap function, any compiler on any optimisation level
will produce the code (sudo-assembler)

read *a into register 1
read *b into register 2
write register 1 into *b
write register 2 into *a

Your code will produce:

read *a into register 1
read *b into register 2
register 1 = register 1 XOR register 2
register 2 = register 2 XOR register 1
register 1 = register 1 XOR register 2
put register 1 into *a
put register 2 into *b

As you can see, your code is clearly taking longer :)

Chris
Jul 23 '05 #6
> we use a local variable and swapping varibales with each other, it may
be expensive. We do this swap function like that:

void swap(int*a ,int *b){
*a ^= *b;
*b ^= *a;
*a ^= *b;
}

Yeah, this function swap the integers, too. But it is faster than the
first one. Also do not need a local variable.


Depends on your compiler, processor architechture, how the turing complete
(?) system executing the code is configured and other factors someone might
kindly contribute. A neat trick but the possible speedup in a typical case
doesn't come from fact that this tiny fragment of code itself when compiled
to something is faster than what the other function would produce but rather
the fact that it might require one less register (assuming this turing
complete computing system has registers or some level of hierarchy as far as
accessing the variables is concerned speedwise) possibly reducing or
completely avoiding spilling (assuming that your computer and/or
microarchitechture and/or/maybe turing complete computing system and the
compiler implementation are related to the concept of spilling in any shape,
form, method, way or fashion).

Furthermore (insert previous disclaimers enmasse here for security reasons),
your compiler might implement these functions without linking time code
generation, leading to observation that the implementation uses some
fashion, form or equivalent of call/return instructions, or close
resemblance thereof concepts making the issue of avoiding spilling a moot
one from any practical point of view in terms of performance or size of the
compiled code. <- this paragraph makes some rather bold assumptions about
the state of the system you are querying the possible differences of
performance for.

However, you should not concern yourself with this level of optimization
very much as it is highly platform and compiler dependent. The fastest way
to do something is not to do it at all, if you can avoid computing
something: don't compute it. It may be faster overall to do relatively slow
operation only a few times rather than optimized operation many times. Use
std::swap and when it becomes apparent that it is too slow and actual
bottleneck in your application you might find out that no matter how fast
swap you have won't help either.. at that time you will be optimizing
something that actually matters.

I'm not saying it's not all good and beneficial, even sexy if you have the
world's fastest swap.. it's just that either of these two can be faster than
the other one..
Jul 23 '05 #7

"RAYYILDIZ" <ra*******@gmail.com> schrieb im Newsbeitrag news:11**********************@o13g2000cwo.googlegr oups.com...
I Know C is the fastest progrmming language. However, by using some
bitwise operation you can get faster the your program.
For instance, we talk about swap function. For a integer swapping we
use this generally.

void swap(int* a,int* b){
int c;
c=*a;
*b=*a;
*b=c;
}

we use a local variable and swapping varibales with each other, it may
be expensive. We do this swap function like that:

void swap(int*a ,int *b){
*a ^= *b;
*b ^= *a;
*a ^= *b;
}

Yeah, this function swap the integers, too. But it is faster than the
first one. Also do not need a local variable.


There is at least one compile which proves you wrong. On my machine, the simple solution using a temporary takes about 19 seconds for 1000 millions of swaps. The obfuscated version takes about 26 seconds for the same number of swaps.

Heinz
Jul 23 '05 #8
Hey, folks... I saw in Doom3 SDK code that Mr. Carmack uses this swap
trick....

So it's safe :).

Jul 23 '05 #9
Il 2005-03-01, RAYYILDIZ <ra*******@gmail.com> ha scritto:
I Know C is the fastest progrmming language. However, by using some
bitwise operation you can get faster the your program.
For instance, we talk about swap function. For a integer swapping we
use this generally.


Does know your i396 compiler tool the instruction XCHG ?
Jul 23 '05 #10
dumitru wrote:
Hey, folks... I saw in Doom3 SDK code that Mr. Carmack uses this swap
trick....

So it's safe :).


Did he use it in C, or in inline assembler?

One thing I think many people miss is that using this swap trick in
assembler can be a useful thing to do. If you have values in two
registers and need to swap them around (perhaps because one of the
registers is a special one which can be used for some special operation)
then doing the "XORing swap trick" lets you swap the two registers
without using a third.

There is however no reason to do this swap trick unless you are playing
some special register tricks, so in plain C++ code with even basic
optimisation it just isn't useful :)

Chris
Jul 23 '05 #11
Chris Jefferson wrote:

dumitru wrote:
Hey, folks... I saw in Doom3 SDK code that Mr. Carmack uses this swap
trick....

So it's safe :).

Did he use it in C, or in inline assembler?

One thing I think many people miss is that using this swap trick in
assembler can be a useful thing to do. If you have values in two
registers and need to swap them around (perhaps because one of the
registers is a special one which can be used for some special operation)
then doing the "XORing swap trick" lets you swap the two registers
without using a third.


Reminds me of a 'trick' we used on IBM/360 back at university.
It was faster (and required less opcode memory) to XOR a register
with itself then to set it to 0.
There is however no reason to do this swap trick unless you are playing
some special register tricks, so in plain C++ code with even basic
optimisation it just isn't useful :)


Exactly.

--
Karl Heinz Buchegger
kb******@gascad.at
Jul 23 '05 #12
Clark S. Cox III wrote:
On 2005-03-01 09:53:05 -0500, "RAYYILDIZ" <ra*******@gmail.com> said:
This is a very bad idea, for several reasons:
1) It's premature optimization.
agree
2) It may actually be slower than the more obvious swapping algorithm on
some platforms
agree
3) It is less readable
agree
4) It is not always correct:

int i = 25;
int j = 25;
swap(&i,&j);


huh? explain please.

i = 25;
j = 25; ( i = 25, j = 25 )
i ^= j; ( i = 0, j = 25 )
j ^= i; ( i = 0, j = 25 )
i ^= j; ( i = 25, j = 25 )

I'm not saying you should use it, I'm just saying that it works.
-michael
Jul 23 '05 #13
Clark S. Cox III wrote:
On 2005-03-01 09:53:05 -0500, "RAYYILDIZ" <ra*******@gmail.com> said:
This is a very bad idea, for several reasons:
1) It's premature optimization.
agree
2) It may actually be slower than the more obvious swapping algorithm on
some platforms
agree
3) It is less readable
agree
4) It is not always correct:

int i = 25;
int j = 25;
swap(&i,&j);


huh? explain please.

i = 25;
j = 25; ( i = 25, j = 25 )
i ^= j; ( i = 0, j = 25 )
j ^= i; ( i = 0, j = 25 )
i ^= j; ( i = 25, j = 25 )

I'm not saying you should use it, I'm just saying that it works.
-michael

Jul 23 '05 #14
"Michael Bishop" <lo******@yahoo.com> wrote...
Clark S. Cox III wrote:
4) It is not always correct:

int i = 25;
int j = 25;
swap(&i,&j);


huh? explain please.


It would have a problem with swap(&i, &i), not with two different
lvalues. Not sure whether Clark *meant* that when he posted his
objection.

V
Jul 23 '05 #15
Karl Heinz Buchegger schrieb:
Chris Jefferson wrote:
Did he use it in C, or in inline assembler?

One thing I think many people miss is that using this swap trick in
assembler can be a useful thing to do. If you have values in two
registers and need to swap them around (perhaps because one of the
registers is a special one which can be used for some special operation)
then doing the "XORing swap trick" lets you swap the two registers
without using a third.

Reminds me of a 'trick' we used on IBM/360 back at university.
It was faster (and required less opcode memory) to XOR a register
with itself then to set it to 0.


That's the standard way most if not all x86 compilers set registers to
0, regardless of optimisation level. It's even common to xor a register
with itself and then increment or decrement it to set it to 1 or -1/max,
which still takes less bytes of opcodes than loading an immediate
value. It's also commonly used in buffer overflow exploits to inject
the code via null-terminated strings as it doesn't contains '\0'. And
all of this is off-topic here, so a quick leap back: In general you
better trust the compiler to optimise such trivial things these days.
With today's CPUs and all the pipelining it isn't all that trivial to
write faster assembly than the one generated by a compiler. And often
it's even worse to try and give "useful" hints to the compiler like the
(otherwise really cute) XOR swap.

Cheers,
Malte
Jul 23 '05 #16

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

5
by: velthuijsen | last post by:
I have a function that before I modified it took around 13.75 seconds to complete after the modification it took .325 seconds to complete. the...
14
by: Sameer | last post by:
Hello, i wish to read a file of int and store into an array dynamically... the size of memory allocated finally, should just be sufficeient to...
18
by: HYRY | last post by:
I want to join two mono wave file to a stereo wave file by only using the default python module. Here is my program, but it is much slower than the...
3
by: Jakob Petersen | last post by:
Hi, I need to increase the speed when retrieving data from a hosted SQL Server into VBA. I'm using simple SELECT statements. How important is...
1
by: Kelie | last post by:
hello, would there be any speed increase in code execution after python code being compiled into exe file with py2exe? thanks, kelie
1
by: AliRezaGoogle | last post by:
Dear members I am working with a 2000 GH P4 Intel, and 512GB RAM. I have a long list matrix 3000 * 15,000 of type double. I have a calculation...
3
by: Starbuck01 | last post by:
I have to write a program for my AP Computer Science Class. Here is the instructions. The Police Department is asking for help in catching those...
0
by: Charles | last post by:
3000 rows is not a big quantity. You can load it into VC program memory, a linked list for example, and "asynchronously" load into Oracle. The...
10
by: Devang | last post by:
Hello, I am using php script to upload file. some times if file size is too big(1GB) it takes too much time to upload. Can someone suggest me...
0
better678
by: better678 | last post by:
Question: Discuss your understanding of the Java platform. Is the statement "Java is interpreted" correct? Answer: Java is an object-oriented...
0
by: teenabhardwaj | last post by:
How would one discover a valid source for learning news, comfort, and help for engineering designs? Covering through piles of books takes a lot of...
0
by: Naresh1 | last post by:
What is WebLogic Admin Training? WebLogic Admin Training is a specialized program designed to equip individuals with the skills and knowledge...
0
jalbright99669
by: jalbright99669 | last post by:
Am having a bit of a time with URL Rewrite. I need to incorporate http to https redirect with a reverse proxy. I have the URL Rewrite rules made...
0
by: antdb | last post by:
Ⅰ. Advantage of AntDB: hyper-convergence + streaming processing engine In the overall architecture, a new "hyper-convergence" concept was...
0
by: Matthew3360 | last post by:
Hi there. I have been struggling to find out how to use a variable as my location in my header redirect function. Here is my code. ...
0
hi
by: WisdomUfot | last post by:
It's an interesting question you've got about how Gmail hides the HTTP referrer when a link in an email is clicked. While I don't have the specific...
0
by: Matthew3360 | last post by:
Hi, I have been trying to connect to a local host using php curl. But I am finding it hard to do this. I am doing the curl get request from my web...
0
Oralloy
by: Oralloy | last post by:
Hello Folks, I am trying to hook up a CPU which I designed using SystemC to I/O pins on an FPGA. My problem (spelled failure) is with the...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.