Is anybody out there using VSIPL and can confirm my findings please?
I'm experiencing a '20-fold' increase in processing time for a simple
vsip_cvmag_f function when passed a view with stride length of 2.
EG..
viewNonInterlaced = vsip_cvbind_f( block, 0, 2, N );
viewInterlaced = vsip_cvbind_f( block, 0, 1, N );
Times =
vsip_cvmag_f( viewNonInterlaced, viewMag ) = ~480 micro secs
vsip_cvmag_f( viewInterlaced, viewMag ) = ~8000 micro secs, or 8
mSecs!!
Data size = ~40000 complex numbers.
Processor, PPC7410.
All I can presume is that any stride length not equal to '1' is using
non-optimised code, probably written in 'c', in the VSIP library. At
this rate it will be quicker to de-stride the input data before
working.
Thanks
Chris (UK) 5 2021
On 9 Feb 2004 09:52:37 -0800, in comp.lang.c , ch***********@hotmail.com
(Chris) wrote: Is anybody out there using VSIPL and can confirm my findings please?
I'm experiencing a '20-fold' increase in processing time for a simple vsip_cvmag_f function when passed a view with stride length of 2.
Chris
unfortunately this is most definitely not a C question, but something
specific about the VSIPL library.
You need to find a group that specialises in vector signal processing, or
perhaps visit the VSIPL fora at www.vsipl.org Its wildly offtopic here.
--
Mark McIntyre
CLC FAQ <http://www.eskimo.com/~scs/C-faq/top.html>
CLC readme: <http://www.angelfire.com/ms3/bchambless0/welcome_to_clc.html>
----== Posted via Newsfeed.Com - Unlimited-Uncensored-Secure Usenet News==---- http://www.newsfeed.com The #1 Newsgroup Service in the World! >100,000 Newsgroups
---= 19 East/West-Coast Specialized Servers - Total Privacy via Encryption =---
Chris <ch***********@hotmail.com> scribbled the following: Is anybody out there using VSIPL and can confirm my findings please?
If there are, here's not the proper place to discuss it. VSIPL,
whatever that is, is not a feature of the C language, but instead a
non-standard extension. Please ask in another newsgroup.
(Snip VSIPL question)
--
/-- Joona Palaste (pa*****@cc.helsinki.fi) ------------- Finland --------\
\-- http://www.helsinki.fi/~palaste --------------------- rules! --------/
"We sorcerers don't like to eat our words, so to say."
- Sparrowhawk
Chris wrote: Is anybody out there using VSIPL and can confirm my findings please?
I'm experiencing a '20-fold' increase in processing time for a simple vsip_cvmag_f function when passed a view with stride length of 2.
EG.. viewNonInterlaced = vsip_cvbind_f( block, 0, 2, N ); viewInterlaced = vsip_cvbind_f( block, 0, 1, N );
Times = vsip_cvmag_f( viewNonInterlaced, viewMag ) = ~480 micro secs vsip_cvmag_f( viewInterlaced, viewMag ) = ~8000 micro secs, or 8 mSecs!!
Data size = ~40000 complex numbers. Processor, PPC7410.
All I can presume is that any stride length not equal to '1' is using non-optimised code, probably written in 'C', in the VSIP library. At this rate, it will be quicker to de-stride the input data before working.
It might be.
My guess is that you are using Randy Judd's "Reference Library"
that you downloaded from the Vector Signal Image Processing Library
(VSIPL) web site: http://www.vsipl.org/
The reference library is hardly optimized at all.
It serves only to help VSIP library developers
verify their own optimized implementations of the VSIPL API.
You have two options:
1. Fork up the bucks for an optimized VSIPL from one of the vendors or
2. write your own optimized version.
"E. Robert Tisdale" <E.**************@jpl.nasa.gov> wrote in message news:<40**************@jpl.nasa.gov>... Chris wrote:
Is anybody out there using VSIPL and can confirm my findings please?
I'm experiencing a '20-fold' increase in processing time for a simple vsip_cvmag_f function when passed a view with stride length of 2.
EG.. viewNonInterlaced = vsip_cvbind_f( block, 0, 2, N ); viewInterlaced = vsip_cvbind_f( block, 0, 1, N );
Times = vsip_cvmag_f( viewNonInterlaced, viewMag ) = ~480 micro secs vsip_cvmag_f( viewInterlaced, viewMag ) = ~8000 micro secs, or 8 mSecs!!
Data size = ~40000 complex numbers. Processor, PPC7410.
All I can presume is that any stride length not equal to '1' is using non-optimised code, probably written in 'C', in the VSIP library. At this rate, it will be quicker to de-stride the input data before working.
It might be.
My guess is that you are using Randy Judd's "Reference Library" that you downloaded from the Vector Signal Image Processing Library (VSIPL) web site:
http://www.vsipl.org/
The reference library is hardly optimized at all. It serves only to help VSIP library developers verify their own optimized implementations of the VSIPL API.
You have two options:
1. Fork up the bucks for an optimized VSIPL from one of the vendors or 2. write your own optimized version.
Actually using the MPI, but thanks.
Sorry for posting off-topic. Was getting kind of desperate and
finding little help elsewhere. This VSIPL is interesting stuff!
Chris.
Chris wrote: Sorry for posting off-topic. Was getting kind of desperate and finding little help elsewhere. This VSIPL is interesting stuff!
I'm glad that you like it.
And I hope that you can make money writing applications
that use the VSIPL API. This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics
by: Glen Low |
last post by:
I have written a new implemention of the std::valarray library that is
optimized to use Altivec (Apple's "Velocity Engine", part of the
PowerPC G4's in most Macintoshes and the announced IBM PPC...
|
by: not telling |
last post by:
Any one have a pointer to qr source?
Orthogonal triangular decomposition?
(Compact model)
|
by: Jorn Attermann |
last post by:
I perform numerical simulation which requires linear algebra programming
that is optimized for speed. However, to my surprise, there seems to be no
"standard" linear algebra library in C++ which is...
|
by: Fred |
last post by:
Has anyone a link or any information comparing c and c++ as far as
execution speed is concerned?
Signal Processing algorithms would be welcome...
Thanks
Fred
|
by: Steven T. Hatton |
last post by:
I bought Josuttis's book on the repeated recommendations of people in this
newsgroup.
http://www.josuttis.com/libbook/
One of the first things I looked up was the std::valarray<>. And what I...
|
by: Dan Elliott |
last post by:
Hello all,
I am writing a program which needs to run as quickly as possible, but holds
a lot of data in memory (around 1GB for a usual run). Is this too much
memory to even consider putting...
|
by: robix |
last post by:
Hi all. i'm developing a simple program for matrix add/multiplication but i
don't know if i'm going the right path.
I'm supposed to receive doubles in the output of my program but instead i...
|
by: ark |
last post by:
Risking to invoke flames from one Tom St Denis of Ottawa :)
Is there any guarantee that, say,
sizeof(int) == sizeof(unsigned int)
sizeof(long) > sizeof(char) ?
Thanks,
Ark
|
by: nemocccc |
last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
|
by: Sonnysonu |
last post by:
This is the data of csv file
1 2 3
1 2 3
1 2 3
1 2 3
2 3
2 3
3
the lengths should be different i have to store the data by column-wise with in the specific length.
suppose the i have to...
|
by: Hystou |
last post by:
There are some requirements for setting up RAID:
1. The motherboard and BIOS support RAID configuration.
2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
|
by: marktang |
last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
|
by: Hystou |
last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...
|
by: jinu1996 |
last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
|
by: tracyyun |
last post by:
Dear forum friends,
With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...
|
by: agi2029 |
last post by:
Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...
|
by: conductexam |
last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...
| |