Greetings Pythoners!
I hope you'll indulge an ignorant outsider. I work at a financial software
firm, and the tool I currently use for my research is R, a software
environment for statistical computing and graphics. R is designed with
matrix manipulation in mind, and it's very easy to do regression and time
series modeling, and to plot the results and test hypotheses. The kinds of
functionality we rely on the most are standard and robust versions of
regression and principal component / factor analysis, bayesian methods such
as Gibbs sampling and shrinkage, and optimization by linear, quadratic,
newtonian / nonlinear, and genetic programming; frequently used graphics
include QQ plots and histograms. In R, these procedures are all available
as functions (some of them are in auxiliary libraries that don't come with
the standard distribution, but are easily downloaded from a central
repository).
For a variety of reasons, the research group is considering adopting Python.
Naturally, I am curious about the mathematical, statistical, and graphical
functionality available in Python. Do any of you out there use Python in
financial research, or other intense mathematical/statistical computation?
Can you compare working in Python with working in a package like R or SPlus
or Matlab, etc.? Which of the procedures I mentioned above are available in
Python? I appreciate any insight you can provide. Thanks!
 TMK 
2124605430 home
9176565351 cell 4 3056
On Jun 14, 4:02 pm, "Talbot Katz" <topk...@msn.comwrote:
Greetings Pythoners!
I hope you'll indulge an ignorant outsider. I work at a financial software
firm, and the tool I currently use for my research is R, a software
environment for statistical computing and graphics. R is designed with
matrix manipulation in mind, and it's very easy to do regression and time
series modeling, and to plot the results and test hypotheses. The kinds of
functionality we rely on the most are standard and robust versions of
regression and principal component / factor analysis, bayesian methods such
as Gibbs sampling and shrinkage, and optimization by linear, quadratic,
newtonian / nonlinear, and genetic programming; frequently used graphics
include QQ plots and histograms. In R, these procedures are all available
as functions (some of them are in auxiliary libraries that don't come with
the standard distribution, but are easily downloaded from a central
repository).
For a variety of reasons, the research group is considering adopting Python.
Naturally, I am curious about the mathematical, statistical, and graphical
functionality available in Python. Do any of you out there use Python in
financial research, or other intense mathematical/statistical computation?
Can you compare working in Python with working in a package like R or SPlus
or Matlab, etc.? Which of the procedures I mentioned above are available in
Python? I appreciate any insight you can provide. Thanks!
 TMK 
2124605430 home
9176565351 cell
I'd look at following modules:
matplotlib  http://matplotlib.sourceforge.net/
numpy  http://numpy.scipy.org/
Finally, this website lists other resources: http://www.astro.cornell.edu/staff/loredo/statpy/
Mike
Talbot Katz wrote:
I hope you'll indulge an ignorant outsider. I work at a financial
software firm, and the tool I currently use for my research is R, a
software environment for statistical computing and graphics. R is
designed with matrix manipulation in mind, and it's very easy to do
regression and time series modeling, and to plot the results and test
hypotheses. The kinds of functionality we rely on the most are standard
and robust versions of regression and principal component / factor
analysis, bayesian methods such as Gibbs sampling and shrinkage, and
optimization by linear, quadratic, newtonian / nonlinear, and genetic
programming; frequently used graphics include QQ plots and histograms.
In R, these procedures are all available as functions (some of them are
in auxiliary libraries that don't come with the standard distribution,
but are easily downloaded from a central repository).
I use both R and Python for my work. I think R is probably better for
most of the stuff you are mentioning. I do any sort of heavy
liftingdatabase queries/tabulation/aggregation in Python and load the
resulting data frames into R for analysis and graphics.

Michael Hoffman
Michael Hoffman wrote:
Talbot Katz wrote:
>I hope you'll indulge an ignorant outsider. I work at a financial software firm, and the tool I currently use for my research is R, a software environment for statistical computing and graphics. R is designed with matrix manipulation in mind, and it's very easy to do regression and time series modeling, and to plot the results and test hypotheses. The kinds of functionality we rely on the most are standard and robust versions of regression and principal component / factor analysis, bayesian methods such as Gibbs sampling and shrinkage, and optimization by linear, quadratic, newtonian / nonlinear, and genetic programming; frequently used graphics include QQ plots and histograms. In R, these procedures are all available as functions (some of them are in auxiliary libraries that don't come with the standard distribution, but are easily downloaded from a central repository).
I use both R and Python for my work. I think R is probably better for
most of the stuff you are mentioning. I do any sort of heavy
liftingdatabase queries/tabulation/aggregation in Python and load the
resulting data frames into R for analysis and graphics.
I would second that. It is not either/or. Use Python, including Numpy
and matplotlib and packages from SciPy, for some things, and R for
others. And you can even embed R in Python using RPy  see http://rpy.sourceforge.net/
We use the combination of Python, Numpy (actually, the older Numeric
Python package, but soon to be converted to Numpy), RPy and R in our
NetEpi Analysis project  exploratory epidemiological analysis of large
data sets  see http://sourceforge.net/projects/netepi  and it is a
good combination  Python for the Web interface, data manipulation and
data heavylifting, and for some of the more elementary statistics, and
R for more involved statistical analysis and graphics (with teh option
of using matplotlib or other Pythonbased graphics packages for some
tasks if we wish). The main thing to remember, though, is that indexing
is zerobased in Python and 1based in R...
Tim C
On Thursday 14 June 2007 5:54 pm, Tim Churches wrote:
Michael Hoffman wrote:
Talbot Katz wrote:
I hope you'll indulge an ignorant outsider. I work at a financial
software firm, and the tool I currently use for my research is R, a
software environment for statistical computing and graphics. R is
designed with matrix manipulation in mind, and it's very easy to do
regression and time series modeling, and to plot the results and test
hypotheses. The kinds of functionality we rely on the most are standard
and robust versions of regression and principal component / factor
analysis, bayesian methods such as Gibbs sampling and shrinkage, and
optimization by linear, quadratic, newtonian / nonlinear, and genetic
programming; frequently used graphics include QQ plots and histograms.
In R, these procedures are all available as functions (some of them are
in auxiliary libraries that don't come with the standard distribution,
but are easily downloaded from a central repository).
I use both R and Python for my work. I think R is probably better for
most of the stuff you are mentioning. I do any sort of heavy
liftingdatabase queries/tabulation/aggregation in Python and load the
resulting data frames into R for analysis and graphics.
I would second that. It is not either/or. Use Python, including Numpy
and matplotlib and packages from SciPy, for some things, and R for
others. And you can even embed R in Python using RPy  see http://rpy.sourceforge.net/
We use the combination of Python, Numpy (actually, the older Numeric
Python package, but soon to be converted to Numpy), RPy and R in our
NetEpi Analysis project  exploratory epidemiological analysis of large
data sets  see http://sourceforge.net/projects/netepi  and it is a
good combination  Python for the Web interface, data manipulation and
data heavylifting, and for some of the more elementary statistics, and
R for more involved statistical analysis and graphics (with teh option
of using matplotlib or other Pythonbased graphics packages for some
tasks if we wish). The main thing to remember, though, is that indexing
is zerobased in Python and 1based in R...
Tim C
Thirded. I use R, Python, Matlab along with other languages (I hate pipeline
pilot) in my work and from what I've seen nothing can compare with R when it
comes to stats. I love R, from its brilliant CRAN system (PyPI needs serious
work to be considered in the same class as CPAN et al) to its delicious Emacs
integration.
I just wish there was a way to distribute R packages without requiring the
user to separately install R.
In a similar vein, I wish there was a reasonable Free Software equivalent to
Spotfire. The closest I've found (and they're nowhere near as good) are
Orange ( http://www.ailab.si/orange) and WEKA
( http://www.cs.waikato.ac.nz/ml/weka/). Orange is written in Python, but its
tied to QT 2.x as the 3.x series was not available on Windows under the GPL.
Josh Gilbert This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics
by: dont bother 
last post by:
Hey,
I was just wondering if python has any support modules
for statistical techniques as Bayesian, Markovian
Style Models etc.
Any hints?
Thanks
Dont
__________________________________

by: administrata 
last post by:
Hi! I'm programming maths programs.
And I got some questions about mathematical signs.
1. Inputing suqare like a * a, It's too long when I do timeconsuming
things. Can it be simplified?
2....

by: techievasant 
last post by:
hello everyone,
Iam vasant from India..
I have a test+interview on C /C++ in the coming month so plz help me
by giving some resources of FAQS, interview questions, tracky
questions, multiple...

by: Matt 
last post by:
I have 2 questions:
1. strlen returns an unsigned (size_t) quantity. Why is an unsigned
value more approprate than a signed value? Why is unsighned value less
appropriate?
2. Would there...

by: Thomas Nelson 
last post by:
Sorry if this is a FAQ, but I couldn't find a good summary through
google. What kinds of statistical analysis tools exist in python? I
really just need ttests, chisquared test, and other such...

by: jeremito 
last post by:
I am extending python with C++ and need some help. I would like to
convert a string to a mathematical function and then make this a C++
function. My C++ code would then refer to this function to...

by: Qun Cao 
last post by:
Hi Everyone,
I am a beginner on cross language development. My problem at hand is to
build a python interface for a C++ application built on top of a 3D
game engine. The purpose of this python...

by: Krypto 
last post by:
Hi,
I have used Python for a couple of projects last year and I found it
extremely useful. I could write two middle size projects in 23 months
(part time). Right now I am a bit rusty and trying...

by: mc 
last post by:
I'm looking for a library which can do mathematical stuff like
solving equations. Or calculation the nulls of a function and so on.
Does anyone know one?
Thanks in advance!

by: taylorcarr 
last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...

by: emmanuelkatto 
last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud.
Please let me know.
Thanks!
Emmanuel

by: BarryA 
last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

by: nemocccc 
last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?

by: Sonnysonu 
last post by:
This is the data of csv file
1 2 3
1 2 3
1 2 3
1 2 3
2 3
2 3
3
the lengths should be different i have to store the data by columnwise with in the specific length.
suppose the i have to...

by: marktang 
last post by:
ONU (Optical Network Unit) is one of the key components for providing highspeed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

by: Hystou 
last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

by: jinu1996 
last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

by: Hystou 
last post by:
Overview:
Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...
 