Import .csv to numpy instead of list to numpy

Currently when I import a data file, I create a list, and then use the

Expand|Select|Wrap|Line Numbers

numpy.array(lst)

method to convert the list into a numpy array.

However, ideally I would like to not have to use a transition data type before converting it to a numpy array (and import the .csv file directly into a numpy array).

I have tried the

Expand|Select|Wrap|Line Numbers

numpy.genfromtext()

method, however, that does not keep the inherent matrix structure I am attempting to preserve (it creates a single vector instead of maintaining the distinct columns and rows from the .csv).

My import method basically looks like this:

Expand|Select|Wrap|Line Numbers

 
import csv

f = open('fileName.csv','rb')

rdr = csv.reader(f,delimeter=',')

lst = []

for row in rdr:

lst.append(row)

When I try to add something like:

Expand|Select|Wrap|Line Numbers

 
numArray = numpy.array([])

for row in rdr:

numpy.append(numArray,row)

However, I am finding that the result is just an empty numpy array at the completion of the loop.

I have also tried

Expand|Select|Wrap|Line Numbers

numpy.row_stack

, however it seems to me like the number of columns must be known apriori to use this (and I don't know how to use the

Expand|Select|Wrap|Line Numbers

csv.reader()

to determine the number of columns.

Any ideas / assistance you could provide would be most greatly appreciated.!!!

Jun 5 '11 #1

Subscribe Post Reply

10056

bvdet

2,851

Expert Mod 2GB

The following bypasses the intermediate assignment to lst and typecasts each str element to float.

Expand|Select|Wrap|Line Numbers

 import csv

import numpy
 
f = open("array.txt")
 
numArray = numpy.array([map(float, item) for item in csv.reader(f)])
 
f.close()

Jun 6 '11 #2

Benjamin Gross

Thanks bvdet!

If have different types in each column, (say of string() and float()) is there any easy way for me to use the map functionality in the list comprehension you provided to account for the different data types?

Thanks for your help!

Jun 6 '11 #3

bvdet

2,851

Expert Mod 2GB

Encapsulate the creation of the array in a function with the data type as an argument.

Expand|Select|Wrap|Line Numbers

 def create_array(fileObj, dataType):

    return numpy.array([map(dataType, item) for item in csv.reader(fileObj)])

Jun 6 '11 #4

bvdet

2,851

Expert Mod 2GB

Oops, I misread your previous message. I am no expert on Numpy, but a Numpy array can only have one data type. Why not create a list of lists or your own container object?

Jun 7 '11 #5

Benjamin Gross

I starting off using a list of lists. However, I need to pass pairs of vectors (a date column along with a float column) into another function for processing.

In a numpy array, this consist of:

Expand|Select|Wrap|Line Numbers

numpyArray[:,0:2]

It doesn't seem like there's as clean a way to do this with lists when I'm reading in an entire matrix of data. That's why i was thinking that the numpy.array() was probably the correct data type for me to use.

Jun 7 '11 #6

bvdet

2,851

Expert Mod 2GB

You can create a class to contain your list of lists. If you have __getitem__ and __iter__ overloads, the information you want to pass could be accessed like this:

Expand|Select|Wrap|Line Numbers

[item[0:2] for item in arrayObj]

It should work the same for a list of lists.

Jun 7 '11 #7

Benjamin Gross

I'm fairly new to Python (programming) so I don't feel all that comfortable creating classes right now. Thanks so much for your input and suggestions.

Jun 7 '11 #8

Similar topics

installing numpy

by: cesco | last post by:

Hi, I'm trying to install the numpy library (precisely numpy-0.9.6-py2.4-linux-i686) on Linux but I encounter several problems. After unpacking the file it creates the following folders:...

Python

Error with: pickle.dumps(numpy.float32)

by: Iljya | last post by:

Hello, I need to pickle the type numpy.float32 but encounter an error when I try to do so. I am able to pickle the array itself, it is specifically the type that I cannot pickle. I am using:...

Python

numpy or _numpy or Numeric?

by: auditory | last post by:

I am a newbie here I am trying to read "space separated floating point data" from file I read about csv module by searching this group, but I couldn't read space separated values with csv....

Python

numpy migration (also posted to numpy-discussion)

by: Duncan Smith | last post by:

Hello, Since moving to numpy I've had a few problems with my existing code. It basically revolves around the numpy scalar types. e.g. ------------------------------------------------ array(,...

Python

Numpy not found

by: adolfo | last post by:

I downloaded and installed Phyton 2.52 (it works), numpy-1.0.4.win32- py2.5, and scipy-0.6.0.win32-py2.5 I can´t get Numpy to show up at Python´s IDLE, or command line. If I do: # I get...

Python

error using all()/numpy [TypeError: cannot perform reduce withflexible type]

by: Marc Oldenhof | last post by:

Hello all, I'm pretty new to Python, but use it a lot lately. I'm getting a crazy error trying to do operations on a string list after importing numpy. Minimal example: Python 2.5.1...

Python

Numpy, adding a row to a matrix

by: sapsi | last post by:

Hello, I have a numpy array (2 rows 3 colums) import numpy a=numpy.array( , ]) I wish to add a row, this is how i do it s=a.shape numpy.resize(a,s+1,s)

Python

Numpy array to gzip file

by: Sean Davis | last post by:

I have a set of numpy arrays which I would like to save to a gzip file. Here is an example without gzip: b=numpy.ones(1000000,dtype=numpy.uint8) a=numpy.zeros(1000000,dtype=numpy.uint8) fd =...

Python

import dll instead of pyd

by: jrh | last post by:

Hello, From previous posts and documentation it seems python should be able to import a module that is compiled into a .dll just as well as a .pyd. I have a pyd that works fine, but after...

Python

Why is indexing into an numpy array that slow?

by: Rüdiger Werner | last post by:

Hello! Out of curiosity and to learn a little bit about the numpy package i've tryed to implement a vectorised version of the 'Sieve of Zakiya'. While the code itself works fine it is...

Python

How to turn on java script in a villaon keypad mobile phone

by: Charles Arthur | last post by:

How do i turn on java script on a villaon, callus and itel keypad mobile phone

Java

Migrating Website to Cloud - Emmanuel Katto

by: emmanuelkatto | last post by:

Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel

General

Navigating the Data Structures and Algorithms (DSA)

by: BarryA | last post by:

What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...

Algorithms / Advanced Math

Is that possible of reading the .csv file in column wise and the column have different lengths ?

by: Sonnysonu | last post by:

This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

C / C++

What is ONU?

by: marktang | last post by:

ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...

General

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server

Discussion: How does Zigbee compare with other wireless protocols in smart home applications?

by: tracyyun | last post by:

Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

General

AI Job Threat for Devs

by: agi2029 | last post by:

Let's talk about the concept of autonomous AI software engineers and no-code agents. These AIs are designed to manage the entire lifecycle of a software development project—planning, coding, testing,...

Career Advice