Python csv calculate percentage by group

6 New Member

I have a table (csv file) with three columns:

Wood [m2] Polygon Area [m2]
15 A 50
10 A 50
12 B 30
10 C 30
05 D 50
10 D 50

My aim is to calculate the percentage of wood for each Polygon. I want to print this result into a new csv table:

Polygon Percentage of Wood (%)
A 0.5 (=25/50)
B
C
D

I usually use Python through ArcGIS (arcpy module) but the modules are very slow for certain things. This is why I want to try to solve the question without this module. But I cannot figure out how to do this. Any help is greatly appreciated.

Jan 30 '15 #1

Subscribe Reply

✓ answered by bvdet

I don't think you want 15+(20/50)(operator precedence). I think you want (15+20)/50.

Here's where a dictionary comes in handy:

Expand|Select|Wrap|Line Numbers

 data = """Wood [m2],Polygon,Area [m2]

15,A,50

10,A,50

12,B,30

10,C,30

05,D,50

10,D,50"""
 
dataLines = data.split("\n")

dd = {}

for line in dataLines[1:]:

    items = line.split(",")

    dd.setdefault(items[1], []).append((float(items[0]), float(items[2])))
 
keys = sorted(dd.keys())

for key in keys:

    print ("Polygon %s: \nPercentage: %0.0f%%" %

           (key, sum((item[0] for item in dd[key]))/dd[key][0][1]*100))

    print "========================"

3075

bvdet

2,851

Recognized Expert Moderator Specialist

You would start by opening the file, reading the file, breaking up the file contents to individual parts and saving in a container object such as a list or dictionary, iterate on the container and perform your calculations, print the output or save to disk. Would not you have to do those steps in ArcGIS?

Jan 30 '15 #2

larafaelivrin

New Member

no, there are arcpy tools which you can call and as I understand they simplify the steps. But the problem is that some of them take very long to run. This website shows me how to read a csv file (https://docs.python.org/2/library/csv.html) and I managed to do that but how can I group the variables? Is there a function?

Jan 30 '15 #3

bvdet

2,851

Recognized Expert Moderator Specialist

Here's an example of manipulating the data after the file is read:

Expand|Select|Wrap|Line Numbers

 data = """Wood [m2],Polygon,Area [m2]

15,A,50

10,A,50

12,B,30

10,C,30

05,D,50

10,D,50"""
 
dataLines = data.split("\n")

for line in dataLines[1:]:

    items = line.split(",")

    print ("Polygon %s: \nPercentage: %0.0f%%" %

           (items[1], float(items[0])/float(items[2])*100))

    print "========================"

And the output:

Expand|Select|Wrap|Line Numbers

 >>> Polygon A: 

Percentage: 30%

========================

Polygon A: 

Percentage: 20%

========================

Polygon B: 

Percentage: 40%

========================

Polygon C: 

Percentage: 33%

========================

Polygon D: 

Percentage: 10%

========================

Polygon D: 

Percentage: 20%

========================

>>>

Jan 30 '15 #4

larafaelivrin

New Member

ok but with this solution I get several output for Polygon A and D. I am interested in summarizing the wooden Areas for each Polygon which has the same name. For Polygon A for example this would be 15+20/50. Is the quickest way to sum up the outputs or to do this step beforehand? Thanks a lot!!

Jan 30 '15 #5

bvdet

2,851

Recognized Expert Moderator Specialist

I don't think you want 15+(20/50)(operator precedence). I think you want (15+20)/50.

Here's where a dictionary comes in handy:

Expand|Select|Wrap|Line Numbers

 data = """Wood [m2],Polygon,Area [m2]

15,A,50

10,A,50

12,B,30

10,C,30

05,D,50

10,D,50"""
 
dataLines = data.split("\n")

dd = {}

for line in dataLines[1:]:

    items = line.split(",")

    dd.setdefault(items[1], []).append((float(items[0]), float(items[2])))
 
keys = sorted(dd.keys())

for key in keys:

    print ("Polygon %s: \nPercentage: %0.0f%%" %

           (key, sum((item[0] for item in dd[key]))/dd[key][0][1]*100))

    print "========================"

Jan 30 '15 #6

larafaelivrin

New Member

I just copied your code and it works perfectly! Thank you so much!! I will try to understand what you did and maybe I can get back to you in case I do not understand something. Thanks!:)

Jan 30 '15 #7

larafaelivrin

New Member

Another question (sry...): If I import my csv file I get the fallowing structure:

['15', 'A', '50']
['10', 'A', '50']
['12', 'B', '30']
['10', 'C', '30']
['5', 'D', '50']
['10', 'D', '50']

How do you import your csv file without listing each row separately? I don´t seem to be able to figure out what I am doing wrong...

Jan 30 '15 #8

larafaelivrin

New Member

Aha, maybe I figured out how to do it:

data = open("Test.csv", "r")
print data.read()

but now I get this error:
Traceback (most recent call last):
File "/home/katharina/Desktop/Test.py", line 14, in <module>
dataLines = data.split("\n")
AttributeError: 'file' object has no attribute 'split'

and if I uncomment the dataLines line the fallowing error appears: Traceback (most recent call last):
File "/home/katharina/Desktop/Test.py", line 16, in <module>
for line in data[1:]:
TypeError: 'file' object has no attribute '__getitem__'

Any clue what I am doing wrong?

Jan 30 '15 #9

bvdet

2,851

Recognized Expert Moderator Specialist

There are several ways of doing this. You don't have to create a file object.

Expand|Select|Wrap|Line Numbers

data = open("Test.csv", "r").read()

Expand|Select|Wrap|Line Numbers

dataLines = [item.strip() for item in open("Test.csv", "r").readlines()

Feb 2 '15 #10

Similar topics

calculate percentage

by: toluj | last post by:

Hi, pls could anyone help me with the script to calculate the percentage btwn two fields in a table.

General

Calculate Percentage Error

by: tulikapuri | last post by:

Dear Friends, I am using the method to cal. percentage in report but no sucess it gives #Num! instead of a number. I am following the steps as given in help to calculate percentage value on a...

Microsoft Access / VBA

Calculate Percentage

by: ngweixiong | last post by:

Hi, I have a Ms Access query which i used to calculate how many times the leadtime is a) less than 7 days b) 7-14 days c) more than 14 days With the query results, i will like to convert...

Microsoft Access / VBA

Calculate percentage for each row

by: smileyangeluv | last post by:

Hi, Would like to get percentage for generated column. Any idea on how to do that?? Following SQL statement SELECT s.Selection_Desc, count(u.user_id) from tbl_system_selection s, tbl_user...

MySQL Database

Calculate percentage on report from TEXT fields?

by: ollyb303 | last post by:

Hello, Trying to help a friend/colleague out with a database and we've both drawn a blank. Not even sure if this is possible. The database has a table (Table1) with a several columns: ID,...

Microsoft Access / VBA

Iowa Python User's Group

by: Mike Driscoll | last post by:

Hi, I am organizing a Python User's Group for Iowa and am hoping there are some Iowans that frequent this list. if you are one and are interested in getting together with other Python-people,...

Python

calculate percentage of scheduled hours

by: NareshN | last post by:

Hi, I have weekly wise scheduled hours of each employee and no of days scheduled for each employee,now i need to calculate no of employees scheduled less than 24 hours,no of emp's scheduled b/w...

ASP.NET

How to calculate percentage in SQL DB2 query

by: Prashant Gadeka | last post by:

Hi, I am having following table structure and sample values: ---------------------------------------------------------- APP_ID DATE HOUR STATUS_CODE HITS ...

DB2 Database

MOSS 2007 / Sharepoint Count tottals & Calculate percentage

by: lnh6513 | last post by:

In short, I'm trying to create a simple dashboard using MOSS2007, but don't have a lot of the additional plugins or webparts installed\available that would make this easy. I have a massive list...

.NET Framework

Changing the language in Windows 10

by: Hystou | last post by:

Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can...

Windows Server

Problem With Comparison Operator <=> in G++

by: Oralloy | last post by:

Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...

C / C++

Maximizing Business Potential: The Nexus of Website Design and Digital Marketing

by: jinu1996 | last post by:

In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

Online Marketing

The easy way to turn off automatic updates for Windows 10/11

by: Hystou | last post by:

Overview: Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows...

Windows Server

Access Europe - Using VBA to create a class based on a table - Wed 1 May

by: isladogs | last post by:

The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome a new...

Microsoft Access / VBA

Couldn’t get equations in html when convert word .docx file to html file in C#.

by: conductexam | last post by:

I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and...

C# / C Sharp

Trying to create a lan-to-lan vpn between two differents networks

by: TSSRALBI | last post by:

Hello I'm a network technician in training and I need your help. I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs. The...

Networking - Hardware / Configuration

Windows Forms - .Net 8.0

by: adsilva | last post by:

A Windows Forms form does not have the event Unload, like VB6. What one acts like?

Visual Basic .NET

transfer the data from one system to another through ip address

by: 6302768590 | last post by:

Hai team i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated ...

C# / C Sharp