473,385 Members | 1,922 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,385 software developers and data experts.

addendum Re: working with images (PIL ?)

I've put together some code to demonstrate what my goal is though looping
pixel by pixel it's rather slow.

import Image

def check_whitespace():
im = Image.open("\\\\server\\vol\\temp\\image.jpg")

size = im.size

i = 0
whitePixCount = 0
while i in range(size[1]):
j = 0
while j in range(size[0]):
p1 = im.getpixel((j,i))
if p1 == (255, 255, 255):
whitePixCount = whitePixCount + 1
if whitePixCount >= 492804: ## ((image dimensions 1404 x
1404) / 4) 25%
return "image no good"
j = j + 1
i = i + 1

print whitePixCount

return "image is good"

print check_whitespace()
"Poppy" <zn****************@yahoo.comwrote in message news:...
>I need to write a program to examine images (JPG) and determine how much
area is whitespace. We need to throw a returned image out if too much of it
is whitespace from the dataset we're working with. I've been examining the
Python Image Library and can not determine if it offers the needed
functionality. Does anyone have suggestions of other image libraries I
should be looking at it, or if PIL can do what I need?

Jun 27 '08 #1
7 1696
On Wed, 14 May 2008 14:35:25 -0400, Poppy wrote:
I've put together some code to demonstrate what my goal is though
looping pixel by pixel it's rather slow.

import Image

def check_whitespace():
im = Image.open("\\\\server\\vol\\temp\\image.jpg")

size = im.size

i = 0
whitePixCount = 0
while i in range(size[1]):
j = 0
while j in range(size[0]):
p1 = im.getpixel((j,i))
if p1 == (255, 255, 255):
whitePixCount = whitePixCount + 1
if whitePixCount >= 492804: ## ((image dimensions 1404
x
1404) / 4) 25%
return "image no good"
j = j + 1
i = i + 1

print whitePixCount

return "image is good"

print check_whitespace()
"Poppy" <zn****************@yahoo.comwrote in message news:...
>>I need to write a program to examine images (JPG) and determine how much
area is whitespace. We need to throw a returned image out if too much of
it is whitespace from the dataset we're working with. I've been
examining the Python Image Library and can not determine if it offers
the needed functionality. Does anyone have suggestions of other image
libraries I should be looking at it, or if PIL can do what I need?
PIL will do this, use histogram() method of Image objects.

-- Ivan
Jun 27 '08 #2
As others have said, PIL has the 'histogram' method to do most of the
work. However, as histogram works on each band separately, you have
a bit of preliminary programming first to combine them.

The ImageChops darker method is one easy-to-understand way (done twice),
but there are lots of alternatives, I am sure.
# ------------------------------------

import Image
import ImageChops

Im = Image.open("\\\\server\\vol\\temp\\image.jpg")
R,G,B = Im.split()

Result=ImageChops.darker(R,G)
Result=ImageChops.darker(Result,B)

WhiteArea=Result.histogram()[0]
TotalArea=Im.size[0] * Im.size[1]
PercentageWhite = (WhiteArea * 100.0)/TotalArea

Poppy wrote:
I've put together some code to demonstrate what my goal is though looping
pixel by pixel it's rather slow.

import Image

def check_whitespace():
im = Image.open("\\\\server\\vol\\temp\\image.jpg")

size = im.size

i = 0
whitePixCount = 0
while i in range(size[1]):
j = 0
while j in range(size[0]):
p1 = im.getpixel((j,i))
if p1 == (255, 255, 255):
whitePixCount = whitePixCount + 1
if whitePixCount >= 492804: ## ((image dimensions 1404 x
1404) / 4) 25%
return "image no good"
j = j + 1
i = i + 1

print whitePixCount

return "image is good"

print check_whitespace()
"Poppy" <zn****************@yahoo.comwrote in message news:...
>I need to write a program to examine images (JPG) and determine how much
area is whitespace. We need to throw a returned image out if too much of it
is whitespace from the dataset we're working with. I've been examining the
Python Image Library and can not determine if it offers the needed
functionality. Does anyone have suggestions of other image libraries I
should be looking at it, or if PIL can do what I need?

Jun 27 '08 #3
Oops. I meant:

WhiteArea=Result.histogram()[255]

of course, not

WhiteArea=Result.histogram()[0]

Ken Starks wrote:
As others have said, PIL has the 'histogram' method to do most of the
work. However, as histogram works on each band separately, you have
a bit of preliminary programming first to combine them.

The ImageChops darker method is one easy-to-understand way (done twice),
but there are lots of alternatives, I am sure.
# ------------------------------------

import Image
import ImageChops

Im = Image.open("\\\\server\\vol\\temp\\image.jpg")
R,G,B = Im.split()

Result=ImageChops.darker(R,G)
Result=ImageChops.darker(Result,B)
#### Mistake here:
WhiteArea=Result.histogram()[0]
TotalArea=Im.size[0] * Im.size[1]
PercentageWhite = (WhiteArea * 100.0)/TotalArea

Poppy wrote:
>I've put together some code to demonstrate what my goal is though
looping pixel by pixel it's rather slow.

import Image

def check_whitespace():
im = Image.open("\\\\server\\vol\\temp\\image.jpg")

size = im.size

i = 0
whitePixCount = 0
while i in range(size[1]):
j = 0
while j in range(size[0]):
p1 = im.getpixel((j,i))
if p1 == (255, 255, 255):
whitePixCount = whitePixCount + 1
if whitePixCount >= 492804: ## ((image dimensions
1404 x 1404) / 4) 25%
return "image no good"
j = j + 1
i = i + 1

print whitePixCount

return "image is good"

print check_whitespace()
"Poppy" <zn****************@yahoo.comwrote in message news:...
>>I need to write a program to examine images (JPG) and determine how
much area is whitespace. We need to throw a returned image out if too
much of it is whitespace from the dataset we're working with. I've
been examining the Python Image Library and can not determine if it
offers the needed functionality. Does anyone have suggestions of
other image libraries I should be looking at it, or if PIL can do
what I need?

Jun 27 '08 #4
Thanks, since posting I figured out how to interpret the histogram results,
which seems to be the consensus in responses. I wrote a check image program
and have been periodically calling it against a folder where I make a copy
of our images used for production. My method right now is to check what we
send for errors, but is not preventive.

Also I determined whitespace is not the only issue, any color that
dominates. I'm considering rewriting this code below to setup bins, so if
combined neighboring colors exceeds the threshold then reject the image. I
have examples where half the image appears black, but actually varies
throughout.

Since my image is RGB I'm looping through a 768 element list.

Zach-

import Image, os
def check_image(file):

try:
im = Image.open(file)
except:
return "Can't open file %s " % file

imData = im.histogram()
i = 0
for ea in imData:
if ea ((im.size[0] * im.size[1]) / 4): ## 25% of image size
return "bad image %s - %s element num is %s " % (file, ea,
str(i))
i = i + 1

return "good image %s, image size is %s" % (file, im.size)
def main(dir):
data = ""
try:
files = os.listdir(dir)
for ea in files:
data = data + str(check_image(os.path.join(dir,ea))) + "\n"
except:
return "Can't get files in %s" % dir
return data

print main("\\\\host\\path\\to\\image_folder\\")
"Ken Starks" <st*****@lampsacos.demon.co.ukwrote in message
news:g0*******************@news.demon.co.uk...
As others have said, PIL has the 'histogram' method to do most of the
work. However, as histogram works on each band separately, you have
a bit of preliminary programming first to combine them.

The ImageChops darker method is one easy-to-understand way (done twice),
but there are lots of alternatives, I am sure.
# ------------------------------------

import Image
import ImageChops

Im = Image.open("\\\\server\\vol\\temp\\image.jpg")
R,G,B = Im.split()

Result=ImageChops.darker(R,G)
Result=ImageChops.darker(Result,B)

WhiteArea=Result.histogram()[0]
TotalArea=Im.size[0] * Im.size[1]
PercentageWhite = (WhiteArea * 100.0)/TotalArea

Poppy wrote:
>I've put together some code to demonstrate what my goal is though looping
pixel by pixel it's rather slow.

import Image

def check_whitespace():
im = Image.open("\\\\server\\vol\\temp\\image.jpg")

size = im.size

i = 0
whitePixCount = 0
while i in range(size[1]):
j = 0
while j in range(size[0]):
p1 = im.getpixel((j,i))
if p1 == (255, 255, 255):
whitePixCount = whitePixCount + 1
if whitePixCount >= 492804: ## ((image dimensions 1404 x
1404) / 4) 25%
return "image no good"
j = j + 1
i = i + 1

print whitePixCount

return "image is good"

print check_whitespace()
"Poppy" <zn****************@yahoo.comwrote in message news:...
>>I need to write a program to examine images (JPG) and determine how much
area is whitespace. We need to throw a returned image out if too much of
it is whitespace from the dataset we're working with. I've been
examining the Python Image Library and can not determine if it offers
the needed functionality. Does anyone have suggestions of other image
libraries I should be looking at it, or if PIL can do what I need?
Jun 27 '08 #5
I would still be concerned that you are checking against the percentage
of the 768 bins returned by the histogram method. Two pixels of
widely different colour end up in the same bin, so long as just ONE
of the Red, Green, or Blue components is equal.

So, for example, colours (2, 27, 200) and (200, 27, 2) are both in
the bin for G=27. But they are very different colours.

There are actualy 256 * 256 * 256 colours, but I don't suppose
you want that many bins!

What you want is a much smaller number of bins, with pixels
of 'close' colours (whatever that means) put into the same bin.

What 'close' means for colours, is quite a difficult thing, and
the consensus is that using the three RGB coordinates is not
as good as certain other colour spaces.

You could use the ImageOps.posterize method to reduce the number of
colours in the image, but whether 'close' colours end up together,
I don't know.

You might try the PIL special interest group (SIG) 'image-sig'

http://mail.python.org/mailman/listinfo/image-sig

(If you want to know exactly how many unique colours an image actually
has, load the image into the 'GIMP' assuming you have it,
and go to :

Menubar --Filters --Colours --Colourcube analysis...

)




Poppy wrote:
Thanks, since posting I figured out how to interpret the histogram results,
which seems to be the consensus in responses. I wrote a check image program
and have been periodically calling it against a folder where I make a copy
of our images used for production. My method right now is to check what we
send for errors, but is not preventive.

Also I determined whitespace is not the only issue, any color that
dominates. I'm considering rewriting this code below to setup bins, so if
combined neighboring colors exceeds the threshold then reject the image. I
have examples where half the image appears black, but actually varies
throughout.

Since my image is RGB I'm looping through a 768 element list.

Zach-

import Image, os
def check_image(file):

try:
im = Image.open(file)
except:
return "Can't open file %s " % file

imData = im.histogram()
i = 0
for ea in imData:
if ea ((im.size[0] * im.size[1]) / 4): ## 25% of image size
return "bad image %s - %s element num is %s " % (file, ea,
str(i))
i = i + 1

return "good image %s, image size is %s" % (file, im.size)
def main(dir):
data = ""
try:
files = os.listdir(dir)
for ea in files:
data = data + str(check_image(os.path.join(dir,ea))) + "\n"
except:
return "Can't get files in %s" % dir
return data

print main("\\\\host\\path\\to\\image_folder\\")

Jun 27 '08 #6
On Mon, 19 May 2008 10:18:00 -0400, Poppy wrote:
Thanks, since posting I figured out how to interpret the histogram
results, which seems to be the consensus in responses. I wrote a check
image program and have been periodically calling it against a folder
where I make a copy of our images used for production. My method right
now is to check what we send for errors, but is not preventive.

Also I determined whitespace is not the only issue, any color that
dominates. I'm considering rewriting this code below to setup bins, so
if combined neighboring colors exceeds the threshold then reject the
image. I have examples where half the image appears black, but actually
varies throughout.

Since my image is RGB I'm looping through a 768 element list.
I suggest:
1. convert to greyscale
2. posterize
3. check the max(im.histogram())

-- Ivan
Jun 27 '08 #7
Thank you and the other responders have given me something to consider, I
understand the concept of the posterize idea and will be experimenting with
that.

I wanted to respond to this statement below which is true, however I believe
the histogram sums the values so both colors would be in bin 229. I say
that because all white is in histogram element 767, while black is in
element 0. Anyone on this list know how to interpret the histogram list?
Your point is still valid regardless of my interpretation.
So, for example, colours (2, 27, 200) and (200, 27, 2) are both in
the bin for G=27. But they are very different colours.
I will be checking out the SIG for PIL thanks for that pointer.

"Ken Starks" <st*****@lampsacos.demon.co.ukwrote in message
news:g0*******************@news.demon.co.uk...
>I would still be concerned that you are checking against the percentage
of the 768 bins returned by the histogram method. Two pixels of
widely different colour end up in the same bin, so long as just ONE
of the Red, Green, or Blue components is equal.

So, for example, colours (2, 27, 200) and (200, 27, 2) are both in
the bin for G=27. But they are very different colours.

There are actualy 256 * 256 * 256 colours, but I don't suppose
you want that many bins!

What you want is a much smaller number of bins, with pixels
of 'close' colours (whatever that means) put into the same bin.

What 'close' means for colours, is quite a difficult thing, and
the consensus is that using the three RGB coordinates is not
as good as certain other colour spaces.

You could use the ImageOps.posterize method to reduce the number of
colours in the image, but whether 'close' colours end up together,
I don't know.

You might try the PIL special interest group (SIG) 'image-sig'

http://mail.python.org/mailman/listinfo/image-sig

(If you want to know exactly how many unique colours an image actually
has, load the image into the 'GIMP' assuming you have it,
and go to :

Menubar --Filters --Colours --Colourcube analysis...

)




Poppy wrote:
>Thanks, since posting I figured out how to interpret the histogram
results, which seems to be the consensus in responses. I wrote a check
image program and have been periodically calling it against a folder
where I make a copy of our images used for production. My method right
now is to check what we send for errors, but is not preventive.

Also I determined whitespace is not the only issue, any color that
dominates. I'm considering rewriting this code below to setup bins, so if
combined neighboring colors exceeds the threshold then reject the image.
I have examples where half the image appears black, but actually varies
throughout.

Since my image is RGB I'm looping through a 768 element list.

Zach-

import Image, os
def check_image(file):

try:
im = Image.open(file)
except:
return "Can't open file %s " % file

imData = im.histogram()
i = 0
for ea in imData:
if ea ((im.size[0] * im.size[1]) / 4): ## 25% of image size
return "bad image %s - %s element num is %s " % (file, ea,
str(i))
i = i + 1

return "good image %s, image size is %s" % (file, im.size)
def main(dir):
data = ""
try:
files = os.listdir(dir)
for ea in files:
data = data + str(check_image(os.path.join(dir,ea))) + "\n"
except:
return "Can't get files in %s" % dir
return data

print main("\\\\host\\path\\to\\image_folder\\")
Jun 27 '08 #8

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

6
by: rzed | last post by:
I'm using PIL to generate some images which may be rotated at the user's option. When they are rotated, the original image is cropped in the new image (which is fine), and the corners are black...
0
by: erkidevries | last post by:
Problem compiling Tkinter program with bmp images (using py2exe) I have a Tkinter gui program that uses bmp as backgrounds. The program itself works when I run from the source. I placed the...
1
by: Poppy | last post by:
I need to write a program to examine images (JPG) and determine how much area is whitespace. We need to throw a returned image out if too much of it is whitespace from the dataset we're working...
1
by: Lawrence D'Oliveiro | last post by:
I'm trying to create PNG files to use in menus for authoring DVDs. As you may know, these menus are only allowed to have limited numbers of colours. Ideally I'd like to create a PNG file with...
0
by: Faith0G | last post by:
I am starting a new it consulting business and it's been a while since I setup a new website. Is wordpress still the best web based software for hosting a 5 page website? The webpages will be...
0
isladogs
by: isladogs | last post by:
The next Access Europe User Group meeting will be on Wednesday 3 Apr 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM). In this session, we are pleased to welcome former...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
by: ryjfgjl | last post by:
In our work, we often receive Excel tables with data in the same format. If we want to analyze these data, it can be difficult to analyze them because the data is spread across multiple Excel files...
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.