473,395 Members | 1,341 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,395 software developers and data experts.

A BILLION pictures...how do they do it?

I am writing my own family photo sharing site that I hope to take
public (like so many others). Anyway, currently, when the user
uploads a picture, I store the picture outside my htdocs folder and
record the image details in a MySQB db. When you browse the picture,
I read the record and build the image by sending an image/jpeg header.

Seems to work but I am a little disappointed with performance.
Granted I am running on a really old machine which might be the
reason. lol

Seriously though, if I take this public and get extremely lucky and
millions of photos are uploaded, would this be the best method?

I've read pros and cons of storing images in a database. I've read
about Flickr, SmugMug, Photobucket having HUNDREDS of millions to over
a BILLION images stored!

Obviously, load balancing plays into this but what other secrets do
you think they use?

One thing I worry about is my file system. I have something like:

pix
-----user1
-------------thumbs
-----user2
-------------thumbs

etc...

Any pointers would be appreciated.
Thanks

cbmeeks

Jun 12 '07 #1
8 1547
cbmeeks wrote:
I am writing my own family photo sharing site that I hope to take
public (like so many others). Anyway, currently, when the user
uploads a picture, I store the picture outside my htdocs folder and
record the image details in a MySQB db. When you browse the picture,
I read the record and build the image by sending an image/jpeg header.

Seems to work but I am a little disappointed with performance.
Granted I am running on a really old machine which might be the
reason. lol

Seriously though, if I take this public and get extremely lucky and
millions of photos are uploaded, would this be the best method?

I've read pros and cons of storing images in a database. I've read
about Flickr, SmugMug, Photobucket having HUNDREDS of millions to over
a BILLION images stored!

Obviously, load balancing plays into this but what other secrets do
you think they use?

One thing I worry about is my file system. I have something like:

pix
-----user1
-------------thumbs
-----user2
-------------thumbs

etc...

Any pointers would be appreciated.
Thanks

cbmeeks
First of all, you should be asking this in a database newsgroup, not a
PHP one. And preferably a newsgroup aimed at the database you're using.

I store pictures in databases. It works quite well. Takes some tuning,
but I find it provides good performance.

--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
js*******@attglobal.net
==================
Jun 12 '07 #2
First of all, you should be asking this in a database newsgroup, not a
PHP one. And preferably a newsgroup aimed at the database you're using.
Well, that's assuming I would only use MySQL and not PHP to serve my
files. :-)
I store pictures in databases. It works quite well. Takes some tuning,
but I find it provides good performance.
Yeah, I'm not surprised you replied. I have been reading some of your
posts about images in db's. You really have me thinking about images
in db's. I have to admit, I am walking on top of the fence and could
jump to either side when it comes to file system/db for storing
images. I agree with your postings about actually doing it instead of
quoting theories.

Scalability is very important but it's not the only thing.
Portability is also important. I am thinking of using Amazon's S3
(which I believe is a flat file system). But the bad thing about
using Amazon is that I put all of my eggs in one basket. They just
recently had a price change that made a lot of people happy but not
all...point is, they did that because they can.

I would love to be the fly on the wall at Amazon, eBay, Google, etc
and see how they store images. I know Google has their BigTable.

I guess I should follow by example. SmugMug uses their own internal
system that is helped along with S3. But I have no idea of how much
they serve from S3 or if they just use S3 as a backup.

Oh well, sorry for the rambling.

cbmeeks
http://www.eblarg.com

Jun 12 '07 #3
On Jun 12, 5:09 pm, Jerry Stuckle <jstuck...@attglobal.netwrote:
cbmeeks wrote:
I am writing my own family photo sharing site that I hope to take
public (like so many others). Anyway, currently, when the user
uploads a picture, I store the picture outside my htdocs folder and
record the image details in a MySQB db. When you browse the picture,
I read the record and build the image by sending an image/jpeg header.
Seems to work but I am a little disappointed with performance.
Granted I am running on a really old machine which might be the
reason. lol
Seriously though, if I take this public and get extremely lucky and
millions of photos are uploaded, would this be the best method?
I've read pros and cons of storing images in a database. I've read
about Flickr, SmugMug, Photobucket having HUNDREDS of millions to over
a BILLION images stored!
Obviously, load balancing plays into this but what other secrets do
you think they use?
One thing I worry about is my file system. I have something like:
pix
-----user1
-------------thumbs
-----user2
-------------thumbs
etc...
Any pointers would be appreciated.
Thanks
cbmeeks

First of all, you should be asking this in a database newsgroup, not a
PHP one. And preferably a newsgroup aimed at the database you're using.

I store pictures in databases. It works quite well. Takes some tuning,
but I find it provides good performance.

--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
jstuck...@attglobal.net
==================
You should read the Database DOCS. In case of MySQL, if you index your
table and use the right mysql database type, then you will get more
perfomance with storing images in the database.
Also if you run a very huge site, your database server's will run on
SCSI machine's which means that you have often faster Database
Harddrive's then your webserver.

Jun 12 '07 #4
cbmeeks wrote:
>First of all, you should be asking this in a database newsgroup, not a
PHP one. And preferably a newsgroup aimed at the database you're using.

Well, that's assuming I would only use MySQL and not PHP to serve my
files. :-)
>I store pictures in databases. It works quite well. Takes some tuning,
but I find it provides good performance.

Yeah, I'm not surprised you replied. I have been reading some of your
posts about images in db's. You really have me thinking about images
in db's. I have to admit, I am walking on top of the fence and could
jump to either side when it comes to file system/db for storing
images. I agree with your postings about actually doing it instead of
quoting theories.

Scalability is very important but it's not the only thing.
Portability is also important. I am thinking of using Amazon's S3
(which I believe is a flat file system). But the bad thing about
using Amazon is that I put all of my eggs in one basket. They just
recently had a price change that made a lot of people happy but not
all...point is, they did that because they can.

I would love to be the fly on the wall at Amazon, eBay, Google, etc
and see how they store images. I know Google has their BigTable.

I guess I should follow by example. SmugMug uses their own internal
system that is helped along with S3. But I have no idea of how much
they serve from S3 or if they just use S3 as a backup.

Oh well, sorry for the rambling.

cbmeeks
http://www.eblarg.com
Either way you're going to have to use PHP (or PERL or some language) to
serve the images up. But the database design and configuration is the
more important thing here. That's why I suggested a database newsgroup.
It's a better place to discuss these things.
--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
js*******@attglobal.net
==================
Jun 12 '07 #5
Max/Jerry:

Oh believe me, I would like to use the DB and I will certainly try it
and run some performance testing.

serve the images up. But the database design and configuration is the
more important thing here. That's why I suggested a database newsgroup.
It's a better place to discuss these things.
Agreed. I just don't like to cross post and I knew that PHP and MySQL
would be involved. That's why I started here first.

Jun 12 '07 #6
NC
On Jun 12, 5:27 am, cbmeeks <cbme...@gmail.comwrote:
>
I am writing my own family photo sharing site that
I hope to take public (like so many others). Anyway,
currently, when the user uploads a picture, I store
the picture outside my htdocs folder and record the
image details in a MySQB db. When you browse the
picture, I read the record and build the image by
sending an image/jpeg header.

Seriously though, if I take this public and get
extremely lucky and millions of photos are uploaded,
would this be the best method?

I've read pros and cons of storing images in a database.
I've read about Flickr, SmugMug, Photobucket having
HUNDREDS of millions to over a BILLION images stored!

Obviously, load balancing plays into this but what
other secrets do you think they use?
Separating (static) pictures from other (dynamic) content. Say, you
have two servers, one with PHP/MySQL (let's call it www.yoursite.com),
another with nothing but Apache (content.yoursite.com), optimized for
serving static images. The application residing on www.yoursite.com
saves images onto content.yoursite.com and records their full URLs
(http://content.yoursite.com/path/file.jpg) in its database. When
content.yoursite.com gets low on available disk space, you put up a
new server (content2.yoursite.com) for writing and start filling it up
with pictures, while content.yoursite.com still remains accessible for
reading. You can continue to add new content*.yoursite.com servers as
you go. Dynamically generated HTML gets served from www.yoursite.com
(which may eventually outgrow a single server and morph into a server
cluster), static images, from content*.yoursite.com.

A slight variation of this approach is that multiple servers are open
for writing at any given time; images are written onto a randomly
chosen server. This helps ensure that highly popular content will be
spread between multiple servers and can thus be served faster.

Yet another possibility is to hide your application behind a layer of
caching proxies...
One thing I worry about is my file system. I have
something like:

pix
-----user1
-------------thumbs
-----user2
-------------thumbs
There's absolutely no need for the file structure to replicate your
database structure...

Cheers,
NC

Jun 12 '07 #7
cbmeeks wrote:
Max/Jerry:

Oh believe me, I would like to use the DB and I will certainly try it
and run some performance testing.

>serve the images up. But the database design and configuration is the
more important thing here. That's why I suggested a database newsgroup.
It's a better place to discuss these things.

Agreed. I just don't like to cross post and I knew that PHP and MySQL
would be involved. That's why I started here first.
Ah, but cross-posting is the ONLY way to fly! :-)

--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
js*******@attglobal.net
==================
Jun 12 '07 #8
That makes sense. I see many of the big sites use
"static123.example.com".


On Jun 12, 1:52 pm, NC <n...@iname.comwrote:
On Jun 12, 5:27 am, cbmeeks <cbme...@gmail.comwrote:


I am writing my own family photo sharing site that
I hope to take public (like so many others). Anyway,
currently, when the user uploads a picture, I store
the picture outside my htdocs folder and record the
image details in a MySQB db. When you browse the
picture, I read the record and build the image by
sending an image/jpeg header.
Seriously though, if I take this public and get
extremely lucky and millions of photos are uploaded,
would this be the best method?
I've read pros and cons of storing images in a database.
I've read about Flickr, SmugMug, Photobucket having
HUNDREDS of millions to over a BILLION images stored!
Obviously, load balancing plays into this but what
other secrets do you think they use?

Separating (static) pictures from other (dynamic) content. Say, you
have two servers, one with PHP/MySQL (let's call itwww.yoursite.com),
another with nothing but Apache (content.yoursite.com), optimized for
serving static images. The application residing onwww.yoursite.com
saves images onto content.yoursite.com and records their full URLs
(http://content.yoursite.com/path/file.jpg) in its database. When
content.yoursite.com gets low on available disk space, you put up a
new server (content2.yoursite.com) for writing and start filling it up
with pictures, while content.yoursite.com still remains accessible for
reading. You can continue to add new content*.yoursite.com servers as
you go. Dynamically generated HTML gets served fromwww.yoursite.com
(which may eventually outgrow a single server and morph into a server
cluster), static images, from content*.yoursite.com.

A slight variation of this approach is that multiple servers are open
for writing at any given time; images are written onto a randomly
chosen server. This helps ensure that highly popular content will be
spread between multiple servers and can thus be served faster.

Yet another possibility is to hide your application behind a layer of
caching proxies...
One thing I worry about is my file system. I have
something like:
pix
-----user1
-------------thumbs
-----user2
-------------thumbs

There's absolutely no need for the file structure to replicate your
database structure...

Cheers,
NC

Jun 18 '07 #9

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

37
by: asj | last post by:
awhile back, eBay decided to switch from a Microsoft/.NET/Windows architecture on the backend to a J2EE one, which might explain why their java backend will handle up to 1 BILLION page views a day!...
0
by: JakeC | last post by:
Hey all, I'm currently redesigning a website that a friend and I started about a year ago. It is a daily surf report so when choosing the best script/language to use for the new design, I found...
34
by: Justin Timberlake | last post by:
I was thinking about all those /Indian Outsourcing/ companies getting those .Net shops set up. 0. Nobody uses .NET in the real world, it's all java. 1. MSFT is about to collapse as witnessed by...
12
by: Wadim Grasza | last post by:
I want to store and display (on a form or a report) multiple pictures per record in an access database. The pictures are not stored within the database. They are stored as files and the database...
13
by: gooze | last post by:
Hello I am working on an applicaion that shows several pictures on a webpage. These pictures are saved in a MySQL DB as BLOB. I noticed, that the web server suffers in its performance by...
13
by: Jim Carlock | last post by:
I have over a hundred pictures I would like to present. Is it practical to create and parse an array of pictures, picture paths, et al using server-side scripting to accomplish this? I...
25
by: Subra | last post by:
Hi, What is the best way to find the 1000 largest numbers from the file having hell lot of entries ? Can you please help me to find out the way ? Do I need to go for B+ trees ?? Please help,...
0
by: eBob.com | last post by:
Here's how I insert and size pictures in an Excel spreadsheet which I create via a VB.NET program (commentary follows): 'add thumb nail objSheet.Rows(XLRow.ToString).rowheight =...
18
by: raylopez99 | last post by:
The maximum int for an array on my machine (a Pentium IV with 2 GB RAM) is < 330 Million...before you get an "out of memory" exception. I simply filled an array of this size with ints...I got as...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: emmanuelkatto | last post by:
Hi All, I am Emmanuel katto from Uganda. I want to ask what challenges you've faced while migrating a website to cloud. Please let me know. Thanks! Emmanuel
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: nemocccc | last post by:
hello, everyone, I want to develop a software for my android phone for daily needs, any suggestions?
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...
0
tracyyun
by: tracyyun | last post by:
Dear forum friends, With the development of smart home technology, a variety of wireless communication protocols have appeared on the market, such as Zigbee, Z-Wave, Wi-Fi, Bluetooth, etc. Each...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.