473,385 Members | 1,769 Online
Bytes | Software Development & Data Engineering Community
Post Job

Home Posts Topics Members FAQ

Join Bytes to post your question to a community of 473,385 software developers and data experts.

How do the large scale sites do it?

I am looking to set up a fail safe site. My idea is to setup three
_duplicate_ sites with three different webhosts. The DNS provider
(Zoneedit) provides a distributed failover system; the DNS host tests the
primary website, and if it fails to respond, the DNS record is altered to
point to an alternate site.

The problem that I face is keeping the databases on the three different
hosts in sync. I'm not talking about a master/slave replication; I need
the data to be consistent across all locations at all times without any
conflicts (duplicate keys, etc). Replication doesn't help me because what
could happen is the master system goes down, users are redirected to a
secondary host, the user makes a change on a secondary system, the primary
system comes back up yet it doesn't know about the changes that took place
on the secondary system.

Looking at the MySQL docs, I can setup a simple Master/Master system, but
this quickly breaks down when the structure of the tables are less than
simplistic (primary key conflicts, etc).

How do the big sites do it? How do they maintain data integrity across
multiple servers when any one of the servers at any time could take over
primary responsibilities?

Any help is greatly appreciated.
Jul 19 '05 #1
3 5133
They host the database on a separate database server.... Look at
www.Pair.com for example....

Their web sites are on about 200 different servers... Their mySQL databases
are on about 45 data servers...

webhost1 --> uses dataserver1
webhost2 --> uses dataserver1
webhost3 --> uses dataserver1
dataserver1

"Jim Jones" <we*******@marketnoize.com> wrote in message
news:9x*******************@news.uswest.net...
I am looking to set up a fail safe site. My idea is to setup three
_duplicate_ sites with three different webhosts. The DNS provider
(Zoneedit) provides a distributed failover system; the DNS host tests the
primary website, and if it fails to respond, the DNS record is altered to
point to an alternate site.

The problem that I face is keeping the databases on the three different
hosts in sync. I'm not talking about a master/slave replication; I need
the data to be consistent across all locations at all times without any
conflicts (duplicate keys, etc). Replication doesn't help me because what could happen is the master system goes down, users are redirected to a
secondary host, the user makes a change on a secondary system, the primary
system comes back up yet it doesn't know about the changes that took place
on the secondary system.

Looking at the MySQL docs, I can setup a simple Master/Master system, but
this quickly breaks down when the structure of the tables are less than
simplistic (primary key conflicts, etc).

How do the big sites do it? How do they maintain data integrity across
multiple servers when any one of the servers at any time could take over
primary responsibilities?

Any help is greatly appreciated.

Jul 19 '05 #2
> Their web sites are on about 200 different servers... Their mySQL
databases
are on about 45 data servers... I'm a bit confused; you state "45 data servers" yet your diagram just
specifies one data server.

If there really are 45 data servers, then I'm back to my original question.
Other than a simplistic website, what do they do when there is a high
interdependence of data across multiple servers?

e.g. If there are 45 servers, each with a table with an autoincrement field,
in order for one row to be inserted into the table on server X, server X
must first verify that servers A,B,C haven't already utilized that key. So
how does this synchronization take place? Or do they just have one massive
MySQL server, and assume the risk of a single point of failure?
"codeWarrior" <GP******@HotMail.com> wrote in message
news:cG******************@news1.news.adelphia.net. .. They host the database on a separate database server.... Look at
www.Pair.com for example....

Their web sites are on about 200 different servers... Their mySQL databases are on about 45 data servers...

webhost1 --> uses dataserver1
webhost2 --> uses dataserver1
webhost3 --> uses dataserver1
dataserver1

"Jim Jones" <we*******@marketnoize.com> wrote in message
news:9x*******************@news.uswest.net...
I am looking to set up a fail safe site. My idea is to setup three
_duplicate_ sites with three different webhosts. The DNS provider
(Zoneedit) provides a distributed failover system; the DNS host tests the primary website, and if it fails to respond, the DNS record is altered to point to an alternate site.

The problem that I face is keeping the databases on the three different
hosts in sync. I'm not talking about a master/slave replication; I need the data to be consistent across all locations at all times without any
conflicts (duplicate keys, etc). Replication doesn't help me because

what
could happen is the master system goes down, users are redirected to a
secondary host, the user makes a change on a secondary system, the primary system comes back up yet it doesn't know about the changes that took place on the secondary system.

Looking at the MySQL docs, I can setup a simple Master/Master system, but this quickly breaks down when the structure of the tables are less than
simplistic (primary key conflicts, etc).

How do the big sites do it? How do they maintain data integrity across
multiple servers when any one of the servers at any time could take over
primary responsibilities?

Any help is greatly appreciated.


Jul 19 '05 #3
Most larger firms use clustering to ensure their databases are always available. With MySQL there are DB nodes that then access the clustered servers. Personally I've not used the MYSQL clustering as most providers and services don't provide it as part of thier standard services. I imagine you would incurr a farely large amount of initial support trying to get two ISP to cordinate enough to push this through on MYSQL. Look here to get more information
Sep 8 '05 #4

This thread has been closed and replies have been disabled. Please start a new discussion.

Similar topics

5
by: Hayden Kirk | last post by:
I've used PHP for many smaller projects, but now im embarking on large corporate websites. After reading this: http://www.ukuug.org/events/linux2002/papers/html/php/index.html PHP does not...
0
by: Constandinos Mavromoustakis | last post by:
CFP: CLADE 2004-Challenges of Large Applications in Distributed Environments ------------------------------------------------- PhD student - Dept.Informatics at Aristotle University of...
36
by: Andrea Griffini | last post by:
I did it. I proposed python as the main language for our next CAD/CAM software because I think that it has all the potential needed for it. I'm not sure yet if the decision will get through, but...
14
by: Dave Patton | last post by:
Does anyone have any experience running Xenu Link Sleuth: http://home.snafu.de/tilman/xenulink.html version 1.2e on very large sites? I'm having problems running it against our site, in that on...
9
by: Da~One | last post by:
This message has been posted to 2 groups, one to the VB.NET group, and the other to C#. I am trying to decide which language to commit to for a large scale project. I am looking for the input of...
57
by: Chris Foote | last post by:
Hi all. I have the need to store a large (10M) number of keys in a hash table, based on a tuple of (long_integer, integer). The standard python dictionary works well for small numbers of keys,...
4
by: raidvvan | last post by:
Hi there, We have been looking for some time now for a database system that can fit a large distributed computing project, but we haven't been able to find one. I was hoping that someone can...
22
by: Jesse Burns | last post by:
I'm about to start working on my first large scale site (in my opinion) that will hopefully have 1000+ users a day. ok, this isn't on the google/facebook scale, but it's going to be have more hits...
5
bilibytes
by: bilibytes | last post by:
Hi, (I don't really know where to post this question) for readability : LSWS = Large Scale Web Sites by LSWS I mean: - flickr -facebook -myspace ... web sites that are quite social...
0
by: taylorcarr | last post by:
A Canon printer is a smart device known for being advanced, efficient, and reliable. It is designed for home, office, and hybrid workspace use and can also be used for a variety of purposes. However,...
0
by: Charles Arthur | last post by:
How do i turn on java script on a villaon, callus and itel keypad mobile phone
0
by: aa123db | last post by:
Variable and constants Use var or let for variables and const fror constants. Var foo ='bar'; Let foo ='bar';const baz ='bar'; Functions function $name$ ($parameters$) { } ...
0
BarryA
by: BarryA | last post by:
What are the essential steps and strategies outlined in the Data Structures and Algorithms (DSA) roadmap for aspiring data scientists? How can individuals effectively utilize this roadmap to progress...
1
by: Sonnysonu | last post by:
This is the data of csv file 1 2 3 1 2 3 1 2 3 1 2 3 2 3 2 3 3 the lengths should be different i have to store the data by column-wise with in the specific length. suppose the i have to...
0
by: Hystou | last post by:
There are some requirements for setting up RAID: 1. The motherboard and BIOS support RAID configuration. 2. The motherboard has 2 or more available SATA protocol SSD/HDD slots (including MSATA, M.2...
0
marktang
by: marktang | last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However,...
0
Oralloy
by: Oralloy | last post by:
Hello folks, I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>". The problem is that using the GNU compilers,...
0
jinu1996
by: jinu1996 | last post by:
In today's digital age, having a compelling online presence is paramount for businesses aiming to thrive in a competitive landscape. At the heart of this digital strategy lies an intricately woven...

By using Bytes.com and it's services, you agree to our Privacy Policy and Terms of Use.

To disable or enable advertisements and analytics tracking please visit the manage ads & tracking page.