I'm doing a postmortem from an outage at my workplace that looks too
similar to an outage we had last fall to not be related. Both database
outages had the following characteristics :
1) VERY large, frequently-accessed & updated tablespace defined on a
raw device of about 348GIG, with the indexes defined on a separate raw
device of a SUN machine.
2) Something happened to the device that made it start logging hardware
and I/O errors, which in turn marked the tablespace as bad.
3) There were no errors in the db2diag.log about the tablespace filling
up. The database just crashed one day and came back up with a mangled
tablespace.
While I've got the server ops dude checking out the hardware, I'm
curious about the device. 2 hardware failures under seemingly the same
set of conditions...I' m wondering...... ......Is it possible to define a
tablespace on a raw device with a size parameter that is acceptable to
DB2 (therefore letting it create the tablespace successfully), but
somehow causes an issue with the OS later on? Meaning, if I had a
device that was 300GIG, could I define a tablespace on it for 300GIG,
or should I leave a little wiggle room? 5 1648
<te********@gma il.com> wrote in message
news:11******** *************@g 44g2000cwa.goog legroups.com... I'm doing a postmortem from an outage at my workplace that looks too similar to an outage we had last fall to not be related. Both database outages had the following characteristics :
1) VERY large, frequently-accessed & updated tablespace defined on a raw device of about 348GIG, with the indexes defined on a separate raw device of a SUN machine. 2) Something happened to the device that made it start logging hardware and I/O errors, which in turn marked the tablespace as bad. 3) There were no errors in the db2diag.log about the tablespace filling up. The database just crashed one day and came back up with a mangled tablespace.
While I've got the server ops dude checking out the hardware, I'm curious about the device. 2 hardware failures under seemingly the same set of conditions...I' m wondering...... ......Is it possible to define a tablespace on a raw device with a size parameter that is acceptable to DB2 (therefore letting it create the tablespace successfully), but somehow causes an issue with the OS later on? Meaning, if I had a device that was 300GIG, could I define a tablespace on it for 300GIG, or should I leave a little wiggle room?
When you create a tablespace on a raw device, DB2 will check to see if it
can access the "end" of the device, presumably to protect against this kind
of problem. So as long as DB2 does not complain when creating the
tablespace, you will be fine.
Granted, both the OS and DB2 have some overhead, so a 300GB physical device
may only have (300GB - xx KB) in usuable space, and once a DB2 container is
created on that device, there will only be (300GB - xx KB - yy KB) available
to use for pages.
--
Matt Emmerton
That makes sense to me. The nature of a DMS container is to claim all
the space you have defined for it up front and let DB2 manage the
space. Still, it's very strange that two of our servers failed in the
same way on two different physical machines. I guess those poor old
RAID 0 array's just gave out from all the activity.
RAID 0? That's just "marketing" for ***NO*** RAID. In simple language,
you're asking for disaster, and you apparently got what you asked for.
"TechWitch" <te********@gma il.com> wrote in message
news:11******** **************@ g47g2000cwa.goo glegroups.com.. . That makes sense to me. The nature of a DMS container is to claim all the space you have defined for it up front and let DB2 manage the space. Still, it's very strange that two of our servers failed in the same way on two different physical machines. I guess those poor old RAID 0 array's just gave out from all the activity.
Well, unfortunately, I did not have any say in the matter. I'm just
the one who they run screaming to when it fails. < shakes head > This
has happened to them twice now over the past year...you think they'd
learn their lesson by now. I can't use words to describe how
FRUSTRATING it is when you try to explain to people that DBMS software
does NOT cause hardware failures and don't believe you. It's the other
way around!!! ( sorry had to vent there. )
FORTUNATELY, we were able to recover some summary tables on a different
device. That satisfied management for the time being.
< sigh >
TW
I sure hope your backups and logs are on a different physical device
from the tablespaces. Your management needs to become educated in RAID
terminology and architecture so they can properly assess the business
consequences of using disk configurations optimized for speed instead of
reliability.
Phil Sherman
TechWitch wrote: That makes sense to me. The nature of a DMS container is to claim all the space you have defined for it up front and let DB2 manage the space. Still, it's very strange that two of our servers failed in the same way on two different physical machines. I guess those poor old RAID 0 array's just gave out from all the activity. This thread has been closed and replies have been disabled. Please start a new discussion. Similar topics |
by: Dave Robinson |
last post by:
I was wondering if anyone could help me with a problem I'm having.
I've been using Dreamweaver to create a hotel booking system for a
friend of mine, using MySQL (version 4.0.21) and PHP 5. The bit I'm
struggling with is checking the Room Availability based on dates that
are typed into a textfield and then returning a list of the available
rooms on the next page.
The three tables involved in this function are:
CREATE TABLE `room` (
|
by: Aurangzeb M. Agha |
last post by:
I'm running Postgres 7.1.3, and just started having a problem where my
dynamic site is going down (read-only DB, with no writes happening to the
DB) regularly (every other day). I have no idea whay this is happening,
and my search of the FAQ's and mail list don't bring up anything. i've
attached the error from the log file, at the end of this message.
Here's an output of the disk usage from within the DB dir
$ du -k .
1716 ...
|
by: Andy |
last post by:
Hi folks
I teach. At school, four IT rooms are booked using a paper based outline
timetable. Completing it is easy but basic and impossible to ensure
completion of all fields (name, year group, subject and software) and
analysis of bookings is a nightmare.
I just fancied pottering with Access, which I am pretty familiar with, to
see if I could create a means of booking a room using a database. Locking a
booking would then be a...
|
by: Tony Liu |
last post by:
I am having a "Null Device is Missing" compile error when compiling a c++
project.
The documentation from MSDN said it could be caused by low system resource
or the user account does not have the prillivage.
But I am plenty of system resources and I logged in as Admin.
Anyone know how to solve this?
Thanks in advance
|
by: DotNetNewbie |
last post by:
Does anyone know of any electronic security device hardware for doors that
can be accessed using vb.net? Like for instance a room that needs to be
secured via an electronic device, that can open via a badge being swiped
through it. I would like to talk to this device with vb.net and sore
information on who leaves and enters the room. If anyone has any information
on this please let me know thanks.
| |
by: Steven Blair |
last post by:
My problem is trying to calcuate whether a room is booked during a date
period.
I have a table with two columns (Start and End date).
I need some SQL code to calculate whether a room is booked during a date
range.
e.g.
|
by: ALIABBAS J PETIWALA |
last post by:
N -ROOM LIGHTS PROBLEM
========================
THERE IS A BIG SQURE ROOM OF SIDE N WHICH CONSISTS OF N X N SMALLER
SQUARE ROOMS(ARRANGED LIKE CHESS BOARD)
EACH ROOM HAS A LIGHT.
WHEN the light of a smaller room k is toggled then all the
neighboring room's lights get toggled (max 5 lights get toggled
including the kth room if center room gets clicked )
|
by: Tom Brown |
last post by:
Hi,
I have a windows application, written in delphi, that communicates to our
devices using raw ethernet frames. I am trying to port this application to
linux using python. However, when I try to open a socket, I get this error:
File "/home/tbrown/projects/discovery/trunk/comm.py", line 9, in __init__
self.s = socket(AF_PACKET, SOCK_RAW, proto)
File "/usr/local/lib/python2.5/socket.py", line 156, in __init__
_sock =...
|
by: marktang |
last post by:
ONU (Optical Network Unit) is one of the key components for providing high-speed Internet services. Its primary function is to act as an endpoint device located at the user's premises. However, people are often confused as to whether an ONU can Work As a Router. In this blog post, we’ll explore What is ONU, What Is Router, ONU & Router’s main usage, and What is the difference between ONU and Router. Let’s take a closer look !
Part I. Meaning of...
|
by: Hystou |
last post by:
Most computers default to English, but sometimes we require a different language, especially when relocating. Forgot to request a specific language before your computer shipped? No problem! You can effortlessly switch the default language on Windows 10 without reinstalling. I'll walk you through it.
First, let's disable language synchronization. With a Microsoft account, language settings sync across devices. To prevent any complications,...
|
by: Oralloy |
last post by:
Hello folks,
I am unable to find appropriate documentation on the type promotion of bit-fields when using the generalised comparison operator "<=>".
The problem is that using the GNU compilers, it seems that the internal comparison operator "<=>" tries to promote arguments from unsigned to signed.
This is as boiled down as I can make it.
Here is my compilation command:
g++-12 -std=c++20 -Wnarrowing bit_field.cpp
Here is the code in...
| |
by: Hystou |
last post by:
Overview:
Windows 11 and 10 have less user interface control over operating system update behaviour than previous versions of Windows. In Windows 11 and 10, there is no way to turn off the Windows Update option using the Control Panel or Settings app; it automatically checks for updates and installs any it finds, whether you like it or not. For most users, this new feature is actually very convenient. If you want to control the update process,...
|
by: isladogs |
last post by:
The next Access Europe User Group meeting will be on Wednesday 1 May 2024 starting at 18:00 UK time (6PM UTC+1) and finishing by 19:30 (7.30PM).
In this session, we are pleased to welcome a new presenter, Adolph Dupré who will be discussing some powerful techniques for using class modules.
He will explain when you may want to use classes instead of User Defined Types (UDT). For example, to manage the data in unbound forms.
Adolph will...
|
by: conductexam |
last post by:
I have .net C# application in which I am extracting data from word file and save it in database particularly. To store word all data as it is I am converting the whole word file firstly in HTML and then checking html paragraph one by one.
At the time of converting from word file to html my equations which are in the word document file was convert into image.
Globals.ThisAddIn.Application.ActiveDocument.Select();...
|
by: TSSRALBI |
last post by:
Hello
I'm a network technician in training and I need your help.
I am currently learning how to create and manage the different types of VPNs and I have a question about LAN-to-LAN VPNs.
The last exercise I practiced was to create a LAN-to-LAN VPN between two Pfsense firewalls, by using IPSEC protocols.
I succeeded, with both firewalls in the same network. But I'm wondering if it's possible to do the same thing, with 2 Pfsense firewalls...
|
by: adsilva |
last post by:
A Windows Forms form does not have the event Unload, like VB6. What one acts like?
|
by: 6302768590 |
last post by:
Hai team
i want code for transfer the data from one system to another through IP address by using C# our system has to for every 5mins then we have to update the data what the data is updated we have to send another system
| |