ADSM-L

Re: [ADSM-L] Recovering Linux TSM server from partial filesystem failure

2014-03-11 16:18:15
Subject: Re: [ADSM-L] Recovering Linux TSM server from partial filesystem failure
From: Skylar Thompson <skylar2 AT U.WASHINGTON DOT EDU>
To: ADSM-L AT VM.MARIST DOT EDU
Date: Tue, 11 Mar 2014 13:15:22 -0700
Since you mentioned Dell, one thing to check would be PERC and hard drive
firmware levels. There have been a number of updates to both over the past
few years concerning silent data corruption under a variety of conditions.

On Tue, Mar 11, 2014 at 03:07:24PM -0400, Zoltan Forray wrote:
> Thanks for finding that but, as you said, it doesn't help much.  The disk
> filled to 100% due to DB2 taking dumps but that doesn't tell me what caused
> the dumping in the first place.
>
> We are still running Dell full diagnostics (started yesterday afternoon and
> was at 78% as of 2pm EDT).  My OS guy is watching the pot boil and will
> report back once it finishes.  Then it that is clean, he will wipe it and
> reinstall RH 6, since it doesn't look like what is left (/TSMDB, /TSMLOG,
> /TSMARCHLOG) will be of any value without the root filesystem.

--
-- Skylar Thompson (skylar2 AT u.washington DOT edu)
-- Genome Sciences Department, System Administrator
-- Foege Building S046, (206)-685-7354
-- University of Washington School of Medicine