Veritas-bu

[Veritas-bu] more on drives being downed

2002-05-22 12:39:55
Subject: [Veritas-bu] more on drives being downed
From: David_Cornely AT intuit DOT com (Cornely, David)
Date: Wed, 22 May 2002 09:39:55 -0700
I don't know if you've done this yet but you should have STK check to make
sure all your drives are at the same firmware level.  I've seen problems
when drives in the same robot are at different firmware levels.

-----Original Message-----
From: danix AT cloud9 DOT net [mailto:danix AT cloud9 DOT net]
Sent: Wednesday, May 22, 2002 8:56 AM
To: veritas-bu AT mailman.eng.auburn DOT edu
Subject: [Veritas-bu] more on drives being downed


I'm learning.

I looked in /opt/openv/netbackup/db/media/errors and found a bunch of read
errors.

I parsed the file with grep/cut/sort/uniq and came up with around 22
different
tapes that present read errors, all since May 14th, a few days before we
made our original system changes.

So, it seems that netbackup is doing the right thing and marking the drives
as down, when it is seeing the read errors.

So, now we are:
- increasing the logging levels 
- checking the storagetek array (9710) for hardware problems.
- going to try new tapes

It's hard to believe that 20+ tapes are all bad, and it's also not a
coincidence
that both arrays were having problems.  Could there be something at the Sun 
level causing read errors?  In my experience, read errors are either bad
tapes or bad heads.

To answer a couple of other questions I received, we've run robtest OK, we
don't
have a separate media server, and we're reinventoried the robot (actually 
reinstalled 4.3 completely yesterday).

I'm pointing to hardware problems at this point, how about you?
_______________________________________________
Veritas-bu maillist  -  Veritas-bu AT mailman.eng.auburn DOT edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu


<Prev in Thread] Current Thread [Next in Thread>