Amanda-Users

Re: Tape error - how 'bad' is this?

2004-05-11 05:20:47
Subject: Re: Tape error - how 'bad' is this?
From: Martin Hepworth <martinh AT solid-state-logic DOT com>
To: Dave Ewart <Dave.Ewart AT cancer.org DOT uk>, amanda-users AT amanda DOT org
Date: Tue, 11 May 2004 10:17:20 +0100
Dave

Being the person you are I presume you've...

cleaned the heads,
reseated the connectors,

sounds like a drive issue to me - there are some utils on the HP web site for diagnosis they will has you run, but you'll have to dig out a windows box to run them....

I've got a DLT1 (same drive just a little slower) I could free up for a lunch time if you need a test unit..

--
Martin Hepworth
Snr Systems Administrator
Solid State Logic
Tel: +44 (0)1865 842300


Dave Ewart wrote:
Hi,

Many months of happy activity and then suddenly, last two nights, the
AMANDA job has failed to finish properly - the error in the AMANDA
report says:


These dumps were to tape Daily-014.
*** A TAPE ERROR OCCURRED: [[writing file: Input/output error]].
Some dumps may have been left in the holding disk.
Run amflush to flush them to tape.
The next tape Amanda expects to use is: Daily-015.

FAILURE AND STRANGE DUMP SUMMARY:
 halcyon    /root lev 0 FAILED [out of tape]

[...]

STATISTICS:
                         Total       Full      Daily
                       --------   --------   --------
Estimate Time (hrs:min)    0:45
Run Time (hrs:min)         8:09
Dump Time (hrs:min)        8:15       0:25       7:50
Output Size (meg)       10606.2     1315.9     9290.3
Original Size (meg)     22351.7     2115.1    20236.6
Avg Compressed Size (%)    47.5       62.2       45.9   (level:#disks ...)
Filesystems Dumped           54         28         26   (1:21 2:4 4:1)
Avg Dump Rate (k/s)       365.7      888.3      337.6

Tape Time (hrs:min)        0:14       0:08       0:06
Tape Size (meg)           612.8      332.6      280.2
Tape Used (%)               1.7        1.0        0.7   (level:#disks ...)
Filesystems Taped            36         26         10   (1:10)
Avg Tp Write Rate (k/s)   745.4      745.0      745.9

[...]

taper: tape Daily-014 kb 653792 fm 37 writing file: Input/output error
  driver: going into degraded mode because of tape error.


I notice it mentions "out of tape" above - that's definitely wrong.
There's certainly plenty of room for all the dumps.

And in the kernel logs for the AMANDA server:


[...] kernel: st0: Error with sense data: Info fld=0x8000, Deferred st09:00: sns
+= f1  4
[...] kernel: ASC=15 ASCQ= 1
[...] kernel: Raw sense data:0xf1 0x00 0x04 0x00 0x00 0x80 0x00 0x16 0x00 0x00
+0x4f 0xc8 0x15 0x01 0x00 0x00 0x00 0x00 0x82 0x01 0x55 0x00 0x00 0x25 0x3b 
0x00 0x96 0x32 0x50 0x00
[...] kernel: st0: Error with sense data: Info fld=0x1, Current st09:00: sns =
+f0  3
[...] kernel: ASC=15 ASCQ= 2
[...] kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x00 0x00 0x01 0x16 0x00 0x00
+0x4f 0xc8 0x15 0x02 0x00 0x00 0x00 0x00 0x82 0x01 0x55 0x00 0x00 0x25 0x3b 
0x00 0x96 0x32 0x50 0x00
[...] kernel: st0: Error on write filemark.
[...] kernel: st0: Error with sense data: Info fld=0x8000, Deferred st09:00: sns
+= f1  4
[...] kernel: ASC=15 ASCQ= 1
[...] kernel: Raw sense data:0xf1 0x00 0x04 0x00 0x00 0x80 0x00 0x16 0x00 0x00
+0x4f 0xc8 0x15 0x01 0x00 0x00 0x00 0x00 0x82 0x01 0x55 0x00 0x00 0x25 0x3b 
0x00 0x96 0x32 0x50 0x00
[...] kernel: st0: Error with sense data: Info fld=0x1, Current st09:00: sns =
+f0  3
[...] kernel: ASC=15 ASCQ= 2
[...] kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x00 0x00 0x01 0x16 0x00 0x00
+0x4f 0xc8 0x15 0x02 0x00 0x00 0x00 0x00 0x82 0x01 0x55 0x00 0x00 0x25 0x3b 
0x00 0x96 0x32 0x50 0x00
[...] kernel: st0: Error with sense data: Info fld=0x8000, Deferred st09:00: sns
+= f1  4
[...] kernel: ASC=15 ASCQ= 1
[...] kernel: Raw sense data:0xf1 0x00 0x04 0x00 0x00 0x80 0x00 0x16 0x00 0x00
+0x4f 0xc8 0x15 0x01 0x00 0x00 0x00 0x00 0x82 0x01 0x55 0x00 0x00 0x25 0x3b 
0x00 0x96 0x32 0x50 0x00
[...] kernel: st0: Error with sense data: Info fld=0x1, Current st09:00: sns =
+f0  3
[...] kernel: ASC=15 ASCQ= 2
[...] kernel: Raw sense data:0xf0 0x00 0x03 0x00 0x00 0x00 0x01 0x16 0x00 0x00
+0x4f 0xc8 0x15 0x02 0x00 0x00 0x00 0x00 0x82 0x01 0x55 0x00 0x00 0x25 0x3b 
0x00 0x96 0x32 0x50 0x00
[...] kernel: st0: Error on write filemark.


What looks like the problem here?  The drive itself?  The SCSI
controller?  We have a HP DLT-40 drive on an Adaptec 29160 controller.
There have been no configuration changes to the server.

Dave.


**********************************************************************

This email and any files transmitted with it are confidential and
intended solely for the use of the individual or entity to whom they
are addressed. If you have received this email in error please notify
the system manager.

This footnote confirms that this email message has been swept
for the presence of computer viruses and is believed to be clean.

**********************************************************************


<Prev in Thread] Current Thread [Next in Thread>