Veritas-bu

[Veritas-bu] Media write error

2002-01-03 13:20:35
Subject: [Veritas-bu] Media write error
From: ferbraga AT vento.com DOT br (ferbraga AT vento.com DOT br)
Date: Thu, 3 Jan 2002 18:20:35 +0000
I have sometimes a failed backup with error 84. It occurs after NBU have
done a lot of work in this class (more than 500GB). So, I don't suspect
drive error, drive configuration errors, or media type errors, like Troubleshoot
Guide suggests. The medias are all new, and the drive is clean. This class
use a storage unit with two drives.
My master server is a Solaris8, Netbackup 3.4 with patch 34_2.
The media server, where the error occurs, is a Tru64 5.1 with the same patch
level. The tape drive is a 9840FC. In this media server are another backups
running pretty well.
The bptm logs shows :

05:34:18 [474102] <2> bptm: INITIATING: -U
05:34:18 [474102] <2> db_byid: search for media id 000042
05:34:18 [474102] <2> db_byid: 000042 found at offset 84
05:34:18 [474102] <2> db_lock_media: unable to lock media at offset 84 (000042)
05:34:18 [474102] <2> bptm: EXITING with status 0 <----------
05:34:36 [470438] <2> signal_parent: sending SIGUSR1 to bpbrm (pid = 470410)
05:34:36 [470438] <2> write_data: attempting write error recovery, err =
5
05:34:36 [470438] <2> tape_error_rec: error recovery to block 73580 requested
05:34:36 [470438] <2> tape_error_rec: attempting error recovery, delay 3
minutes before next attempt, tries left = 5
.
.
.
05:37:36 [470438] <2> io_ioctl: command (0)MTWEOF 0 from (overwrite.c.395)
on drive index 0
05:37:36 [470438] <2> io_ioctl: MTWEOF failed during error recovery, I/O
error
05:37:36 [470438] <2> tape_error_rec: attempting error recovery, delay 3
minutes before next attempt, tries left = 4
.
.
.
05:49:36 [470438] <2> io_ioctl: command (0)MTWEOF 0 from (overwrite.c.395)
on drive index 0
05:49:36 [470438] <2> io_ioctl: MTWEOF failed during error recovery, I/O
error
05:49:36 [470438] <2> getsockconnected: host=casra-backserver2 service=bpdbm
address=172.18.100.54 protocol=tcp non-reserved port=13721
05:49:36 [470438] <2> bind_on_port_addr: bound to port 4657
05:49:36 [470438] <2> check_authentication: no authentication required
05:49:37 [470438] <16> write_data: cannot write image to media id 000042,
drive index 0, I/O error
05:49:37 [470438] <2> rewind_after_error: rewinding after EIO error
05:49:38 [475173] <2> bptm: INITIATING: -U
05:49:38 [475173] <2> db_byid: search for media id 000042
05:49:38 [475173] <2> db_byid: 000042 found at offset 84
05:49:38 [475173] <2> db_lock_media: unable to lock media at offset 84 (000042)
05:49:38 [475173] <2> bptm: EXITING with status 0 <----------
05:49:49 [470438] <2> rewind_after_error: tried 1 time(s) to perform rewind
05:49:49 [470438] <2> log_media_error: successfully wrote to error file
- 01/03/02 05:49:49 000042 0 WRITE_ERROR
05:49:49 [470438] <2> check_error_history: called from bptm line 12362,
EXIT_Status = 84
05:49:49 [470438] <2> check_error_history: drive index = 0, media id = 000042,
time = 01/03/02 05:49:49, both_match = 0, media_match = 0, drive_match =
0
05:49:49 [470438] <2> io_close: closing 
/usr/openv/netbackup/db/media/tpreq/000042,
from bptm.c.10850
05:49:49 [470438] <2> tpunmount: tpunmount'ing 
/usr/openv/netbackup/db/media/tpreq/000042
05:49:49 [470438] <2> bptm: EXITING with status 84 <----------


Any help will be welcome!!

Thanks in advanced

Fernando





<Prev in Thread] Current Thread [Next in Thread>