Veritas-bu

[Veritas-bu] consistent error 40's

2000-09-28 01:27:24
Subject: [Veritas-bu] consistent error 40's
From: Mike Andres mike_andres AT cnt DOT com
Date: Thu, 28 Sep 2000 00:27:24 -0500
Hi,

    We keep seeing error 40's in the job monitor when we back up an HP-UX
11.00 client across the network to an HP-UX 11.00 master server connected to
9840 tape drives in a 9710 library.  At first we suspected network problems
but after more investigations it seems to be related to the HP A5158A fibre
channel HBA.  When we run the backup with the tape drives connected via a
SCSI interface we see no problems, but when we are attached to the tapes via
FC we eventually get the error 40 in the job monitor.  The error seems to
always occur when the backup reaches the end of a tape.  Looking at the bptm
logs we consistently see an error 174 occuring when the fragment file number
is 37 which seems to be the last fragment on the tape.  The NBU
troubleshooting guide suggests that an error 174 of this type may be due to
the drive being in fixed lengthed mode, but this doesn't seem to apply to
HP-UX .  Does anyone have a clue as to what may be happening here?  Relevant
bptm log snippet below. 

TIA,
Mike Andres


15:07:32 [2544] <4> write_backup: successfully wrote backup id
ranger_0969898942, copy 1, fragment 36, 2000000 Kbytes at 4197.941
Kbytes/sec
15:07:32 [2544] <2> getsockconnected: host=horizon service=bpdbm
address=100.13.201.5 protocol=tcp non-reserved port=13721
15:07:32 [2544] <4> write_backup: begin writing backup id ranger_0969898942,
copy 1, fragment 37, to media id 000516 on drive index 0
15:07:32 [2544] <2> io_write_back_header: drive index 0, ranger_0969898942,
file num = 37, mpx_headers = 0
15:07:32 [2544] <2> write_data: completed writing backup header, start
writing data when first buffer is available
15:07:32 [2544] <2> write_data: received first buffer (32768 bytes), begin
writing data
15:12:32 [4491] <2> bptm: INITIATING: -count -cmd -rt 8 -rn 1 -stunit
horizon -den 6 -mt 2 
15:12:32 [4491] <2> bptm: EXITING with status 0 <----------
15:12:52 [2544] <2> getsockconnected: host=horizon service=bpdbm
address=100.13.201.5 protocol=tcp non-reserved port=13721
15:12:53 [2544] <16> write_data: write of 32768 bytes indicated only 0 bytes
were written, errno = 0
15:12:53 [2544] <2> wait_for_sigcld: waiting for child to exit, timeout is
300
15:12:53 [2544] <2> check_error_history: called from bptm line 12047,
EXIT_Status = 174
15:12:53 [2544] <2> io_close: closing
/usr/openv/netbackup/db/media/tpreq/000516, from bptm.c.10586
15:12:53 [2544] <2> tpunmount: tpunmount'ing
/usr/openv/netbackup/db/media/tpreq/000516





<Prev in Thread] Current Thread [Next in Thread>
  • [Veritas-bu] consistent error 40's, Mike Andres mike_andres <=