Veritas-bu

[Veritas-bu] Error bptm(pid=3956) SCSI RESERVE failed (reserve unit scsi command failed)

2007-07-13 13:24:08
Subject: [Veritas-bu] Error bptm(pid=3956) SCSI RESERVE failed (reserve unit scsi command failed)
From: "Preston, Douglas L" <dpreston AT landam DOT com>
To: <VERITAS-BU AT mailman.eng.auburn DOT edu>
Date: Fri, 13 Jul 2007 13:07:29 -0400
Staring 3 days ago I have been seeing a lot of the following error on one of my 
media servers.  Has anyone seen this before and do they have a possible cause 
or solution?

NBU6.0 MP4
Adic Scalar I2000 library
14 SSO drives


7/13/2007 9:26:33 AM - begin Duplicate
7/13/2007 9:26:34 AM - requesting resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:26:34 AM - requesting resource L00882
7/13/2007 9:26:34 AM - reserving resource L00882
7/13/2007 9:26:34 AM - reserving resource 000418
7/13/2007 9:26:34 AM - reserving resource L00723
7/13/2007 9:31:48 AM - reserved resource L00882
7/13/2007 9:31:48 AM - reserved resource 000418
7/13/2007 9:31:48 AM - reserved resource L00723
7/13/2007 9:31:48 AM - granted resource L00882
7/13/2007 9:31:48 AM - granted resource IBMULTRIUM-TD23
7/13/2007 9:31:48 AM - granted resource 000378
7/13/2007 9:31:48 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:31:48 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:31:28 AM - started process bptm (3956)
7/13/2007 9:31:34 AM - started process bptm (3956)
7/13/2007 9:31:34 AM - mounting 000378
7/13/2007 9:31:34 AM - started process bptm (5180)
7/13/2007 9:31:35 AM - started process bptm (4152)
7/13/2007 9:31:37 AM - started process bptm (5736)
7/13/2007 9:31:39 AM - started process bptm (5180)
7/13/2007 9:31:39 AM - mounting L00882
7/13/2007 9:32:48 AM - mounted; mount time: 00:01:09
7/13/2007 9:32:54 AM - positioning L00882 to file 206
7/13/2007 9:33:18 AM - positioned L00882; position time: 00:00:24
7/13/2007 9:33:24 AM - begin reading
7/13/2007 9:33:24 AM - end reading; read time: 00:00:00
7/13/2007 9:33:25 AM - positioning L00882 to file 207
7/13/2007 9:33:25 AM - positioned L00882; position time: 00:00:00
7/13/2007 9:33:26 AM - begin reading
7/13/2007 9:33:39 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:33:40 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:34:40 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:34:19 AM - started process bptm (3956)
7/13/2007 9:34:19 AM - mounting 000378
7/13/2007 9:35:14 AM - granted resource 000378
7/13/2007 9:35:14 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:35:14 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:36:06 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:36:07 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:37:06 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:37:13 AM - started process bptm (3956)
7/13/2007 9:37:13 AM - mounting 000378
7/13/2007 9:38:10 AM - granted resource 000378
7/13/2007 9:38:10 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:38:10 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:39:13 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:39:14 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:40:13 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:40:53 AM - granted resource 000378
7/13/2007 9:40:53 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:40:53 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:39:55 AM - started process bptm (3956)
7/13/2007 9:39:55 AM - mounting 000378
7/13/2007 9:41:49 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:41:50 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:42:50 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:43:22 AM - granted resource 000378
7/13/2007 9:43:22 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:43:22 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:42:26 AM - started process bptm (3956)
7/13/2007 9:42:26 AM - mounting 000378
7/13/2007 9:44:25 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:44:26 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:45:21 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:45:02 AM - started process bptm (3956)
7/13/2007 9:45:02 AM - mounting 000378
7/13/2007 9:45:37 AM - granted resource 000378
7/13/2007 9:45:37 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:45:37 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:46:50 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:46:51 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:46:59 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:47:32 AM - granted resource 000378
7/13/2007 9:47:32 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:47:32 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:47:27 AM - started process bptm (3956)
7/13/2007 9:47:27 AM - mounting 000378
7/13/2007 9:49:39 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:49:40 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:49:48 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:50:24 AM - started process bptm (3956)
7/13/2007 9:50:24 AM - mounting 000378
7/13/2007 9:50:29 AM - granted resource 000378
7/13/2007 9:50:29 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:50:29 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:52:17 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:52:18 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:52:26 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:52:54 AM - started process bptm (3956)
7/13/2007 9:52:54 AM - mounting 000378
7/13/2007 9:52:59 AM - granted resource 000378
7/13/2007 9:52:59 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:52:59 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:54:47 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:54:48 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:55:47 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:55:24 AM - started process bptm (3956)
7/13/2007 9:55:24 AM - mounting 000378
7/13/2007 9:56:22 AM - granted resource 000378
7/13/2007 9:56:22 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:56:22 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:57:22 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:57:23 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 9:58:22 AM - current media 000378 complete, requesting next media Any
7/13/2007 9:58:01 AM - started process bptm (3956)
7/13/2007 9:58:01 AM - mounting 000378
7/13/2007 9:58:57 AM - granted resource 000378
7/13/2007 9:58:57 AM - granted resource IBMULTRIUM-TD28
7/13/2007 9:58:57 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 9:59:48 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 9:59:49 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 10:00:48 AM - current media 000378 complete, requesting next media Any
7/13/2007 10:01:21 AM - granted resource 000378
7/13/2007 10:01:21 AM - granted resource IBMULTRIUM-TD28
7/13/2007 10:01:21 AM - granted resource SCACIBU01-hcart2-robot-tld-0
7/13/2007 10:00:25 AM - started process bptm (3956)
7/13/2007 10:00:25 AM - mounting 000378
7/13/2007 10:02:30 AM - Error bptm(pid=3956) SCSI RESERVE failed (reserve unit 
scsi command failed)     
7/13/2007 10:02:31 AM - Warning bptm(pid=3956) media id 000378 load operation 
reported an error     
7/13/2007 10:03:31 AM - current media 000378 complete, requesting next media Any
7/13/2007 10:03:12 AM - started process bptm (3956)
7/13/2007 10:03:12 AM - mounting 000378
7/13/2007 10:04:10 AM - granted resource 000378
7/13/2007 10:04:10 AM - granted resource IBMULTRIUM-TD28
7/13/2007 10:04:10 AM - granted resource SCACIBU01-hcart2-robot-tld-0 


Doug Preston
Systems Engineer
Land America Tax and Flood Services
Phone 626-339-5221 Ext 1104
Email  dlpreston AT landam DOT com


------------------------------------------------------------------------------------
NOTICE: This electronic mail transmission may constitute a communication that 
is legally privileged. It is not intended for transmission to, or receipt by, 
any unauthorized persons. If you have received this electronic mail 
transmission in error, please delete it from your system without copying it, 
and notify the sender by reply e-mail, so that our address record can be 
corrected.
------------------------------------------------------------------------------------




-----Original Message-----
From: veritas-bu-bounces AT mailman.eng.auburn DOT edu 
[mailto:veritas-bu-bounces AT mailman.eng.auburn DOT edu] On Behalf Of Renee 
Carlisle
Sent: Friday, July 13, 2007 9:50 AM
To: jlightner AT water DOT com; VERITAS-BU AT mailman.eng.auburn DOT edu; 
david.clooney AT bankofamerica DOT com
Subject: Re: [Veritas-bu] URGENT Friday Afternoon Help

With SSO check to see if it caused a hang on communicating with ltid or 
vmd...that could trigger the failover as well.





-----Original Message-----
From: "Clooney, David" [david.clooney AT bankofamerica DOT com]
Date: 07/13/2007 12:32 PM
To: "Jeff Lightner" , VERITAS-BU AT mailman.eng.auburn DOT edu
Subject: Re: [Veritas-bu] URGENT Friday Afternoon Help





Thanks for the input Jeff

 

Unfortunately the servers are using Microsoft cluster, not sure whether you get 
the sae functionality.

 

One cluster failed over as one of the nodes was hit, however on our other 
cluster both nodes where taken out, not a good situation.

 

Dave

 

________________________________

From: Jeff Lightner [mailto:jlightner AT water DOT com]
Sent: 13 July 2007 16:58
To: Clooney, David; VERITAS-BU AT mailman.eng.auburn DOT edu
Subject: RE: [Veritas-bu] URGENT Friday Afternoon Help

 

If you put your drives in as â01Ccriticalâ01D resources for the cluster and 
they suddenly couldnâ019t be accessed due to an incommunicative Library it 
makes sense the cluster would fail over.   We mistakenly did this with some 
filesystems early on in our Development cluster.   We had multiple environments 
on the cluster and when we unmounted one of the filesystems for one environment 
it failed the entire cluster over.   Veritas Cluster lets you mark which items 
are â01Ccriticalâ01D to the cluster.  (e.g. the underlying NIC might be but not 
individual filesystems in the Dev cluster).

 

________________________________

From: veritas-bu-bounces AT mailman.eng.auburn DOT edu 
[mailto:veritas-bu-bounces AT mailman.eng.auburn DOT edu] On Behalf Of Clooney, 
David
Sent: Friday, July 13, 2007 11:43 AM
To: VERITAS-BU AT mailman.eng.auburn DOT edu
Subject: [Veritas-bu] URGENT Friday Afternoon Help

 

Hi All

 

Apologies for the dramatic subject.

 

We are in a crisis situation at the moment and are at a loss as to what the 
problems might be .

 

Scenario , 5.1 MP5 all

Master    solaris 8 64 bit

2 X windows 2003 SP1 media servers

2 X windows 2003 Clustered media server.

Using SSO

 

STK 9940b

 

 

We had a drive go down at 14h00 today and it caused 1 of the standalone media 
servers and three of the clustered nodes to fall over, all at once, ouch

 

Has anyone ever seen this behaviour before or anything like it ?

 

Sorry for the lack of info, however am trying to get to the bottom of this.

 

Any info or thoughts would be most welcome.

 

Regards

 

Dave

 

 

 

________________________________

Notice to recipient:
The information in this internet e-mail and any attachments is confidential and 
may be privileged. It is intended solely for the addressee. If you are not the 
intended addressee please notify the sender immediately by telephone. If you 
are not the intended recipient, any disclosure, copying, distribution or any 
action taken or omitted to be taken in reliance on it, is prohibited and may be 
unlawful.

When addressed to external clients any opinions or advice contained in this 
internet e-mail are subject to the terms and conditions expressed in any 
applicable governing terms of business or client engagement letter issued by 
the pertinent Bank of America group entity.

If this email originates from the U.K. please note that Bank of America, N.A., 
London Branch and Banc of America Securities Limited are authorised and 
regulated by the Financial Services Authority.

________________________________


_______________________________________________
Veritas-bu maillist  -  Veritas-bu AT mailman.eng.auburn DOT edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu