[Networker] Endless loop in 7.3.1

Starting sometime of Friday evening, one of the 4 LTO3 drives in ourSL500 library decided it needed cleaning, so EBS 7.3.1 went on to grab acleaning tape from an appropriate slot, then it finished the operationwith a message "device cleaned notice: Jukebox `jb0' Device`/dev/rmt/3hbn' in jukebox `jb0' cleaned at" <date on Friday>. Eversince then, that drive has been displaying the same message every coupleof minutes non-stop!

After poking around, I found the following:

In the Java Admin Console, the following shows up in 'Operations' tabunder 'Monitoring', one after another:


Origin   Operation Data              Duration Progress Message
nsrmmgd  Clean device /dev/rmt/3hbn  30       retryable

In /nsr/logs/daemon.log, I see the same combination of lines repeat:

03/19/07 18:19:23 nsrd: [Jukebox `jb0', operation 2098]. Initiatedoperation `Clean device /dev/rmt/3hbn using cleaning slot 373'.03/19/07 18:19:49 nsrlcpd #1: Jukebox error: Jukebox:jb0access:/dev/scsi/changer/c2t2d0 failed:MOVE MEDIUM key:4 status:CHECKCONDITION No Additional Sense, Media Load or Eject Failed

03/19/07 18:19:49 nsrmmgd: lcpd 1 at host scotch.tape.gatech.edureported error 'Jukebox:jb0 access:/dev/scsi/changer/c2t2d0 failed:MOVEMEDIUM key:4 status:CHECK CONDITION No Additional Sense, Media Load orEject Failed

' for the command `4'.

03/19/07 18:19:49 nsrd: [Jukebox `jb0', operation # 2098]. Jukebox:jb0access:/dev/scsi/changer/c2t2d0 failed:MOVE MEDIUM key:4 status:CHECKCONDITION No Additional Sense, Media Load or Eject Failed

03/19/07 18:19:49 nsrmmgd: Jukebox:jb0 access:/dev/scsi/changer/c2t2d0failed:MOVE MEDIUM key:4 status:CHECK CONDITION No Additional Sense,Media Load or Eject Failed

03/19/07 18:19:49 nsrd: device cleaned notice: Jukebox `jb0' Device`/dev/rmt/3hbn' in jukebox `jb0' cleaned at `Mon Mar 19 18:19:49GMT-0400 2007'.03/19/07 18:19:53 nsrd: [Jukebox `jb0', operation # 2098]. Finished withstatus: retryable

03/19/07 18:19:53 nsrmmgd: RAP error: Invalid resource data.

03/19/07 18:19:53 nsrmmgd: Cannot update operation status resource(instance 2098).

What do you think is causing this, and how on earth can I stop it??!Drive /dev/rmt/3hbn still obeys commands such as "nsrjb -L -S 100 -f/dev/rmt/3hbn" so the drive seems to still work...


TIA,
Andrew Dietz

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER