Bacula-users

[Bacula-users] Help in decoding hardware error

2011-02-17 15:27:23
Subject: [Bacula-users] Help in decoding hardware error
From: Rory Campbell-Lange <rory AT campbell-lange DOT net>
To: bacula-users AT lists.sourceforge DOT net
Date: Thu, 17 Feb 2011 20:23:37 +0000
I'm having a frustrating time trying to work out why our Dell PV124T (LTO4
autoloader with a Quantum drive) is not working well at present.

Kernel 2.6.33-2-amd64 on Debian
Bacula 5.0.2-1~bpo50 (customised)

The latest in a set of failures (can't write to tapes, can read reliably
off tapes) is as follows (from my kernel log):

    [ 2517.323423] INFO: task bacula-sd:21475 blocked for more than 120 seconds.
    [ 2517.323477] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables 
this message.
    [ 2517.323559] bacula-sd     D 0000000000000000     0 21475      1 
0x00000000
    [ 2517.323562]  ffffffff8163e020 0000000000000086 0000000000000000 
0000000000000fe0
    [ 2517.323566]  ffff8803399e24d0 000000000000f8e0 ffff88032a8bdfd8 
0000000000015680
    [ 2517.323569]  0000000000015680 ffff88009d5f8000 ffff88009d5f82f0 
0000000039974000
    [ 2517.323572] Call Trace:
    [ 2517.323586]  [<ffffffffa0016809>] ? scsi_init_sgtable+0x3f/0x5a 
[scsi_mod]
    [ 2517.323590]  [<ffffffff812ed4cb>] ? schedule_timeout+0x2e/0xdd
    [ 2517.323593]  [<ffffffff81171918>] ? blk_peek_request+0x18b/0x19f
    [ 2517.323596]  [<ffffffff812ed366>] ? wait_for_common+0xde/0x15b
    [ 2517.323599]  [<ffffffff81042cc2>] ? default_wake_function+0x0/0x9
    [ 2517.323603]  [<ffffffffa01d609e>] ? st_do_scsi+0x2c4/0x2f4 [st]
    [ 2517.323607]  [<ffffffffa01d9642>] ? st_read+0x396/0x8f4 [st]
    [ 2517.323610]  [<ffffffff812ee88e>] ? common_interrupt+0xe/0x13
    [ 2517.323613]  [<ffffffff81040d61>] ? finish_task_switch+0x3a/0xa7
    [ 2517.323618]  [<ffffffff810ea675>] ? vfs_read+0xa6/0xff
    [ 2517.323620]  [<ffffffff810ea78a>] ? sys_read+0x45/0x6e
    [ 2517.323624]  [<ffffffff81008ac2>] ? system_call_fastpath+0x16/0x1b
    [ 2981.275435] st0: Error 50000 (driver bt 0x0, host bt 0x5).
    [ 2981.278175] st0: Sense Key : Aborted Command [current]
    [ 2981.278180] st0: Add. Sense: Tagged overlapped commands (task tag 0)
    [ 3880.828005] st0: Error 50000 (driver bt 0x0, host bt 0x5).

Can anyone tell me what this means, apart from "your restore job just failed"?

Many thanks
Rory

-- 
Rory Campbell-Lange
rory AT campbell-lange DOT net

------------------------------------------------------------------------------
The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
Pinpoint memory and threading errors before they happen.
Find and fix more than 250 security defects in the development cycle.
Locate bottlenecks in serial and parallel code that limit performance.
http://p.sf.net/sfu/intel-dev2devfeb
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>