Bacula-users

Re: [Bacula-users] SCSI Errors

2012-01-23 18:18:36
Subject: Re: [Bacula-users] SCSI Errors
From: John Drescher <drescherjm AT gmail DOT com>
To: Nikola Lazic <nl AT vpi DOT us>
Date: Mon, 23 Jan 2012 18:16:52 -0500
On Mon, Jan 23, 2012 at 6:08 PM, Nikola Lazic <nl AT vpi DOT us> wrote:
> I have Bacula 5.0.3 on FreeBSD 8.2 backing up to an IBM Ultrium ULT3580 tape
> drive, using 400~800GB tapes, connected to an Adaptec 39320LPE Ultra320 SCSI
> adapter. I'm using SQLite and have 4GB of RAM.
>
> I used to have a Quantum DLTV4 tape drive, but I was getting SCSI errors
> every few weeks, so I switched to the Ultrium.
>
> I've used 3 other tapes in this drive with no issues. This tape fails:
>
> # cat /var/log/messages
> Jan 21 00:44:00 bsd kernel: (sa0:ahd0:0:6:0): WRITE FILEMARKS(6). CDB: 10 0
> 0 0 1 0
> Jan 21 00:44:00 bsd kernel: (sa0:ahd0:0:6:0): CAM status: SCSI Status Error
> Jan 21 00:44:00 bsd kernel: (sa0:ahd0:0:6:0): SCSI status: Check Condition
> Jan 21 00:44:00 bsd kernel: (sa0:ahd0:0:6:0): SCSI sense: MEDIUM ERROR
> asc:31,0 (Medium format corrupted) field replaceable unit: 30
> Jan 21 00:44:00 bsd kernel: (sa0:ahd0:0:6:0): WRITE FILEMARKS(6). CDB: 10 0
> 0 0 2 0
> Jan 21 00:44:00 bsd kernel: (sa0:ahd0:0:6:0): CAM status: SCSI Status Error
> Jan 21 00:44:00 bsd kernel: (sa0:ahd0:0:6:0): SCSI status: Check Condition
> Jan 21 00:44:00 bsd kernel: (sa0:ahd0:0:6:0): SCSI sense: MEDIUM ERROR
> asc:31,0 (Medium format corrupted) field replaceable unit: 30
>
> # cat /var/db/bacula/log
> 21-Jan 00:43 bacula-server-sd JobId 1070: Committing spooled data to Volume
> "BSD2a". Despooling 9,641,289,685 bytes ...
> 21-Jan 00:43 bacula-server-sd JobId 1070: End of Volume "BSD2a" at 21:7164
> on device "ULT3580" (/dev/nsa0). Write of 64512 bytes got 0.
> 21-Jan 00:44 bacula-server-sd JobId 1070: Error: Error writing final EOF to
> tape. This Volume may not be readable.
> dev.c:1745 ioctl MTWEOF error on "ULT3580" (/dev/nsa0). ERR=Input/output
> error.
> 21-Jan 00:44 bacula-server-sd JobId 1070: End of medium on Volume "BSD2a"
> Bytes=13,968,912,384 Blocks=216,531 at 21-Jan-2012 00:44.
> 21-Jan 00:44 bacula-server-sd JobId 1070: Job nti-bsd.2012-01-20_23.05.00_26
> is waiting. Cannot find any appendable volumes.
> Please use the "label" command to create a new Volume for:
>    Storage:      "ULT3580" (/dev/nsa0)
>    Pool:         Ultium
>    Media type:   LTO4
>
> I've tried migrating the jobs from the tape, but I get:
> # cat /var/db/bacula/log
> 23-Jan 15:34 bacula-server-sd JobId 1079: Forward spacing Volume "BSD2a" to
> file:block 20:0.
> 23-Jan 15:35 bacula-server-sd JobId 1079: Error: block.c:1002 Read error on
> fd=5 at file:blk 21:7163 on device "ULT3580" (/dev/nsa0). ERR=Operation not
> permitted.
> 23-Jan 15:35 bacula-server-sd JobId 1079: Error: Unexpected Tape is Off-line
> 23-Jan 15:35 bacula-server-dir JobId 1079: Error: Bacula bacula-server-dir
> 5.0.3 (04Aug10): 23-Jan-2012 15:35:54
>
> I was able to write multi-tape btape fill to this tape with no errors. I
> haven't completed the read test, yet.
>
> Here's a sample error from my old Quantum DLTV4 drive, since I feel like
> these might be connected.
>
> Dec 14 03:34:35 bsd kernel: (sa0:ahd0:0:5:0): SCSI status: Check Condition
> Dec 14 03:34:35 bsd kernel: (sa0:ahd0:0:5:0): SCSI sense: UNIT ATTENTION
> csi:e0,40,0,2e asc:29,3 (Bus device reset function occurred)
> Dec 14 05:34:46 bsd kernel: ahd0: Recovery Initiated - Card was not paused
>
> Other things I've done:
> * Replaced the SCSI adapter
> * Replaced the SCSI cable and terminator
>
> Here's the current SCSI adapter info
> # dmesg
> ahd0: <Adaptec 39320LPE Ultra320 SCSI adapter> port
> 0xe400-0xe4ff,0xe800-0xe8ff mem 0xfbd7e000-0xfbd7ffff irq 16 at device 4.0
> on pci4
> ahd0: [ITHREAD]
> aic7902: Ultra320 Wide Channel A, SCSI Id=7, PCI-X 101-133MHz, 512 SCBs
>
> Only other thing I haven't done is move to MySQL since my bacula.db is about
> 3.6 GB holding ~18M files in 332 jobs:
>
> select sum(JobFiles) from job;
> +---------------+
> | sum(JobFiles) |
> +---------------+
> | 18685071      |
> +---------------+
> Enter SQL query: select count(*) from job;
> +----------+
> | count(*) |
> +----------+
> | 332      |
> +----------+
>
> I have no idea where else to look. Any help is appreciated!
>

Bad tape? Drive need cleaned?

John

------------------------------------------------------------------------------
Try before you buy = See our experts in action!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-dev2
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>