Bacula-users

Re: [Bacula-users] Catastrophic error. Cannot write overflow block to device "LTO4"

2011-07-10 14:02:05
Subject: Re: [Bacula-users] Catastrophic error. Cannot write overflow block to device "LTO4"
From: Dan Langille <dan AT langille DOT org>
To: stevecs AT chaven DOT com
Date: Sun, 10 Jul 2011 13:58:31 -0400
On Jul 10, 2011, at 8:17 AM, Steve Costaras wrote:

> 
> 
> I am trying a full backup/multi-job to a single client and all was going well 
> until this morning when I received the error below.   All other jobs were 
> also canceled.  
> 
> My question is two fold:
> 
> 1) What the heck is this error?   I can unmount the drive, issue a rawfill to 
> the tape w/ btape and no problems?   

I don't know.  Perhaps someone else will.

> 
> 2) since everything is spooled first, there should be NO error that should 
> cancel a job.   A tape drive could fail, a tape could burst into flame,  all 
> that would be needed was bacula to know that there was an issue and give the 
> admin a simple statement do you want to fix the issue or cancel?, the admin 
> to fix the problem, and then bacula told to restart from the last block that 
> was stored successfully OR if need be from the beginning of the spooled data 
> file.

This I do know.  Although, at first glance it seems easy to do this, it is not. 
   If it was trivial to do, I assure you, it would already be in place.

> Canceling jobs that run for days for TB's of data is just screwed up.

I suggest running smaller jobs.  I don't mean to sound trite, but that really 
is the solution.  Given that the alternative is non-trivial, the sensible 
choice is, I'm afraid, cancel the job.

> 
> Steve 
> 
> ----
> 3000 OK label. VolBytes=1024 DVD=0 Volume="FA0016" Device="LTO4" (/dev/nst0)
> Requesting to mount LTO4 ...
> 3905 Bizarre wait state 7
> Do not forget to mount the drive!!!
> 2011-07-10 03SD-loki JobId 6: Wrote label to prelabeled Volume "FA0016" on 
> device "LTO4" (/dev/nst0)
> 2011-07-10 03SD-loki JobId 6: New volume "FA0016" mounted on device "LTO4" 
> (/dev/nst0) at 10-Jul-2011 03:51.
> 2011-07-10 03SD-loki JobId 6: Fatal error: block.c:439 Attempt to write on 
> read-only Volume. dev="LTO4" (/dev/nst0)
> 2011-07-10 03SD-loki JobId 6: End of medium on Volume "FA0016" Bytes=1,024 
> Blocks=0 at 10-Jul-2011 03:51.
> 2011-07-10 03SD-loki JobId 6: Fatal error: Job 6 canceled.
> 2011-07-10 03SD-loki JobId 6: Fatal error: device.c:192 Catastrophic error. 
> Cannot write overflow block to device "LTO4" (/dev/nst0). ERR=Input/output 
> error
> 
> *
> 2011-07-10 03SD-loki JobId 6: Despooling elapsed time = 02:32:53, Transfer 
> rate = 93.64 M Bytes/second
> 2011-07-10 03SD-loki JobId 6: Job write elapsed time = 57:37:54, Transfer 
> rate = 8.278 M Bytes/second
> 2011-07-10 03FD-loki JobId 6: Error: bsock.c:393 Write error sending 65536 
> bytes to Storage daemon:loki:9103: ERR=Connection reset by peer
> 2011-07-10 03FD-loki JobId 6: Fatal error: backup.c:1024 Network send error 
> to SD. ERR=Connection reset by peer
> 2011-07-10 03SD-loki JobId 7: Fatal error: block.c:439 Attempt to write on 
> read-only Volume. dev="LTO4" (/dev/nst0)
> 2011-07-10 03SD-loki JobId 7: Fatal error: spool.c:301 Fatal append error on 
> device "LTO4" (/dev/nst0): ERR=block.c:1015 Read zero bytes at 0:0 on device 
> "LTO4" (/dev/nst0).
> 
> 2011-07-10 03SD-loki JobId 7: Despooling elapsed time = 00:00:01, Transfer 
> rate = 858.9 G Bytes/second
> *
> 2011-07-10 03DIR-loki JobId 6: Error: Bacula DIR-loki 5.0.3 (04Aug10): 
> 10-Jul-2011 03:52:08
>  Build OS:               x86_64-unknown-linux-gnu ubuntu 10.04
>  JobId:                  6
>  Job:                    
> JOB-loki_var_ftp_pub_Multimedia_DVD.2011-07-07_17.45.01_08
>  Backup Level:           Full
>  Client:                 "FD-loki" 5.0.3 (04Aug10) 
> x86_64-unknown-linux-gnu,ubuntu,10.04
>  FileSet:                "FS-loki_var_ftp_pub_Multimedia_DVD" 2011-07-06 
> 18:00:01
>  Pool:                   "BackupSetFA" (From Run FullPool override)
>  Catalog:                "MyCatalog" (From Client resource)
>  Storage:                "LTO4" (From Pool resource)
>  Scheduled time:         07-Jul-2011 17:45:01
>  Start time:             07-Jul-2011 17:50:30
>  End time:               10-Jul-2011 03:52:08
>  Elapsed time:           2 days 10 hours 1 min 38 secs
>  Priority:               50
>  FD Files Written:       452
>  SD Files Written:       452
>  FD Bytes Written:       1,717,640,639,816 (1.717 TB)
>  SD Bytes Written:       1,717,632,388,872 (1.717 TB)
>  Rate:                   8222.4 KB/s
>  Software Compression:   None
>  VSS:                    no
>  Encryption:             no
>  Accurate:               yes
>  Volume name(s):         FA0011|FA0012|FA0015
>  Volume Session Id:      6
>  Volume Session Time:    1310078212
>  Last Volume Bytes:      1,024 (1.024 KB)
>  Non-fatal FD errors:    1
>  SD Errors:              0
>  FD termination status:  Error
>  SD termination status:  Error
>  Termination:            *** Backup Error ***
> -------
> 
> 
> 
> ------------------------------------------------------------------------------
> All of the data generated in your IT infrastructure is seriously valuable.
> Why? It contains a definitive record of application performance, security 
> threats, fraudulent activity, and more. Splunk takes this data and makes 
> sense of it. IT sense. And common sense.
> http://p.sf.net/sfu/splunk-d2d-c2
> _______________________________________________
> Bacula-users mailing list
> Bacula-users AT lists.sourceforge DOT net
> https://lists.sourceforge.net/lists/listinfo/bacula-users

-- 
Dan Langille - http://langille.org


------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users