Bacula-users

[Bacula-users] Catastrophic error. Cannot write overflow block to device "LTO4"

2011-07-10 08:20:11
Subject: [Bacula-users] Catastrophic error. Cannot write overflow block to device "LTO4"
From: "Steve Costaras" <stevecs AT chaven DOT com>
To: bacula-users AT lists.sourceforge DOT net
Date: Sun, 10 Jul 2011 12:17:55 +0000

I am trying a full backup/multi-job to a single client and all was going well 
until this morning when I received the error below.   All other jobs were also 
canceled.  

My question is two fold:

1) What the heck is this error?   I can unmount the drive, issue a rawfill to 
the tape w/ btape and no problems?   

2) since everything is spooled first, there should be NO error that should 
cancel a job.   A tape drive could fail, a tape could burst into flame,  all 
that would be needed was bacula to know that there was an issue and give the 
admin a simple statement do you want to fix the issue or cancel?, the admin to 
fix the problem, and then bacula told to restart from the last block that was 
stored successfully OR if need be from the beginning of the spooled data file.

Canceling jobs that run for days for TB's of data is just screwed up.

Steve 

----
3000 OK label. VolBytes=1024 DVD=0 Volume="FA0016" Device="LTO4" (/dev/nst0)
Requesting to mount LTO4 ...
3905 Bizarre wait state 7
Do not forget to mount the drive!!!
2011-07-10 03SD-loki JobId 6: Wrote label to prelabeled Volume "FA0016" on 
device "LTO4" (/dev/nst0)
2011-07-10 03SD-loki JobId 6: New volume "FA0016" mounted on device "LTO4" 
(/dev/nst0) at 10-Jul-2011 03:51.
2011-07-10 03SD-loki JobId 6: Fatal error: block.c:439 Attempt to write on 
read-only Volume. dev="LTO4" (/dev/nst0)
2011-07-10 03SD-loki JobId 6: End of medium on Volume "FA0016" Bytes=1,024 
Blocks=0 at 10-Jul-2011 03:51.
2011-07-10 03SD-loki JobId 6: Fatal error: Job 6 canceled.
2011-07-10 03SD-loki JobId 6: Fatal error: device.c:192 Catastrophic error. 
Cannot write overflow block to device "LTO4" (/dev/nst0). ERR=Input/output error

*
2011-07-10 03SD-loki JobId 6: Despooling elapsed time = 02:32:53, Transfer rate 
= 93.64 M Bytes/second
2011-07-10 03SD-loki JobId 6: Job write elapsed time = 57:37:54, Transfer rate 
= 8.278 M Bytes/second
2011-07-10 03FD-loki JobId 6: Error: bsock.c:393 Write error sending 65536 
bytes to Storage daemon:loki:9103: ERR=Connection reset by peer
2011-07-10 03FD-loki JobId 6: Fatal error: backup.c:1024 Network send error to 
SD. ERR=Connection reset by peer
2011-07-10 03SD-loki JobId 7: Fatal error: block.c:439 Attempt to write on 
read-only Volume. dev="LTO4" (/dev/nst0)
2011-07-10 03SD-loki JobId 7: Fatal error: spool.c:301 Fatal append error on 
device "LTO4" (/dev/nst0): ERR=block.c:1015 Read zero bytes at 0:0 on device 
"LTO4" (/dev/nst0).

2011-07-10 03SD-loki JobId 7: Despooling elapsed time = 00:00:01, Transfer rate 
= 858.9 G Bytes/second
*
2011-07-10 03DIR-loki JobId 6: Error: Bacula DIR-loki 5.0.3 (04Aug10): 
10-Jul-2011 03:52:08
  Build OS:               x86_64-unknown-linux-gnu ubuntu 10.04
  JobId:                  6
  Job:                    
JOB-loki_var_ftp_pub_Multimedia_DVD.2011-07-07_17.45.01_08
  Backup Level:           Full
  Client:                 "FD-loki" 5.0.3 (04Aug10) 
x86_64-unknown-linux-gnu,ubuntu,10.04
  FileSet:                "FS-loki_var_ftp_pub_Multimedia_DVD" 2011-07-06 
18:00:01
  Pool:                   "BackupSetFA" (From Run FullPool override)
  Catalog:                "MyCatalog" (From Client resource)
  Storage:                "LTO4" (From Pool resource)
  Scheduled time:         07-Jul-2011 17:45:01
  Start time:             07-Jul-2011 17:50:30
  End time:               10-Jul-2011 03:52:08
  Elapsed time:           2 days 10 hours 1 min 38 secs
  Priority:               50
  FD Files Written:       452
  SD Files Written:       452
  FD Bytes Written:       1,717,640,639,816 (1.717 TB)
  SD Bytes Written:       1,717,632,388,872 (1.717 TB)
  Rate:                   8222.4 KB/s
  Software Compression:   None
  VSS:                    no
  Encryption:             no
  Accurate:               yes
  Volume name(s):         FA0011|FA0012|FA0015
  Volume Session Id:      6
  Volume Session Time:    1310078212
  Last Volume Bytes:      1,024 (1.024 KB)
  Non-fatal FD errors:    1
  SD Errors:              0
  FD termination status:  Error
  SD termination status:  Error
  Termination:            *** Backup Error ***
-------



------------------------------------------------------------------------------
All of the data generated in your IT infrastructure is seriously valuable.
Why? It contains a definitive record of application performance, security 
threats, fraudulent activity, and more. Splunk takes this data and makes 
sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-d2d-c2
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users