Bacula-users

[Bacula-users] copy-job failed on big jobs (other backups use the source jukebox during volume change)

2014-07-30 05:52:36
Subject: [Bacula-users] copy-job failed on big jobs (other backups use the source jukebox during volume change)
From: Udo Lembke <udo.lembke AT albertbauer DOT com>
To: "bacula-users AT lists.sourceforge DOT net" <bacula-users AT lists.sourceforge DOT net>
Date: Wed, 30 Jul 2014 11:50:17 +0200
Hi,
I try to use copy-jobs with an neo200s (2 drives,with logical partition:
2 jukeboxes with one drive each).
Bacula is version 5.2.10.
If I run an copyjob with 4.8 TB the job fails during volume-change on
the normal (source) backup-jukebox.
Both jobs failed at the same time (2:00):

########### copy-job ###########
29-Jul 11:37 backup-srv-dir JobId 2475: The following 1 JobId was chosen
to be copied: 2452
29-Jul 11:37 backup-srv-dir JobId 2475: Copying using JobId=2452
Job=fileserver.2014-07-26_21.35.00_43
29-Jul 11:37 backup-srv-dir JobId 2475: Bootstrap records written to
/var/bacula/working/backup-srv-dir.restore.3.bsr
29-Jul 11:37 backup-srv-dir JobId 2475: Start Copying JobId 2475,
Job=Copy-To-Offsite.2014-07-29_11.37.21_38
29-Jul 11:37 backup-srv-dir JobId 2475: Using Device "LTO6-Drive-2"
29-Jul 11:37 backup-srv-sd JobId 2475: 3307 Issuing autochanger "unload
slot 12, drive 0" command.
29-Jul 11:38 backup-srv-sd JobId 2475: 3304 Issuing autochanger "load
slot 2, drive 0" command.
29-Jul 11:39 backup-srv-sd JobId 2475: 3305 Autochanger "load slot 2,
drive 0", status is OK.
29-Jul 11:39 backup-srv-sd JobId 2475: Ready to read from volume
"BM0014L6" on device "LTO6-Drive-1" (/dev/nst0).
29-Jul 11:39 backup-srv-sd JobId 2475: 3307 Issuing autochanger "unload
slot 2, drive 0" command.
29-Jul 11:40 backup-srv-sd JobId 2475: 3304 Issuing autochanger "load
slot 3, drive 0" command.
29-Jul 11:41 backup-srv-sd JobId 2475: 3305 Autochanger "load slot 3,
drive 0", status is OK.
29-Jul 11:41 backup-srv-sd JobId 2475: Wrote label to prelabeled Volume
"BK0008L6" on device "LTO6-Drive-2" (/dev/nst1)
29-Jul 11:41 backup-srv-sd JobId 2475: Forward spacing Volume "BM0014L6"
to file:block 79:1.
30-Jul 01:55 backup-srv-sd JobId 2475: End of Volume at file 530 on
device "LTO6-Drive-1" (/dev/nst0), Volume "BM0014L6"
30-Jul 01:55 backup-srv-sd JobId 2475: 3307 Issuing autochanger "unload
slot 2, drive 0" command.
30-Jul 01:58 backup-srv-sd JobId 2475: 3304 Issuing autochanger "load
slot 6, drive 0" command.
30-Jul 01:58 backup-srv-sd JobId 2475: 3305 Autochanger "load slot 6,
drive 0", status is OK.
30-Jul 01:58 backup-srv-sd JobId 2475: Ready to read from volume
"BM0019L6" on device "LTO6-Drive-1" (/dev/nst0).
30-Jul 01:58 backup-srv-sd JobId 2475: Forward spacing Volume "BM0019L6"
to file:block 0:1.
30-Jul 02:00 backup-srv-sd JobId 2475: Error: block.c:1001 Read error on
fd=4 at file:blk 2:0 on device "LTO6-Drive-1" (/dev/nst0).
ERR=Input/output error.
30-Jul 02:00 backup-srv-sd JobId 2475: End of Volume at file 2 on device
"LTO6-Drive-1" (/dev/nst0), Volume "BM0019L6"
30-Jul 02:00 backup-srv-sd JobId 2475: Fatal error: acquire.c:71 Acquire
read: num_writers=1 not zero. Job 2475 canceled.
30-Jul 02:00 backup-srv-sd JobId 2475: Fatal error: mount.c:865 Cannot
open Dev="LTO6-Drive-1" (/dev/nst0), Vol=BM0019L6
30-Jul 02:00 backup-srv-sd JobId 2475: End of all volumes.
30-Jul 02:00 backup-srv-sd JobId 2475: Fatal error: mac.c:127 Fatal
append error on device "LTO6-Drive-2" (/dev/nst1): ERR=block.c:1001 Read
error on fd=6 at file:blk 0:0 on device "LTO6-Drive-2" (/dev/nst1).
ERR=Input/output error.


########### normal-job ###########
29-Jul 21:35 backup-srv-dir JobId 2477: Start Backup JobId 2477,
Job=backup-srv.2014-07-29_21.35.00_40
30-Jul 01:56 backup-srv-dir JobId 2477: Using Device "LTO6-Drive-1"
30-Jul 01:58 backup-srv-sd JobId 2477: 3307 Issuing autochanger "unload
slot 6, drive 0" command.
30-Jul 01:59 backup-srv-sd JobId 2477: 3304 Issuing autochanger "load
slot 12, drive 0" command.
30-Jul 02:00 backup-srv-sd JobId 2477: 3305 Autochanger "load slot 12,
drive 0", status is OK.
30-Jul 02:00 backup-srv-sd JobId 2477: Volume "BM0021L6" previously
written, moving to end of data.
30-Jul 02:00 backup-srv-sd JobId 2477: Ready to append to end of Volume
"BM0021L6" at file=2.
30-Jul 02:00 backup-srv-sd JobId 2477: Spooling data ...
30-Jul 02:00 backup-srv-sd JobId 2477: Job write elapsed time =
00:00:01, Transfer rate = 0  Bytes/second
30-Jul 02:00 backup-srv-sd JobId 2477: Committing spooled data to Volume
"BM0021L6". Despooling 452 bytes ...
30-Jul 02:00 backup-srv-sd JobId 2477: Fatal error: block.c:439 Attempt
to write on read-only Volume. dev="LTO6-Drive-1" (/dev/nst0)
30-Jul 02:00 backup-srv-sd JobId 2477: Fatal error: spool.c:301 Fatal
append error on device "LTO6-Drive-1" (/dev/nst0): ERR=block.c:1001 Read
error on fd=4 at file:blk 2:0 on device "LTO6-Drive-1" (/dev/nst0).
ERR=Input/output error.


The drive has "Maximum Concurrent Jobs = 15" because of multible backup
jobs (sequential due spool = yes).
If I must change the Concurrent Jobs to 1, I guess I run in trouble
during normal backup (backup-window).

But why don't prevent the copy-job to using the source-jukebox for other
jobs??

Any hints??

Regards

Udo

------------------------------------------------------------------------------
Infragistics Professional
Build stunning WinForms apps today!
Reboot your WinForms applications with our WinForms controls. 
Build a bridge from your legacy apps to the future.
http://pubads.g.doubleclick.net/gampad/clk?id=153845071&iu=/4140/ostg.clktrk
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>
  • [Bacula-users] copy-job failed on big jobs (other backups use the source jukebox during volume change), Udo Lembke <=