Bacula-users

[Bacula-users] Unable to restore jobs created on one client, on another

2010-10-20 21:27:49
Subject: [Bacula-users] Unable to restore jobs created on one client, on another
From: Richard Scobie <richard AT sauce.co DOT nz>
To: bacula-users <bacula-users AT lists.sourceforge DOT net>
Date: Thu, 21 Oct 2010 14:24:35 +1300
Bacula 5.0.2 Linux.

I have a number of Jobs, written to LTO4 tape on a host, backup1, which 
I am trying to restore from  with the tapes loaded in a host called 
flash. Director is running on host library1.

These restores are failing with either an error:

"RestoreFiles.2010-10-21_13.54.01_08    is waiting on max Storage jobs", 
and there are no other jobs pending, or fail completely with "Fatal Error".

All bacula fds,sds and director have been restarted a number of times. 
Below is a typical fatal error scenario:

"restore jobid="345" client="backup1-fd_arc" 
file="?restore_476707713_16" done

1 file selected to be restored.

Bootstrap records written to /var/bacula/library1-dir.restore.1.bsr

The job will require the following
    Volume(s)                 Storage(s)                SD Device(s)
===========================================================================

     AR0030L4                  HP_MSL2024                HP_MSL2024

NOTE: HP_MSL2024 is attached to backup1 and job is modified to set flash 
as the restore client and storage:

.mod restoreclient="flash-fd_arc" fileset="Full Set" 
storage="flash_LTO-4" replace="always" when="2010-10-21 12:00:30" 
bootstrap="/var/bacula/library1-dir.restore.1.bsr" 
where="/mnt/storage1/share/bacula/restores" priority="10" yes
Job queued. JobId=466

NOTE: A "status" shows:

Select daemon type for status (1-4): library1-dir Version: 5.0.2 (28 
April 2010) x86_64-unknown-linux-gnu redhat
Daemon started 21-Oct-10 11:56, 0 Jobs run since started.
  Heap: heap=135,168 smbytes=85,035 max_bytes=1,096,857 bufs=360 
max_bufs=365
Full              Backup       11    22-Oct-10 09:10       BackupCatalog 
         Catalog2
    466              RestoreFiles.2010-10-21_12.00.58_04    is waiting 
on Storage flash_LTO-4

NOTE: the requested volume is then mounted:

Select Storage resource (1-3): 3001 Mounted Volume: AR0030L4
3001 Device "flash_LTO-4" (/dev/nst0) is mounted with Volume "AR0030L4"

NOTE: At this point the job fails with a fatal error.

21-Oct 12:01 library1-dir JobId 466: Start Restore Job 
RestoreFiles.2010-10-21_12.00.58_04
21-Oct 12:05 library1-dir JobId 466: Using Device "flash_LTO-4"
21-Oct 12:05 flash-sd JobId 466: Fatal error: No Volume names found for 
restore.
21-Oct 12:05 flash-fd JobId 466: Fatal error: job.c:2004 Bad response to 
Read Data command. Wanted 3000 OK data
, got 3000 error

21-Oct 12:05 library1-dir JobId 466: Using Device "Drive-1"
21-Oct 12:05 flash-fd JobId 466: Fatal error: Failed to authenticate 
Storage daemon.
21-Oct 12:05 library1-dir JobId 466: Fatal error: Bad response to 
Storage command: wanted 2000 OK storage
, got 2902 Bad storage

21-Oct 12:05 library1-dir JobId 466: Error: Bacula library1-dir 5.0.2 
(28Apr10): 21-Oct-2010 12:05:07
   Build OS:               x86_64-unknown-linux-gnu redhat
   JobId:                  466
   Job:                    RestoreFiles.2010-10-21_12.00.58_04
   Restore Client:         flash-fd_arc
   Start time:             21-Oct-2010 12:01:00
   End time:               21-Oct-2010 12:05:07
   Files Expected:         1
   Files Restored:         0
   Bytes Restored:         0
   Rate:                   0.0 KB/s
   FD Errors:              0
   FD termination status:
   SD termination status:  Waiting on FD
   Termination:            *** Restore Error ***

21-Oct 12:05 library1-dir JobId 466: Error: Bacula library1-dir 5.0.2 
(28Apr10): 21-Oct-2010 12:05:07
   Build OS:               x86_64-unknown-linux-gnu redhat
   JobId:                  466
   Job:                    RestoreFiles.2010-10-21_12.00.58_04
   Restore Client:         flash-fd_arc
   Start time:             21-Oct-2010 12:01:00
   End time:               21-Oct-2010 12:05:07
   Files Expected:         1
   Files Restored:         0
   Bytes Restored:         0
   Rate:                   0.0 KB/s
   FD Errors:              1
   FD termination status:
   SD termination status:  Waiting on FD
   Termination:            *** Restore Error ***

I don't understand here why it attempts to use "Drive-1", which is the 
drive on the creation host, backup1.

Am I missing something basic here? All the jobs tried restore OK on 
backup1, where they were created.

Regards,

Richard


------------------------------------------------------------------------------
Nokia and AT&T present the 2010 Calling All Innovators-North America contest
Create new apps & games for the Nokia N8 for consumers in  U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store 
http://p.sf.net/sfu/nokia-dev2dev
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users