Bacula-users

Re: [Bacula-users] Unable to restore jobs created on one client, on another

2010-10-20 22:48:48
Subject: Re: [Bacula-users] Unable to restore jobs created on one client, on another
From: Dan Langille <dan AT langille DOT org>
To: Richard Scobie <richard AT sauce.co DOT nz>
Date: Wed, 20 Oct 2010 22:45:40 -0400
On 10/20/2010 9:24 PM, Richard Scobie wrote:
> Bacula 5.0.2 Linux.
>
> I have a number of Jobs, written to LTO4 tape on a host, backup1, which
> I am trying to restore from  with the tapes loaded in a host called
> flash. Director is running on host library1.

The subject line appears misleading.  It makes me think you're trying to 
take a backup of box A and restore it to Box B.  This is rather straight 
forward.

Instead, what you're saying is:

- backup of Box A was done to Storage Daemon backup1
- tape was then moved to Storage Daemon flash

Does the Director know that the tape has moved from one system to another?


> These restores are failing with either an error:
>
> "RestoreFiles.2010-10-21_13.54.01_08  is waiting on max Storage jobs",
> and there are no other jobs pending, or fail completely with "Fatal Error".
>
> All bacula fds,sds and director have been restarted a number of times.
> Below is a typical fatal error scenario:
>
> "restore jobid="345" client="backup1-fd_arc"
> file="?restore_476707713_16" done
>
> 1 file selected to be restored.
>
> Bootstrap records written to /var/bacula/library1-dir.restore.1.bsr
>
> The job will require the following
>      Volume(s)                 Storage(s)                SD Device(s)
> ===========================================================================
>
>       AR0030L4                  HP_MSL2024                HP_MSL2024
>
> NOTE: HP_MSL2024 is attached to backup1 and job is modified to set flash
> as the restore client and storage:

This reads right to me.

>
> .mod restoreclient="flash-fd_arc" fileset="Full Set"
> storage="flash_LTO-4" replace="always" when="2010-10-21 12:00:30"
> bootstrap="/var/bacula/library1-dir.restore.1.bsr"
> where="/mnt/storage1/share/bacula/restores" priority="10" yes
> Job queued. JobId=466
>
> NOTE: A "status" shows:
>
> Select daemon type for status (1-4): library1-dir Version: 5.0.2 (28
> April 2010) x86_64-unknown-linux-gnu redhat
> Daemon started 21-Oct-10 11:56, 0 Jobs run since started.
>    Heap: heap=135,168 smbytes=85,035 max_bytes=1,096,857 bufs=360
> max_bufs=365
> Full              Backup       11    22-Oct-10 09:10       BackupCatalog
>           Catalog2
>      466              RestoreFiles.2010-10-21_12.00.58_04    is waiting
> on Storage flash_LTO-4
>
> NOTE: the requested volume is then mounted:
>
> Select Storage resource (1-3): 3001 Mounted Volume: AR0030L4
> 3001 Device "flash_LTO-4" (/dev/nst0) is mounted with Volume "AR0030L4"
>
> NOTE: At this point the job fails with a fatal error.
>
> 21-Oct 12:01 library1-dir JobId 466: Start Restore Job
> RestoreFiles.2010-10-21_12.00.58_04
> 21-Oct 12:05 library1-dir JobId 466: Using Device "flash_LTO-4"
> 21-Oct 12:05 flash-sd JobId 466: Fatal error: No Volume names found for
> restore.
> 21-Oct 12:05 flash-fd JobId 466: Fatal error: job.c:2004 Bad response to
> Read Data command. Wanted 3000 OK data
> , got 3000 error
>
> 21-Oct 12:05 library1-dir JobId 466: Using Device "Drive-1"
> 21-Oct 12:05 flash-fd JobId 466: Fatal error: Failed to authenticate
> Storage daemon.
> 21-Oct 12:05 library1-dir JobId 466: Fatal error: Bad response to
> Storage command: wanted 2000 OK storage
> , got 2902 Bad storage
>
> 21-Oct 12:05 library1-dir JobId 466: Error: Bacula library1-dir 5.0.2
> (28Apr10): 21-Oct-2010 12:05:07
>     Build OS:               x86_64-unknown-linux-gnu redhat
>     JobId:                  466
>     Job:                    RestoreFiles.2010-10-21_12.00.58_04
>     Restore Client:         flash-fd_arc
>     Start time:             21-Oct-2010 12:01:00
>     End time:               21-Oct-2010 12:05:07
>     Files Expected:         1
>     Files Restored:         0
>     Bytes Restored:         0
>     Rate:                   0.0 KB/s
>     FD Errors:              0
>     FD termination status:
>     SD termination status:  Waiting on FD
>     Termination:            *** Restore Error ***
>
> 21-Oct 12:05 library1-dir JobId 466: Error: Bacula library1-dir 5.0.2
> (28Apr10): 21-Oct-2010 12:05:07
>     Build OS:               x86_64-unknown-linux-gnu redhat
>     JobId:                  466
>     Job:                    RestoreFiles.2010-10-21_12.00.58_04
>     Restore Client:         flash-fd_arc
>     Start time:             21-Oct-2010 12:01:00
>     End time:               21-Oct-2010 12:05:07
>     Files Expected:         1
>     Files Restored:         0
>     Bytes Restored:         0
>     Rate:                   0.0 KB/s
>     FD Errors:              1
>     FD termination status:
>     SD termination status:  Waiting on FD
>     Termination:            *** Restore Error ***
>
> I don't understand here why it attempts to use "Drive-1", which is the
> drive on the creation host, backup1.

My guess: because that's where the Catalog says is it?

>
> Am I missing something basic here? All the jobs tried restore OK on
> backup1, where they were created.
>
> Regards,
>
> Richard
>
>
> ------------------------------------------------------------------------------
> Nokia and AT&T present the 2010 Calling All Innovators-North America contest
> Create new apps&  games for the Nokia N8 for consumers in  U.S. and Canada
> $10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
> Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store
> http://p.sf.net/sfu/nokia-dev2dev
> _______________________________________________
> Bacula-users mailing list
> Bacula-users AT lists.sourceforge DOT net
> https://lists.sourceforge.net/lists/listinfo/bacula-users
>


-- 
Dan Langille - http://langille.org/

------------------------------------------------------------------------------
Nokia and AT&T present the 2010 Calling All Innovators-North America contest
Create new apps & games for the Nokia N8 for consumers in  U.S. and Canada
$10 million total in prizes - $4M cash, 500 devices, nearly $6M in marketing
Develop with Nokia Qt SDK, Web Runtime, or Java and Publish to Ovi Store 
http://p.sf.net/sfu/nokia-dev2dev
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users