Bacula-users

Re: [Bacula-users] Error talking to remote storage daemon

2017-07-31 14:26:28
Subject: Re: [Bacula-users] Error talking to remote storage daemon
From: Steve Garcia <sgarcia AT bak.rr DOT com>
To: Ana Emília M. Arruda <emiliaarruda AT gmail DOT com>
Date: Mon, 31 Jul 2017 18:25:30 +0000
---- "Ana Emília M. Arruda" <emiliaarruda AT gmail DOT com> wrote: 
> Hi Steve,
> 
> Sorry, my mistake...
> 
> There is no problem in having a remote Storage Daemon with your tape
> library attached. This is a very usual configuration.
> 
> Before having a try with Bacula, I would recommend you to check if mtx and
> mt are properly working (please use /dev/tape/by-id names when running
> tests).

I've tested mt with both the by-id and with /dev/nst0, and it works 
identically.  Similarly, mtx-changer works perfectly with both the by-id and 
the /dev/autochanger1 names.  I haven't tested the naked mtx command, but since 
mtx-changer uses mtx, I figure that would be a valid test.

btest works perfectly with both the by-id name and with /dev/nst0

Unfortunately, bacula also works identically with both the by-id name and with 
/dev/nst0, and not in a good way.  I have altered my configuration to be 
identical with what you have suggested below, and it fails in exactly the same 
manner as before.

Since the by-id names are themselves symlinks to the same special files that I 
was using before, unless I change the hardware, I can't imagine any churn in 
those devices.  There will only ever be one /dev/nst0, since there will never 
be another tape drive added, and /dev/autochanger1 will always point to the 
correct changer device since it's set by udev based on the specifics of this 
changer.  Using the by-id name isn't a bad idea, but I don't think it's related 
to this problem.  

Is there maybe some way to generate a debug log that has more information than 
what shows up in the director?  "3999 Device "AutochangerOdin" not found or 
could not be opened" isn't very clear.

Could this be a permissions issue?  Both raw devices (/dev/sg3 and /dev/nst0) 
are owned by the "tape" group, and I double checked that the storage daemon 
*is* running as group "tape."

I'm still at a loss here.

> 
> The error messages seems related to the tape drive and not to the tape
> library configuration. So I would try
> "/dev/tape/by-id/scsi-35000e11164c42001-nst"
> for the tape device:
> 
> Device {
>   Name = Drive-1                      #
>   Description = "LT06 inside Dell TL2000 Library"
>   Drive Index = 0
>   Media Type = LT06
>   Archive Device = /dev/tape/by-id/scsi-35000e11164c42001-nst
>   AutomaticMount = yes;               # when device opened, read it
>   AlwaysOpen = yes;
>   RemovableMedia = yes;
>   RandomAccess = no;
>   AutoChanger = yes
>   SpoolDirectory = "/var/spool/bacula"
>   MaximumSpoolSize = 485G
>   Maximum Network Buffer Size = 65536
>   Offline On Unmount = no
>   Alert Command = "sh -c 'smartctl -H -l error %c'"
> }
> 
> You should run btape tests before starting backups as well.
> 
> Best regards,
> 
> Ana
> 
> 
> On Wed, Jul 26, 2017 at 3:20 PM, Steve Garcia <sgarcia AT bak.rr DOT com> 
> wrote:
> 
> > OK, I've got my tape drive working (thanks Ana!) but I'm having trouble
> > connecting to the autochanger it's in using the director.  This is the
> > first time I've tried having a storage daemon on a different machine than
> > the director.  The director is a slightly lower version (7.4.3 on Debian
> > Jessie using backports) than the storage daemon (7.4.4 on stretch) but I
> > had understood that those versions were close enough to work.
> >
> > So I'm hoping this is another configuration issue.
> >
> > Right now what I'm trying to do is label all the tapes in the new library.
> >
> > When I try to access the new storage from the director, it is able to get
> > a listing of all the tapes, but it fails when it tries to actually do the
> > labeling.  I get a "3999 Device not found or could not be opened" error.
> > These errors show up quickly, there is no delay as it tries each slot, so
> > it's obviously not getting far enough to try.  But it *is* obviously
> > connecting to the remote storage, otherwise it wouldn't be able to obtain
> > the slot list.
> >
> > What am I missing?
> >
> > root@sleipnir:/etc/bacula# bconsole
> > Connecting to Director sleipnir:9101
> > 1000 OK: 102 sleipnir-dir Version: 7.4.3 (18 June 2016)
> > Enter a period to cancel a command.
> > *label storage=Library2 barcodes
> > Automatically selected Catalog: MyCatalog
> > Using Catalog "MyCatalog"
> > Connecting to Storage daemon Library2 at odin:9103 ...
> > 3306 Issuing autochanger "slots" command.
> > Device "AutochangerOdin" has 24 slots.
> > Connecting to Storage daemon Library2 at odin:9103 ...
> > 3306 Issuing autochanger "list" command.
> > The following Volumes will be labeled:
> > Slot  Volume
> > ==============
> >    1  000015L6
> >    2  000018L6
> >    3  000021L6
> >    4  CLNU00L1
> >    5  000014L6
> >    6  000017L6
> >    7  000020L6
> >    8  CLN005L3
> >    9  000013L6
> >   10  000016L6
> >   11  000019L6
> >   12  000012L6
> >   13  000009L6
> >   14  000006L6
> >   15  000003L6
> >   16  000011L6
> >   17  000008L6
> >   18  000005L6
> >   19  000002L6
> >   20  000010L6
> >   21  000007L6
> >   22  000004L6
> >   23  000001L6
> > Do you want to label these Volumes? (yes|no):  yes
> > Defined Pools:
> >      1: Default
> >      2: OdinPool
> > Select the Pool (1-2): 2
> > Connecting to Storage daemon Library2 at odin:9103 ...
> > Sending label command for Volume "000015L6" Slot 1 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000015L6.
> > Sending label command for Volume "000018L6" Slot 2 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000018L6.
> > Sending label command for Volume "000021L6" Slot 3 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000021L6.
> > Media record for Slot 4 Volume "CLNU00L1" already exists.
> > Sending label command for Volume "000014L6" Slot 5 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000014L6.
> > Sending label command for Volume "000017L6" Slot 6 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000017L6.
> > Sending label command for Volume "000020L6" Slot 7 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000020L6.
> > Media record for Slot 8 Volume "CLN005L3" already exists.
> > Sending label command for Volume "000013L6" Slot 9 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000013L6.
> > Sending label command for Volume "000016L6" Slot 10 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000016L6.
> > Sending label command for Volume "000019L6" Slot 11 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000019L6.
> > Sending label command for Volume "000012L6" Slot 12 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000012L6.
> > Sending label command for Volume "000009L6" Slot 13 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000009L6.
> > Sending label command for Volume "000006L6" Slot 14 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000006L6.
> > Sending label command for Volume "000003L6" Slot 15 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000003L6.
> > Sending label command for Volume "000011L6" Slot 16 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000011L6.
> > Sending label command for Volume "000008L6" Slot 17 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000008L6.
> > Sending label command for Volume "000005L6" Slot 18 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000005L6.
> > Sending label command for Volume "000002L6" Slot 19 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000002L6.
> > Sending label command for Volume "000010L6" Slot 20 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000010L6.
> > Sending label command for Volume "000007L6" Slot 21 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000007L6.
> > Sending label command for Volume "000004L6" Slot 22 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000004L6.
> > Sending label command for Volume "000001L6" Slot 23 ...
> > 3999 Device "AutochangerOdin" not found or could not be opened.
> > Label command failed for Volume 000001L6.
> > You have messages.
> > *
> >
> > From the bacula-sd.conf on odin (where the library is):
> > Autochanger {
> >   Name = AutochangerOdin
> >   Device = Drive-1
> >   Changer Command = "/etc/bacula/scripts/mtx-changer %c %o %S %a %d"
> >   Changer Device = /dev/autochanger1
> > }
> >
> > Device {
> >   Name = Drive-1                      #
> >   Description = "LT06 inside Dell TL2000 Library"
> >   Drive Index = 0
> >   Media Type = LT06
> >   Archive Device = /dev/nst0
> >   AutomaticMount = yes;               # when device opened, read it
> >   AlwaysOpen = yes;
> >   RemovableMedia = yes;
> >   RandomAccess = no;
> >   AutoChanger = yes
> >   SpoolDirectory = "/var/spool/bacula"
> >   MaximumSpoolSize = 485G
> >   Maximum Network Buffer Size = 65536
> >   Offline On Unmount = no
> >   Alert Command = "sh -c 'smartctl -H -l error %c'"
> > }
> >
> > From bacula-dir.conf on sleipnir (where the director is):
> > Storage {
> >   Name = Library2
> > # Do not use "localhost" here
> >   Address = odin                # N.B. Use a fully qualified name here
> >   SDPort = 9103
> >   Password = "*****************"
> >   Device = AutochangerOdin
> >   Media Type = LTO6
> >   Autochanger = yes                   # enable for autochanger device
> > }
> >
> > From the /dev directory on odin:
> > lrwxrwxrwx 1 root root 3     Jun  5 17:42 /dev/autochanger1 -> sg3
> > crw-rw---- 1 root tape 21, 3 Jun  1 15:01 /dev/sg3
> >
> >
> >
> > --
> > Steve Garcia
> > Ignorance killed the cat, curiosity was framed.
> >
> > ------------------------------------------------------------
> > ------------------
> > Check out the vibrant tech community on one of the world's most
> > engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> > _______________________________________________
> > Bacula-users mailing list
> > Bacula-users AT lists.sourceforge DOT net
> > https://lists.sourceforge.net/lists/listinfo/bacula-users
> >


------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

ADSM.ORG Privacy and Data Security by https://kimlaw.us