Bacula-users

Re: [Bacula-users] How to recover from lost connection

2016-04-21 07:22:19
Subject: Re: [Bacula-users] How to recover from lost connection
From: Josh Fisher <jfisher AT pvct DOT com>
To: bacula-users AT lists.sourceforge DOT net
Date: Thu, 21 Apr 2016 07:21:08 -0400

On 4/21/2016 2:36 AM, Ian Douglas wrote:
> hi All
>
> Is there a way to recover from this situation?
>
> I'm trying to make up a NAS drive with many TB of data. 4th attempt.
>
> I don't know why it lost connection to the SD, box is up and running and has
> not gone down:
>
> ssh tapeserver
> ian@tapeserver's password:
> Last login: Thu Apr 21 08:18:40 2016 from trooper
> [ian@tapeserver ~]$ uptime
>   08:19:19 up 3 days, 15:41,  2 users,  load average: 0.00, 0.01, 0.22
> [ian@tapeserver ~]$
>
> Any advice gratefully received... :-)
>
> Comment: now that I read it again, it appears that SD spooled almost a TB of
> data, without writing any ... do I need a heartbeat?

Yes, most likely. In the 3 hours or so it took to despool to tape, the 
SD->FD TCP connection was dropped for some reason. I am of the opinion 
that not all interfaces interpret IEE 802.3az Energy-Efficient Ethernet 
in the same way, and when one interface puts its transmitter into sleep 
mode, the receiving interface sees it as a dropped connection. Setting a 
heartbeat should work, if that is what is happening.

>
> It feels like I should be setting some sort of "chunk size" that specifies to
> write a block say every 50 GB or so, but I don't see any options like that in
> the manual. Perhaps I'm searching for the wrong words.
>
> 20-Apr 17:03 trooper-dir JobId 10: No prior Full backup Job record found.
> 20-Apr 17:03 trooper-dir JobId 10: No prior or suitable Full backup found in
> catalog. Doing FULL backup.
> 20-Apr 17:03 trooper-dir JobId 10: Start Backup JobId 10, Job=Backup2Nas-to-
> Tape.2016-04-20_17.03.47_06
> 20-Apr 17:04 trooper-dir JobId 10: Using Device "LTO-6" to write.
> 20-Apr 17:04 TapeServer JobId 10: Warning: Director wanted Volume
> "NASFull-0003".
>      Current Volume "NonNASIncDiff-0002" not acceptable because:
>      1998 Volume "NonNASIncDiff-0002" catalog status is Append, not in Pool.
> 20-Apr 17:04 TapeServer JobId 10: Please mount append Volume "NASFull-0003" or
> label a new one for:
>      Job:          Backup2Nas-to-Tape.2016-04-20_17.03.47_06
>      Storage:      "LTO-6" (/dev/nst0)
>      Pool:         NASFullPool
>      Media type:   LTO-6
> 20-Apr 17:09 TapeServer JobId 10: Wrote label to prelabeled Volume
> "NASFull-0003" on tape device "LTO-6" (/dev/nst0)
> 20-Apr 17:09 TapeServer JobId 10: Spooling data ...
> 20-Apr 23:44 TapeServer JobId 10: Writing spooled data to Volume. Despooling
> 940,494,394,596 bytes ...
> 21-Apr 03:16 TapeServer JobId 10: Despooling elapsed time = 03:32:23, Transfer
> rate = 73.80 M Bytes/second
> 21-Apr 07:39 trooper-fd JobId 10: Error: bsock.c:448 Write error sending 65540
> bytes to Storage daemon:192.168.1.60:9103: ERR=Broken pipe
> 21-Apr 07:39 trooper-fd JobId 10: Fatal error: backup.c:853 Network send error
> to SD. ERR=Broken pipe
> 21-Apr 07:40 trooper-dir JobId 10: Error: Director's connection to SD for this
> Job was lost.
> 21-Apr 07:40 trooper-dir JobId 10: Error: Bacula trooper-dir 7.0.5 (28Jul14):
>    Build OS:               x86_64-pc-linux-gnu gentoo
>    JobId:                  10
>    Job:                    Backup2Nas-to-Tape.2016-04-20_17.03.47_06
>    Backup Level:           Full (upgraded from Incremental)
>    Client:                 "nas2" 7.4.0 (16Jan16) amd64-portbld-
> freebsd9.3,freebsd,9.3-RELEASE
>    FileSet:                "Nas2Files" 2016-04-20 16:50:33
>    Pool:                   "NASFullPool" (From Job FullPool override)
>    Catalog:                "MyCatalog" (From Client resource)
>    Storage:                "TapeServer" (From command line)
>    Scheduled time:         20-Apr-2016 17:03:45
>    Start time:             20-Apr-2016 17:04:15
>    End time:               21-Apr-2016 07:40:08
>    Elapsed time:           14 hours 35 mins 53 secs
>    Priority:               10
>    FD Files Written:       369,687
>    SD Files Written:       0
>    FD Bytes Written:       1,593,625,355,624 (1.593 TB)
>    SD Bytes Written:       0 (0 B)
>    Rate:                   30324.2 KB/s
>    Software Compression:   None
>    VSS:                    no
>    Encryption:             no
>    Accurate:               no
>    Volume name(s):         NASFull-0003
>    Volume Session Id:      17
>    Volume Session Time:    1460903935
>    Last Volume Bytes:      939,939,840,000 (939.9 GB)
>    Non-fatal FD errors:    2
>    SD Errors:              0
>    FD termination status:  Error
>    SD termination status:  Error
>    Termination:            *** Backup Error ***
>
> Thanks, Ian


------------------------------------------------------------------------------
Find and fix application performance issues faster with Applications Manager
Applications Manager provides deep performance insights into multiple tiers of
your business applications. It resolves application problems quickly and
reduces your MTTR. Get your free trial!
https://ad.doubleclick.net/ddm/clk/302982198;130105516;z
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users