Bacula-users

[Bacula-users] SD hung

2008-10-23 11:48:16
Subject: [Bacula-users] SD hung
From: Pawel Rybak <pawel.rybak AT yahoo DOT ca>
To: bacula-users AT lists.sourceforge DOT net
Date: Thu, 23 Oct 2008 11:45:05 -0400
Hi,

I run Director and storage daemon on Debian Etch (Bacula ver. 1.38.11)
and backup a bunch of Windows machines (ver. 2.4.2). Last night one of
the jobs crashed with message:

23-Oct 00:00 huntsys81-fd JobId 1008: Generate VSS snapshots.
Driver="VSS WinXP", Drive(s)="C"
23-Oct 00:32 huntsys81-fd JobId 1008: Fatal
error: ../../filed/backup.c:892 Network send error to SD.
ERR=Input/output error
23-Oct 00:32 huntsys81-fd JobId 1008: Error: ../../lib/bsock.c:306 Write
error sending 36522 bytes to Storage daemon:nas2:9103: ERR=Input/output
error
23-Oct 00:34 huntsys81-fd JobId 1008: VSS Writer (BackupComplete):
"Microsoft Writer (Bootable State)", State: 0x1 (VSS_WS_STABLE)
23-Oct 00:34 huntsys81-fd JobId 1008: VSS Writer (BackupComplete):
"Microsoft Writer (Service State)", State: 0x1 (VSS_WS_STABLE)
23-Oct 00:34 huntsys81-fd JobId 1008: VSS Writer (BackupComplete):
"MSDEWriter", State: 0x1 (VSS_WS_STABLE)
23-Oct 00:34 huntsys81-fd JobId 1008: VSS Writer (BackupComplete): "WMI
Writer", State: 0x1 (VSS_WS_STABLE)
23-Oct 00:33 rt-dir: huntsys81-Job.2008-10-22_23.55.15 Error: Bacula
1.38.11 (28Jun06): 23-Oct-2008 00:33:41
  JobId:                  1008
  Job:                    huntsys81-Job.2008-10-22_23.55.15
  Backup Level:           Incremental, since=2008-10-21 23:59:43
  Client:                 "huntsys81-fd" 2.4.2 (26Jul08)
Linux,Cross-compile,Win32
  FileSet:                "AgentSet" 2008-09-15 13:40:17
  Pool:                   "NAS3"
  Storage:                "Part3"
  Scheduled time:         22-Oct-2008 23:55:14
  Start time:             22-Oct-2008 23:59:37
  End time:               23-Oct-2008 00:33:41
  Elapsed time:           34 mins 4 secs
  Priority:               10
  FD Files Written:       13
  SD Files Written:       0
  FD Bytes Written:       592,929,883 (592.9 MB)
  SD Bytes Written:       0 (0 B)
  Rate:                   290.1 KB/s
  Software Compression:   48.3 %
  Volume name(s):         
  Volume Session Id:      929
  Volume Session Time:    1222362986
  Last Volume Bytes:      5,456,661,105 (5.456 GB)
  Non-fatal FD errors:    1
  SD Errors:              0
  FD termination status:  Error
  SD termination status:  Error
  Termination:            *** Backup Error ***

The director threw the error and killed the job. That's okay, it
happens. But SD kept reporting that it was still working on this job so
that refused to accept data from another clients.
What can I do to prevent SD from "locking" in cases like that?



-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>
  • [Bacula-users] SD hung, Pawel Rybak <=