Bacula-users

[Bacula-users] Failed jobs "hang as running" in director status

2010-01-20 05:53:25
Subject: [Bacula-users] Failed jobs "hang as running" in director status
From: Alex Ehrlich <Alex.Ehrlich AT mail DOT ee>
To: bacula-users <bacula-users AT lists.sourceforge DOT net>
Date: Wed, 20 Jan 2010 12:33:59 +0200
Hello,

I have got the following problem (probably since upgrading to v 3): the failed jobs tend to "hang" in the running state from the director's point of view. This causes other jobs to stall (with "... is waiting for higher priority jobs to finish" or "waiting for maximum concurrent").
In configs, "Rerun Failed Levels" = yes, but no Reschedule set in job defaults.
Restarting bacula-dir resolves the problem (no more pseudo-running jobs in "status dir").

In the "status client" and "status dir" output below, look at the job 9219.

Does anybody have an idea what's wrong?

------------------------------------------------------------------
*status client=danillap-fd
Connecting to Client danillap-fd at lap-danil:9102

lap-danil-fd Version: 3.0.2 (18 July 2009)  VSS Linux Cross-compile Win32
Daemon started 19-Jan-10 11:04, 5 Jobs run since started.
 Heap: heap=0 smbytes=174,774 max_bytes=275,357 bufs=89 max_bufs=277
 Sizeof: boffset_t=8 size_t=4 debug=0 trace=1

Running Jobs:
Director connected at: 20-Jan-10 12:17
No Jobs running.
====

Terminated Jobs:
 JobId  Level    Files      Bytes   Status   Finished        Name
======================================================================
...
  9219  Incr          3    29.99 M  Error    20-Jan-10 10:46 danillap_outlook

 
*status dir
backupsrv-dir Version: 3.0.3 (18 October 2009) i686-redhat-linux-gnu redhat
Daemon started 19-Jan-10 08:58, 38 Jobs run since started.
 Heap: heap=8,122,368 smbytes=196,142 max_bytes=350,199 bufs=1,314 max_bufs=2,527
...

Running Jobs:
Console connected at 20-Jan-10 12:12
 JobId Level   Name                       Status
======================================================================
  9219 Increme  danillap_outlook.2010-01-20_10.45.00_57 is running
...
====
------------------------------------------------------------------

Regards,

Alex Ehrlich
------------------------------------------------------------------------------
Throughout its 18-year history, RSA Conference consistently attracts the
world's best and brightest in the field, creating opportunities for Conference
attendees to learn about information security's most important issues through
interactions with peers, luminaries and emerging and established companies.
http://p.sf.net/sfu/rsaconf-dev2dev
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users
<Prev in Thread] Current Thread [Next in Thread>
  • [Bacula-users] Failed jobs "hang as running" in director status, Alex Ehrlich <=