Bacula-users

[Bacula-users] Director segfaults then refuses connections

2008-08-15 12:50:41
Subject: [Bacula-users] Director segfaults then refuses connections
From: Jason Dixon <jdixon AT omniti DOT com>
To: bacula-users AT lists.sourceforge DOT net
Date: Fri, 15 Aug 2008 12:50:23 -0400
I was updating the Director configuration and reloaded while a job was
running.  Unfortunately, there was a minor typo and it caused the
Director to bail.

After fixing the typo and restarting bacula-dir, it complained of a
volume size mismatch:


15-Aug 12:02 vlad-sd JobId 925: Error: Bacula cannot write on disk
Volume "Vol0013" because: The sizes do not match! Volume=11206992819
Catalog=10207056909
15-Aug 12:02 vlad-sd JobId 925: Marking Volume "Vol0013" in Error in
Catalog.


At this point bconsole would hang anytime I ran "status dir".  I
restarted bacula-dir, but this didn't help.  I then rebooted the server
just in case, but this seemed to make it even less happy.  I can no
longer connect with bconsole at all, and it's not logging anything else
either.  Here are the log entries:


========================================================================
15-Aug 12:00 vlad-dir: ERROR in parse_conf.c:483 Config error: Could not
find config Resource warehoue referenced on line 130 :   Client =
warehoue


            : line 130, col 20 of file /etc/bacula/bacula-dir.conf
  Client = warehoue

15-Aug 12:00 vlad-dir: Fatal Error because: Bacula interrupted by signal
11: Segmentation violation
15-Aug 11:53 vlad-dir JobId 924: No prior Full backup Job record found.
15-Aug 11:53 vlad-dir JobId 924: No prior or suitable Full backup found
in catalog. Doing FULL backup.
15-Aug 11:53 vlad-dir JobId 924: Start Backup JobId 924,
Job=OSX_miguel.2008-08-15_11.53.06
15-Aug 11:53 vlad-dir JobId 924: Using Device "FileStorage"
15-Aug 11:53 vlad-sd JobId 924: Volume "Vol0013" previously written,
moving to end of data.
15-Aug 11:53 vlad-sd JobId 924: Ready to append to end of Volume
"Vol0013" size=10207056909
15-Aug 12:00 vlad-dir: Fatal Error because: Bacula interrupted by signal
11: Segmentation violation
15-Aug 11:53 vlad-dir JobId 924: No prior Full backup Job record found.
15-Aug 11:53 vlad-dir JobId 924: No prior or suitable Full backup found
in catalog. Doing FULL backup.
15-Aug 11:53 vlad-dir JobId 924: Start Backup JobId 924,
Job=OSX_miguel.2008-08-15_11.53.06
15-Aug 11:53 vlad-dir JobId 924: Using Device "FileStorage"
15-Aug 11:53 vlad-sd JobId 924: Volume "Vol0013" previously written,
moving to end of data.
15-Aug 11:53 vlad-sd JobId 924: Ready to append to end of Volume
"Vol0013" size=10207056909
15-Aug 12:04 vlad-dir JobId 0: Error: bsock.c:306 Write error sending 1
bytes to client:127.0.0.1:36131: ERR=Broken pipe
15-Aug 12:04 vlad-dir JobId 0: Error: bsock.c:306 Write error sending 75
bytes to client:127.0.0.1:36131: ERR=Broken pipe
========================================================================


The server is CentOS 5 with bacula-sqlite-2.2.8-2.  All services are
running and the bacula files look fine.  I've tried moving
bacula-dir.9101.state aside and restarting, but that hasn't helped.  I'm
out of ideas, any suggestions?


$ sudo ls -l /var/lib/bacula/
total 2359532
-rw-r----- 1 bacula bacula 2241090560 Aug 15 12:47 bacula.db
-rw-r----- 1 bacula bacula  172355344 Aug 15 12:17 bacula.db-journal
-rw-r----- 1 bacula bacula        196 Aug 15 12:39 bacula-dir.9101.state
-rw-r----- 1 bacula bacula       2196 Aug 15 12:30
bacula-dir.9101.state.old
-rw-r----- 1 bacula bacula        349 May 23 16:06
bacula-dir.bacula-dir.203465032.mail
-rw-r----- 1 bacula bacula        349 Feb 27 17:18
bacula-dir.bacula-dir.227504280.mail
-rw-r----- 1 bacula bacula        349 May 23 16:04
bacula-dir.bacula-dir.309883208.mail
-rw-r----- 1 bacula bacula        349 May 23 16:05
bacula-dir.bacula-dir.367272264.mail
-rw-r----- 1 root   bacula       2196 Aug 15 12:35 bacula-fd.9102.state
-rw-r----- 1 bacula disk         2196 Aug 15 12:35 bacula-sd.9103.state
-rw------- 1 bacula bacula          0 May 23 17:36 localhost-dir.conmsg
-rw-r----- 1 bacula bacula          0 May 23 16:08
localhost-dir.UnixBackups.2008-05-23_16.08.03.420286552.mail
-rw-r----- 1 bacula bacula          0 May 23 16:57
localhost-dir.Unix_localhost-fd.2008-05-23_16.57.03.305103000.mail
-rw-r----- 1 bacula bacula     460949 Aug 15 12:04 log
-rw-r----- 1 bacula bacula    1094195 Jul 31 23:16 log.1
-rw-r----- 1 bacula bacula     931983 Jun 30 23:14 log.2
-rw-r----- 1 bacula bacula     105932 May 31 23:11 log.3
-rw-r----- 1 bacula bacula      28426 May 23 23:10 log.4
-rw-r----- 1 bacula bacula        349 Feb 27 17:18 log.5
-rw------- 1 bacula bacula          0 Aug 15 12:07 vlad-dir.conmsg
-rw-r----- 1 bacula bacula       1022 Aug 15 12:00
vlad-dir.OSX_miguel.2008-08-15_11.53.06.196295560.mail
-rw-r----- 1 bacula bacula          0 Aug 15 12:02
vlad-dir.OSX_miguel.2008-08-15_12.02.03.14845096.mail
-rw-r----- 1 bacula bacula          0 Aug 15 12:07
vlad-dir.OSX_miguel.2008-08-15_12.07.04.345892760.mail
-rw-r----- 1 bacula bacula        157 Jun  2 16:00
vlad-dir.restore.3.bsr
-rw-r----- 1 bacula bacula        200 Aug 15 12:00
vlad-dir.vlad-dir.199024968.mail
-rw-r----- 1 bacula bacula        668 Jul 30 22:11
vlad-dir.vlad-dir.205992840.mail
-rw-r----- 1 bacula bacula        137 Jun 26 15:16
vlad-dir.vlad-dir.501987768.mail


Thanks,

-- 
Jason Dixon
OmniTI Computer Consulting, Inc.
jdixon AT omniti DOT com
443.325.1357 x.241 

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users