BackupPC-users

[BackupPC-users] BackupPC spontaneous server exit with SIGPIPE

2011-08-11 14:18:28
Subject: [BackupPC-users] BackupPC spontaneous server exit with SIGPIPE
From: Carl Wilhelm Soderstrom <chrome AT real-time DOT com>
To: backuppc-users AT lists.sourceforge DOT net
Date: Thu, 11 Aug 2011 13:16:38 -0500
BackupPC 3.1.0 on Debian, kernel is 2.6.32, filesystem is xfs.

I have a BackupPC server that has twice now had the BackupPC server process
spontaneously exit. It *did* shut down nicely, interestingly enough. The
server log (/var/lib/backuppc/log/LOG) is this:

2011-08-11 08:00:01 Running 2 BackupPC_nightly jobs from 12..13 (out of
0..15)
2011-08-11 08:00:01 Running BackupPC_nightly -m 192 207 (pid=32271)
2011-08-11 08:00:01 Running BackupPC_nightly 208 223 (pid=32272)
2011-08-11 08:00:01 Next wakeup is 2011-08-11 09:00:00
2011-08-11 08:21:58 Finished full backup on host1.example.com
2011-08-11 09:00:01 Next wakeup is 2011-08-11 10:00:00
2011-08-11 09:51:47 Got signal PIPE... cleaning up
2011-08-11 12:22:30 Reading hosts file
2011-08-11 12:22:30 BackupPC started, pid 2195
2011-08-11 12:22:30 Running BackupPC_link host1.example.com (pid=2201)
2011-08-11 12:22:30 Next wakeup is 2011-08-11 13:00:00
2011-08-11 12:22:32 Finished host1.example.com (BackupPC_link host1.example.com)
2011-08-11 12:22:32 Running BackupPC_trashClean (pid=2205)


I have BackupPC set to start its schedule at 8am; so it will do the
BackupPC_nightly jobs when everyone is starting work, instead of in the
middle of the night when the time is critically needed for doing backups. so
that's why the log starts at 8am instead of the default midnight.

It looks like there's no jobs running when the server exits; but obvously
when it gets restarted (by me) it runs a BackupPC_link job on the host it
was working on before it exited... so it's like the host 'hung' on doing
something in the backup, exited (timeout of some sort?), then when restarted
resumes where it left off and finishes all parts of the backup successfully.

This isn't the per-host LOG (/var/lib/backuppc/pc/host1.example.com/LOG) or
XferLOG (/var/lib/backuppc/pc/host1.example.com/LOG), so it's not obvious
how to turn up the debug level or otherwise try to figure out what's going
on here.

It's happened twice, but I've never seen this host do it before. There's
nothing in dmesg or syslog to indicate a problem. Only thing I can think of
is that BackupPC is running into some sort of filesystem corruption that is
causing it to fail out rather than risk corrupting it further.

Upgrading this host to BackupPC 3.2.1 may be possible (need to get buy-in
from some other admins before doing that); but before I do that
I'm going to fsck the disks. I've never seen BackupPC do this before on any
of the installations I admin and it's done it twice in a few days, so I'm
suspecting hardware problems.


-- 
Carl Soderstrom
Systems Administrator
Real-Time Enterprises
www.real-time.com

------------------------------------------------------------------------------
Get a FREE DOWNLOAD! and learn more about uberSVN rich system, 
user administration capabilities and model configuration. Take 
the hassle out of deploying and managing Subversion and the 
tools developers use with it. 
http://p.sf.net/sfu/wandisco-dev2dev
_______________________________________________
BackupPC-users mailing list
BackupPC-users AT lists.sourceforge DOT net
List:    https://lists.sourceforge.net/lists/listinfo/backuppc-users
Wiki:    http://backuppc.wiki.sourceforge.net
Project: http://backuppc.sourceforge.net/

<Prev in Thread] Current Thread [Next in Thread>