Bacula-users

[Bacula-users] Bacula 5.0.3 and Storage hangs on Solaris

2013-01-26 14:58:05
Subject: [Bacula-users] Bacula 5.0.3 and Storage hangs on Solaris
From: "Novosielski, Ryan" <novosirj AT umdnj DOT edu>
To: bacula-users <bacula-users AT lists.sourceforge DOT net>
Date: Sat, 26 Jan 2013 14:53:48 -0500
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi all,

I've recently upgraded to Bacula 5.0.3 (OpenCSW packages) and have
started to have problems with hangs waiting on the storage daemon.
Here is the current status:

 JobId Level   Name                       Status
======================================================================
 13451 Increme  CFMX-dev.2013-01-23_23.00.00_13 is waiting on Storage
helix_DDS
 13452 Increme  Xymon-Display.2013-01-23_23.00.01_14 is waiting on
Storage helix_DDS
 13453 Increme  SVNrepos.2013-01-23_23.00.01_15 is waiting on Storage
helix_DDS
 13454 Increme  Xymon-NET.2013-01-23_23.00.01_16 is waiting on Storage
helix_DDS
 13455 Increme  orlith-PHP.2013-01-23_23.00.01_17 is waiting on
Storage helix_DDS
 13456 Full    Catalog.2013-01-23_23.55.00_18 is waiting for higher
priority jobs to finish

I tried to do a truss on the storage daemon and it's just a bunch of this:

/25:    pollsys(0xFE2FB9D8, 1, 0xFE2FBA80, 0x00000000)  = 0
/25:    time()                                          = 1359229621
/25:    read(7, 0x00190DBB, 5)                          Err#11 EAGAIN
/25:    pollsys(0xFE2FB9D8, 1, 0xFE2FBA80, 0x00000000) (sleeping...)
/24:    pollsys(0xFE0FB9E0, 1, 0xFE0FBA80, 0x00000000)  = 0
/24:    time()                                          = 1359229628
/24:    read(6, 0x0009D473, 5)                          Err#11 EAGAIN
/24:    pollsys(0xFE0FB9E0, 1, 0xFE0FBA80, 0x00000000) (sleeping...)
/25:    pollsys(0xFE2FB9D8, 1, 0xFE2FBA80, 0x00000000)  = 0
/25:    time()                                          = 1359229631
/25:    read(7, 0x00190DBB, 5)                          Err#11 EAGAIN
/25:    pollsys(0xFE2FB9D8, 1, 0xFE2FBA80, 0x00000000) (sleeping...)
/24:    pollsys(0xFE0FB9E0, 1, 0xFE0FBA80, 0x00000000)  = 0
/24:    time()                                          = 1359229638
/24:    read(6, 0x0009D473, 5)                          Err#11 EAGAIN
/24:    pollsys(0xFE0FB9E0, 1, 0xFE0FBA80, 0x00000000) (sleeping...)

I'm guessing the next step is to run something in a debug mode? It
takes days for this to happen though, but my observation seems to be
that the failures come the day after the full backup runs (which runs
on a different/larger capacity tape drive).

Thanks for any pointers you might have. I'd like to leave this in this
state until someone could tell my how I might prod it to get more
information out, but that might not be practical as it's a production
machine.

- -- 
- ---- _  _ _  _ ___  _  _  _
|Y#| |  | |\/| |  \ |\ |  | |Ryan Novosielski - Sr. Systems Programmer
|$&| |__| |  | |__/ | \| _| |novosirj AT umdnj DOT edu - 973/972.0922 (2-0922)
\__/ Univ. of Med. and Dent.|IST/EI-Academic Svcs. - ADMC 450, Newark
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)
Comment: Using GnuPG with undefined - http://www.enigmail.net/

iEYEARECAAYFAlEENDkACgkQmb+gadEcsb77XQCfUj9vLz5WVA/JzzwmXzYy3B6e
HFUAni6NEmM7lqyXspIv7dPjexrCM5Qu
=YzV4
-----END PGP SIGNATURE-----


------------------------------------------------------------------------------
Master Visual Studio, SharePoint, SQL, ASP.NET, C# 2012, HTML5, CSS,
MVC, Windows 8 Apps, JavaScript and much more. Keep your skills current
with LearnDevNow - 3,200 step-by-step video tutorials by Microsoft
MVPs and experts. ON SALE this month only -- learn more at:
http://p.sf.net/sfu/learnnow-d2d
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>
  • [Bacula-users] Bacula 5.0.3 and Storage hangs on Solaris, Novosielski, Ryan <=