Bacula-users

Re: [Bacula-users] Solaris "Packet size too big" failures

2009-01-27 13:26:02
Subject: Re: [Bacula-users] Solaris "Packet size too big" failures
From: Jason Dixon <jdixon AT omniti DOT com>
To: Allan Black <Allan.Black AT btconnect DOT com>
Date: Tue, 27 Jan 2009 13:24:07 -0500
On Tue, Jan 27, 2009 at 10:12:03AM +0000, Allan Black wrote:
> Jason Dixon wrote:
> > a "Packet size too big" error.  The Director resides on a global zone in
> > Solaris x86.  I've managed to capture a truss during one of the
> > failures:
> > http://mirrors.omniti.com/bacula/bacula.truss
> 
...
> [This is where it goes wrong]
> 
> Just after half way through the above, this happens:
> 
> 14106/68:     pollsys(0xFE55FE10, 1, 0xFE55FEC8, 0x00000000)  = 1
> 14106/68:             fd=6  ev=POLLRDNORM rev=POLLRDNORM
> 14106/68:             timeout: 5.000000000 sec
> 
> which indicates that a "normal" incoming event has occurred on file 
> descriptor 6,
> which is the connection to the SD. 3 lines later,
> 
> 14106/68:     read(6, 0xFE55FF80, 4)                          Err#131 
> ECONNRESET
> 
> The FD attempts to read from the SD, and gets "Connection reset by peer". From
> the job report you posted, it doesn't look like the SD is crashing/restarting,
> nor is the machine rebooting.
> 
> Something, somewhere though, is interfering with the connection between the FD
> and the SD. Sorry to say this, but you may have to truss the SD!

I've enabled the truss to run on bacula-sd each night.  I'll report back
my findings.

Thanks,

-- 
Jason Dixon
OmniTI Computer Consulting, Inc.
jdixon AT omniti DOT com
443.325.1357 x.241

------------------------------------------------------------------------------
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users