On Tue, Jan 27, 2009 at 10:12:03AM +0000, Allan Black wrote:
> Jason Dixon wrote:
> > a "Packet size too big" error. The Director resides on a global zone in
> > Solaris x86. I've managed to capture a truss during one of the
> > failures:
> > http://mirrors.omniti.com/bacula/bacula.truss
>
...
> [This is where it goes wrong]
>
> Just after half way through the above, this happens:
>
> 14106/68: pollsys(0xFE55FE10, 1, 0xFE55FEC8, 0x00000000) = 1
> 14106/68: fd=6 ev=POLLRDNORM rev=POLLRDNORM
> 14106/68: timeout: 5.000000000 sec
>
> which indicates that a "normal" incoming event has occurred on file
> descriptor 6,
> which is the connection to the SD. 3 lines later,
>
> 14106/68: read(6, 0xFE55FF80, 4) Err#131
> ECONNRESET
>
> The FD attempts to read from the SD, and gets "Connection reset by peer". From
> the job report you posted, it doesn't look like the SD is crashing/restarting,
> nor is the machine rebooting.
>
> Something, somewhere though, is interfering with the connection between the FD
> and the SD. Sorry to say this, but you may have to truss the SD!
I've enabled the truss to run on bacula-sd each night. I'll report back
my findings.
Thanks,
--
Jason Dixon
OmniTI Computer Consulting, Inc.
jdixon AT omniti DOT com
443.325.1357 x.241
------------------------------------------------------------------------------
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users
|