Bacula-users

Re: [Bacula-users] [Bacula-devel] Storage Daemon crash backtrace

2010-07-02 12:42:57
Subject: Re: [Bacula-users] [Bacula-devel] Storage Daemon crash backtrace
From: Kern Sibbald <kern AT sibbald DOT com>
To: Robert LeBlanc <robert AT leblancnet DOT us>
Date: Fri, 2 Jul 2010 18:41:45 +0200
On Friday 02 July 2010 17:30:19 Robert LeBlanc wrote:
> On Fri, Jul 2, 2010 at 9:17 AM, Kern Sibbald <kern AT sibbald DOT com> wrote:
> > Hello Robert,
> >
> > Eric and I "finished" Bacula Enterprise version 4.0.0 today, a bit faster
> > than
> > I expected, so I am not running all the final tests, which gave me some
> > time
> > to look at the problem.
> >
> > I downloaded the zlib source code, and I don't immediately see anything
> > in the
> > file that would cause problems -- of course it is quite complicated code.
> >
> > I did look through the Bacula TLS code, and I noticed that the author did
> > not
> > properly set error conditions in Bacula when it finds an error on the
> > comm line.  This could cause Bacula to continue running, and might cause
> > subsequent calls to openssl subroutines, when there is no valid data, and
> > thus the seg fault.  I still must test the changes I made.
> >
> > It is rather a long shot, but if you see that everytime that the SD
> > crashes it
> > is when there is a disrupted comm line problem, then it could well be the
> > problem -- of course, if one has a good solid network, there should never
> > be
> > any "broken pipe" errors, which is possibly why we cannot see the
> > problem.
> >
> > Having said this, I cannot rule out a problem on openssl at this point.
>
> It makes sense to me. I'm transferring the data over the university
> network, so the network isn't terrible, however there are plenty of routers
> and speed changes between the FD and SD. I don't know if some packets are
> coming out of order and may be causing issues (TCP should reorder them
> before the apps sees the data anyway). I was running a bond interface on
> the SD in mode 6 over two different physical switch paths. I have since
> stopped using bonding to see if things got better (not really any change).
>
> If you have branch I can pull, I would be happy to help test this.

I'm testing it now.  I will let you know -- probably tomorrow or Sunday when I 
am sure that I did not break something ...

Best regards,

Kern

>
> Thanks,
>
> Robert LeBlanc
> Life Sciences & Undergraduate Education Computer Support
> Brigham Young University



------------------------------------------------------------------------------
This SF.net email is sponsored by Sprint
What will you do first with EVO, the first 4G phone?
Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users