Bacula-users

Re: [Bacula-users] verify job with differences doesn't finish and blocks storage

2009-03-03 09:29:22
Subject: Re: [Bacula-users] verify job with differences doesn't finish and blocks storage
From: "Ralf Gross" <Ralf.Gross AT STZ-Softwaretechnik DOT de>
To: bacula-users AT lists.sourceforge DOT net
Date: Wed, 18 Feb 2009 12:19:19 +0100 (CET)
Arno Lehmann said:

> 18.02.2009 09:36, Ralf Gross wrote:
>> Ralf Gross said:
>>> lately I've seen that verify jobs that have differences just doesn't
>>> finish.
>>>
>>> bacula 2.4.4-b1, psql
>>>
>>> *st dir
>>>
>>> ...
>>> This is starting to be annoying because the volumes are then locked
>>> until
>>> I cancel thee job.
>>
>>
>> 24h later I tried to cancel the still running and blocking verify job.
>>
>> Running Jobs:
>>  JobId Level   Name                       Status
>> =====================================================================>
>> 9644 VolumeT  VerifyVU0EM003.2009-02-17_07.06.00.27 has been canceled
>>   9652 Full    VU0EM003-FBR.2009-02-17_13.15.54.51 is running
>>   9661 Increme  VU0EM003.2009-02-18_00.06.00.32 is waiting on max Client
>> jobs
>>   9671 VolumeT  VerifyVU0EM003.2009-02-18_07.06.00.58 is waiting on max
>> Client jobs
>> ===>
>>
>> *cancel jobid=9644
>> 2001 Job VerifyVU0EM003.2009-02-17_07.06.00.27 marked to be canceled.
>>
>>
>>
>> And then the console hangs and just doesn't return to the prompt.
>>
>> It seems that the only way to get thing going again is to restart the
>> dir
>> or maybe the fd where the verify job is running. But there is an other
>> job
>> with several TB to backup running right now.
>>
>> Any idea how to resolve this deadlock?
>
> Not really... I'd suggest to create some debug output (preferrably use
> the btraceback script) and see if you can get the developers
> interested in this.
>
> Then you could try killing just the thread / LWP that is stuck - but I
> don't know if this could work. I doubt it, but if it's the only chance
> I'd rather do that soon, so you don't lose too much time with the
> other, big job you might have to re-run.


The problem is that the other job already has backed up 5,5 TB which is
about half the amount of data.

But I see that the fd version on the client that runs the verify job is
2.2.8. I'll update to 2.4.4 after the backup job has finished.

Ralf



------------------------------------------------------------------------------
Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA
-OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise
-Strategies to boost innovation and cut costs with open source participation
-Receive a $600 discount off the registration fee with the source code: SFAD
http://p.sf.net/sfu/XcvMzF8H
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>
  • Re: [Bacula-users] verify job with differences doesn't finish and blocks storage, Ralf Gross <=