Networker

[Networker] Saveset backup works in a group and not in another, why?

2009-02-11 04:47:09
Subject: [Networker] Saveset backup works in a group and not in another, why?
From: Manel Rodero <manel AT FIB.UPC DOT EDU>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Wed, 11 Feb 2009 10:43:02 +0100
Hello,

Recently I've seen that a saveset of a Linux machine doesn't finish its backup when this client is in its normal group (the saveset backup 'hangs' more or less at 50% of the saveset size).

I've put this client in another group (my test group, that has the same definition of the normal group but backups in another pool; the backup is full in the two groups) and here it works!

I have no idea about why this client has this behaviour. Any of you has some clues about what kind of test must I do for solving this problem?

I've tested the client (only the problematic saveset) in my test group, doing a full backup and it works OK:

mminfo -av -q "group=LCFIB-Pruebas,client=client1" | find /i "2/10/2009"
LPR101         client1 2/10/2009 10:41:51 AM 17 GB 3482406047 cb full
 /home2

Then, I try to do the same backup in my historical group (is a full backup with a retention/browse period of one year) and it fails, retrying the backup 3 times:

mminfo -av -q "group=LCFIB-HISTORICOS,client=client1" |find /i "2/10/2009"
HIS111         client1 2/10/2009 1:27:27 PM 7676 MB 3465638930 ca ful
l /home2
HIS111         client1 2/10/2009 2:45:22 PM 7676 MB 3448866034 ca ful
l /home2
HIS111         client1 2/10/2009 3:57:23 PM 7676 MB 3432093139 ca ful
l /home2

Any idea?

Here are the group definitions:

                        type: NSR group;
                        name: LCFIB-Pruebas;
                     comment: ;
                    snapshot: False;
                   autostart: Disabled;
                 autorestart: Disabled;
                  start time: "17:12";
                  last start: "Wed Feb 11 10:26:20 2009";
                    last end: ;
                    interval: "24:00";
              restart window: "12:00";
           force incremental: No;
         savegrp parallelism: 0;
              client retries: 1;
                      clones: No;
                  clone pool: Default Clone;
           success threshold: Warning;
                     options: No index save, Verbose, Manual restart;
                       level: full;
                     printer: ;
                    schedule: ;
               schedule time: ;
             expiration time: ;
          inactivity timeout: 60;
   File inactivity threshold: 0;
File inactivity alert threshold: 0;
                   work list: client1, "full:save", /home2;
                  completion: ;
                      status: running;
             Snapshot Policy: Daily;
               Snapshot Pool: Default;


                        type: NSR group;
                        name: LCFIB-Historicos;
                     comment: ;
                    snapshot: False;
                   autostart: Disabled;
                 autorestart: Disabled;
                  start time: "3:33";
                  last start: "Wed Feb 11 09:55:45 2009";
                    last end: "Wed Feb 11 10:16:21 2009";
                    interval: "24:00";
              restart window: "12:00";
           force incremental: No;
         savegrp parallelism: 0;
              client retries: 1;
                      clones: No;
                  clone pool: Default Clone;
           success threshold: Warning;
                     options: Manual restart;
                       level: full;
                     printer: ;
                    schedule: ;
               schedule time: ;
             expiration time: ;
          inactivity timeout: 60;
   File inactivity threshold: 0;
File inactivity alert threshold: 0;
                   work list: client1, "full:index", index;
                  completion: client1, /home2, "failed:full:save",
"* client1:/home2 (interrupted), exiting
* client1:/home2 aborted";
                      status: idle;
             Snapshot Policy: Daily;
               Snapshot Pool: Default;

And here is the command I use for starting only the problematic client in the historical group:

savegrp -l full -w "2/1/2010" -y "2/1/2010" -c client1 -G "LCFIB-Historicos"

Thank you very much!

--

o o o  Manel Rodero Blánquez             | LCFIB - FIB - UPC
o o o  IT System Manager                 | Campus Nord - Modul B6
o o o  Barcelona School of Informatics   | Jordi Girona, 1-3
U P C  Technical University of Catalonia | 08034 Barcelona (Spain)
                                         |
       manel AT fib.upc DOT edu                 | Tel: +00 34 93 401 0847
       http://www.fib.upc.edu/           | Fax: +00 34 93 401 7040

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER