Amanda-Users

Re: Hung processes w/ Amanda over SSH

2007-10-17 11:02:17
Subject: Re: Hung processes w/ Amanda over SSH
From: Mario Silva <msilva AT nmcourts DOT com>
To: Cameron Matheson <cameronm AT webpipe DOT net>, amanda List <amanda-users AT amanda DOT org>
Date: Wed, 17 Oct 2007 08:39:33 -0600
Cameron:

I had a similar problem:

I am backing up Data Base servers with amanda, and because DB Servers need them to be backed up at different schedule I have a configuration for each data base server, and I have a master script that starts/shutdown Database after/before amanda perform the backup.

My script was hunging up all the proccess even when the backup was performed pretty fine, every morning I had to check in the console if amanda ended the data base backups because amanda process was hunged trying to send the email report.

I decided the re-write my script and I took out from my script many switching sessions I had, the start/shutdown part was made with SSH auth, but the backup is made with regular amdump, after re-write my script and delete a lot of SU sessions my script is not hunging amanda sessions.

I hope this can help you Cameron

have a great day !

mario

-- 
Mario Silva
Systems Administrator
Supreme Court of New Mexico
Judicial Information Division
2905 Rodeo Park Dr. East, Bldg. #5
Santa Fe, NM 87505
Phone:  (505) 476-6959 / Mobil: (505) 660-1026
Fax:    (505) 476-6952

Website:  http://www.nmcourts.gov
mailto: msilva AT nmcourts DOT gov

LEGAL DISCLAIMER:

The content of this data transmission is not considered as an offer, proposal, understanding, or agreement unless it is confirmed in a document signed by a legal representative of Supreme Court of the State of New Mexico or the Judicial Information Division. The content of this data transmission is confidential and it is intended to be delivered only to the addresses, therefore, it shall not be distributed and/or disclosed through any mean without the original sender's previous authorization. If you are not the addressee you are forbidden to use it, either totally or partially, for any purpose.

AVISO LEGAL:

El contenido de este mensaje de datos no se considera oferta, propuesta o acuerdo, sino hasta que sea confirmado en  documento por escrito que contenga la firma autógrafa del apoderado legal de La Suprema Corte del Estado de Nuevo Mexico o de la Division de Informatica del Estado. El contenido de este mensaje de datos es confidencial y se entiende dirigido y para uso exclusivo del destinatario, por lo que no podrá distribuirse y/o difundirse por ningún medio sin la previa autorización del emisor original. Si usted no es el destinatario, se le prohíbe su utilización total o parcial para cualquier fin.


Cameron Matheson wrote:
Hi Guys,

I've installed Amanda v2.5.2p1 on my servers using SSH auth.  The
backups are working fine (Dumps come in good, and I can restore w/out
any trouble), but I'm seeing a whole bunch of hung ssh/amandad/tar
processes on my clients.  I'm not real clear on what's causing this
(maybe it's the estimates, since the backups are coming in fine?).  The
only thing I've been able to find in my logs that looks odd is the
following (taken from one of my clients):

  
amandad: time 44.823: security_close(handle=0x9666220, driver=0xd20e60 (SSH))
amandad: time 44.823: security_stream_close(0x9696b80)
amandad: time 59524.148: security_stream_seterr(0x967ead8, write error to : Broken pipe)
amandad: time 59524.163: sending NAK pkt:
<<<<<
ERROR write error on stream 499999: write error to : Broken pipe
  
amandad: time 59524.163: security_stream_close(0x967ead8)
amandad: time 59524.163: security_stream_seterr(0x967ead8, write error to : Broken pipe)
amandad: time 59524.163: security_stream_close(0x9686b10)
amandad: time 59524.163: security_stream_seterr(0x9686b10, write error to : Broken pipe)
amandad: time 59524.163: security_stream_close(0x968eb48)
amandad: time 59524.163: security_stream_seterr(0x968eb48, write error to : Broken pipe)
amandad: time 59524.163: pid 19860 finish time Tue Oct 16 18:30:27 2007

So I can see how that might cause a hung process (strace'ing the
processes generally shows that they're read()ing on something
indefinitely)--but is there anyway to avoid this?

Thanks,
Cameron

  

Attachment: msilva.vcf
Description: Vcard

<Prev in Thread] Current Thread [Next in Thread>