Re: Hung processes w/ Amanda over SSH
2007-10-17 11:02:17
Cameron:
I had a similar problem:
I am backing up Data Base servers with amanda, and because DB Servers
need them to be backed up at different schedule I have a configuration
for each data base server, and I have a master script that
starts/shutdown Database after/before amanda perform the backup.
My script was hunging up all the proccess even when the backup was
performed pretty fine, every morning I had to check in the console if
amanda ended the data base backups because amanda process was hunged
trying to send the email report.
I decided the re-write my script and I took out from my script many
switching sessions I had, the start/shutdown part was made with SSH
auth, but the backup is made with regular amdump, after re-write my
script and delete a lot of SU sessions my script is not hunging amanda
sessions.
I hope this can help you Cameron
have a great day !
mario
--
Mario Silva
Systems Administrator
Supreme Court of New Mexico
Judicial Information Division
2905 Rodeo Park Dr. East, Bldg. #5
Santa Fe, NM 87505
Phone: (505) 476-6959 / Mobil: (505) 660-1026
Fax: (505) 476-6952
Website: http://www.nmcourts.gov
mailto: msilva AT nmcourts DOT gov
LEGAL DISCLAIMER:
The content of this data transmission is not considered as an offer, proposal, understanding, or agreement unless it is confirmed in a document signed by a legal representative of Supreme Court of the State of New Mexico or the Judicial Information Division. The content of this data transmission is confidential and it is intended to be delivered only to the addresses, therefore, it shall not be distributed and/or disclosed through any mean without the original sender's previous authorization. If you are not the addressee you are forbidden to use it, either totally or partially, for any purpose.
AVISO LEGAL:
El contenido de este mensaje de datos no se considera oferta, propuesta o acuerdo, sino hasta que sea confirmado en documento por escrito que contenga la firma autógrafa del apoderado legal de La Suprema Corte del Estado de Nuevo Mexico o de la Division de Informatica del Estado. El contenido de este mensaje de datos es confidencial y se entiende dirigido y para uso exclusivo del destinatario, por lo que no podrá distribuirse y/o difundirse por ningún medio sin la previa autorización del emisor original. Si usted no es el destinatario, se le prohíbe su utilización total o parcial para cualquier fin.
Cameron Matheson wrote:
Hi Guys,
I've installed Amanda v2.5.2p1 on my servers using SSH auth. The
backups are working fine (Dumps come in good, and I can restore w/out
any trouble), but I'm seeing a whole bunch of hung ssh/amandad/tar
processes on my clients. I'm not real clear on what's causing this
(maybe it's the estimates, since the backups are coming in fine?). The
only thing I've been able to find in my logs that looks odd is the
following (taken from one of my clients):
amandad: time 44.823: security_close(handle=0x9666220, driver=0xd20e60 (SSH))
amandad: time 44.823: security_stream_close(0x9696b80)
amandad: time 59524.148: security_stream_seterr(0x967ead8, write error to : Broken pipe)
amandad: time 59524.163: sending NAK pkt:
<<<<<
ERROR write error on stream 499999: write error to : Broken pipe
amandad: time 59524.163: security_stream_close(0x967ead8)
amandad: time 59524.163: security_stream_seterr(0x967ead8, write error to : Broken pipe)
amandad: time 59524.163: security_stream_close(0x9686b10)
amandad: time 59524.163: security_stream_seterr(0x9686b10, write error to : Broken pipe)
amandad: time 59524.163: security_stream_close(0x968eb48)
amandad: time 59524.163: security_stream_seterr(0x968eb48, write error to : Broken pipe)
amandad: time 59524.163: pid 19860 finish time Tue Oct 16 18:30:27 2007
So I can see how that might cause a hung process (strace'ing the
processes generally shows that they're read()ing on something
indefinitely)--but is there anyway to avoid this?
Thanks,
Cameron
|
msilva.vcf
Description: Vcard
|
|
|