Amanda-Users

Re: 2.5.1p2: planner returns error=EOF on read

2006-11-27 10:41:33
Subject: Re: 2.5.1p2: planner returns error=EOF on read
From: Jean-Francois Malouin <Jean-Francois.Malouin AT bic.mni.mcgill DOT ca>
To: Jean-Louis Martineau <martineau AT zmanda DOT com>
Date: Mon, 27 Nov 2006 10:32:40 -0500
* Jean-Louis Martineau <martineau AT zmanda DOT com> [20061127 09:24]:
> Jean-Francois,
> 
> Could you try the attached patch, it should kill the tar process.
> If it still doesn't work, could you attach a debugger to the sendbackup 
> process to know where it is?

Thanks Jean-Louis. 
I'm recompiling and will let you know what happens.

jf

> 
> Jean-Louis
> 
> Jean-Francois Malouin wrote:
> >* Jean-Louis Martineau <martineau AT zmanda DOT com> [20061123 17:20]:
> >  
> >>Do amandad is running?
> >>    
> >
> >To be sure that I would hit the problem again I didn't load new tapes
> >so after the holddding filled up I have a bunch of gtar running and
> >just wasting a lot of cycles. A typical example (sorry for the long lines):
> >
> >  amanda    3555936    3557855  0        - -       0:00 <defunct> 
> >  amanda    3557855          1  0 05:37:39 ?       1:30 
> >  /opt/amanda/amanda1/libexec/sendbackup amandad bsdtcp
> >    root    3568087    3557855  0 05:40:02 ?      277:35 gtar --create 
> >    --file - --directory /data/mafalda/mafalda1/susanita/sandra/DTI -
> >
> >Tracing the sendbackup process yields no output while the gtar one
> >gives just an uninterupted flow of lines:
> >
> >   68mS             tar(3568087): write(1, <fd 63 e3 6c ff 42 3c df 87
> >41 82 bf e9 dd 95 2c>..., 10240) errno = 32 (Broken pipe)
> >
> >Don't know if this is going to be of any help...
> >jf
> >
> >  
> >>Could you try to find what each process (amanda,sendbackup,tar) are 
> >>doing? on which system call they are hung?
> >>On linux, I would use strace for that.
> >>
> >>Jean-Louis
> >>
> >>Jean-Francois Malouin wrote:
> >>    
> >>>Jean-Louis,
> >>>
> >>>The amdump just finished and because amanda ran out of tapes
> >>>(runtapes=10) it completed badly leaving stuff in the holdding
> >>>disk. I'm flushing it at the moment but I noticed that when amanda
> >>>gave up it left a lot of gnutar lying around, with amandad as parent.
> >>>I had to manually kill them. Looks like signals are not doing ok.
> >>>I've attached one example of a DLE's sendbackup and runtar debug file.
> >>>It's not the first time I notice that when a DLE fails to make it
> >>>to tape successfully processes are left running...
> >>>
> >>>* Jean-Louis Martineau <martineau AT zmanda DOT com> [20061122 09:32]:
> >>> 
> >>>      
> >>>>Could you post amandad.<timestamp>.debug file from yorick?
> >>>>
> >>>>Jean-Francois Malouin wrote:
> >>>>   
> >>>>        
> >>>>>On a server running irix-6.5 and amanda 2.5.1p2
> >>>>>planner debug shows:
> >>>>>
> >>>>>security_getdriver(name=bsdtcp) returns 4075aa8
> >>>>>security_handleinit(handle=1001d4a8, driver=4075aa8 (BSDTCP))
> >>>>>security_streaminit(stream=1005e538, driver=4075aa8 (BSDTCP))
> >>>>>security_close(handle=1001cfe0, driver=4075aa8 (BSDTCP))
> >>>>>security_stream_close(10053a80)
> >>>>>security_stream_seterr(1005e538, SOCKET_EOF)
> >>>>>security_seterror(handle=1001d4a8, driver=4075aa8 (BSDTCP) error=EOF
> >>>>>on read from yorick)
> >>>>>security_close(handle=1001d4a8, driver=4075aa8 (BSDTCP))
> >>>>>security_stream_close(1005e538)
> >>>>>
> >>>>>and the amanda report shows things like:
> >>>>>
> >>>>>yorick  DATA_sub101   lev 0  FAILED [hmm, disk was stranded on waitq]
> >>>>>planner: ERROR Request to yorick failed: EOF on read from yorick
> >>>>>
> >>>>>and the backup fails. The funny thing is that I have 2 other 
> >>>>>configurations
> >>>>>running 2.5.1p2 in parallel that doesn't exhibit this behaviour.
> >>>>>Any clues?
> >>>>>jf
> >>>>>
> >>>>>     
> >>>>>          
> >>> 
> >>>------------------------------------------------------------------------
> >>>
> >>>runtar: debug 1 pid 2826413 ruid 666 euid 0: start at Thu Nov 23 
> >>>08:04:13 2006
> >>>runtar: version 2.5.1p2
> >>>/usr/freeware/bin/tar version: tar (GNU tar) 1.13.25
> >>>
> >>>config: stk_80-conf1
> >>>runtar: debug 1 pid 2826413 ruid 0 euid 0: rename at Thu Nov 23 08:04:13 
> >>>2006
> >>>running: /usr/freeware/bin/tar: 'gtar' '--create' '--file' '-' 
> >>>'--directory' 
> >>>'/data/mafalda/mafalda1/susanita/jen/anxiety_version1/sub101' 
> >>>'--one-file-system' '--listed-incremental' 
> >>>'/opt/amanda/amanda1/var/amanda/gnutar-lists/yoricksub101_1.new' 
> >>>'--sparse' '--ignore-failed-read' '--totals' '.' runtar: pid 2826413 
> >>>finish time Thu Nov 23 08:04:13 2006
> >>> 
> >>>------------------------------------------------------------------------
> >>>
> >>>sendbackup: debug 1 pid 2836263 ruid 666 euid 666: start at Thu Nov 23 
> >>>08:03:20 2006
> >>>sendbackup: version 2.5.1p2
> >>>Could not open conf file 
> >>>"/opt/amanda/amanda1/etc/amanda/amanda-client.conf": No such file or 
> >>>directory
> >>>Reading conf file 
> >>>"/opt/amanda/amanda1/etc/amanda/stk_80-conf1/amanda-client.conf".
> >>>sendbackup: debug 1 pid 2836263 ruid 666 euid 666: rename at Thu Nov 23 
> >>>08:03:20 2006
> >>> sendbackup req: <GNUTAR sub101 
> >>> /data/mafalda/mafalda1/susanita/jen/anxiety_version1/sub101 1 
> >>> 2006:11:13:13:21:32 OPTIONS |;auth=bsdtcp;index;>
> >>> parsed request as: program `GNUTAR'
> >>>                    disk `sub101'
> >>>                    device 
> >>>                    
> >>> `/data/mafalda/mafalda1/susanita/jen/anxiety_version1/sub101'
> >>>                    level 1
> >>>                    since 2006:11:13:13:21:32
> >>>                    options `|;auth=bsdtcp;index;'
> >>>sendbackup: start: yorick:sub101 lev 1
> >>>sendbackup-gnutar: time 0.208: doing level 1 dump as listed-incremental 
> >>>      
> >>>from '/opt/amanda/amanda1/var/amanda/gnutar-lists/yoricksub101_0' to 
> >>    
> >>>'/opt/amanda/amanda1/var/amanda/gnutar-lists/yoricksub101_1.new'
> >>>sendbackup-gnutar: time 53.057: doing level 1 dump from date: 2006-11-13 
> >>>13:22:26 GMT
> >>>sendbackup: time 53.246: started index creator: "/usr/freeware/bin/tar 
> >>>-tf - 2>/dev/null | sed -e 's/^\.//'"
> >>>sendbackup: time 53.254: spawning /opt/amanda/amanda1/libexec/runtar in 
> >>>pipeline
> >>>sendbackup: argument list: runtar stk_80-conf1 gtar --create --file - 
> >>>--directory /data/mafalda/mafalda1/susanita/jen/anxiety_version1/sub101 
> >>>--one-file-system --listed-incremental 
> >>>/opt/amanda/amanda1/var/amanda/gnutar-lists/yoricksub101_1.new --sparse 
> >>>--ignore-failed-read --totals .
> >>>sendbackup-gnutar: time 53.292: /opt/amanda/amanda1/libexec/runtar: pid 
> >>>2826413
> >>>sendbackup: time 53.292: started backup
> >>>sendbackup: time 1839.833: index tee cannot write [Broken pipe]
> >>>sendbackup: time 1839.833: pid 2836285 finish time Thu Nov 23 08:34:00 
> >>>2006
> >>>sendbackup: time 1839.926: 118: strange(?): sendbackup: index tee cannot 
> >>>write [Broken pipe]
> >>>sendbackup: time 9284.787: error [/usr/freeware/bin/tar got signal 15]
> >>>sendbackup: time 9284.856: pid 2836263 finish time Thu Nov 23 10:38:05 
> >>>2006
> >>> 
> >>>      
> >
> >  
> 



-- 
<° ><

<Prev in Thread] Current Thread [Next in Thread>