Bacula-users

[Bacula-users] Bacula SD Broken Pipe after 16 minutes after ...

2014-07-07 02:43:58
Subject: [Bacula-users] Bacula SD Broken Pipe after 16 minutes after ...
From: dave <bacula-forum AT backupcentral DOT com>
To: bacula-users AT lists.sourceforge DOT net
Date: Sun, 06 Jul 2014 23:41:34 -0700
Hi !

I'm back from vacation. Smile

thanks for your tips. Unfortunately the Heartbeat won't help. I have upgraded 
meanwhile to 7.0.4. Just came into office to see that the weekend backup failed 
again with Heartbeat set on the client/server on all daemons to 300. As 
suggested.

- It just happens wenn MaxSpoolCache Size gets hist.
- It despools for exactly 16:12 min and then breaks. (what kind of timeout 
would that be ?)

Also did ....
- Switched network card on bacula server
- Removed on LTO drive (running single now)
- Switched SAS Port (on Library)

05-Jul 15:42 srv-bacula-dir JobId 9856: Start Backup JobId 9856, 
Job=cli-bacula-data.2014-07-05_15.42.00_34
05-Jul 15:42 srv-bacula-dir JobId 9856: Using Device "tapelib-drive0" to write.
05-Jul 15:42 srv-bacula-sd JobId 9856: Spooling data ...
05-Jul 23:10 srv-bacula-sd JobId 9856: User specified Device spool size 
reached: DevSpoolSize=800,000,016,969 MaxDevSpoolSize=800,000,000,000
05-Jul 23:10 srv-bacula-sd JobId 9856: Writing spooled data to Volume. 
Despooling 800,000,016,969 bytes ...
05-Jul 23:26 srv-client-fd JobId 9856: Error: bsock.c:428 Write error sending 
65540 bytes to Storage daemon:srv-bacula:9103: ERR=Broken pipe
05-Jul 23:26 srv-client-fd JobId 9856: Fatal error: backup.c:1200 Network send 
error to SD. ERR=Broken pipe
05-Jul 23:26 srv-bacula-sd JobId 9856: Despooling elapsed time = 00:16:12, 
Transfer rate = 823.0 M Bytes/second
05-Jul 23:26 srv-bacula-dir JobId 9856: Error: Director's connection to SD for 
this Job was lost.
05-Jul 23:26 srv-bacula-dir JobId 9856: Error: Bacula srv-bacula-dir 7.0.4 
(04Jun14):


Again - I am desperate. No clue what else todo to get it running.
- Why is this happening when it starts despooling from MaxSpoolCache size ?
- What has the client to do with despooling ? (05-Jul 23:26) cause the data is 
in the cache on the server.
- Why after 16:12min ?

After restarting the job - in 95% of the retries the backup completes.

Many thanks
--> David

+----------------------------------------------------------------------
|This was sent by dwa AT espros DOT ch via Backup Central.
|Forward SPAM to abuse AT backupcentral DOT com.
+----------------------------------------------------------------------



------------------------------------------------------------------------------
Open source business process management suite built on Java and Eclipse
Turn processes into business applications with Bonita BPM Community Edition
Quickly connect people, data, and systems into organized workflows
Winner of BOSSIE, CODIE, OW2 and Gartner awards
http://p.sf.net/sfu/Bonitasoft
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>