Bacula-users

[Bacula-users] backup always fails from only one client with "Packet size too big"

2008-09-17 06:55:42
Subject: [Bacula-users] backup always fails from only one client with "Packet size too big"
From: Georg Weiß <georg.weiss AT avira DOT com>
To: bacula-users AT lists.sourceforge DOT net
Date: Wed, 17 Sep 2008 12:55:35 +0200
Hi

I have a problem with just one bacula-fd where all jobs fails with the
above shown message. All other bacula-fd (we have 8 running on other
servers) perform their duty quite well.

log extract of bacula-fd "-d100"
--8<--
...
guybrush-fd: job.c:1242-0 adj = 0 since_time=1221508800
guybrush-fd: job.c:233-0 <dird: storage
address=calculon.srv.tt.avira.com port=9103 ssl=0
guybrush-fd: job.c:249-0 Executing storage  command.
guybrush-fd: job.c:1297-0 StorageCmd: storage
address=calculon.srv.tt.avira.com port=9103 ssl=0
guybrush-fd: bsock.c:195-0 Current host[ipv4:10.0.0.1:9103] All
host[ipv4:10.0.0.1:9103]
guybrush-fd: bsock.c:149-0 who=Storage daemon
host=calculon.srv.tt.avira.com port=9103
guybrush-fd: cram-md5.c:133-0 cram-get received: auth cram-md5
<56603325.1221648661@calculon-sd> ssl=0
guybrush-fd: cram-md5.c:152-0 sending resp to challenge:
aXcovy+Yb8+5k6tqx8+TBC
guybrush-fd: cram-md5.c:80-0 send: auth cram-md5
<887093967.1221648660@guybrush-fd> ssl=0
guybrush-fd: cram-md5.c:99-0 Authenticate OK vQ/EAy/Id8g8371cpH/u/C
guybrush-fd: job.c:233-0 <dird: backup
guybrush-fd: job.c:249-0 Executing backup command.
guybrush-fd: job.c:1356-0 begin backup ff=1506b840
guybrush-fd: find.c:93-0 Enter set_find_options()
guybrush-fd: find.c:96-0 Leave set_find_options()
guybrush-fd: find.c:198-0 F /local/data/rd/build
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: heartbeat.c:95-0 wait_intr=0 stop=0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: heartbeat.c:95-0 wait_intr=0 stop=0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: heartbeat.c:95-0 wait_intr=0 stop=0
guybrush-fd: heartbeat.c:95-0 wait_intr=0 stop=0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: heartbeat.c:95-0 wait_intr=0 stop=0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: heartbeat.c:95-0 wait_intr=0 stop=0
guybrush-fd: heartbeat.c:90-0 Got BNET_SIG -4 from SD
guybrush-fd: heartbeat.c:95-0 wait_intr=1 stop=1
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: crypto.c:600-0 crypto_digest_new jcr=1506b2c0
guybrush-fd: heartbeat.c:139-0 Send kill to heartbeat id
guybrush-fd: backup.c:197-0 end blast_data ok=0
guybrush-fd: job.c:252-0 Quit command loop. Canceled=1
guybrush-fd: job.c:343-0 Calling term_find_files
guybrush-fd: job.c:346-0 Done with term_find_files
guybrush-fd: job.c:348-0 Done with free_jcr
...
--8<--

log extract of bacula-sd "-d200"
--8<--
...
calculon-sd: mount.c:363-0 Want dirVol=AVIRA_SALES_S3_T1 dirStat=Append
calculon-sd: mount.c:370-0 Vol OK name=AVIRA_SALES_S3_T1
calculon-sd: mount.c:267-0 Device previously written, moving to end of data
calculon-sd: dev.c:837-0 eod
calculon-sd: dev.c:884-0 Using EOM for EOM
calculon-sd: dev.c:907-0 EOD file=3
calculon-sd: dev.c:964-0 EOD dev->file=3
calculon-sd: mount.c:283-0 update volinfo mounts=16
calculon-sd: askdir.c:345-0 Update cat VolFiles=3
calculon-sd: askdir.c:368-0 >dird CatReq
Job=GUYBRUSH-BUILD.2008-09-17_12.50.43 UpdateMedia
VolName=AVIRA_SALES_S3_T1 VolJobs=3 VolFiles=3 VolBlocks=881
VolBytes=56899584 VolMounts=16 VolErrors=0 VolWrites=39117657
MaxVolBytes=0 EndTime=1221648661 VolStatus=Append Slot=1 relabel=0
InChanger=1 VolReadTime=0 VolWriteTime=2866018383 VolFirstWritten=0
VolParts=0
calculon-sd: askdir.c:182-0 <dird 1000 OK VolName=AVIRA_SALES_S3_T1
VolJobs=3 VolFiles=3 VolBlocks=881 VolBytes=56899584 VolMounts=16
VolErrors=0 VolWrites=39117657 MaxVolBytes=0 VolCapacityBytes=0
VolStatus=Append Slot=1 MaxVolJobs=0 MaxVolFiles=0 InChanger=1
VolReadTime=0 VolWriteTime=2866018383 EndFile=2 EndBlock=313 VolParts=0
LabelType=0 MediaId=111
calculon-sd: askdir.c:205-0 do_reqest_vol_info return true slot=1
Volume=AVIRA_SALES_S3_T1
calculon-sd: mount.c:293-0 set APPEND, normal return from
mount_next_write_volume. dev="Quantum_SDLT320" (/dev/nst0)
calculon-sd: acquire.c:389-0 Output pos=3:0
calculon-sd: askdir.c:345-0 Update cat VolFiles=3
calculon-sd: askdir.c:368-0 >dird CatReq
Job=GUYBRUSH-BUILD.2008-09-17_12.50.43 UpdateMedia
VolName=AVIRA_SALES_S3_T1 VolJobs=4 VolFiles=3 VolBlocks=881
VolBytes=56899584 VolMounts=16 VolErrors=0 VolWrites=39117657
MaxVolBytes=0 EndTime=1221648661 VolStatus=Append Slot=1 relabel=0
InChanger=1 VolReadTime=0 VolWriteTime=2866018383 VolFirstWritten=0
VolParts=0
calculon-sd: askdir.c:182-0 <dird 1000 OK VolName=AVIRA_SALES_S3_T1
VolJobs=4 VolFiles=3 VolBlocks=881 VolBytes=56899584 VolMounts=16
VolErrors=0 VolWrites=39117657 MaxVolBytes=0 VolCapacityBytes=0
VolStatus=Append Slot=1 MaxVolJobs=0 MaxVolFiles=0 InChanger=1
VolReadTime=0 VolWriteTime=2866018383 EndFile=2 EndBlock=313 VolParts=0
LabelType=0 MediaId=111
calculon-sd: askdir.c:205-0 do_reqest_vol_info return true slot=1
Volume=AVIRA_SALES_S3_T1
calculon-sd: reserve.c:490-0 Dec reserve=0 dev="Quantum_SDLT320" (/dev/nst0)
calculon-sd: jcr.c:603-0 OnEntry JobStatus=R set=R
calculon-sd: jcr.c:623-0 OnExit JobStatus=R set=R
calculon-sd: append.c:96-0 Begin append device="Quantum_SDLT320" (/dev/nst0)
calculon-sd: append.c:101-0 Just after acquire_device_for_append
calculon-sd: label.c:725-0 session_label record=16b28298
calculon-sd: label.c:770-0 Write sesson_label record JobId=28109
FI=SOS_LABEL SessId=1 Strm=28109 len=174 remainder=0
calculon-sd: label.c:774-0 Leave write_session_label Block=0d File=3d
calculon-sd: bnet.c:667-0 who=client host=10.0.0.1 port=36643
calculon-sd: dircmd.c:173-0 Conn: Hello Director calculon-dir calling
calculon-sd: dircmd.c:188-0 Got a DIR connection at 17-Sep-2008 12:51:32
calculon-sd: jcr.c:603-0 OnEntry JobStatus=calculon-sd: jcr.c:623-0
OnExit JobStatus=C set=C
calculon-sd: cram-md5.c:73-0 send: auth cram-md5
<1732267464.1221648692@calculon-sd> ssl=0
calculon-sd: cram-md5.c:133-0 cram-get received: auth cram-md5
<95948430.1221648692@calculon-dir> ssl=0
calculon-sd: cram-md5.c:152-0 sending resp to challenge:
g8h5hA++d5/OE2MAv7wk6D
calculon-sd: dircmd.c:210-0 Message channel init completed.
calculon-sd: dircmd.c:217-0 <dird: cancel
Job=GUYBRUSH-BUILD.2008-09-17_12.50.43
calculon-sd: dircmd.c:231-0 Do command: cancel
calculon-sd: jcr.c:603-0 OnEntry JobStatus=R set=A
calculon-sd: jcr.c:623-0 OnExit JobStatus=A set=A
calculon-sd: mem_pool.c:377-0 garbage collect memory pool
calculon-sd: jcr.c:603-0 OnEntry JobStatus=A set=T
calculon-sd: jcr.c:623-0 OnExit JobStatus=A set=T
calculon-sd: fd_cmds.c:160-0 <filed: 39 6 0calculon-sd: fd_cmds.c:173-0
<filed: Command not found: 39 6 0
calculon-sd: append.c:284-0 Write EOS label JobStatus=A
calculon-sd: label.c:725-0 session_label record=16b25898
calculon-sd: label.c:770-0 Write sesson_label record JobId=28109
FI=EOS_LABEL SessId=1 Strm=28109 len=210 remainder=0
calculon-sd: label.c:774-0 Leave write_session_label Block=1055d File=3d
calculon-sd: append.c:301-0 back from write_end_session_label()
calculon-sd: acquire.c:427-0 release_device device "Quantum_SDLT320"
(/dev/nst0) is tape
calculon-sd: acquire.c:446-0 There are 0 writers in release_device
calculon-sd: acquire.c:449-0 dir_create_jobmedia. Release
vol=AVIRA_SALES_S3_T1 dev="Quantum_SDLT320" (/dev/nst0)
calculon-sd: askdir.c:412-0 >dird CatReq
Job=GUYBRUSH-BUILD.2008-09-17_12.50.43 CreateJobMedia FirstIndex=1
LastIndex=38 StartFile=3 EndFile=3 StartBlock=0 EndBlock=1055 Copy=0
Strip=0 MediaId=111
calculon-sd: askdir.c:414-0 create_jobmedia error bnet_recv
calculon-sd: jcr.c:603-0 OnEntry JobStatus=A set=f
calculon-sd: jcr.c:623-0 OnExit JobStatus=A set=f
calculon-sd: jcr.c:603-0 OnEntry JobStatus=A set=f
calculon-sd: jcr.c:623-0 OnExit JobStatus=A set=f
calculon-sd: dev.c:1648-0 weof_dev
calculon-sd: askdir.c:345-0 Update cat VolFiles=4
calculon-sd: askdir.c:368-0 >dird calculon-sd: askdir.c:177-0 getvolname
error bnet_recv
calculon-sd: jcr.c:603-0 OnEntry JobStatus=A set=f
calculon-sd: jcr.c:623-0 OnExit JobStatus=A set=f
calculon-sd: askdir.c:374-0 Didn't get vol info vol=AVIRA_SALES_S3_T1:
ERR=Network error on bnet_recv in req_vol_info.
calculon-sd: acquire.c:464-0 dir_update_vol_info. Release
vol=AVIRA_SALES_S3_T1 dev="Quantum_SDLT320" (/dev/nst0)
calculon-sd: reserve.c:553-0 jid=28109 === set not reserved
vol=AVIRA_SALES_S3_T1 num_writers=0 dev_reserved=0 dev="Quantum_SDLT320"
(/dev/nst0)
calculon-sd: reserve.c:554-0 === clear in_use vol=AVIRA_SALES_S3_T1
calculon-sd: acquire.c:481-0 0 writers, 0 reserve, dev="Quantum_SDLT320"
(/dev/nst0)
calculon-sd: reserve.c:189-0 jid=28109 List acquire:release_device():
AVIRA_SALES_S3_T1 in_use=0 on device "Quantum_SDLT320" (/dev/nst0)
calculon-sd: acquire.c:519-0 JobId=28109 broadcast wait_device_release
at 17-Sep-2008 12:51:58
calculon-sd: acquire.c:534-0 ===== Device "Quantum_SDLT320" (/dev/nst0)
released by JobId=28109
calculon-sd: append.c:341-0 return from do_append_data() ok=1
calculon-sd: jcr.c:603-0 OnEntry JobStatus=A set=E
calculon-sd: jcr.c:623-0 OnExit JobStatus=A set=E
calculon-sd: jcr.c:603-0 OnEntry JobStatus=A set=T
calculon-sd: jcr.c:623-0 OnExit JobStatus=A set=T
calculon-sd: dircmd.c:234-0 Command run reqeusts quit
calculon-sd: mem_pool.c:377-0 garbage collect memory pool
...
--8<--

The network setup is quite easy. All machines are connected to a special
"backup" vlan (10.0.0.0/16).

All fd's,sd's and the dir are running on gentoo linux using the lastest
bacula version 2.4.2.

I can provide more/full logs on request.

Thanks for any help.

-- 
Mit freundlichen Gruessen

Georg Weiss, Systemadministrator
Tel: +49 (0) 7542-500 700 ServiceDesk
Tel: +49 (0) 7542-500 708 Durchwahl
Fax: +49 (0) 7542-500 724

Avira GmbH
Lindauer Str. 21, D-88069 Tettnang, Germany


Gesch�ftsf�hrender Gesellschafter: Tjark Auerbach
Sitz der Gesellschaft: Tettnang
Handelsregister: Amtsgericht Ulm, HRB 630992

==================================================
Dieser Virenschutz ist so gut, dass wir ihn erweitert haben: Avira AntiVir
gibt es jetzt auch f�r Microsoft Exchange Cluster! www.avira.de
==================================================
ALLGEMEINE GESCH�FTSBEDINGUNGEN
Es gelten unsere Allgemeinen Gesch�ftsbedingungen
(AGB). Sie finden sie in der jeweils g�ltigen Fassung
im Internet unter http://www.avira.de/agb
***************************************************

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users
<Prev in Thread] Current Thread [Next in Thread>