Bacula-users

Re: [Bacula-users] bacula-sd crashing on Suse 10SP2

2009-10-02 07:04:13
Subject: Re: [Bacula-users] bacula-sd crashing on Suse 10SP2
From: Evan Fraser <Evan.Fraser AT rms DOT com>
To: Evan Fraser <Evan.Fraser AT rms DOT com>, "bacula-users AT lists.sourceforge DOT net" <bacula-users AT lists.sourceforge DOT net>
Date: Fri, 2 Oct 2009 12:02:37 +0100

I’ve pasted a backtrace below, many thanks for any help.

 

Using host libthread_db library "/lib64/libthread_db.so.1".

[Thread debugging using libthread_db enabled] [New Thread 47010999105392 (LWP 23567)] [New Thread 1140881728 (LWP 24523)] [New Thread 1132489024 (LWP 23871)] [New Thread 1124096320 (LWP 23867)] [New Thread 1082132800 (LWP 23862)] [New Thread 1115703616 (LWP 23856)] [New Thread 1107310912 (LWP 23850)] [New Thread 1098918208 (LWP 23845)] [New Thread 1090525504 (LWP 23570)]

0x00002ac199ce8a02 in select () from /lib64/libc.so.6

$1 = "uklxhome0-sd", '\0' <repeats 17 times>

$2 = 0x54e088 "bacula-sd"

$3 = 0x54e0c8 "/usr/sbin/bacula-sd"

$4 = 0x0

$5 = 0x2ac1990f8dc2 "3.0.2 (18 July 2009)"

$6 = 0x2ac1990f8dd7 "x86_64-suse-linux-gnu"

$7 = 0x2ac1990f8ded "suse"

$8 = 0x2ac1990f8df2 "10"

$9 = "uklxhome0", '\0' <repeats 40 times> #0  0x00002ac199ce8a02 in select () from /lib64/libc.so.6

#1  0x00002ac1990cd517 in bnet_thread_server (addrs=0x54eb98, max_clients=41,

    client_wq=0x54d620,

    handle_client_request=0x41cb40 <handle_connection_request(void*)>)

    at bnet_server.c:161

#2  0x0000000000407cb9 in main (argc=<value optimized out>,

    argv=<value optimized out>) at stored.c:306

#3  0x00002ac199c4b164 in __libc_start_main () from /lib64/libc.so.6

#4  0x0000000000406fe9 in _start ()

 

Thread 9 (Thread 1090525504 (LWP 23570)):

#0  0x00002ac19951c3b7 in pthread_cond_timedwait@@GLIBC_2.3.2 ()

   from /lib64/libpthread.so.0

#1  0x00002ac1990f083b in watchdog_thread (arg=<value optimized out>)

    at watchdog.c:308

#2  0x00002ac199518143 in start_thread () from /lib64/libpthread.so.0

#3  0x00002ac199ceebed in clone () from /lib64/libc.so.6

#4  0x0000000000000000 in ?? ()

 

Thread 8 (Thread 1098918208 (LWP 23845)):

#0  0x00002ac19951e8bb in write () from /lib64/libpthread.so.0

#1  0x0000000000415260 in DEVICE::write (this=0x5679b8, buf=0x2aaaab46e050,

    len=2000000) at dev.c:2340

#2  0x0000000000413940 in write_block_to_dev (dcr=0x574798) at block.c:545

#3  0x0000000000414504 in write_block_to_device (dcr=0x574798) at block.c:382

#4  0x000000000040eaed in do_append_data (jcr=0x56d848) at append.c:202

#5  0x00000000004215a5 in append_data_cmd (jcr=0x56d848) at fd_cmds.c:199

#6  0x0000000000420eac in do_fd_commands (jcr=0x56d848) at fd_cmds.c:162

#7  0x0000000000421716 in run_job (jcr=0x56d848) at fd_cmds.c:124

#8  0x0000000000421c07 in run_cmd (jcr=0x56d848) at job.c:211

#9  0x000000000041ceb0 in handle_connection_request (arg=<value optimized out>)

    at dircmd.c:233

#10 0x00002ac1990f106e in workq_server (arg=<value optimized out>)

    at workq.c:346

#11 0x00002ac199518143 in start_thread () from /lib64/libpthread.so.0

#12 0x00002ac199ceebed in clone () from /lib64/libc.so.6

#13 0x0000000000000000 in ?? ()

 

Thread 7 (Thread 1107310912 (LWP 23850)):

#0  0x00002ac19951f72f in waitpid () from /lib64/libpthread.so.0

#1  0x00002ac1990e7e3c in signal_handler (sig=11) at signal.c:211

#2  <signal handler called>

#3  e_msg (file=0x2ac1990fa5ed "smartall.c", line=217, type=1,

    level=<value optimized out>,

    fmt=0x2ac1990fa560 "Buffer overrun called from %s:%d\n") at message.c:1075

#4  0x00002ac1990e8c36 in sm_free (file=0x2ac1990f934d "mem_pool.c", line=213,

    fp=0x2aaaab657038) at smartall.c:217

#5  0x00002ac1990e0bfd in sm_free_pool_memory (fname=0x43bf34 "block.c",

    lineno=171, obuf=0x2aaaab657050 "ÂH") at mem_pool.c:213

#6  0x0000000000411853 in free_block (block=0x56fca0) at block.c:171

#7  0x000000000040cea0 in free_dcr (dcr=0x56f808) at acquire.c:682

#8  0x000000000040d30a in release_device (dcr=0x56f808) at acquire.c:546

#9  0x000000000040f03f in do_append_data (jcr=0x56eec8) at append.c:318 #10 0x00000000004215a5 in append_data_cmd (jcr=0x56eec8) at fd_cmds.c:199

#11 0x0000000000420eac in do_fd_commands (jcr=0x56eec8) at fd_cmds.c:162

#12 0x0000000000421716 in run_job (jcr=0x56eec8) at fd_cmds.c:124

#13 0x0000000000421c07 in run_cmd (jcr=0x56eec8) at job.c:211

#14 0x000000000041ceb0 in handle_connection_request (arg=<value optimized out>)

    at dircmd.c:233

#15 0x00002ac1990f106e in workq_server (arg=<value optimized out>)

    at workq.c:346

#16 0x00002ac199518143 in start_thread () from /lib64/libpthread.so.0

#17 0x00002ac199ceebed in clone () from /lib64/libc.so.6

#18 0x0000000000000000 in ?? ()

 

Thread 6 (Thread 1115703616 (LWP 23856)):

#0  bcrc32 (buf=<value optimized out>, len=1999996) at crc32.c:137

#1  0x0000000000411d02 in ser_block_header (block=0x571790) at block.c:211

#2  0x00000000004136d9 in write_block_to_dev (dcr=0x5712f8) at block.c:469

#3  0x0000000000414504 in write_block_to_device (dcr=0x5712f8) at block.c:382

#4  0x000000000040eaed in do_append_data (jcr=0x570478) at append.c:202

#5  0x00000000004215a5 in append_data_cmd (jcr=0x570478) at fd_cmds.c:199

#6  0x0000000000420eac in do_fd_commands (jcr=0x570478) at fd_cmds.c:162

#7  0x0000000000421716 in run_job (jcr=0x570478) at fd_cmds.c:124

#8  0x0000000000421c07 in run_cmd (jcr=0x570478) at job.c:211

#9  0x000000000041ceb0 in handle_connection_request (arg=<value optimized out>)

    at dircmd.c:233

#10 0x00002ac1990f106e in workq_server (arg=<value optimized out>)

    at workq.c:346

#11 0x00002ac199518143 in start_thread () from /lib64/libpthread.so.0

#12 0x00002ac199ceebed in clone () from /lib64/libc.so.6

#13 0x0000000000000000 in ?? ()

Current language:  auto; currently c++

 

Thread 5 (Thread 1082132800 (LWP 23862)):

#0  0x00002ac199ce8097 in ioctl () from /lib64/libc.so.6

#1  0x0000000000416759 in DEVICE::weof (this=0x56c158, num=1) at dev.c:1713

#2  0x0000000000413dd9 in write_block_to_dev (dcr=0x5a7468) at block.c:498

#3  0x0000000000414504 in write_block_to_device (dcr=0x5a7468) at block.c:382

#4  0x000000000040eaed in do_append_data (jcr=0x5a6438) at append.c:202

#5  0x00000000004215a5 in append_data_cmd (jcr=0x5a6438) at fd_cmds.c:199

#6  0x0000000000420eac in do_fd_commands (jcr=0x5a6438) at fd_cmds.c:162

#7  0x0000000000421716 in run_job (jcr=0x5a6438) at fd_cmds.c:124

#8  0x0000000000421c07 in run_cmd (jcr=0x5a6438) at job.c:211

#9  0x000000000041ceb0 in handle_connection_request (arg=<value optimized out>)

    at dircmd.c:233

#10 0x00002ac1990f106e in workq_server (arg=<value optimized out>)

    at workq.c:346

#11 0x00002ac199518143 in start_thread () from /lib64/libpthread.so.0

#12 0x00002ac199ceebed in clone () from /lib64/libc.so.6

#13 0x0000000000000000 in ?? ()

 

Thread 4 (Thread 1124096320 (LWP 23867)):

#0  0x00002ac19951e93b in read () from /lib64/libpthread.so.0

#1  0x00002ac1990cc296 in read_nbytes (bsock=0x5bc658,

    ptr=0x5bce00 "\003\004", nbytes=65536) at bnet.c:80

#2  0x00002ac1990cf9e8 in BSOCK::recv (this=0x5bc658) at bsock.c:509

#3  0x00002ac1990cbd6f in bget_msg (sock=0x5bc658) at bget_msg.c:60

#4  0x000000000040ea53 in do_append_data (jcr=0x5b8ab8) at append.c:186

#5  0x00000000004215a5 in append_data_cmd (jcr=0x5b8ab8) at fd_cmds.c:199

#6  0x0000000000420eac in do_fd_commands (jcr=0x5b8ab8) at fd_cmds.c:162

#7  0x0000000000421716 in run_job (jcr=0x5b8ab8) at fd_cmds.c:124

#8  0x0000000000421c07 in run_cmd (jcr=0x5b8ab8) at job.c:211

#9  0x000000000041ceb0 in handle_connection_request (arg=<value optimized out>)

    at dircmd.c:233

#10 0x00002ac1990f106e in workq_server (arg=<value optimized out>)

    at workq.c:346

#11 0x00002ac199518143 in start_thread () from /lib64/libpthread.so.0

#12 0x00002ac199ceebed in clone () from /lib64/libc.so.6

#13 0x0000000000000000 in ?? ()

 

Thread 3 (Thread 1132489024 (LWP 23871)):

#0  0x00002ac19951e8bb in write () from /lib64/libpthread.so.0

#1  0x0000000000415260 in DEVICE::write (this=0x56e128, buf=0x2aaaab250050,

    len=2000000) at dev.c:2340

#2  0x0000000000413940 in write_block_to_dev (dcr=0x5bbe78) at block.c:545

#3  0x0000000000414504 in write_block_to_device (dcr=0x5bbe78) at block.c:382

#4  0x000000000040eaed in do_append_data (jcr=0x5ba878) at append.c:202

#5  0x00000000004215a5 in append_data_cmd (jcr=0x5ba878) at fd_cmds.c:199

#6  0x0000000000420eac in do_fd_commands (jcr=0x5ba878) at fd_cmds.c:162

#7  0x0000000000421716 in run_job (jcr=0x5ba878) at fd_cmds.c:124

#8  0x0000000000421c07 in run_cmd (jcr=0x5ba878) at job.c:211

#9  0x000000000041ceb0 in handle_connection_request (arg=<value optimized out>)

    at dircmd.c:233

#10 0x00002ac1990f106e in workq_server (arg=<value optimized out>)

    at workq.c:346

#11 0x00002ac199518143 in start_thread () from /lib64/libpthread.so.0

#12 0x00002ac199ceebed in clone () from /lib64/libc.so.6

#13 0x0000000000000000 in ?? ()

 

Thread 2 (Thread 1140881728 (LWP 24523)):

#0  0x00002ac19951e93b in read () from /lib64/libpthread.so.0

#1  0x00002ac1990cc296 in read_nbytes (bsock=0x5df318, ptr=0x44006ad4 "",

    nbytes=4) at bnet.c:80

#2  0x00002ac1990cf7ff in BSOCK::recv (this=0x5df318) at bsock.c:451

#3  0x00002ac1990cbd6f in bget_msg (sock=0x5df318) at bget_msg.c:60

#4  0x000000000040ea53 in do_append_data (jcr=0x5de6b8) at append.c:186

#5  0x00000000004215a5 in append_data_cmd (jcr=0x5de6b8) at fd_cmds.c:199

#6  0x0000000000420eac in do_fd_commands (jcr=0x5de6b8) at fd_cmds.c:162

#7  0x0000000000421716 in run_job (jcr=0x5de6b8) at fd_cmds.c:124

#8  0x0000000000421c07 in run_cmd (jcr=0x5de6b8) at job.c:211

#9  0x000000000041ceb0 in handle_connection_request (arg=<value optimized out>)

    at dircmd.c:233

#10 0x00002ac1990f106e in workq_server (arg=<value optimized out>)

    at workq.c:346

#11 0x00002ac199518143 in start_thread () from /lib64/libpthread.so.0

#12 0x00002ac199ceebed in clone () from /lib64/libc.so.6

#13 0x0000000000000000 in ?? ()

 

Thread 1 (Thread 47010999105392 (LWP 23567)):

#0  0x00002ac199ce8a02 in select () from /lib64/libc.so.6

#1  0x00002ac1990cd517 in bnet_thread_server (addrs=0x54eb98, max_clients=41,

    client_wq=0x54d620,

    handle_client_request=0x41cb40 <handle_connection_request(void*)>)

    at bnet_server.c:161

#2  0x0000000000407cb9 in main (argc=<value optimized out>,

    argv=<value optimized out>) at stored.c:306

#3  0x00002ac199c4b164 in __libc_start_main () from /lib64/libc.so.6

#4  0x0000000000406fe9 in _start ()

#0  0x00002ac199ce8a02 in select () from /lib64/libc.so.6 #0  0x00002ac199ce8a02 in select () from /lib64/libc.so.6 No symbol table info available.

#1  0x00002ac1990cd517 in bnet_thread_server (addrs=0x54eb98, max_clients=41,

    client_wq=0x54d620,

    handle_client_request=0x41cb40 <handle_connection_request(void*)>)

    at bnet_server.c:161

161         if ((stat = select(maxfd + 1, &sockset, NULL, NULL, NULL)) < 0) {

maxfd = 5

sockset = {fds_bits = {32, 0 <repeats 15 times>}} newsockfd = <value optimized out> stat = <value optimized out> clilen = 16 cli_addr = {sa_family = 2,

  sa_data = "à~¬\020\001\021\000\000\000\000\000\000\000"}

tlog = <value optimized out>

turnon = 1

p = (IPADDR *) 0x54f188

fd_ptr = <value optimized out>

buf = "172.16.1.17\000\000*\000\000 \206\001", '\0' <repeats 13 times>, "\210\025R\231Á*\000\000\000\000\000\000\000\000\000\000 ½\v\231Á*\000\000\000p\v\231Á*\000\000\000\000\000\000\000\000\000\000`²T\000\000\000\000\000\001", '\0' <repeats 15 times>, "\030õT", '\0' <repeats 21 times>, "¥°È\230Á*\000"

sockfds = {<SMARTALLOC> = {<No data fields>}, head = 0x7fff11e29130,

  tail = 0x7fff11e29130, loffset = 0, num_items = 1} allbuf = "\016\000\000\000\000\000\000\000\f\000\000\000\000\000\000\000 \223â\021ÿ\177\000\000\030Í\v\231Á*\000\000,\033}\f\000\000\000\000\221,\f\231Á*\000\000\016\000\000\000\000\000\000\0008ÌÃ\231Á*\000\000à\034Ä\231Á*\000\000Hs\v\231", '\0' <repeats 12 times>, "àáæ\231Á*\000\000\002\000Ú", '\0' <repeats 21 times>, "Hs\v\231Á*\000\000`\223â\021ÿ\177\000\000 \223â\021ÿ\177\000\000,\033}\f\000\000\000\000¸ãæ\231Á*\000\000\000\000\000\000\000\000\000\000wzÈ\230Á*\000\000¸ãæ\231Á*\000\000\001", '\0' <repeats 15 times>, "\001\000\000\000Á*\000\000`\223â\021ÿ\177\000\000\000p"...

#2  0x0000000000407cb9 in main (argc=<value optimized out>,

    argv=<value optimized out>) at stored.c:306

306                         &dird_workq, handle_connection_request);

ch = <value optimized out>

no_signals = false

test_config = false

thid = 1082132800

uid = 0x0

gid = 0x0

#3  0x00002ac199c4b164 in __libc_start_main () from /lib64/libc.so.6 No symbol table info available.

#4  0x0000000000406fe9 in _start ()

No symbol table info available.

#0  0x0000000000000000 in ?? ()

No symbol table info available.

#0  0x0000000000000000 in ?? ()

No symbol table info available.

#0  0x0000000000000000 in ?? ()

No symbol table info available.

 

 

Evan Fraser

 

Senior Systems Analyst

Peninsular House, 30 Monument Street

London EC3R 8NB, United Kingdom

Tel +44 20 7444 7860

Mobile +44 75 9024 5788

evan.fraser AT rms DOT com

www.rms.com

 

From: Evan Fraser [mailto:Evan.Fraser AT rms DOT com]
Sent: 02 October 2009 10:58
To: bacula-users AT lists.sourceforge DOT net
Subject: [Bacula-users] bacula-sd crashing on Suse 10SP2

 

Hello,

 

I’ve just installed bacula 3.0.2 on a SLES 10 SP2 system.  I’m using a Dell ML-6000 tape library with 6 LTO4 tape drives.

 

My problem is that the bacula-sd process keeps crashing during backups.  Has anyone else had a similar problem?

 

I compiled bacula from the SRPMs on the target system.  The RPM’s claim to require GLIBC 2.5, but SLES 10SP2 only has GLIBC 2.4.  Is this likely to be the source of my problem?  If so, should I use an older version of bacula?

 

Many thanks,

 

Evan.

 

Evan Fraser

 

Senior Systems Analyst

Peninsular House, 30 Monument Street

London EC3R 8NB, United Kingdom

Tel +44 20 7444 7860

Mobile +44 75 9024 5788

evan.fraser AT rms DOT com

www.rms.com

 

 


This message and any attachments contain information that may be RMS Inc. confidential and/or privileged. If you are not the intended recipient (or authorized to receive for the intended recipient), and have received this message in error, any use, disclosure or distribution is strictly prohibited. If you have received this message in error, please notify the sender immediately by replying to the e-mail and permanently deleting the message from your computer and/or storage system.



This message and any attachments contain information that may be RMS Inc. confidential and/or privileged. If you are not the intended recipient (or authorized to receive for the intended recipient), and have received this message in error, any use, disclosure or distribution is strictly prohibited. If you have received this message in error, please notify the sender immediately by replying to the e-mail and permanently deleting the message from your computer and/or storage system.
------------------------------------------------------------------------------
Come build with us! The BlackBerry&reg; Developer Conference in SF, CA
is the only developer event you need to attend this year. Jumpstart your
developing skills, take BlackBerry mobile applications to market and stay 
ahead of the curve. Join us from November 9&#45;12, 2009. Register now&#33;
http://p.sf.net/sfu/devconf
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users
<Prev in Thread] Current Thread [Next in Thread>