Bacula-users

Re: [Bacula-users] bacula-sd: page allocation failure

2012-05-31 14:44:08
Subject: Re: [Bacula-users] bacula-sd: page allocation failure
From: Martin Simmons <martin AT lispworks DOT com>
To: Uwe Schuerkamp <uwe.schuerkamp AT nionex DOT net>
Date: Thu, 31 May 2012 19:42:13 +0100
>>>>> On Tue, 29 May 2012 16:20:18 +0200, Uwe Schuerkamp said:
> 
> Hi folks, 
> 
> recently we've been seeing more and more problems with bacula-fd
> messages in dmesg about a page allocation failure. 
> 
> Platform is centos 6.2 64 bit, Version 5.2.6 compiled from Source
> using the stock distro gcc. 
> 
> We're using MariaDB 5.x as the db backend, here are some stats about
> the bacula installation itself: 
> 
> Total clients:                128     Total bytes stored:     77.33 TB
> Total files:          76905116      Database size:    111.72 GB
> 
> The server has 18G RAM, backup performance is generally rather good. 
> 
> Online backups are going to disk (33TB full of 39TB, xfs based FS). 
> 
> Here's the message from dmesg: 
> 
> May 28 12:24:16 bacula-server kernel: bacula-sd: page allocation failure. 
> order:1, mode:0x20
> May 28 12:24:16 bacula-server kernel: Pid: 21923, comm: bacula-sd Not tainted 
> 2.6.32-71.29.1.el6.x86_64 #1
> May 28 12:24:16 bacula-server kernel: Call Trace:
> May 28 12:24:16 bacula-server kernel: <IRQ>  
> [<ffffffff8111eab6>]__alloc_pages_nodemask+0x706/0x850
> May 28 12:24:16 bacula-server kernel: [<ffffffff81156212>] 
> kmem_getpages+0x62/0x170
> May 28 12:24:16 bacula-server kernel: [<ffffffff81156e2a>] 
> fallback_alloc+0x1ba/0x270
> May 28 12:24:16 bacula-server kernel: [<ffffffff8115687f>] 
> ?cache_grow+0x2cf/0x320
> May 28 12:24:16 bacula-server kernel: [<ffffffff81156ba9>] 
> ____cache_alloc_node+0x99/0x160
> May 28 12:24:16 bacula-server kernel: [<ffffffff8115750b>] 
> kmem_cache_alloc+0x11b/0x190
> May 28 12:24:16 bacula-server kernel: [<ffffffff81404448>] 
> sk_prot_alloc+0x48/0x180
> May 28 12:24:16 bacula-server kernel: [<ffffffff81404692>] sk_clone+0x22/0x2a0
> May 28 12:24:16 bacula-server kernel: [<ffffffff8144c276>] 
> inet_csk_clone+0x16/0xd0
> May 28 12:24:16 bacula-server kernel: [<ffffffff814651c3>] 
> tcp_create_openreq_child+0x23/0x450
> May 28 12:24:16 bacula-server kernel: [<ffffffff81462c0d>] 
> tcp_v4_syn_recv_sock+0x4d/0x280
> May 28 12:24:16 bacula-server kernel: [<ffffffff81464f81>] 
> tcp_check_req+0x201/0x420
> May 28 12:24:16 bacula-server kernel: [<ffffffff8146262b>] 
> tcp_v4_do_rcv+0x35b/0x430
> May 28 12:24:16 bacula-server kernel: [<ffffffff8105c484>] 
> ?try_to_wake_up+0x284/0x380
> May 28 12:24:16 bacula-server kernel: [<ffffffff81463e40>] 
> tcp_v4_rcv+0x5b0/0x7e0
> May 28 12:24:16 bacula-server kernel: [<ffffffff8105c592>] 
> ?default_wake_function+0x12/0x20
> May 28 12:24:16 bacula-server kernel: [<ffffffff81441e7d>] 
> ip_local_deliver_finish+0xdd/0x2d0
> May 28 12:24:16 bacula-server kernel: [<ffffffff81442108>] 
> ip_local_deliver+0x98/0xa0
> May 28 12:24:16 bacula-server kernel: [<ffffffff814415cd>] 
> ip_rcv_finish+0x12d/0x440
> May 28 12:24:16 bacula-server kernel: [<ffffffff81441b55>] ip_rcv+0x275/0x350
> May 28 12:24:16 bacula-server kernel: [<ffffffff8140ffeb>] 
> netif_receive_skb+0x38b/0x670
> May 28 12:24:16 bacula-server kernel: [<ffffffff8126ce48>] 
> ?is_swiotlb_buffer+0x18/0x50
> May 28 12:24:16 bacula-server kernel: [<ffffffffa0269238>] 
> bnx2_poll_work+0xd18/0x1240 [bnx2]
> May 28 12:24:16 bacula-server kernel: [<ffffffff8134a57a>] 
> ?scsi_next_command+0x4a/0x60
> May 28 12:24:16 bacula-server kernel: [<ffffffff8134b36e>] 
> ?scsi_io_completion+0x35e/0x550
> May 28 12:24:16 bacula-server kernel: [<ffffffff8105c846>] 
> ?update_curr+0xe6/0x1e0
> May 28 12:24:16 bacula-server kernel: [<ffffffffa026979d>] 
> bnx2_poll_msix+0x3d/0xc0 [bnx2]
> May 28 12:24:16 bacula-server kernel: [<ffffffff81410b73>] 
> net_rx_action+0x103/0x210
> May 28 12:24:16 bacula-server kernel: [<ffffffff81073d67>] 
> __do_softirq+0xb7/0x1e0
> May 28 12:24:16 bacula-server kernel: [<ffffffff810d8a10>] 
> ?handle_IRQ_event+0x60/0x170
> May 28 12:24:16 bacula-server kernel: [<ffffffff81073dc4>] 
> ?__do_softirq+0x114/0x1e0
> May 28 12:24:16 bacula-server kernel: [<ffffffff810142cc>] 
> call_softirq+0x1c/0x30
> May 28 12:24:16 bacula-server kernel: [<ffffffff81015f35>] 
> do_softirq+0x65/0xa0
> May 28 12:24:16 bacula-server kernel: [<ffffffff81073b65>] irq_exit+0x85/0x90
> May 28 12:24:16 bacula-server kernel: [<ffffffff814d0945>] do_IRQ+0x75/0xf0
> May 28 12:24:16 bacula-server kernel: [<ffffffff81013ad3>] 
> ret_from_intr+0x0/0x11
> 
> Any idea what's going wrong here? I don't see any significant swapping
> or memory usage when this happens. I can provide a full dmesg dump if
> that's helpful, I just didn't want to spam the list beyond measure
> this time. 

Looks like a kernel or driver issue to me, not a problem with bacula-sd.  You
could ask on the mailing list.

__Martin

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users

<Prev in Thread] Current Thread [Next in Thread>