Bacula-users

Re: [Bacula-users] Extremely Slow Performance on VMware

2012-09-28 14:41:17
Subject: Re: [Bacula-users] Extremely Slow Performance on VMware
From: Rodrigo Abrantes Antunes <rodrigoantunes AT pelotas.ifsul.edu DOT br>
To: bacula-users AT lists.sourceforge DOT net
Date: Fri, 28 Sep 2012 15:38:42 -0300

Citando Rodrigo Abrantes Antunes <rodrigoantunes AT pelotas.ifsul.edu DOT br>:

I have a physical machine that is a Vmware ESX node wich hosts only one vm, the one with bacula-director, bacula-sd and bacula-fd (called bacula-server), this vm has an RDM with the fibre channel storage where the volumes partition is mounted. Then I have all my clients (some are physycal machines and others are vms in other ESX nodes) with bacula-fd that are backed up. When I manually run a job to backup one of these clients in bacula-server I can see (with the command htop) that bacula-sd is using 100% of the cpu, I also noted that the backup starts at around 4MB/s and after some time it is around 300KB/s. If I simple send the same files to be backed up with scp for example the transfer goes around 100MB/s. The vms don't have vmstat, I use linux own commands.

Citando Geert Stappers <Geert.Stappers AT vanadgroup DOT com>:

Op 20120927 om 21:55 schreef Rodrigo Abrantes Antunes:

Citando Davide Franco <dfranco AT dflc DOT ch>:
On Thu, Sep 27, 2012 at 7:04 PM, Rodrigo Abrantes Antunes wrote:

> Hi, I'm having extremely slow performance when backing up, a 10gb
> backup took around 3h. Where should I start looking?
>
> During a backup I can see bacula-sd using 100% cpu, memory usage is
> normal. My system is a vmware virtual machine with a cpu with two
> 2,5ghz cores and 2gb ram. The vm have a separate partition for the
> volumes using ext4 and mounted via RDM in a fc storage array with
> raid6. I tried mapping more cores and more ram for the vm with the same
> result so I don't think the problem is lack of resources.
>

giving more informations like

<snip/>

will give us more ideas to troubleshoot your problem.


Director: 5.0.1-1ubuntu1
Storage: 5.0.1-1ubuntu1
FD: 5.0.1-1ubuntu1 (some clients have lower version)
Database: mysqI
OS: Ubuntu 10.04.4 x64 Server
FC Storage 4 GBits/s.
All my network is Gigabit Ethernet.

Yes, and how is the further design?

In others words: The provided list can read as
One physical computer with fibre channel disk hosts all the VMs.
If it is so, then tell so. Otherwise eloborate the setup, the design.

One question: can vmware tools have any influence in this case? From what I
know vmware tools is mainly for better graphics performance  isn't it? I din't
installed it.

There could also be beter disk drivers. The virtio drivers come to mind.


Back to

> During a backup I can see bacula-sd using 100% cpu,

And where did you see the "100%"? ( Which tool was used to read that
performance valule? )

I would like to see the output of

            vmstat 2 5

during non-back-up-time and also the output of

            vmstat 2 5

during back-up-time. Thing I'm interrested in, are the CPU columns.
Especial the colums "system" and "wait".

<screenshot>
$ vmstat 2 3
procs -----------memory---------- ---swap-- -----io---- -system-- ----cpu----
r  b   swpd   free   buff  cache   si   so    bi    bo   in   cs us sy id wa
0  0   6712  11360 192584 159120    0    0     5     4    6   11  5  6 90  0
0  0   6712  11344 192584 159120    0    0     0     0   67  342  7 15 79  0
0  0   6712  11344 192584 159120    0    0     0     0   65  340  8 14 78  0
</screenshot>


And to avoid an extra e-mail exchange:
I'm asking for 2 to the power 3, so 8 measurements.

So 2 moments (during backup or outside backup)
on 2 Bacula compoments ( storage deamon and file deamon ) on the VMs
on 2 physical hosts.

Yes, that means that I assume the VMware hosts have a 'vmstat' command.
That is because I'm not familair with VMware, I'm from the Xen world :-)


Cheers
Geert Stappers
--
http://www.vanadcimplicity.com/

------------------------------------------------------------------------------
Got visibility?
Most devs has no idea what their production app looks like.
Find out how fast your code is with AppDynamics Lite.
http://ad.doubleclick.net/clk;262219671;13503038;y?
http://info.appdynamics.com/FreeJavaPerformanceDownload.html
_______________________________________________
Bacula-users mailing list
[email protected].nethttps://lists.sourceforge.net/lists/listinfo/bacula-users


 

One thing I noted now, in the vm htop says that 100% cpu is used and actually the machine is very slow when backing up so I think this value is accurate but in VSphere Client in the performance chart it says that the vm is using only 400Mhz of the 5000Mhz that were allocated, but the node cpu usage is low so I don't know why it isn't aloccating more MHZ to the bacula-server.

I installed vmstat, I can't do vmstat during non backup time because it is currently backing up my mail server, about 200gb, it is doing this for almost 15h:

vmstat 2 5 during backing-up on the bacula-server (director, storagedaemon)

procs -----------memory---------- ---swap-- -----io---- -system-- ----cpu----
 r  b   swpd   free   buff  cache   si   so    bi    bo   in   cs us sy id wa
 3  0      0  16576  15592 1763272    0    0    10    55    2    7  0  8 92  0
 1  0      0  15916  15592 1763964    0    0     0     0  204   49  1 50 49  0
 2  0      0  16812  15588 1762996    0    0     0     2  172   72  1 78 21  0
 2  0      0  17820  15604 1767348    0    0     0    18  193  114  3 66 32  0
 1  0      0  16296  15604 1769924    0    0     0     0  277   29  0 55 45  0

vmstat 2 5 during backing-up on the physical mail-server (filedaemon)

procs -----------memory---------- ---swap-- -----io---- -system-- ----cpu----
 r  b   swpd   free   buff  cache   si   so    bi    bo   in   cs us sy id wa
 0  0  16824  38496 143856 5019424    0    0    45    42    3    4  1  0 98  1
 0  0  16824  30928 143868 5020004    0    0   336   434 1328  981  1  1 96  2
 2  0  16824  35844 143868 5020768    0    0   458   116 1121  635  0  1 99  0
 0  0  16824  32180 143876 5024996    0    0  2060   116 1861  686  0  1 97  2
 0  0  16824  30192 143912 5026304    0    0   640   521 1505 1032  1  1 86 13






------------------------------------------------------------------------------
Got visibility?
Most devs has no idea what their production app looks like.
Find out how fast your code is with AppDynamics Lite.
http://ad.doubleclick.net/clk;262219671;13503038;y?
http://info.appdynamics.com/FreeJavaPerformanceDownload.html
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users