Re: [Bacula-users] Slow Attribute spooling
2015-06-08 14:16:10
On 08/06/15 15:58, Dimitri Maziuk wrote:
> On 2015-06-08 05:25, Denis Witt wrote:
>
>> I'm using normal (7.200 RPM) HDDs. But, as pointed out in
>> my mail, there is only one job running ("despooling Attributes") at the
>> moment. All other jobs are still waiting for execution ("waiting on
>> max storage jobs").
>
> Did you see my "Fixed: Re: concurrent jobs not working?" post from 4
> days ago? Take a look.
The problem with adding more concurrent jobs (up from one) is that the
increased seek load will badly affect despooling time.
I benchamrked multidrive 7200rpm raid0 arrays several years ago and
posted results to this list.
The short version is:
0: Spooling areas _must_ be on seperate spindles to everythng else.
1: If you have spindles, you will see an exponential-"plus a bit"
dropoff in overall throughput for each extra task added to the spool
array from what's seen with a single job - output divided amongst
jobs-in-progress, plus you've added effectively random seek latency of
8-12ms on top.
2: A raid-0 mechanical disk array is the _only_ way 7200rpm mechanical
disks can keep up with LTO 1-2 tape drives for concurrent backups
(simultaneous reads and writes because one spool builds whilst another
is despooling to tape.)
3: You need at least 3 striped drives and no concurrent backups to keep
up with LTO 3-4 - the benchmarked sequential read/write speed of 3-drive
raid0 maxes out at about 360Gb/s - which is just enough if you are
feeding any form of compressible data to the tape drive - the disks
_must_ be able to keep well ahead of what's being fed to the tape drive
or it will start shoeshining if there is the slightest hiccup.
4: When you add concurrency, you need more spindles to compensate for
the seek penalties. Lots more spindles.
5: A good datacenter SSD will replace a bunch of spindles because it has
virtually no seek penalties, plus decent write speeds. Datacentre SSDs
continue to perform well under heavy random r/w load. Consumer ones
don't - the write speeds suffer particularly badly and cheap consumer
SSD used to have slower sequential write performance than spinning
media, often falling to kB/s write speeds under random load (it's better
now but still often falls apart quickly under heavy load)
6: No matter what you build, test, test and test again. If you think
it's overspecified it probably isn't. Concurrent job spooling is not the
kind of load drive designers had in mind even when considering extreme
random r/w performance.
7: If you can afford it, tossing 100GB of ramdrive at the task is even
better than SSD.
8: It doesn't hurt to put the database on a dedicated raid10 SSD set.
My current spool area is a 6-way raid0 of Intel X25E drives. These feed
7 LTO5 drives via FC with the limiting factor being the 8Gb/s FC
connections out of the bacula-sd box. Over the last 5 years that's
worked pretty well at keeping up but I still get shoeshining from time
to time.
Concurrency plus spooling is a big win if you get it right and a major
pain in the backside if you don't.
Don't be tempted to attempt to run multiple concurrent jobs without
spooling. It may write quickly, but the penalty on the read side is so
severe as to make the tape effectively unuseable. My personal feeling is
that it will shoeshine so badly the tape or drive may end up destroyed
before being sucessfuly read, and if you do suceed in reading, you're
looking at throughputs in the tens of kB/s range.
------------------------------------------------------------------------------
_______________________________________________
Bacula-users mailing list
Bacula-users AT lists.sourceforge DOT net
https://lists.sourceforge.net/lists/listinfo/bacula-users
|
|
|