Re: [Networker] saveset sizes?

Hi Jeff,

We are currently backing up a windows file server with 5 drives most
of these are over a 1TB 2 are 3TB and 1 is 7TB, obviously these take a
long time to backup.

So I have some questions

1. Is there a maximum recommended backup size?

I'm not aware of the limit for a saveset size in NetWorker. I imagineit would be in the petabytes at least.

2 As I have 5 tape drives is there a way to allow the one saveset f:\
for example to stream to multiple tapes?

Not directly. We're still waiting for multistreaming an individualsaveset to be perfected. (It's been toyed with at least once to myknowledge, but there were recovery issues so it never became available.)

3. As we don't control the filesystem layout does anyone have a cool
example of how to split up the drive into separate directory's
F:\dir1, f:\dir2 etc and then run a catchall f:\ with skip statments
for F:\dir1, F:\dir2  etc???

In the past when I've approached this, I've considered usingheuristics where I first work out the size of various directories, butthe time taken to work out such sizes negates a lot of the effort. Ialso don't think calculating previous backup sizes is all that of anoption, unless anyone can point out a reliable mechanism to haveNetWorker report the size of a subset of a saveset, and I'm not awareof any mechanism to do so.

Instead when I've done this, I've relied on auto-building clientdefinitions at "good" directory points.

For instance, when I had a customer that had very, very densefileserver filesystems (e.g., 400GB to start with, but easily40,000,000+ files at that size), the filesystems were structured alongthe lines of:


X:\Department\Dept A
X:\Department\Dept B
X:\Department\Dept C
etc
X:\HomeDirs\usera
X:\HomeDirs\userb
X:\HomeDirs\userc
etc
X:\otherdir1
X:\otherdir2
X:\otherdir3

In this scenario, I had 1 client definition that backed up thoseminor, "otherdir" directories.

For the departmental and user directories, I had a custom backupcommand that constructed a directory listing of each, and populatedclient instances set for higher levels of parallelism (8) with a listof the individual "master" directories - i.e., you'd get clientinstances with savesets of say:


X:\Department\Dept A
X:\Department\Dept B
X:\Department\Dept C
etc

These were auto-populated each time the backup was run, so there wasno risk of missing new directories.

One thing to note in this strategy - NetWorker has limits to thenumber of savesets, or the size of the client saveset field --- Iusually found that to be on the safe side I limited the number ofsavesets for a client to around 250, based on that relatively flatinitial directory structure outlined above. If you do the break-upfurther down in the directory structure, your mileage would vary.Obviously that meant having (potentially) multiple client definitionsand having a modicum of intelligence to populate/refresh clientdefinitions. A good understanding of nsradmin is reasonably essentialin this process.

Obviously the downside of this is that your backups are written withmuch higher levels of multiplexing; however, if you're having problemsstreaming media due to density of filesystems, this can be a solutionif block level backups can't be used. For the case of the customerwhere I did this, block level backup for NetWorker was still veryimmature (e.g., couldn't do complete filesystem recovery across a tapeboundary! - that's been fixed at least), but because these were denseand highly active fileserver filesystems, it also meant that thefilesystems were quite fragmented. Doing file level recoveries fromblock level backups using cache rebuilds and tape scans wasprohibitively slow and the customer used a series of array levelreplication options, so it was decided to go with massive multiplexingto tape at higher streaming speeds - i.e., using the option above.

4: Does anyone have any other great ideas I should be think of???


One very, very important piece of advice.

DON'T USE SKIP for this style of backup - use the NULL asm instead.

The reason for this is very important. Skip will not only skip a file/directory during backup, but it will also reflect that in the index.On the other hand, null does not.

What that means is that if you skip directories one night, you'll needto change your browse time in order to find them for recovery. If youuse null to not backup directories one night, you'll at all times seeall the directories that have been backed up. This makes recoveries alot easier - i.e., no guessing about what days what savesets werebacked up, and more importantly, being able to run one recovery ratherthan multiple recoveries.


Good luck!

Cheers,

Preston.

--
Preston de Guise

"Enterprise Systems Backup and Recovery: A Corporate InsurancePolicy", due out September 17 2008:


http://www.crcpress.com/shopping_cart/products/product_detail.asp?sku=AU6396&isbn=9781420076394&parent_id=&pc=

http://www.enterprisesystemsbackup.com

To sign off this list, send email to listserv AT listserv.temple DOT edu and type 
"signoff networker" in the body of the email. Please write to networker-request 
AT listserv.temple DOT edu if you have any problems with this list. You can access the 
archives at http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER