Hi all.
Environment:
Master Server - Win NT Server running Netbackup 3.4 (aklad10w)
Slave Server - Solaris 5.6 running Netbackup 3.4 (aklss02u)
Client - NetApp F840 NAS Filer with NDMP enabled (aklns010-ga)
We're use a separate scheduling application to kick of the NDMP backups from
the Filer. The Unix Netbackup slave server is also a client for the
scheduling software, and it's here that we want to initiate the backups as
we don't have direct access to the command line on the Windows box.
Currently I'm using the following command to start a backup:
bpbackup -c classname -i -h aklns012-ga -w -s full
This works fine, but I don't get a decent log of the backup, and by adding
the -L arg the bpbackup command terminates with error 23:
bpbackup -c aklns012 -L /tmp/backup.log -i -h aklns012-ga -w -s full
EXIT STATUS 23: socket read failed
Running the command via truss shows a failure to read from file handle #7
which according to lsof is a socket connection to the bprd daemon on the
Netbackup master:
sigaction(SIGPIPE, 0xEFFFCF70, 0xEFFFCFF0) = 0
write(7, "\0\0\018", 4) = 4
sigaction(SIGPIPE, 0xEFFFCF70, 0xEFFFCFF0) = 0
write(7, " / n a s _ m n t / a k l".., 24) = 24
sigaction(SIGPIPE, 0xEFFFCFE8, 0xEFFFD068) = 0
sigaction(SIGPIPE, 0xEFFFCFE8, 0xEFFFD068) = 0
sigaction(SIGPIPE, 0xEFFFCF70, 0xEFFFCFF0) = 0
write(7, "\0\0\0\b", 4) = 4
sigaction(SIGPIPE, 0xEFFFCF70, 0xEFFFCFF0) = 0
write(7, " C O N T I N U E", 8) = 8
sigaction(SIGPIPE, 0xEFFFCFE8, 0xEFFFD068) = 0
read(7, 0xEFFFBEDB, 1) (sleeping...)
read(7, "\0", 1) = 1
read(7, "\0", 1) = 1
read(7, "\0", 1) = 1
read(7, "0E", 1) = 1
read(7, 0xEFFFD0A7, 14) Err#131 ECONNRESET
time() = 1013990100
getpid() = 6053 [6052]
write(4, " 1 2 : 5 5 : 0 0 [ 6 0".., 91) = 91
time() = 1013990100
getpid() = 6053 [6052]
write(4, " 1 2 : 5 5 : 0 0 [ 6 0".., 108) = 108
close(7) = 0
open("/usr/openv/msg/C/netbackup/CerMsgs", O_RDONLY) Err#2 ENOENT
write(2, " E X I T S T A T U S ".., 34) = 34
write(2, "\n", 1) = 1
llseek(0, 0, SEEK_CUR) = 164582
_exit(23)
aklss02u # lsof -p 7385
COMMAND PID USER FD TYPE DEVICE SIZE/OFF INODE NAME
bpbackup 7385 root cwd VDIR 205,2532 36864 636640
/nas_mnt/aklns012-ga/etc
bpbackup 7385 root txt VREG 16383,262143
/opt/openv/netbackup/bin/bpbackup
bpbackup 7385 root txt VREG 85,1 27000 40842
/usr/lib/nss_files.so.1
bpbackup 7385 root txt VREG 85,1 1015636 40794
/usr/lib/libc.so.1
bpbackup 7385 root txt VREG 85,1 36512 40875
/usr/lib/libaio.so.1
bpbackup 7385 root txt VREG 85,1 19304 40814
/usr/lib/libmp.so.2
bpbackup 7385 root txt VREG 85,1 726968 40887
/usr/lib/libnsl.so.1
bpbackup 7385 root txt VREG 85,1 16932 164719
/usr/platform/sun4u/lib/libc_psr.so.1
bpbackup 7385 root txt VREG 85,1 33588 40817
/usr/lib/libposix4.so.1
bpbackup 7385 root txt VREG 85,1 14756 40892
/usr/lib/libsec.so.1
bpbackup 7385 root txt VREG 85,1 53656 40826
/usr/lib/libsocket.so.1
bpbackup 7385 root txt VREG 85,1 4304 41215
/usr/lib/libdl.so.1
bpbackup 7385 root txt VREG 85,1 181840 40790 /usr/lib/ld.so.1
bpbackup 7385 root 0u VCHR 24,1 0t227133 47938
/devices/pseudo/pts@0:1->ttcompat->ldterm->ptem->pts
bpbackup 7385 root 1u VCHR 24,1 0t227133 47938
/devices/pseudo/pts@0:1->ttcompat->ldterm->ptem->pts
bpbackup 7385 root 2u VCHR 24,1 0t227133 47938
/devices/pseudo/pts@0:1->ttcompat->ldterm->ptem->pts
bpbackup 7385 root 3r DOOR 0x61ba8318
(FA:->0x60df5c60)
bpbackup 7385 root 4w VREG 1,105912
/opt/openv/netbackup/logs/bpbackup/log.021802
bpbackup 7385 root 5u VCHR 24,1 0t227133 47938
/devices/pseudo/pts@0:1->ttcompat->ldterm->ptem->pts
bpbackup 7385 root 7u inet 0x63d92460 0t170 TCP
aklss02u-pub.airnz.co.nz:537->aklad10w.airnz.co.nz:bprd (ESTABLISHED)
The bprd daemon on the master shows a failure to open the log file on the
client aklns012-ga, i.e. the Filer, not the slave server where the bpbackup
command is run:
13:27:02 [2912.2532] <2> getsockconnected: host=aklad10w service=bpdbm
address=10.65.50.82 protocol=tcp non-reserved port=13721
13:27:02 [2912.2532] <2> db_getCLIENT_by_hostname: db_CLIENTreceive: no
entity was found 227
13:27:02 [2912.2532] <2> getsockconnected: host=aklns012-ga service=bpcd
address=10.65.47.91 protocol=tcp reserved port=13782
13:27:02 [2912.2532] <2> getsockconnected: Connect to aklns012-ga on port
791
13:27:04 [2912.2532] <2> getsockconnected: Connect to aklns012-ga on port
804
13:27:08 [2912.2532] <2> getsockconnected: Connect to aklns012-ga on port
831
13:27:13 [2912.2532] <2> getsockconnected: Connect to aklns012-ga on port
728
13:27:22 [2912.2532] <2> getsockconnected: Connect to aklns012-ga on port
575
13:27:39 [2912.2532] <2> getsockconnected: Connect to aklns012-ga on port
935
13:27:40 [2912.2532] <16> getsockconnected: unable to connect() to
aklns012-ga, No connection could be made because the target machine activ
ely refused it.
13:27:40 [2912.2532] <16> bpcr_connect: Can't connect to client aklns012-ga
13:27:40 [2912.2532] <16> append_to_client_log: cannot connect to bpcd on
client aklns012-ga, No connection could be made because the target
machine actively refused it.
13:27:40 [2912.2532] <8> bkarfiles: failure writing progress log on client
aklns012-ga in log /backup.log: can't connect to client (58)
13:27:40 [2912.2532] <2> getsockconnected: host=aklad10w service=bpdbm
address=10.65.50.82 protocol=tcp non-reserved port=13721
13:27:40 [2912.2532] <2> getsockconnected: host=aklad10w service=bpdbm
address=10.65.50.82 protocol=tcp non-reserved port=13721
13:27:40 [2912.2532] <16> bkarfiles: Unable to write progress log
</backup.log> on client aklns012-ga. Class=aklns012 Sched=full
13:27:40 [2912.2532] <16> bkarfiles: Immediate backup failed (client =
aklns012-ga user = root group = other): can't connect to client
13:27:40 [2912.2532] <2> mail_msg: entered; status = 58
13:27:40 [2912.2532] <16> mail_msg: BACKUP EXIT STATUS = 58
Am I attempting something that's not allowed, or is it just a config issue?
Any help appreciated.
Regards, Gavin
|