Veritas-bu

[Veritas-bu] help with failing backups

2002-04-02 12:50:35
Subject: [Veritas-bu] help with failing backups
From: MSyed AT xo DOT com (Syed, Mukarram)
Date: Tue, 2 Apr 2002 11:50:35 -0600
Hi All.

I have a problem with my initiating backups since yesterday.
The manual backups don't work.  The bpsched deamon runs but there is nothing
produced in the activity monitor that the backup is running.  No tapes get
loaded etc.  All my scheduled backups did not start last night.
The bpsched starts the process but there is no development after that.  I
can't even kill the bpsched/bprd child process using the kill command.
I cleaned up all the log files and started a fresh manual backup.  Below is
the output of the bpsched log file after the manual backup was started.
Bprd did not produce any logs.
The class from which I started the manual backup is pinkfloyd_test.
My master/media server is an E450 running Solaris 7.
I am in the process of killing the NBU/media manager deamons and starting
them.
I hope that solves my problem but I would like to know why my backups are
not working now.  They were before yesterday.
Please help me if you can.  If you need any more log outputs, please let me
know.
Thanks in Advance.


Mukarram Syed
UNIX Systems Administrator
XO Communications

----------------------------------------------------------------------------
----------------

09:27:18 [14330] <4> bpsched: INITIATING (verbose=11) ...
09:27:18 [14330] <2> logparams: bpsched -ru root -rg other -ct 0 -class
pinkfloyd_test -IB 
09:27:18 [14330] <4> bpsched_main: wait_on_que=0, timeout_in_que=36000,
reread_interval=300,queue_on_error=0, bptm_query_timeout=480
09:27:18 [14330] <2> LOCAL CLASS_ATT_DEFS: Product ID = 6
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 12, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 6, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 7, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 17, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 18, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 11, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 25, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 24, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 20, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 26, was 2, now 1
09:27:18 [14330] <4> bpsched_main: VSMInit () failed: 2d
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: CONFIG
09:27:18 [14330] <2> getsockconnected: host=moonunit service=bpdbm
address=216.112.37.106 protocol=tcp non-reserved port=13721
09:27:18 [14330] <2> bind_on_port_addr: bound to port 4800
09:27:18 [14330] <2> check_authentication: no authentication required
09:27:18 [14330] <4> ?:   mail admin:            msyed AT xo DOT com
09:27:18 [14330] <4> ?:   wakeup interval:       10 minutes
09:27:18 [14330] <4> ?:   max jobs/client:       20
09:27:18 [14330] <4> ?:   backup tries:          2 times in 12 hours
09:27:18 [14330] <4> ?:   keep logs:             14 days
09:27:18 [14330] <4> ?:   hours ago:             24 hours
09:27:18 [14330] <4> ?:   max drives this master: 0
09:27:18 [14330] <4> ?:   compact database:      yes
09:27:18 [14330] <4> ?:   media mnt timeout:     0 seconds
09:27:18 [14330] <4> ?:   multihost mnt timeout: 0 seconds
09:27:18 [14330] <4> ?:   post process images:   yes
09:27:18 [14330] <4> ?:   keep tir:   yes
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: RETENTION
09:27:18 [14330] <2> getsockconnected: host=moonunit service=bpdbm
address=216.112.37.106 protocol=tcp non-reserved port=13721
09:27:18 [14330] <2> bind_on_port_addr: bound to port 4800
09:27:18 [14330] <2> check_authentication: no authentication required
09:27:18 [14330] <4> ?:   retention[0] = 7 days
09:27:18 [14330] <4> ?:   retention[1] = 14 days
09:27:18 [14330] <4> ?:   retention[2] = 21 days
09:27:18 [14330] <4> ?:   retention[3] = 31 days
09:27:18 [14330] <4> ?:   retention[4] = 62 days
09:27:18 [14330] <4> ?:   retention[5] = 93 days
09:27:18 [14330] <4> ?:   retention[6] = 186 days
09:27:18 [14330] <4> ?:   retention[7] = 279 days
09:27:18 [14330] <4> ?:   retention[8] = 365 days
09:27:18 [14330] <4> ?:   retention[9] = 24855 days
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: CLASSES
09:27:18 [14330] <2> getsockconnected: host=moonunit service=bpdbm
address=216.112.37.106 protocol=tcp non-reserved port=13721
09:27:18 [14330] <2> bind_on_port_addr: bound to port 4800
09:27:18 [14330] <2> check_authentication: no authentication required
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> get_db_info: Searching for class <pinkfloyd_test> sched
<>
09:27:18 [14330] <4> get_db_info: not found
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: STORAGE UNITS
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <2> getsockconnected: host=moonunit service=bpdbm
address=216.112.37.106 protocol=tcp non-reserved port=13721
09:27:18 [14330] <2> bind_on_port_addr: bound to port 4800
09:27:18 [14330] <2> check_authentication: no authentication required
09:27:19 [14330] <8> get_type_of_client_port: db_getCLIENT() failed: no
entity was found (227)
09:27:19 [14330] <4> add_to_worklists: adding clientjob for pinkfloyd to
worklist[1]
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 9 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 8 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 7 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 6 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 5 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 4 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 3 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 2 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 1 WORKLIST
09:27:19 [14330] <4> ?:   client: pinkfloyd  class: pinkfloyd_test  sched:
pinkfloyd_test (FULL)
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 0 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> bpsched_main: empty-main bpsched already started, pid:
19693
09:27:19 [14330] <4> write_worklists_to_file: writing work to
/usr/openv/netbackup/bin/bpsched.d/worklist.14330
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <4> send_reqQ_msg_list: queue full, suspending process
14330
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:32:53 [14405] <4> bpsched: INITIATING (verbose=11) ...
09:32:53 [14405] <2> logparams: /usr/openv/netbackup/bin/bpsched 
09:32:53 [14405] <4> bpsched_main: wait_on_que=0, timeout_in_que=36000,
reread_interval=300,queue_on_error=0, bptm_query_timeout=480
09:32:53 [14405] <2> LOCAL CLASS_ATT_DEFS: Product ID = 6
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 12, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 6, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 7, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 17, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 18, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 11, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 25, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 24, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 20, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 26, was 2, now 1
09:32:53 [14405] <4> bpsched_main: VSMInit () failed: 2d
09:32:53 [14405] <8> bpsched_main: another regular bpsched is already
examining the class configuration
09:32:53 [14405] <4> bpsched: scheduler exiting - regular bpsched is already
running (214)

<Prev in Thread] Current Thread [Next in Thread>