Veritas-bu

[Veritas-bu] help with failing backups

2002-04-02 13:15:00
Subject: [Veritas-bu] help with failing backups
From: MSyed AT xo DOT com (Syed, Mukarram)
Date: Tue, 2 Apr 2002 12:15:00 -0600
Thanks for responding Penelope.
The server I am trying to backup is also a Solaris box.
We don't have the current patch level installed for veritas netbackup.
Is the bpsched error log useful?
Here is the shared memory parameters I have in my /etc/system file in the
Solaris 7 master server:

set shmsys:shminfo_shmmax=4294967295
set shmsys:shminfo_shmmin=1
set shmsys:shminfo_shmmni=100
set shmsys:shminfo_shmseg=50
set semsys:seminfo_semmni=100
set semsys:seminfo_semmsl=210
set semsys:seminfo_semmns=400

However, this was a working system, never had such a problem with it in
1-1/2 years.

Thanks

-Mukarram Syed.


-----Original Message-----
From: Penelope Carr [mailto:Penelope.Carr AT veritas DOT com]
Sent: Tuesday, April 02, 2002 9:57 AM
To: Syed, Mukarram
Subject: RE: [Veritas-bu] help with failing backups


Is the server you are trying to backup a NT machine??  If so, you probably
have OTM enabled.  You have to make a change in the registry and set OTM
MAX_CACHE from hex to decimal and change it to zero.  If not then,

It looks as though you have a resource issue with message queues.  I have
attached a doc with minimum settings for NBU to properly function.  Even
though the doc is for HP it will work for Solaris.

If you require further assistance please do not hesitate to ask.

Penelope




-----Original Message-----
From: Syed, Mukarram [mailto:MSyed AT xo DOT com]
Sent: Tuesday, April 02, 2002 12:51 PM
To: veritas-bu AT mailman.eng.auburn DOT edu
Subject: [Veritas-bu] help with failing backups



Hi All.

I have a problem with my initiating backups since yesterday.
The manual backups don't work.  The bpsched deamon runs but there is nothing
produced in the activity monitor that the backup is running.  No tapes get
loaded etc.  All my scheduled backups did not start last night.
The bpsched starts the process but there is no development after that.  I
can't even kill the bpsched/bprd child process using the kill command.
I cleaned up all the log files and started a fresh manual backup.  Below is
the output of the bpsched log file after the manual backup was started.
Bprd did not produce any logs.
The class from which I started the manual backup is pinkfloyd_test.
My master/media server is an E450 running Solaris 7.
I am in the process of killing the NBU/media manager deamons and starting
them.
I hope that solves my problem but I would like to know why my backups are
not working now.  They were before yesterday.
Please help me if you can.  If you need any more log outputs, please let me
know.
Thanks in Advance.


Mukarram Syed
UNIX Systems Administrator
XO Communications

----------------------------------------------------------------------------
----------------

09:27:18 [14330] <4> bpsched: INITIATING (verbose=11) ...
09:27:18 [14330] <2> logparams: bpsched -ru root -rg other -ct 0 -class
pinkfloyd_test -IB 
09:27:18 [14330] <4> bpsched_main: wait_on_que=0, timeout_in_que=36000,
reread_interval=300,queue_on_error=0, bptm_query_timeout=480
09:27:18 [14330] <2> LOCAL CLASS_ATT_DEFS: Product ID = 6
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 12, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 6, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 7, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 17, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 18, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 11, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 25, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 24, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 20, was 2, now 1
09:27:18 [14330] <2> adjust_Vnydef: Change for class type 26, was 2, now 1
09:27:18 [14330] <4> bpsched_main: VSMInit () failed: 2d
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: CONFIG
09:27:18 [14330] <2> getsockconnected: host=moonunit service=bpdbm
address=216.112.37.106 protocol=tcp non-reserved port=13721
09:27:18 [14330] <2> bind_on_port_addr: bound to port 4800
09:27:18 [14330] <2> check_authentication: no authentication required
09:27:18 [14330] <4> ?:   mail admin:            msyed AT xo DOT com
09:27:18 [14330] <4> ?:   wakeup interval:       10 minutes
09:27:18 [14330] <4> ?:   max jobs/client:       20
09:27:18 [14330] <4> ?:   backup tries:          2 times in 12 hours
09:27:18 [14330] <4> ?:   keep logs:             14 days
09:27:18 [14330] <4> ?:   hours ago:             24 hours
09:27:18 [14330] <4> ?:   max drives this master: 0
09:27:18 [14330] <4> ?:   compact database:      yes
09:27:18 [14330] <4> ?:   media mnt timeout:     0 seconds
09:27:18 [14330] <4> ?:   multihost mnt timeout: 0 seconds
09:27:18 [14330] <4> ?:   post process images:   yes
09:27:18 [14330] <4> ?:   keep tir:   yes
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: RETENTION
09:27:18 [14330] <2> getsockconnected: host=moonunit service=bpdbm
address=216.112.37.106 protocol=tcp non-reserved port=13721
09:27:18 [14330] <2> bind_on_port_addr: bound to port 4800
09:27:18 [14330] <2> check_authentication: no authentication required
09:27:18 [14330] <4> ?:   retention[0] = 7 days
09:27:18 [14330] <4> ?:   retention[1] = 14 days
09:27:18 [14330] <4> ?:   retention[2] = 21 days
09:27:18 [14330] <4> ?:   retention[3] = 31 days
09:27:18 [14330] <4> ?:   retention[4] = 62 days
09:27:18 [14330] <4> ?:   retention[5] = 93 days
09:27:18 [14330] <4> ?:   retention[6] = 186 days
09:27:18 [14330] <4> ?:   retention[7] = 279 days
09:27:18 [14330] <4> ?:   retention[8] = 365 days
09:27:18 [14330] <4> ?:   retention[9] = 24855 days
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: CLASSES
09:27:18 [14330] <2> getsockconnected: host=moonunit service=bpdbm
address=216.112.37.106 protocol=tcp non-reserved port=13721
09:27:18 [14330] <2> bind_on_port_addr: bound to port 4800
09:27:18 [14330] <2> check_authentication: no authentication required
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> get_db_info: Searching for class <pinkfloyd_test> sched
<>
09:27:18 [14330] <4> get_db_info: not found
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <4> ?: STORAGE UNITS
09:27:18 [14330] <4> ?: ----------------
09:27:18 [14330] <2> getsockconnected: host=moonunit service=bpdbm
address=216.112.37.106 protocol=tcp non-reserved port=13721
09:27:18 [14330] <2> bind_on_port_addr: bound to port 4800
09:27:18 [14330] <2> check_authentication: no authentication required
09:27:19 [14330] <8> get_type_of_client_port: db_getCLIENT() failed: no
entity was found (227)
09:27:19 [14330] <4> add_to_worklists: adding clientjob for pinkfloyd to
worklist[1]
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 9 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 8 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 7 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 6 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 5 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 4 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 3 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 2 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 1 WORKLIST
09:27:19 [14330] <4> ?:   client: pinkfloyd  class: pinkfloyd_test  sched:
pinkfloyd_test (FULL)
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> ?: RETENTION LEVEL 0 WORKLIST
09:27:19 [14330] <4> ?: --------------------------
09:27:19 [14330] <4> bpsched_main: empty-main bpsched already started, pid:
19693
09:27:19 [14330] <4> write_worklists_to_file: writing work to
/usr/openv/netbackup/bin/bpsched.d/worklist.14330
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:27:19 [14330] <2> send_reqQ_msg: msgsnd returned with stat -1  errno 11
(Resource temporarily unavailable)
09:27:19 [14330] <2> recv_reqQ_msg: msgrcv(nodelay) stat -1  errno 35 (No
message of desired type).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <2> recv_runQ_msg: before msgrcv
09:27:19 [14330] <2> recv_runQ_msg: msgrcv(nodelay) stat -1  errno 22
(Invalid argument).  sigcld=0 sigalrm=0 sigusr1=0 sigusr2=0
09:27:19 [14330] <4> send_reqQ_msg_list: queue full, suspending process
14330
09:27:19 [14330] <2> send_reqQ_msg: sending msg: dpid=1 spid=14330 rqtyp=1
stat=0 class=NULL clnt=NULL jobid=NULL
09:32:53 [14405] <4> bpsched: INITIATING (verbose=11) ...
09:32:53 [14405] <2> logparams: /usr/openv/netbackup/bin/bpsched 
09:32:53 [14405] <4> bpsched_main: wait_on_que=0, timeout_in_que=36000,
reread_interval=300,queue_on_error=0, bptm_query_timeout=480
09:32:53 [14405] <2> LOCAL CLASS_ATT_DEFS: Product ID = 6
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 12, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 6, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 7, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 17, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 18, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 11, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 25, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 24, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 20, was 2, now 1
09:32:53 [14405] <2> adjust_Vnydef: Change for class type 26, was 2, now 1
09:32:53 [14405] <4> bpsched_main: VSMInit () failed: 2d
09:32:53 [14405] <8> bpsched_main: another regular bpsched is already
examining the class configuration
09:32:53 [14405] <4> bpsched: scheduler exiting - regular bpsched is already
running (214)
_______________________________________________
Veritas-bu maillist  -  Veritas-bu AT mailman.eng.auburn DOT edu
http://mailman.eng.auburn.edu/mailman/listinfo/veritas-bu


<Prev in Thread] Current Thread [Next in Thread>