This is a multipart message in MIME format.
--=_alternative 0076A14387256A8D_=
Content-Type: text/plain; charset="us-ascii"
Steve,
You win the prize for giving the best clue to solving this problem.
The problem it seems was a restore job which failed because of a bad tape
and left the following process active but hung:
bprd -dontfork -mpxmain
I don't know why it stayed around even after doing
K77netbackup/S77netbackup but it did, so I didn't think anything of it
being around when I did ps -ef | grep bp during all my failed restore
attempts.
Your clue " If bprd thinks it's tied up it can't process your restore
request and "will* just hang." made me think more about what was going on
during the restore process.
I killed the "hung" process and now my restores work fine.
Thanx again, Steve, and everyone else who responded so promptly with
questions, ideas, suggestions. This is a valuable problem-solving resource
for all us NBU folk.
James
Steve Curry <scurry505 AT yahoo DOT com>
07/18/2001 10:50 AM
To: James Laurie <jlaurie AT csu DOT org>
cc: scurry AT yahoo-inc DOT com
bcc:
Subject: RE: [Veritas-bu] unable to restore
James,
bprd (the request daemon) seems to be busy doing
something else. Have you tried shutting down
NetBackup and restarting it again to clear this? If
bprd thinks it's tied up it can't proccess your
restore request and *will* just hang. Make sure you
have the latest Jumbo patch installed and all other
client and server patches.
Hope this helps,
Steve Curry
<QUOTE>
07:35:47 [5089] <4> restorefiles: VSMJobBegin ()
failed: 4d
07:35:47 [5089] <4> restorefiles: VSMAlarm
(BPVSM_ALARM_NBUMASTER_RDSTARTJOB) f
ailed: 4d
07:35:47 [5089] <2> start_mpx_main_bprd: server =
fims00
07:35:47 [5089] <2> getsockconnected: host=fims00
service=bpcd address=172.20.7
7.215 protocol=tcp reserved port=13782
</QUOTE>
--- James Laurie <jlaurie AT csu DOT org> wrote:
> Below is a portion of the bpdb log file for the time
> frame. It's long but
> the most relevant entry seems to be 2 lines
> extracted out of context (scroll through the rest to
> see surrounding
> context if needed):
>
> 07:35:47 [5089] <4> restorefiles: VSMJobBegin ()
> failed: 4d
> 07:35:47 [5089] <4> restorefiles: VSMAlarm
> (BPVSM_ALARM_NBUMASTER_RDSTARTJOB) f
> ailed: 4d
>
> I can't find any reference in my documentation to
> "failed: 4d"
> In any event the restore job is still hung.
>
>
> 07:34:47 [5022] <4> connected_peer: Connection from
> host fims00,
> 172.20.77.215,
> on non-reserved port 46136
> 07:34:47 [5022] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:34:47 [5022] <2> db_getCLIENT_by_hostname:
> db_CLIENTreceive: no entity
> was f
> ound 227
> 07:34:47 [5022] <4> process_request: EXIT STATUS 0
> 07:34:48 [5036] <4> connected_peer: Connection from
> host fims00,
> 172.20.77.215,
> on non-reserved port 46138
> 07:34:48 [5036] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:34:48 [5036] <2> db_getCLIENT_by_hostname:
> db_CLIENTreceive: no entity
> was f
> ound 227
> 07:34:48 [5036] <4> get_type_of_client_list_restore:
> db_getCLIENT()
> failed: no
> entity was found (227)
> 07:34:48 [5036] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:34:48 [5036] <2> db_getCLIENT_by_hostname:
> db_CLIENTreceive: no entity
> was f
> ound 227
> 07:34:48 [5036] <4> get_type_of_client_list_restore:
>
> db_getCLIENT_by_hostname()
> failed: no entity was found (227)
> 07:34:48 [5036] <4> get_type_of_client_free_browse:
> db_getCLIENT() failed:
> no e
> ntity was found (227)
> 07:34:48 [5036] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:34:49 [5036] <2> db_getCLIENT_by_hostname:
> db_CLIENTreceive: no entity
> was f
> ound 227
> 07:34:49 [5036] <4> get_type_of_client_free_browse:
> db_getCLIENT_by_hostname()
> failed: no entity was found (227)
> 07:34:49 [5036] <4> fileslist: sockfd = 6
> 07:34:49 [5036] <4> fileslist: owner = root
> 07:34:49 [5036] <4> fileslist: group = other
> 07:34:49 [5036] <4> fileslist: client = fw00
> 07:34:49 [5036] <4> fileslist: sched_type = 12
> 07:34:49 [5036] <4> fileslist: starttime =
> 993859733
> 07:34:49 [5036] <4> fileslist: endtime =
> 993859733
> 07:34:49 [5036] <4> fileslist: filepath =
> /opt/CPfw1-41/lib
> 07:34:49 [5036] <4> fileslist: recursion_level =
> 2
> 07:34:49 [5036] <4> fileslist: timetype = 4
> 07:34:49 [5036] <4> fileslist: rqtimetype = 4
> 07:34:49 [5036] <4> fileslist: user_interface = 1
> 07:34:49 [5036] <4> fileslist: full_listing = 0
> 07:34:49 [5036] <4> fileslist: pc1_listing = 0
> 07:34:49 [5036] <4> fileslist: long_listing = 1
> 07:34:49 [5036] <4> fileslist:
> raw_partition_search = 0
> 07:34:49 [5036] <4> fileslist: client_type = 0
> 07:34:49 [5036] <4> fileslist: class = NONE
> 07:34:49 [5036] <4> fileslist: keyword =
> 07:34:49 [5036] <4> fileslist: bplist_format = 3
> 07:34:49 [5036] <4> fileslist: true_image = 0
> 07:34:49 [5036] <4> fileslist: list_lc_messages =
> C
> 07:34:49 [5036] <4> fileslist: list_lc_time = C
> 07:34:49 [5036] <4> fileslist: list_lc_ctype = C
> 07:34:49 [5036] <4> fileslist: list_lc_collate =
> C
> 07:34:49 [5036] <4> fileslist: list_lc_numeric =
> C
> 07:34:49 [5036] <4> fileslist: directories_only =
> 0
> 07:34:49 [5036] <4> fileslist: include_extra_info
> = 0
> 07:34:49 [5036] <4> fileslist: client_uid = 0
> 07:34:49 [5036] <4> fileslist: client_gid = 1
> 07:34:49 [5036] <4> fileslist: ignore_case = 0
> 07:34:49 [5036] <2> fileslist: fims00
> 07:34:49 [5036] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:34:49 [5036] <2> fileslist: begin db
> communication
> 07:34:49 [5036] <2> fileslist: criteria sent to db
> mgr
> 07:34:52 [5036] <4> process_request: EXIT STATUS 0
> 07:34:52 [18205] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20
> .77.215 protocol=tcp non-reserved port=13721
> 07:35:44 [5089] <4> connected_peer: Connection from
> host fims00,
> 172.20.77.215,
> on non-reserved port 46146
> 07:35:44 [5089] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:35:44 [5089] <4> get_type_of_client_port:
> db_getCLIENT() failed: no
> entity w
> as found (227)
> 07:35:44 [5089] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:35:44 [5089] <2> db_getCLIENT_by_hostname:
> db_CLIENTreceive: no entity
> was f
> ound 227
> 07:35:45 [5089] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:35:45 [5089] <2> db_getCLIENT_by_hostname:
> db_CLIENTreceive: no entity
> was f
> ound 227
> 07:35:45 [5089] <4> get_type_of_client_list_restore:
> db_getCLIENT()
> failed: no
> entity was found (227)
> 07:35:45 [5089] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:35:45 [5089] <2> db_getCLIENT_by_hostname:
> db_CLIENTreceive: no entity
> was f
> ound 227
> 07:35:45 [5089] <4> get_type_of_client_list_restore:
>
> db_getCLIENT_by_hostname()
> failed: no entity was found (227)
> 07:35:45 [5089] <4> get_type_of_client_free_browse:
> db_getCLIENT() failed:
> no e
> ntity was found (227)
> 07:35:45 [5089] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:35:45 [5089] <2> db_getCLIENT_by_hostname:
> db_CLIENTreceive: no entity
> was f
> ound 227
> 07:35:45 [5089] <4> get_type_of_client_free_browse:
> db_getCLIENT_by_hostname()
> failed: no entity was found (227)
> 07:35:45 [5089] <2> adjust_clientname:
> adjust_clientname(fims00, fims00,
> fims00
> , fims00, 1)
> 07:35:45 [5089] <2> getsockconnected: host=fims00
> service=bpdbm
> address=172.20.
> 77.215 protocol=tcp non-reserved port=13721
> 07:35:45 [5089] <2> adjust_clientname: adjusted
> clientname = fims00,
> client_bp_
> conf_name = fims00
> 07:35:45 [5089] <2> getsockconnected: host=fwac00
> service=bpcd
=== message truncated ===
__________________________________________________
Do You Yahoo!?
Get personalized email addresses from Yahoo! Mail
http://personal.mail.yahoo.com/
--=_alternative 0076A14387256A8D_=
Content-Type: text/html; charset="us-ascii"
<br><font size=2 face="sans-serif">Steve,</font>
<br>
<br><font size=2 face="sans-serif">You win the prize for giving the best clue
to solving this problem.</font>
<br><font size=2 face="sans-serif">The problem it seems was a restore job which
failed because of a bad tape and left the following process active but
hung:</font>
<br><font size=2 face="sans-serif"> bprd -dontfork -mpxmain</font>
<br><font size=2 face="sans-serif">I don't know why it stayed around even after
doing K77netbackup/S77netbackup but it did, so I didn't think anything of it
being around when I did ps -ef | grep bp during all my failed restore
attempts.</font>
<br><font size=2 face="sans-serif">Your clue " If bprd thinks it's tied up
it can't process your restore request and "will* just hang." made me
think more about what was going on during the restore process.</font>
<br><font size=2 face="sans-serif">I killed the "hung" process and
now my restores work fine.</font>
<br>
<br><font size=2 face="sans-serif">Thanx again, Steve, and everyone else who
responded so promptly with questions, ideas, suggestions. This is a valuable
problem-solving resource for all us NBU folk.</font>
<br>
<br><font size=2 face="sans-serif">James</font>
<br>
<br>
<br>
<table width=100%>
<tr valign=top>
<td>
<td><font size=1 face="sans-serif"><b>Steve Curry <scurry505 AT yahoo DOT
com></b></font>
<p><font size=1 face="sans-serif">07/18/2001 10:50 AM</font>
<br>
<td><font size=1 face="Arial"> </font>
<br><font size=1 face="sans-serif"> To:
James Laurie <jlaurie AT csu DOT org></font>
<br><font size=1 face="sans-serif"> cc:
scurry AT yahoo-inc DOT com</font>
<br><font size=1 face="sans-serif"> bcc:
</font>
<br><font size=1 face="sans-serif"> Subject:
RE: [Veritas-bu] unable to restore</font></table>
<br>
<br>
<br><font size=2 face="Courier New">James,<br>
<br>
bprd (the request daemon) seems to be busy doing<br>
something else. Have you tried shutting down<br>
NetBackup and restarting it again to clear this? If<br>
bprd thinks it's tied up it can't proccess your<br>
restore request and *will* just hang. Make sure you<br>
have the latest Jumbo patch installed and all other<br>
client and server patches.<br>
<br>
<br>
Hope this helps,<br>
<br>
<br>
Steve Curry<br>
<br>
<br>
<QUOTE><br>
<br>
07:35:47 [5089] <4> restorefiles: VSMJobBegin ()<br>
failed: 4d <br>
07:35:47 [5089] <4> restorefiles: VSMAlarm<br>
(BPVSM_ALARM_NBUMASTER_RDSTARTJOB) f <br>
ailed: 4d <br>
07:35:47 [5089] <2> start_mpx_main_bprd: server =<br>
fims00 <br>
07:35:47 [5089] <2> getsockconnected: host=fims00<br>
service=bpcd address=172.20.7 <br>
7.215 protocol=tcp reserved port=13782 <br>
<br>
</QUOTE><br>
<br>
<br>
<br>
--- James Laurie <jlaurie AT csu DOT org> wrote:<br>
> Below is a portion of the bpdb log file for the time<br>
> frame. It's long but <br>
> the most relevant entry seems to be 2 lines<br>
> extracted out of context (scroll through the rest to<br>
> see surrounding <br>
> context if needed):<br>
> <br>
> 07:35:47 [5089] <4> restorefiles: VSMJobBegin ()<br>
> failed: 4d<br>
> 07:35:47 [5089] <4> restorefiles: VSMAlarm <br>
> (BPVSM_ALARM_NBUMASTER_RDSTARTJOB) f<br>
> ailed: 4d<br>
> <br>
> I can't find any reference in my documentation to<br>
> "failed: 4d"<br>
> In any event the restore job is still hung.<br>
> <br>
> <br>
> 07:34:47 [5022] <4> connected_peer: Connection from<br>
> host fims00, <br>
> 172.20.77.215,<br>
> on non-reserved port 46136<br>
> 07:34:47 [5022] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:34:47 [5022] <2> db_getCLIENT_by_hostname:<br>
> db_CLIENTreceive: no entity <br>
> was f<br>
> ound 227<br>
> 07:34:47 [5022] <4> process_request: EXIT STATUS 0<br>
> 07:34:48 [5036] <4> connected_peer: Connection from<br>
> host fims00, <br>
> 172.20.77.215,<br>
> on non-reserved port 46138<br>
> 07:34:48 [5036] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:34:48 [5036] <2> db_getCLIENT_by_hostname:<br>
> db_CLIENTreceive: no entity <br>
> was f<br>
> ound 227<br>
> 07:34:48 [5036] <4> get_type_of_client_list_restore:<br>
> db_getCLIENT() <br>
> failed: no <br>
> entity was found (227)<br>
> 07:34:48 [5036] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:34:48 [5036] <2> db_getCLIENT_by_hostname:<br>
> db_CLIENTreceive: no entity <br>
> was f<br>
> ound 227<br>
> 07:34:48 [5036] <4> get_type_of_client_list_restore:<br>
> <br>
> db_getCLIENT_by_hostname()<br>
> failed: no entity was found (227)<br>
> 07:34:48 [5036] <4> get_type_of_client_free_browse:<br>
> db_getCLIENT() failed: <br>
> no e<br>
> ntity was found (227)<br>
> 07:34:48 [5036] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:34:49 [5036] <2> db_getCLIENT_by_hostname:<br>
> db_CLIENTreceive: no entity <br>
> was f<br>
> ound 227<br>
> 07:34:49 [5036] <4> get_type_of_client_free_browse: <br>
> db_getCLIENT_by_hostname() <br>
> failed: no entity was found (227)<br>
> 07:34:49 [5036] <4> fileslist: sockfd = 6<br>
> 07:34:49 [5036] <4> fileslist: owner = root<br>
> 07:34:49 [5036] <4> fileslist: group = other<br>
> 07:34:49 [5036] <4> fileslist: client = fw00<br>
> 07:34:49 [5036] <4> fileslist: sched_type = 12<br>
> 07:34:49 [5036] <4> fileslist: starttime =<br>
> 993859733<br>
> 07:34:49 [5036] <4> fileslist: endtime =<br>
> 993859733<br>
> 07:34:49 [5036] <4> fileslist: filepath =<br>
> /opt/CPfw1-41/lib<br>
> 07:34:49 [5036] <4> fileslist: recursion_level =<br>
> 2<br>
> 07:34:49 [5036] <4> fileslist: timetype = 4<br>
> 07:34:49 [5036] <4> fileslist: rqtimetype = 4<br>
> 07:34:49 [5036] <4> fileslist: user_interface = 1<br>
> 07:34:49 [5036] <4> fileslist: full_listing = 0<br>
> 07:34:49 [5036] <4> fileslist: pc1_listing = 0<br>
> 07:34:49 [5036] <4> fileslist: long_listing = 1<br>
> 07:34:49 [5036] <4> fileslist: <br>
> raw_partition_search = 0<br>
> 07:34:49 [5036] <4> fileslist: client_type = 0<br>
> 07:34:49 [5036] <4> fileslist: class = NONE<br>
> 07:34:49 [5036] <4> fileslist: keyword = <br>
> 07:34:49 [5036] <4> fileslist: bplist_format = 3<br>
> 07:34:49 [5036] <4> fileslist: true_image = 0<br>
> 07:34:49 [5036] <4> fileslist: list_lc_messages =<br>
> C<br>
> 07:34:49 [5036] <4> fileslist: list_lc_time = C<br>
> 07:34:49 [5036] <4> fileslist: list_lc_ctype = C<br>
> 07:34:49 [5036] <4> fileslist: list_lc_collate =<br>
> C<br>
> 07:34:49 [5036] <4> fileslist: list_lc_numeric =<br>
> C<br>
> 07:34:49 [5036] <4> fileslist: directories_only =<br>
> 0<br>
> 07:34:49 [5036] <4> fileslist: include_extra_info<br>
> = 0<br>
> 07:34:49 [5036] <4> fileslist: client_uid = 0<br>
> 07:34:49 [5036] <4> fileslist: client_gid = 1<br>
> 07:34:49 [5036] <4> fileslist: ignore_case = 0<br>
> 07:34:49 [5036] <2> fileslist: fims00<br>
> 07:34:49 [5036] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:34:49 [5036] <2> fileslist: begin db<br>
> communication<br>
> 07:34:49 [5036] <2> fileslist: criteria sent to db<br>
> mgr<br>
> 07:34:52 [5036] <4> process_request: EXIT STATUS 0<br>
> 07:34:52 [18205] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20<br>
> .77.215 protocol=tcp non-reserved port=13721<br>
> 07:35:44 [5089] <4> connected_peer: Connection from<br>
> host fims00, <br>
> 172.20.77.215,<br>
> on non-reserved port 46146<br>
> 07:35:44 [5089] <2> getsockconnected: host=fims00<br>
> service=bpdbm </font>
<br><font size=2 face="Courier New">> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:35:44 [5089] <4> get_type_of_client_port:<br>
> db_getCLIENT() failed: no <br>
> entity w<br>
> as found (227)<br>
> 07:35:44 [5089] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:35:44 [5089] <2> db_getCLIENT_by_hostname:<br>
> db_CLIENTreceive: no entity <br>
> was f<br>
> ound 227<br>
> 07:35:45 [5089] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:35:45 [5089] <2> db_getCLIENT_by_hostname:<br>
> db_CLIENTreceive: no entity <br>
> was f<br>
> ound 227<br>
> 07:35:45 [5089] <4> get_type_of_client_list_restore:<br>
> db_getCLIENT() <br>
> failed: no <br>
> entity was found (227)<br>
> 07:35:45 [5089] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:35:45 [5089] <2> db_getCLIENT_by_hostname:<br>
> db_CLIENTreceive: no entity <br>
> was f<br>
> ound 227<br>
> 07:35:45 [5089] <4> get_type_of_client_list_restore:<br>
> <br>
> db_getCLIENT_by_hostname()<br>
> failed: no entity was found (227)<br>
> 07:35:45 [5089] <4> get_type_of_client_free_browse:<br>
> db_getCLIENT() failed: <br>
> no e<br>
> ntity was found (227)<br>
> 07:35:45 [5089] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:35:45 [5089] <2> db_getCLIENT_by_hostname:<br>
> db_CLIENTreceive: no entity <br>
> was f<br>
> ound 227<br>
> 07:35:45 [5089] <4> get_type_of_client_free_browse: <br>
> db_getCLIENT_by_hostname() <br>
> failed: no entity was found (227)<br>
> 07:35:45 [5089] <2> adjust_clientname:<br>
> adjust_clientname(fims00, fims00, <br>
> fims00<br>
> , fims00, 1)<br>
> 07:35:45 [5089] <2> getsockconnected: host=fims00<br>
> service=bpdbm <br>
> address=172.20.<br>
> 77.215 protocol=tcp non-reserved port=13721<br>
> 07:35:45 [5089] <2> adjust_clientname: adjusted<br>
> clientname = fims00, <br>
> client_bp_<br>
> conf_name = fims00<br>
> 07:35:45 [5089] <2> getsockconnected: host=fwac00<br>
> service=bpcd <br>
=== message truncated ===<br>
<br>
<br>
__________________________________________________<br>
Do You Yahoo!?<br>
Get personalized email addresses from Yahoo! Mail<br>
http://personal.mail.yahoo.com/<br>
</font>
<br>
<br>
--=_alternative 0076A14387256A8D_=--
|