RE: [nv-l] graph and daemon problems
2004-04-28 09:16:50
OVs_NON_WELL_BEHAVED is not
an error condition but an indication that these daemons can stay up independently
of the core daemons like ovwdb, pmd, and trapd. The ones you cite
will always be OVs_NON_WELL_BEHAVED, as will others like nvpagerd.
You high ovwdb CPU rate on
the other hand is not good. Is your cache size properly adjusted?
Do "ovobjprint -S" and see what you object count is. Then
make certain that your cache size on ovwdb is at least 20% greater. Otherwise
the daemon will spend a lot of time allocating and reallocating storage
to hold your object database in memory.
And you really should plan
to move to a supported level of the code. 7.1.1 has not been supported
ever since 7.1.3 came out over two years ago. The current maintenance
level on 7.1.3 is FixPack2, which means that you are 5 version of the code
downlevel so far. FixPack3 for 7.1.3 is in the works now.
But none of these things has
anything to do with your problem. graphing is done by xnmgraph on
data collected by snmpCollect.
I am not an expert in this
area but I do know that data will not graph if there are gaps in the data,
and I'll bet that if you look in the snmpCollect log you will find lots
of errors, producing those gaps. Collections get deferred if an error
occurs, which produces those gaps. Offhand, I would guess that you
cannot successfully poll 200 routers for data in 3 minutes. If you
quadruple the number of devices to collect from, I'd suspect you
have to more than quadruple the collection interval.
Anybody else?
James Shanks
Level 3 Support for Tivoli NetView for UNIX and Windows
Tivoli Software / IBM Software Group
"Ioannis Yerou"
<ioannisi AT ieee DOT org>
Sent by: owner-nv-l AT lists.us.ibm DOT com
04/28/2004 08:11 AM
|
To
| <nv-l AT lists.us.ibm DOT com>
|
cc
|
|
Subject
| RE: [nv-l] graph and deamon
problems |
|
Hi to all,
It has been some days now that
the grapher stopped to display the BandwidthUtilIn and BandwidthUtilOut
(from MIB Data Collection) for the interfaces of the routers selected.
Even though I can see that the data is collected for all the routers. I
have changed the time interval from 1 min to 3 min but no effect. I should
also mention that I was collecting data for about 50 routers without problems
and the last month that number was increased to 200. I noticed also the
following things which I don’t know if they are related. Two deamons trapgend
and mgragentd even though running are OVs_NON_WELL_BEHAVED. The last thing
is that the process running the ovwdb takes about 99 to 100% of CPU continuously.
Any Ideas?
Netview 7.1.1
AIX 4.3
ARN Nortel Routers
Regards,
Ioannis
|
|
|