RE: [nv-l] graph and daemon problems
2004-04-29 09:40:54
It's hard to imagine what might be wrong
since you are so far downlevel. Lots of maintenance has been delivered
since then, and as I said, I am not expert in this area. I don't
work on data collection nor graphing.
My only suggestion is that if you have
lots and lots of data then you might go into /usr/OV/app-defaults/Xnm and
increase you maxMallocPoints from the default of 5120 to something much
greater, say four times that amount.
But the real answer may lie in what
happens when you try to graph. Do you get an error message when you
try to graph or what?
Other than this, I have no suggestions.
I am hoping that some other user who does graphing all the time will
have some suggestions. Because if not, you will have to open a problem
to Support.
James Shanks
Level 3 Support for Tivoli NetView for UNIX and Windows
Tivoli Software / IBM Software Group
"Ioannis Yerou"
<ioannisi AT ieee DOT org>
Sent by: owner-nv-l AT lists.us.ibm DOT com
04/29/2004 07:31 AM
|
To
| <nv-l AT lists.us.ibm DOT com>
|
cc
|
|
Subject
| RE: [nv-l] graph and daemon
problems |
|
James,
Thanks a lot for the input.
First, I checked for the ovwdb and you were right. The number in cache
was 10000 and the objects number was 12000. Therefore, I changed the number
to 15000. Now, regarding the graph I changed the collection time to 10
min but still it does not seem to work. What I have noticed in the log
is maybe what you are calling gaps. For many routers it says is not up.
Any other input regarding the
graph problem will be appreciated.
We are planning to upgrade
the version of Netview to the latest.
Thanks again,
Regards,
Ioannis
-----Original Message-----
From: owner-nv-l AT lists.us.ibm DOT com [mailto:owner-nv-l AT lists.us.ibm DOT com]
On Behalf Of James Shanks
Sent: Wednesday, April 28, 2004 4:16 PM
To: nv-l AT lists.us.ibm DOT com
Subject: RE: [nv-l] graph and daemon problems
OVs_NON_WELL_BEHAVED is not an error condition but an indication that these
daemons can stay up independently of the core daemons like ovwdb, pmd,
and trapd. The ones you cite will always be OVs_NON_WELL_BEHAVED,
as will others like nvpagerd.
You high ovwdb CPU rate on the other hand is not good. Is your cache
size properly adjusted? Do "ovobjprint -S" and see what
you object count is. Then make certain that your cache size on ovwdb
is at least 20% greater. Otherwise the daemon will spend a lot of
time allocating and reallocating storage to hold your object database in
memory.
And you really should plan to move to a supported level of the code. 7.1.1
has not been supported ever since 7.1.3 came out over two years ago. The
current maintenance level on 7.1.3 is FixPack2, which means that you are
5 version of the code downlevel so far. FixPack3 for 7.1.3 is in
the works now.
But none of these things has anything to do with your problem. graphing
is done by xnmgraph on data collected by snmpCollect.
I am not an expert in this area but I do know that data will not graph
if there are gaps in the data, and I'll bet that if you look in the snmpCollect
log you will find lots of errors, producing those gaps. Collections
get deferred if an error occurs, which produces those gaps. Offhand,
I would guess that you cannot successfully poll 200 routers for data in
3 minutes. If you quadruple the number of devices to collect from,
I'd suspect you have to more than quadruple the collection interval.
Anybody else?
James Shanks
Level 3 Support for Tivoli NetView for UNIX and Windows
Tivoli Software / IBM Software Group
"Ioannis Yerou"
<ioannisi AT ieee DOT org>
Sent by: owner-nv-l AT lists.us.ibm DOT com
04/28/2004 08:11 AM
|
To
| <nv-l AT lists.us.ibm DOT com>
|
cc
|
|
Subject
| RE: [nv-l] graph and deamon
problems |
|
Hi to all,
It has been some days now that the grapher stopped to display the BandwidthUtilIn
and BandwidthUtilOut (from MIB Data Collection) for the interfaces of the
routers selected. Even though I can see that the data is collected for
all the routers. I have changed the time interval from 1 min to 3 min but
no effect. I should also mention that I was collecting data for about 50
routers without problems and the last month that number was increased to
200. I noticed also the following things which I don’t know if they are
related. Two deamons trapgend and mgragentd even though running are OVs_NON_WELL_BEHAVED.
The last thing is that the process running the ovwdb takes about 99 to
100% of CPU continuously.
Any Ideas?
Netview 7.1.1
AIX 4.3
ARN Nortel Routers
Regards,
Ioannis
|
|
|