ADSM-L

Re: List of Error Messages

1997-05-22 18:52:00
Subject: Re: List of Error Messages
From: "Siska, Bob" <BSISKA AT NCS DOT COM>
Date: Thu, 22 May 1997 15:52:00 PDT
Here's the Email about monitoring error messages:
>Subject: Monitoring ADSM for errors.

>

>Howdy All,

>I am currently revisiting the way we monitor ADSM's activity.

>Our environment is 20 nodes backing up to 1 server (all on an SP)
overnight.
>We use NetView to monitor the file dsmerror.log. This issues all sorts of

>messages that don't need to be passed onto NetView.

>The manual "ADSM for AIX: Advanced Topics" recommends monitoring the
console
>log instead of the dsmerror.log (but doesn't say why).

>The console log is piped out to /dev/null. Is this the norm?

>Is there a mechanism for pruning the console log or dsmerror.log?

>It also says to monitor for specific messages & ignore the rest.

>I think it would be better to take a sample of the most common messages &

>filter out the messages you don't need to see. This way should ADSM issue
any
>new messages (in future versions) we won't be ignoring them.

>

>What's the consensus, monitor the console log or dsmerror.log?

>What does everyone else do?

>

>I'd welcome any opinions/advise before I start scripting.
>

What I do here is I have a perl script that runs the dsmserv process, grabs

the

console information and writes it to a file.  At the same time it checks the

error messages to decide which messages to notify on... either with E-mail,

pages or as in your case passing to NetView to do the notification.  Some of

the basic rules to classify the errors is to use the error typing of ADSM

(information, warning, errors, and critical errors).  Then on an exception

basis handle specific error codes.

 Information            ANR....I

 Warnings               ANR....W

 Errors                 ANR....E

 Critical Errors        ANR....D  &  ANR....S

I myself have set up to take the following actions (classes).

 Information            - take no action.

 Warnings               - only log in a warings log file.

 Errors                 - log in an error log file and send e-mail of
problem.
 Critical Errors        - log in an error log file , send e-mail, & send
page.
Now ss I stated, I do group specific ANR messsage into one of the above

classes
that would be different then its default class.  I case you are wondering my

break down is as follows......

Critical Errors Class:

ANR0130E ANR0131E ANR0132E ANR0359E ANR0360E ANR0361E ANR2700E ANR2707E

ANR2708E ANR4565E ANR4570E ANR4571E ANR4573E ANR4575E ANR4576E

ANR4577E ANR4578E ANR4579E ANR4580E ANR4582E ANR4583E ANR7823E ANR8469E

ANR0202W ANR0204W ANR0205W ANR0362W ANR0437W ANR0438W ANR0439W ANR0485W

ANR0522W ANR1025W ANR2574W ANR2575W ANR4581W

ANR4561I ANR4562I ANR4563I ANR4564I

Errors Class:

ANR2572W ANR0206W ANR0208W ANR0214W ANR0215W ANR4550I ANR8326I

Information Class: (basically these will be ignored)

ANR5124E ANR5233E ANR5241E ANR5307E ANR5308E ANR5311E ANR7821E ANR8304E

ANR8447E ANR2000E

Now this is the setup that works for me.  You may have other ANR messages
that
you wish to take a different action on then what I am taking.  But hopefully

this will give you a start in looking at the the message groupings.

Hope this helps;

Chuck T.




 ----------
From: owner-adsm-l
To: ADSM-L
Subject: List of Error Messages
Date: Thursday, May 22, 1997 2:26PM

     I had an email concerning a list of error messages that IBM
     recommended be monitored/captured and have subsequently lost it.

     Does anyone have this email or know where the list of the "top" error
     messages to catch are printed?  They are in a book and I've even seen
     them but can't find them now, when they are needed the most.

     Any suggestions please.

     Sherry Jackson
     Whirlpool Corporation
<Prev in Thread] Current Thread [Next in Thread>