Var Adm Messages Test

This test reports the count of new CPU and memory errors that have occurred between two test runs.

Target of the test : A Solaris

Agent deploying the test : An internal agent

Outputs of the test : One set of results for the every error pattern configured

Configurable parameters for the test
  1. TEST PERIOD - How often should the test be executed
  2. Host - The host for which the test is to be configured
  3. port – The port at which the host listens
  4. alertfile - The full path to the alert file that needs to be monitored. By default, /var/adm/messages will be displayed therein.

    Also, instead of a specific log file, the path to the directory containing alert files can be provided - eg., /user/logs. This ensures that eG monitors the most recent files in the specified directory. If while monitoring a log file in a directory a newer log file gets added to that directory, then eG will first finish monitoring the original log file and then start monitoring the new one. Specific log file name patterns can also be specified, so that the log file(s) monitored are restricted to files that match the specified patterns. For example, to monitor the latest log files with names containing the strings 'dblogs' and 'applogs', the parameter specification can be, /tmp/db/*dblogs*,/tmp/app/*applogs*. Here, '*' indicates leading/trailing spaces (as the case may be).

    The eG monitor interface will report one set of measurements for every configured path. You can also configure the path in the following format:Name@logfilepath. Here, Name represents the display name of the path being configured. Accordingly, the parameter specification for the 'dblogs' and 'applogs' example discussed above can be: dblogs@/tmp/db/*dblogs*,applogs@/tmp/app/*applogs*. In this case, the display names 'dblogs' and 'applogs' will alone be displayed as descriptors of the test, and not the individual paths.

  5. searchpattern - input the error patterns to search for in the specified alert file in the following format: PatternName:ErrorPattern. Here, PatternName refers to the display name of the error pattern. In other words, it is this name that will be displayed as an info (descriptor) of the VarAdmMsgsTest in the eG monitor interface. The ErrorPattern refers to the pattern of errors to search for in the alert file. An error pattern can be expressed in any of the following forms - *expr* or expr or *expr or expr* or *expr1*expr2*... or expr1*expr2, etc. A leading '*' signifies any number of leading characters, while a trailing '*' signifies any number of trailing characters. For example, by providing the entry Kernel_errors:*kern* here, you can instruct the eG Enterprise system to search for errors containing the text 'kern'. Statistics related to these errors will be displayed in the eG monitor interface when the info Kernel_errors is clicked on. Multiple error patterns can be monitored as a comma-separated list. For example, Kernel_errors:*kern*,Memory_errors:*AFT*. A single pattern may also be of the form e1+e2, where + signifies an OR condition. That is, the PatternName is matched if either e1 is true or e2 is true.
  6. lines - To enable eG to provide additional information about the errors in the detailed diagnosis page, you can specify in the LINES text box the number of lines of text below and above the 'error line' (in the alert file) that the detailed diagnosis page should display. This specification should be in the format: No. of lines above:No. of lines below. By default, this is set to '0:0', which will display only the error line in the detailed diagnosis page. If you set it to 2:3, then besides the error line, 2 lines above and 3 lines below the error line will also be displayed in the detailed diagnosis page.
  7. excludepattern - Provide a comma-separated list of patterns to be excluded from monitoring in the EXCLUDEPATTERN text box. For example *critical*,*exception*. By default, this parameter is set to 'none'.
  8. uniquematch - By default, the UNIQUEMATCH parameter is set to FALSE, indicating that, by default, the test checks every line in the log file for the existence of each of the configured SEARCHPATTERNS. By setting this parameter to TRUE, you can instruct the test to ignore a line and move to the next as soon as a match for one of the configured patterns is found in that line. For example, assume that Pattern1:*fatal*,Pattern2:*error* is the SEARCHPATTERN that has been configured. If UNIQUEMATCH is set to FALSE, then the test will read every line in the log file completely to check for the existence of messages embedding the strings 'fatal' and 'error'. If both the patterns are detected in the same line, then the number of matches will be incremented by 2. On the other hand, if UNIQUEMATCH is set to TRUE, then the test will read a line only until a match for one of the configured patterns is found and not both. This means that even if the strings 'fatal' and 'error' follow one another in the same line, the test will consider only the first match and not the next. The match count in this case will therefore be incremented by only 1.
  9. rotatingfile - By default, the ROTATINGFILE parameter is set to FALSE. To instruct the eG Enterprise system to monitor newer log files also, set this parameter to TRUE. Otherwise, set it to FALSE.
  10. DETAILED DIAGNOSIS – To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.

    The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:

    • The eG manager license should allow the detailed diagnosis capability
    • Both the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.
Measurements made by the test
Measurement Description Measurement Unit Interpretation

Recent errors:

Indicates the number of new CPU and memory errors that occurred between two test runs.

Number

The detailed diagnosis of this measure, if enabled, will list the recent errors and additional lines of information pertaining to the errors (if configured).