VNXe Storage Processors Test

This test auto-discovers the storage processors on the target storage system and for each storage processor, reports the current health, uptime and the temperature. Using this test, administrators may be alerted to storage processors that failed/degraded, rebooted too often and operating at abnormal temperature range and take remedial steps to replace those storage processors before end users start complaining.

Target of the test : An EMC VNXe Storage system

Agent deploying the test : A remote agent

Outputs of the test : One set of results for each Storage Processor on the target storage system being monitored

Configurable parameters for the test
Parameters Description

Test Period

How often should the test be executed.

Host

The host for which the test is to be configured.

Unisphere CLI Path

The eG agent uses the command-line utility, UEMCli.exe, which is part of the UniSphere Management Suite, to communicate with and monitor the storage device. To enable the eG agent to invoke the CLI, configure the full path to the CLI in the UniSphere CLI Path text box.

Username and Password

Provide the credentials of a user possessing monitor role to access the storage device in the User Name and Password text boxes.

Confirm Password

Confirm the password by retyping it here.

VNXe Version

Select the version of the target EMC VNXe storage system that is to be monitored from this list. By default, 1600 will be chosen from this list.

Detailed Diagnosis

To make diagnosis more efficient and accurate, the eG Enterprise suite embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.

The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:

  • The eG manager license should allow the detailed diagnosis capability
  • Both the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.
Measurements made by the test
Measurement Description Measurement Unit Interpretation

Health state

Indicates the current health of this storage processor.

 

The values reported by this measure and its numeric equivalents are mentioned in the table below:

Measure value Numeric Value
Normal 0
Attention 1
Unknown 2
Degraded/Warning 3
Minor failure 4
Major failure 5
Critical failure 6
Non-recoverable error 7

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the current health of this storage processor. The graph of this measure however is represented using the numeric equivalents only - 0 to 7.

The detailed diagnosis of this measure lists the storage processor name, the health , the name of the disk processor enclosure, the manufacturer model and mode of the storage processor.

Memory size

Indicates the current memory of this storage processor.

GB

 

Peak utilization

Indicates the maximum percentage of core utilization detected on this storage processor.

Percentage

A consistently high value for this measure is a cause of concern.

The detailed diagnosis of this measure lists the name of the storage processor, the VNX time and the CPU utilization of the storage processor.

Average utilization

Indicates the average percentage of core utilization detected on this storage processor.

Percentage

 

Uptime

Indicates the total time that this storage processor has been up since its last reboot.

 

This measure displays the number of years, months, days, hours, minutes and seconds since the last reboot. Administrators may wish to be alerted if the storage processor has been running without a reboot for a very long period. Setting a threshold for this metric allows administrators to determine such conditions.

Is rebooted?

Indicates whether this storage processor has been rebooted during the last measurement period or not.

 

If the value of this measure is Yes, it means that the storage processor was rebooted during the last measurement period. By checking the time periods when this metric changes from No to Yes, an administrator can determine the times when this storage processor was rebooted.

 

Temperature

Indicates the current temperature of this storage processor.

Celsius

Ideally, the temperature of this storage processor should be within permissible range. An abnormally high temperature of may lead to irreversible damage of the storage processor