EMC Unity Storage Processors Test

This test auto-discovers the storage processors on the target storage system and for each storage processor, reports the current health, uptime and the temperature. Using this test, administrators may be alerted to storage processors that failed/degraded, rebooted too often and operating at abnormal temperature range and take remedial steps to replace those storage processors before end users start complaining.

Target of the test : An EMC Unity Storage System

Agent deploying the test : A remote agent

Outputs of the test : One set of results for each Storage Processor on the target storage system being monitored.

Configurable parameters for the test

Parameter

Description

Test Period

How often should the test be executed.

Host

The IP address of the storage system for which this test is to be configured.

UniSphere CLI Path

The eG agent uses the command-line utility, UEMCli.exe, which is part of the UniSphere Management Suite, to communicate with and monitor the storage device. To enable the eG agent to invoke the CLI, configure the full path to the CLI here.

Username and Password

Provide the credentials of a user possessing 'monitor' role to access the storage system in the Username and Password text boxes.

Confirm Password

Confirm the Password by retyping it in the Confirm Password text box.

Timeout

Specify the duration (in seconds) beyond which this test should time out in the Timeout text box. The default is 300 seconds.

Detailed Diagnosis

To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.

The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:

  • The eG manager license should allow the detailed diagnosis capability
  • Both the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.
Measurements made by the test
Measurement Description Measurement Unit Interpretation

Health state

Indicates the current health of this storage processor.

 

The values reported by this measure and its numeric equivalents are mentioned in the table below:

Measure value Numeric Value
Normal 0
Attention 1
Unknown 2
Degraded/Warning 3
Minor failure 4
Major failure 5
Critical failure 6
Non-recoverable error 7

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the current health of this storage processor. The graph of this measure however is represented using the numeric equivalents only - 0 to 7.

The detailed diagnosis of this measure lists the storage processor name, the health , the name of the disk processor enclosure, the manufacturer model and mode of the storage processor.

Memory size

Indicates the current memory of this storage processor.

GB

 

Peak utilization

Indicates the maximum percentage of core utilization detected on this storage processor.

Percentage

A consistently high value for this measure is a cause of concern.

The detailed diagnosis of this measure lists the name of the storage processor, the VNX time and the CPU utilization of the storage processor.

Average utilization

Indicates the average percentage of core utilization detected on this storage processor.

Percentage

 

Uptime

Indicates the total time that this storage processor has been up since its last reboot.

 

This measure displays the number of years, months, days, hours, minutes and seconds since the last reboot. Administrators may wish to be alerted if the storage processor has been running without a reboot for a very long period. Setting a threshold for this metric allows administrators to determine such conditions.

Is rebooted?

Indicates whether this storage processor has been rebooted during the last measurement period or not.

 

The values reported by this measure and its numeric equivalents are mentioned in the table below:

Measure value Numeric Value
Yes 1
No 0

Note:

By default, this measure reports the Measure Values listed in the table above to indicate whether this storage processor was rebooted or not. The graph of this measure however is represented using the numeric equivalents only - 0 or 1.

If the value of this measure is Yes, it means that the storage processor was rebooted during the last measurement period. By checking the time periods when this metric changes from No to Yes, an administrator can determine the times when this storage processor was rebooted.

Temperature

Indicates the current temperature of this storage processor.

Celsius

Ideally, the temperature of this storage processor should be within permissible range. An abnormally high temperature of may lead to irreversible damage of the storage processor.