XIO SC InfiniBand Switches Test

This test reports whether/not each Infiniband Switch is enabled. This test also helps administrators to determine the availability of each Infiniband Switch, the current health of the switches and the state of the ports on the Infiniband Switches. Using this test, administrators can easily identify the infiniband switch that is prone to errors.

Target of the test : An EMC XtremIO Storage array

Agent deploying the test : A remote agent

Outputs of the test : One set of results for each Storage Controller:InfiniBand Switch of the target EMC XtremIO being monitored

Configurable parameters for the test
Parameter Description

Test Period

How often should the test be executed.

Host

The IP address of the storage device for which this test is to be configured.

Port

The port number at which the storage array listens. The default is NULL.

XtremIO User and XtremIO Password

Provide the credentials of a user who has read only privileges to access the XtremIO storage array in the XtremIO User and XtremIO Password text boxes.

Confirm Password

Confirm the password by retyping it here.

SSL

The eG agent collects performance metrics by invoking Restful APIs on the target Storage array. Typically, the Restful APIs can be invoked through the HTTP or the HTTPS mode. By default, the eG agent invokes the Restful APIs using the HTTPS mode. This is why, the SSL flag is set to Yes by default. If the target storage array is not SSL-enabled, then the Restful APIs can be accessed through the HTTP mode only. In this case, set the SSL flag to No.

XMS IP

This parameter is applicable only for EMC XtremIO 4.x. By default, None will be chosen from this list. If the target EMC XtremIO storage array is within a XMS Management Server that is auto-discovered, then the IP or host name of that XMS Management Server will be displayed in this list. Select that particular XMS IP to configure this test. If you wish to monitor an EMC XtremIO Storage Array that is either not an integral part of the auto-discovered XMS Management Server or a brand new EMC XtremIO Storage Array, choose the Other option. This will enable you to add a new XMS Managament Server. To know how to add a new XMS Management Server, refer to Adding a new XMS.

DD Frequency

Refers to the frequency with which detailed diagnosis measures are to be generated for this test. The default is 2:1. This indicates that, by default, detailed measures will be generated every second time this test runs, and also every time the test detects a problem. You can modify this frequency, if you so desire. Also, if you intend to disable the detailed diagnosis capability for this test, you can do so by specifying none against DD frequency.

Measurements made by the test
Measurement Description Measurement Unit Interpretation

InfiniBand link health level

Indicates the health of this InfiniBand Switch when linked to this Storage Controller.

 

The values reported by this measure and its numeric equivalents are mentioned in the table below:

Measure value Numeric Value
Level_1_clear 0
Level_2_unknown 1
Level_3_warning 2
Level_4_minor 3
Level_5_major 4
Level_6_critical 5

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the health of this InfiniBand Switch in the Storage Controller. The graph of this measure however is represented using the numeric equivalents only - 0 to 5.

InfiniBand port state

Indicates the state of the port available on this Storage Controller that is linked to this InfiniBand Switch.

 

The values reported by this measure and its numeric equivalents are mentioned in the table below:

Measure value Numeric Value
Up 0
Down 1

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the state of the port available in this InfiniBand Switch of the Storage Controller. The graph of this measure however is represented using the numeric equivalents only - 0 or 1.

Connection between Storage Controllers

Indicates the health of this InfiniBand Switch connection between the Storage Controllers.

 

The values reported by this measure and its numeric equivalents are mentioned in the table below:

Measure value Numeric Value
Healthy 0
Not_node 1
Wrong_port 2

Note:

By default, this measure reports the Measure Values listed in the table above to indicate the health of this InfiniBand Switch connection between the Storage Controllers. The graph of this measure however is represented using the numeric equivalents only - 0 to 2.

Number of times InfiniBand port was down

Indicates the number of times the port to which this Infiniband Switch was connected was down.

Number

Ideally, the value of this measure should be zero.

InfiniBand link error recoveries

Indicates the number of times the port to which this InfiniBand Switch was connected successfully completed a link error recovery procedure.

Number

 

InfiniBand local link integrity errors

Indicates the number of times logical link integrity errors were encountered by the port to which this InfiniBand Switch was connected.

Number

Ideally, the value of this measure should be zero.

InfiniBand port received errors

Indicates the number of packets that were received by the port of this InfiniBand Switch of the Storage Controller with errors.

Number

Ideally, the value of this measure should be zero.

Remote physical errors

Indicates the number of remote physical errors encountered by the port to which this InfiniBand Switch was connected.

Number

Ideally, the value of this measure should be zero.

InfiniBand symbol errors

Indicates the number of symbol errors encountered by the port of this InfiniBand Switch of the Storage Controller.

Number

Ideally, the value of this measure should be zero.