Nutanix Alerts Test

Nutanix Prism Central is a management solution that provides a single-pane-of-glass interface to manage and monitor Nutanix clusters and resources. Alerts in Nutanix Prism Central are notifications that inform administrators and operators about the status of the infrastructure and applications managed by Nutanix. These alerts are crucial for proactive monitoring and troubleshooting. Nutanix Prism Central generates various types of alerts to keep you informed about the health and performance of your infrastructure. These alerts can include hardware issues, software errors, performance anomalies, and other critical events.

This test monitors the alerts and collect the key metrics like open alerts, critical alerts, warnings etc. This information is vital for administrators to understand the problem areas and start acting if there is any problem.

Target of the test : A Nutanix Prism Central

Agent deploying the test : A remote agent

Outputs of the test : One set of results for Prism central node being monitored.

Configurable parameters for the test
Parameter Description

Test Period

How often should the test be executed

Host

The host for which the test is to be configured.

Port

The port at which the specified host listens. By default, this is 9440.

Nutanix Prism Central User, Nutanix Prism Central Password and Confirm Password

To connect to the Nutanix Prism Element and collect metrics from it, the eG agent should be configured with the credentials of a Prism Element user with the Viewer role. The steps for creating such a user are detailed in the Pre-requisites for monitoring Nutanix Prism Central

Confirm the Nutanix Prism Element password by retyping it in Confirm Password textbox.

SSL

By default, the Nutanix Prism Element server is SSL-enabled. Accordingly, the SSL flag is set to Yes by default. This indicates that the eG agent will communicate with the Prism Element server via HTTPS by default.

WebPort

By default, the Nutanix Prism Element server listens on port 9440. This implies that while monitoring a Nutanix AHV server via the Prism server, the eG agent connects to port 9440.

DD Frequency

Refers to the frequency with which detailed diagnosis measures are to be generated for this test. The default is 1:1. This indicates that, by default, detailed measures will be generated every time this test runs, and also every time the test detects a problem. You can modify this frequency, if you so desire. Also, if you intend to disable the detailed diagnosis capability for this test, you can do so by specifying none against DD frequency.

Detailed Diagnosis

To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.

The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:

  • The eG manager license should allow the detailed diagnosis capability
  • Both the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.
Measurements made by the test
Measurement Description Measurement Unit Interpretation

All open alerts

Indicates the total number of open alert during the last measurement period.

Number

If the number of alerts is more than the threshold, it needs t be investigated.

Recently open alerts

Indicates the alerts opened during the last measurement period.

Number

The most recently open alerts can make the most impact when investigated.

The detailed diagnosis of this measure lists the additional metrics including Title, Source Entity and Impact Type.

Open critical alerts

Indicates the total number of open critical alerts during the last measurement period.

Number

Critical alerts should be investigated immediately as they may be indicating something which might run into error.

Recently open critical alerts

Indicates the critical alerts opened during the last measurement period.

Number

These should be the top priority as the recent alerts are the ones which make the most impact.

The detailed diagnosis of this measure lists the additional metrics including Title, Source Entity and Impact Type.

Open warning alerts

Indicates the total number of open warning alerts during the last measurement period.

Number

Warning alerts can provide very important insights into the current problems in the system and can help avoiding any issues.

Recently open warning alerts

Indicates the warning alerts opened during the last measurement period.

Number

The recently open ones are the most critical, so these should be investigates on priority.

The detailed diagnosis of this measure lists the additional metrics including Title, Source Entity and Impact Type.

Open info alerts

Indicates the total number of open information alerts during the last measurement period.

Number

Info alerts are only about information like completion of some event, arrival of file etc.

Recently open info alerts

Indicates the information alerts opened during the last measurement period.

Number

Recently open ones can be used to record information and provide the same to users.

The detailed diagnosis of this measure lists the additional metrics including Title, Source Entity and Impact Type.