Panorama Device Health Test

This test monitors the devices connected to the Panorama and reports the state of each device. This way, administrators can figure out whether/not the devices connected to the target panorama is active, and if not, can also determine where the source of the problem lies – is it with the memory? the fans? the CPU? or the PSUs? Once the area of concern is isolated, administrators can use the Panorama Device Health test that deep dives into that realm of performance to accurately diagnose the root-cause of the problem.

Target of the test: Palo Alto Panorama

Agent deploying the test: A Remote Agent

Outputs of the test: One set of results for each device connected to the Palo Alto Panorama that is being monitored.

Configurable parameters for the test

Parameter

Description

Test period

How often should the test be executed.

Host

The IP address of the target host to be monitored.

Port

Specify the port at which the specified host listens to.

API Key

The eG agent collects the required metrics from the target Palo Alto Panorama by executing API commands using XML API and pulls out critical metrics. In order to collect metrics, the eG agent should be provided with a valid API key.

SSL

By default, this flag is set to Yes indicating that the SSL (Secured Socket Layer) is used to connect to the target Palo Alto Panorama. If not so, set the SSL flag to No .

Detailed Diagnosis

To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.

The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:

  • The eG manager license should allow the detailed diagnosis capability
  • Both the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.
Measurements made by the test

Measurement

Description

Measurement Unit

Interpretation

Throughput

Indicates the rate at which this device processes network traffic.

Kbps

A consistent drop in the value of this measure could indicate that this device connected to the target panorama does not have adequate bandwidth resources for processing network traffic.

Connections per second

Indicates the number of peer devices connected to this device per second.

Number

An abnormally high value for this measure could indicate a probable virus or spam attack to this device.

Sessions count

Indicates the number of sessions currently open for this device.

Number

 

CPU utilization - Data plane

Indicates the total CPU utilization of this device on the data plane.

Percent

The data plane is responsible for forwarding data packets between devices in the network.

This measure indicates the percentage of CPU utilized in this device while forwarding data packets.

CPU utilization - Management plane

Indicates the total CPU utilization of this device on the management plane.

Percent

The management plane focuses on managing and monitoring the network’s operations. It monitors device performance, network health, and resource utilization. The management plane handles tasks like software updates, security, and ensuring reliable operations.

This measure denotes the percentage of CPU utilized for managing and monitoring network operations.

Memory utilization - Management plane

Indicates the total memory utilized by this device on the management plane.

Percent

This measure denotes the percentage of memory utilized for managing and monitoring network operations.

Total fans

Indicates the total number of fans available in this device.

Number

 

Fans in use

Indicates the number of fans in this device that are currently in use.

Number

 

Total power supplies

Indicates the total number of power supplies on this device.

Number

 

Power supplies in use

Indicates the number of PSUs in this device that are currently in use.

Number

 

Total ports

Indicates the total number of ports on this device.

Number

 

Ports in use

Indicates the number of ports in this device that are currently in use.

Number

 

State

Indicates the high availability state of this device.

 

The values reported by this measure and its numeric equivalent are mentioned in the table below:

Measure Value Numeric Value
Passive 0
Active 1

Note:

By default, this measure reports the above-mentioned Measure Values while indicating the high availability of the device connected to the target Palo Alto Panorama. However, in the graph of this measure, states will be represented using the corresponding numeric equivalents only - i.e., 0 or 1.

Synchronization status

Indicates the synchronization status of the HA device connected to this device.

 

The values reported by this measure and its numeric equivalent are mentioned in the table below:

Measure Value Numeric Value
Unsynchronized 0
Synchronized 1

Note:

By default, this measure reports the above-mentioned Measure Values while indicating the synchronization status of the HA device connected to this device in the target Palo Alto Panorama. However, in the graph of this measure, states will be represented using the corresponding numeric equivalents only - i.e., 0 or 1.

Has the system been rebooted?

Indicates whether this device has been rebooted or not.

 

The values reported by this measure and its numeric equivalent are mentioned in the table below:

Measure Value Numeric Value
No 0
Yes 1

Note:

By default, this measure reports the above-mentioned Measure Values while indicating the reboot state of the device connected to the target Palo Alto Panorama. However, in the graph of this measure, states will be represented using the corresponding numeric equivalents only - i.e., 0 or 1.

Uptime during the last measure period

Indicates the time period that this device has been up since the last time this test ran.

Seconds

If the device has not been rebooted during the last measurement period and the agent has been running continuously, this value will be equal to the measurement period. If the panorama was rebooted during the last measurement period, this value will be less than the measurement period of the test. For example, if the measurement period is 300 secs, and if the panorama was rebooted 120 secs back, this metric will report a value of 120 seconds. The accuracy of this metric is dependent on the measurement period - the smaller the measurement period, greater the accuracy.

Total uptime of the system

Indicates the total time that this device has been up since its last reboot.

Minutes

Administrators may wish to be alerted if the panorama has been running without a reboot for a very long period. Setting a threshold for this metric allows administrators to determine such conditions.