Machine Reliability Test

Machine stability refers to the ability of a machine or mechanical system to perform its intended tasks effectively and reliably, without any unwanted disturbances that could affect its function or lifespan. Stable machines operate more efficiently and effectively, delivering higher precision and output while instability can lead to dangerous situations, such as equipment breakdowns, which could be harmful to workers and lead to accidents. By identifying potential problems before they escalate, predictive maintenance techniques can be employed. Maintenance activities can be scheduled at the right time (before a failure occurs), rather than relying on reactive maintenance, which can be both expensive and inefficient.

This test reports the percentage of machines that are stable. The detailed diagnosis helps administrator to drill down and analyze the reasons for instability in machines. By detecting and correcting minor issues before they evolve into major ones, the machine's availability is increased, contributing to more continuous and productive operations.

Target of the test :Any host system

Agent deploying the test : An internal agent

Outputs of the test : One set of results for every host system being monitored.

Configurable parameters for the test:
Parameter Description

Test Period

How often should the test be executed

Host

The host for which the test is to be configured.

Port

The port at which the specified Host listens. By default, this is NULL.

Stability Interval Minutes

Specify the time duration (in minutes) over which the test should evaluate system events to determine the machine’s stability in this text box. This test analyze the events occurring within the specified time interval to identify patterns of instability, such as application crashes, system hangs, or other critical issues. By default, this set to 120 minutes.

DD Frequency

Refers to the frequency with which detailed diagnosis measures are to be generated for this test. The default is 1:1. This indicates that, by default, detailed measures will be generated every time this test runs, and also every time the test detects a problem. You can modify this frequency, if you so desire. Also, if you intend to disable the detailed diagnosis capability for this test, you can do so by specifying none against DD FREQUENCY.

Detailed Diagnosis

To make diagnosis more efficient and accurate, eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.

The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:

  • The eG manager license should allow the detailed diagnosis capability

  • Both the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.

Measurements made by the test
Measurement Description Measurement Unit Interpretation

Machine stability index

Indicates the percentage of machines that were stable.

Percent

A higher value (close to 100%) for this measure means the machine is highly stable, with few or no significant issues.

A low value may be reported for this measure when the machine encounters frequent Application crashes, OS hangs, driver failures, unexpected reboots and system errors logged in the Event Viewer. In such cases, the machine may require user's immediate attention to maintain or restore stability.

The detailed diagnosis of this measure provides the date and time, event ID, event type, source name, application name, application path, and event message associated with the events impacting the machine's stability index.