Crash Details - OS Test
Event logs on Windows systems capture critical error conditions such as service crashes and application crashes on the VMs, application and service hangs, and service errors. Since the crash/slowness experienced by any mission-critical program/service on a Windows system may affect the uptime of the dependent business services, administrators should be able to instantly capture these serious problem conditions, investigate the reasons for their occurrence, and promptly resolve them. This is exactly what the Crash Details - OS test helps administrators achieve! This test periodically scans the event logs on each Windows system and reports the count of crashes, hangs, and errors that may have occurred recently on that system. Detailed diagnostics provided by this test pinpoints the applications/services that crashed, hanged, or encountered errors, and thus enables quick and efficient troubleshooting.
Target of the test : A Windows Systems Group
Agent deploying the test : A remote agent
Outputs of the test : One set of results for every Windows system being monitored
Parameter | Description |
---|---|
Test Period |
How often should the test be executed. By default, this is set to 5 mins. |
Host |
The nick name of the Windows Systems Group component for which this test is to be configured. |
Port |
The port at which the specified Host listens. By default, this is NULL. |
Inside View Using |
To obtain the 'inside view' of performance of the systems - i.e., to measure the internal performance of the systems - this test uses a light-weight eG VM Agent software deployed on each of the systems. Accordingly, this parameter is by default set to eG VM Agent. |
Report By User |
This flag is set to No by default. This implies that the Windows systems in environments will always be identified using the system name. In other words, this test will, by default, report measures for every systemname. On the other hand, if you want this test to report the measures for every user on a system, then set this flag to Yes. In such a case, this test will report the measures for every username_on_systemname. |
Report Powered OS |
By default, this flag is set to Yes, then the 'inside view' tests will report measures for even those Windows systems that do not have any users logged in currently. The systems will be identified by their name and not by the username_on_systemname. On the other hand, if this flag is set to No, then this test will not report measures for those systems to which no users are logged in currently. |
Is Cloud VMs? |
This flag is set to Yes by default. The value of this flag cannot be changed. This implies that the cloud-based Windows systems in environments will always be identified using the login name of the user. In other words, in cloud environments, this test will, by default, report measures for every username_on_systemname. |
Ignore Applications and Services |
By default, this parameter is set to none. This means that the test will monitor all applications and services running on the system, and will alert you every time any of these applications/services crash or hang. If you want to exclude specific applications/services on the systems from the monitoring scope of this test, then provide a comma-separated list of application/service names here. |
DD Frequency |
Refers to the frequency with which detailed diagnosis measures are to be generated for this test. The default is 1:1. This indicates that, by default, detailed measures will be generated every time this test runs, and also every time the test detects a problem. You can modify this frequency, if you so desire. Also, if you intend to disable the detailed diagnosis capability for this test, you can do so by specifying none against DD FREQUENCY. |
Detailed Diagnosis |
To make diagnosis more efficient and accurate, eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option. The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:
|
Measurement | Description | Measurement Unit | Interpretation |
---|---|---|---|
Recent application crashes |
Indicates the number of application crash events that occurred on this system during the last measurement period. |
Number |
An event with the ID 1000 is logged in the event log every time a program terminates unexpectedly on a virtual desktop. This measure reports the number of events in the event log with event ID 1000. Use the detailed diagnosis of this measure to know which programs and modules stopped suddenly. |
Recent service crashes |
Indicates the number of service crash events that occurred on this system during the last measurement period. |
Number |
An event with the ID 7031 is logged in the Service Control Manager every time a service terminates ungracefully. This measure reports the number of events in the event log with event ID 7031. Use the detailed diagnosis of this measure to know the complete details of such events. |
Recent application hangs |
Indicates the number of application hang events that occurred on this system during the last measurement period. |
Number |
An event with the ID 1002 is logged in the Application Event Log every time an application hangs. This measure reports the number of events in the event log with event ID 1002. Use the detailed diagnosis of this measure to know the complete details of the recent application hang events. |
Recent service hangs |
Indicates the number of service hang events that occurred on this system during the last measurement period. |
Number |
An event with the ID 7022 is logged in the Service Control Manager every time a service hangs. This measure reports the number of events in the event log with event ID 7022. Use the detailed diagnosis of this measure to know the complete details of the recent service hang events. |
Recent service errors |
Indicates the number of service errors that occurred on this system during the last measurement period. |
Number |
Events with the ID 7023, 7024, and 7026 are logged in the Service Control Manager every time a service error occurs. This measure reports the number of events in the event log with the aforesaid event IDs. Use the detailed diagnosis of this measure to know the complete details of the recent service errors. |