Crash Details Test

Event logs on Windows servers capture critical error conditions such as service crashes and application crashes on the servers, application and service hangs, and service errors. Since the crash/slowness experienced by any mission-critical program/service on a Windows server may affect the uptime of the dependent business services, administrators should be able to instantly capture these serious problem conditions, investigate the reasons for their occurrence, and promptly resolve them. This is exactly what the Crash Details test helps administrators achieve! This test periodically scans the event logs on a Windows server and reports the count of crashes, hangs, and errors that may have occurred recently on that server. Detailed diagnostics provided by this test pinpoints the applications/services that crashed, hanged, or encountered errors, and thus enables quick and efficient troubleshooting.

Note:

This test will not report metrics on Windows 2000/2003/XP systems.

Target of the test: A Windows host

Agent executing the test: An internal agent

Output of the test: One set of results for the Windows host being monitored.

Configurable parameters for the test
  1. Test period - How often should the test be executed
  2. Host - The host for which the test is to be configured.
  3. port - The port at which the specified host listens. By default, this is NULL.
  4. DD FREQUENCY - Refers to the frequency with which detailed diagnosis measures are to be generated for this test. The default is 1:1. This indicates that, by default, detailed measures will be generated every time this test runs, and also every time the test detects a problem. You can modify this frequency, if you so desire. Also, if you intend to disable the detailed diagnosis capability for this test, you can do so by specifying none against DD FREQUENCY.
  5. To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.

    The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:

    • The eG manager license should allow the detailed diagnosis capability
    • Both the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.
Measurements made by the test
Measurement Description Measurement Unit Interpretation

Recent application crashes:

Indicates the number of application crash events that occurred during the last measurement period.

Number

An event with the ID 1000 is logged in the event log every time a program terminates unexpectedly on a Windows server. This measure reports the number of events in the event log with event ID 1000.

Use the detailed diagnosis of this measure to know which programs and modules stopped suddenly.

Recent service crashes:

Indicates the number of service crash events that occurred during the last measurement period.

Number

An event with the ID 7031 is logged in the Service Control Manager every time a service terminates ungracefully. This measure reports the number of events in the event log with event ID 7031.

Use the detailed diagnosis of this measure to know the complete details of such events.

Recent application hangs:

Indicates the number of application hang events that occurredduring the last measurement period.

Number

An event with the ID 1002 is logged in the Application Event Log every time an application hangs. This measure reports the number of events in the event log with event ID 1002.

Use the detailed diagnosis of this measure to know the complete details of the recent application hang events.

Recent service hangs

Indicates the number of service hang events that occurred during the last measurement period.

Number

An event with the ID 7022 is logged in the Service Control Manager every time a service hangs. This measure reports the number of events in the event log with event ID 7022.

Use the detailed diagnosis of this measure to know the complete details of the recent service hang events.

Recent service errors

Indicates the number of service errors that occurred during the last measurement period.

Number

Events with the ID 7023, 7024, and 7026 are logged in the Service Control Manager every time a service error occurs. This measure reports the number of events in the event log with the aforesaid event IDs.

Use the detailed diagnosis of this measure to know the complete details of the recent service errors.