DB2 DPF Backup Status Test

This test auto-discovers the logical partitions on the target DB2 database serverand for each logical partition, this test reports the statistics pertaining to the backups performed. Using this test, administrators can figure out the count of backups that failed and the count of backup jobs that were completed successfully. In addition, this test reports the maximum time taken to complete the backup jobs on each logical partition. By closely monitoring the measures reported by this test, administrators can pin point the logical partition on which maximum number of backup jobs failed and further analyze the reason behind such failures.

Target of the test : A DB2 database server with DPF enabled

Agent deploying the test : An internal agent

Outputs of the test : One set of results for every logical partition of each database on the DB2 database server

Configurable parameters for the test
  1. TEST PERIOD – How often should the test be executed
  2. HostThe IP address of the DB2 server
  3. PortThe port number through which the DB2 server communicates. The default port is 50000.
  4. user - Specify the name of the user who has any of the following privileges to the specified DATABASE: SYSADM or SYSCTRL or SYSMAINT or SYSMON. You can create a separate user on the OS hosting the DB2 server for this purpose, and assign any of the aforesaid privileges to that user. The steps for the same are detailed in the Creating a Special User for Monitoring DB2.
  5. password - Enter the password of the specified USER in the PASSWORD text box.
  6. confirm password – Confirm the password by retyping it here.
  7. database - Specify the name of the database on the monitored DB2 server to be used by this test
  8. DETAILED DIAGNOSIS - To make diagnosis more efficient and accurate, the eG Enterprise suite embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option.The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:The eG manager license should allow the detailed diagnosis capabilityBoth the normal and abnormal frequencies configured for the detailed diagnosis measures should not be 0.
Measurements made by the test

Measurement

Description

Measurement Unit

Interpretation

Total backup

Indicates the total number of backups performed for this logical partition during the last measurement period.

Number

This measure is a good indicator of backup load on the logical partition.

Completed backup

Indicates the number of backups completed successfully for this logical partition during the last measurement period.

Number

The detailed diagnosis of this measure lists the DB partition number EID, Start time, End Time, Duration(Minutes), Backup ID, Comment text, Number of tablespaces, Tablespace Names, Operation type, Object type, Location, Device type, SQL code, SQL warn and SQL state.

Failed backup

Indicates the number of backups failed for this logical partition during the last measurement period.

Number

Ideally, the value of this measure should be zero.

Any value greater than zero, is a cause of concern, as it indicates the existence of a failed job. To know which job(s) has failed, use the detailed diagnosis capability of this measure.

The detailed diagnosis of this measure lists the DB partition number EID, Start time, End Time, Duration(Minutes), Backup ID, Comment text, Number of tablespaces, Tablespace Names, Operation type, Object type, Location, Device type, SQL code, SQL warn and SQL state.

Compare the value of this measure across the logical partitions to figure out the logical partition on which the maximum number of backup jobs failed.

Maximum duration of completed backup

Indicates the maximum time taken to complete the backup jobs on this logical partition during the last measurement period.

Minutes

Compare the value of this measure across logical partitions to figure out the logical partition on which the backup took too long to be completed.