Maria Cluster Availability Test
This test monitors the overall availability and responsiveness of a Maria database cluster. It validates the ability of the cluster to accept connections, process queries, and return results within acceptable response times. By tracking connection availability, connection latency, query processor status, query execution time, number of records returned, and average response time, the test provides administrators with insights into both availability and performance. This enables the early detection of connectivity issues, slow responses, or query failures that could impact database reliability and user experience.
Target of the test : A MariaDB Cluster
Agent deploying the test : An external agent
Outputs of the test : One set of results for each node on the MariaDB Cluster being monitored.
| Parameter | Description |
|---|---|
|
Test Period |
How often should the test be executed. |
|
Host |
The IP address of the MariaDB Cluster. |
|
Port |
The port on which the server is listening. By default, this is set to 3306. |
|
Database |
Specify the name of the database that is to be monitored on the target MariaDB Cluster. |
|
User and Password |
The eG agent has to be configured with the credentials of a user who has server-wide PROCESS and SELECT privileges on the monitored MariaDB Cluster. To know how to create such a user, refer to Configuring the eG Agent with Access Privileges |
|
Confirm Password |
Confirm the Password (if any) by retyping it here. |
|
SSL |
This indicates that the eG agent will communicate with the MariaDB Cluster via HTTPS or not. By default, this flag is set to No, as the target Maria database is not SSL-enabled by default. If the target cluster is SSL-enabled, then set this flag to Yes. |
|
Verify CA |
If the eG agent is required to establish an encrypted connection with the target MariaDB Cluster by authenticating the server's identity through verifying the server CA certificate, set Verify CA flag to Yes. By default, this flag is set to No. |
|
Available Nodes |
In the Available Nodes text box, provide a comma-separated list of all the available nodes to be included for monitoring. This way, the test monitor and collect metrics from all the available nodes in the cluster. By default, this parameter is set to none. The format of this configuration is: HOSTNAME:PORT, for example, 172.16.8.136:3306,172.16.8.139:3306 |
|
Display Nodes |
If you want the test to report metrics individually for each node configured in the Available Nodes parameter then set Display Nodes value to Yes. In this case, each node in the cluster will be a descriptor of this test. By default, this value is set to No, indicating that this test reports metrics for only a Summary descriptor revealing the database connection availability of the cluster. |
|
Detailed Diagnosis |
To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option. The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:
|
|
Measurement |
Description |
Measurement Unit |
Interpretation |
|---|---|---|---|
|
Database connection availability |
Indicates whether the database connection to this node is available or not. For the Summary descriptor this measure reveals the database connection availability of the target cluster. |
Percent |
If this measure reports the value 100, it indicates that the database connection is available. The value 0 on the other hand indicates that the database connection is unavailable. A connection to the database may be unavailable if the database is down or if the database is listening on a port other than the one configured for it in the eG manager or owing to a poor network link. If the Node availability measure reports the value 0, then, you can check the value of this measure to determine whether/not it is due to the unavailability of a connection to the server. |
|
Database connection response time |
Indicates the time taken to connect to the cluster node. |
Seconds |
This measure is not reported for Summary descriptor. A high value could indicate a connection bottleneck. Whenever the Node response time measure soars, you may want to check the value of this measure to determine whether a connection latency is causing the poor responsiveness of the node. |
|
Query processor availability |
Indicates whether the query to this node is executed successfully or not. |
Percent |
This measure is not reported for Summary descriptor. If this measure reports the value 100, it indicates that the query executed successfully. The value 0 on the other hand indicates that the query failed. In the event that the Node availability measure reports the value 0, check the value of this measure to figure out whether the failed query is the reason why that measure reported a node unavailability. |
|
Query execution time |
Indicates the time taken for query execution on this node. |
Seconds |
This measure is not reported for Summary descriptor. A high value could indicate that one/more queries to the node are taking too long to execute. Inefficient/badly designed queries to the database often take too long to execute. If the value of this measure is higher than that of the Connection time measure, you can be rest assured that long running queries are causing the node to respond slowly to requests. |
|
Number of records |
Indicates the number of records fetched from the database for this node. |
Number |
This measure is not reported for Summary descriptor. The value 0 indicates that no records are fetched from the database. |
|
Average Response Time |
Indicates the average time taken by the cluster to respond to a query. |
Seconds |
This measure is reported only for the Summary descriptor. A sudden increase in response time is indicative of a potential performance bottleneck on the database cluster. |