Maria Cluster Member Status Test
A MariaDB Cluster is built on multiple database nodes that work together using Galera synchronous replication. In this setup, each node can accept queries, and updates made on one node are immediately replicated to others to ensure consistency. Nodes can operate in primary or secondary roles, and their health directly affects the overall stability of the cluster. If a node goes into read-only mode, becomes unsynced, or gets disconnected, it can disrupt workload distribution, delay transactions, or even threaten data reliability. Issues such as nodes failing to sync, frequent role switches, uneven distribution of read/write nodes, or an increase in disconnected or joining nodes can all degrade cluster performance and availability.
This test tracks critical parameters like node roles, health state, number of synced/unsynced nodes, and the availability of read-write vs. read-only nodes. By doing so, it helps administrators detect early warnings, isolate failing nodes, and ensure that the MariaDB Cluster remains balanced, resilient, and performant.
Target of the test : A MariaDB Cluster
Agent deploying the test : An external agent
Outputs of the test : One set of results for each node on the MariaDB Cluster being monitored.
| Parameter | Description |
|---|---|
|
Test Period |
How often should the test be executed. |
|
Host |
The IP address of the MariaDB Cluster. |
|
Port |
The port on which the server is listening. By default, this is set to 3306. |
|
Database |
Specify the name of the database that is to be monitored on the target MariaDB Cluster. |
|
User and Password |
The eG agent has to be configured with the credentials of a user who has server-wide PROCESS and SELECT privileges on the monitored Maria Cluster Database server. To know how to create such a user, refer to Configuring the eG Agent with Access Privileges |
|
Confirm Password |
Confirm the Password (if any) by retyping it here. |
|
SSL |
This indicates that the eG agent will communicate with theMaria cluster via HTTPS or not. By default, this flag is set to No, as the target Maria database is not SSL-enabled by default. If the target cluster is SSL-enabled, then set this flag to Yes. |
|
Verify CA |
If the eG agent is required to establish an encrypted connection with the target MariaDB Cluster by authenticating the server's identity through verifying the server CA certificate, set Verify CA flag to Yes. By default, this flag is set to No. |
|
Available Nodes |
In the Available Nodes text box, provide a comma-separated list of all the available nodes to be included for monitoring. This way, the test monitor and collect metrics from all the available nodes in the cluster. By default, this parameter is set to none. The format of this configuration is: HOSTNAME:PORT, for example, 172.16.8.136:3306,172.16.8.139:3306 |
|
Detailed Diagnosis |
To make diagnosis more efficient and accurate, the eG Enterprise embeds an optional detailed diagnostic capability. With this capability, the eG agents can be configured to run detailed, more elaborate tests as and when specific problems are detected. To enable the detailed diagnosis capability of this test for a particular server, choose the On option. To disable the capability, click on the Off option. The option to selectively enable/disable the detailed diagnosis capability will be available only if the following conditions are fulfilled:
|
|
Measurement |
Description |
Measurement Unit |
Interpretation |
||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
Status |
Indicates the current status of this cluster node. |
|
This measure is not reported for the Summary descriptor. The values reported by this measure and their numeric equivalents are available in the table below:
Note: This measure reports the Measure Values listed in the table above to indicate the status of this node. However, in the graph, this measure is indicated using the Numeric Values listed in the table above. |
||||||||||||||||||
|
Is this primary node? |
Indicates whether or not this node is the primary node in the cluster. |
|
This measure is not reported for the Summary descriptor. The values reported by this measure and their numeric equivalents are available in the table below:
Note: This measure reports the Measure Values listed in the table above to indicate whether/not the node is a primary node. However, in the graph, this measure is indicated using the Numeric Values listed in the table above. |
||||||||||||||||||
|
Has the role of this node changed? |
Indicates whether or not this node has switched roles (e.g., primary to secondary or vice versa). |
|
This measure is not reported for the Summary descriptor. The values reported by this measure and their numeric equivalents are available in the table below:
Note: This measure reports the Measure Values listed in the table above to indicate whether/not role of the node has switched. However, in the graph, this measure is indicated using the Numeric Values listed in the table above. Use the detailed diagnosis of this measure to find out the Host, Current state, Previou state, Port, and Access mode. |
||||||||||||||||||
|
Is this node in read-write mode? |
Indicates whether or not this node is currently operating in read-write mode. |
|
This measure is not reported for the Summary descriptor. The values reported by this measure and their numeric equivalents are available in the table below:
Note: This measure reports the Measure Values listed in the table above to indicate whether/not the node is in read-write mode. However, in the graph, this measure is indicated using the Numeric Values listed in the table above. |
||||||||||||||||||
|
Total number of nodes |
Indicates the total number of nodes configured in the cluster. |
Number |
This measure is reported only for the Summary descriptor. Sudden drop in total nodes may indicate node failure or connectivity issues. |
||||||||||||||||||
|
Primary count |
Indicates the number of primary nodes currently available in the cluster. |
Number |
This measure is reported only for the Summary descriptor. Use the detailed diagnosis of this measure to find out the Host, Current state, Previou state, Port, and Access mode. |
||||||||||||||||||
|
Secondary count |
Indicates the number of nodes currently serving as secondary nodes in the cluster. |
Number |
This measure is reported only for the Summary descriptor. Use the detailed diagnosis of this measure to find out the Host, Current state, Previou state, Port, and Access mode. |
||||||||||||||||||
|
Disconnected count |
Indicates the number of nodes currently disconnected from the cluster. |
Number |
This measure is reported only for the Summary descriptor. A high count may affect data replication and cluster reliability. Use the detailed diagnosis of this measure to find out the Host, Current state, Previou state, Port, and Access mode. |
||||||||||||||||||
|
Synced nodes |
Indicates the number of nodes that are fully synchronized with the primary node. |
Number |
This measure is reported only for the Summary descriptor. A low number of synced nodes may result in stale reads or replication delays. |
||||||||||||||||||
|
Joined nodes |
Indicates the number of nodes that have successfully joined the cluster and are operational. |
Number |
This measure is reported only for the Summary descriptor. |
||||||||||||||||||
|
Donor nodes |
Indicates the number of nodes acting as donors during state transfers. |
Number |
This measure is reported only for the Summary descriptor. |
||||||||||||||||||
|
Waiting nodes |
Indicates the number of nodes currently waiting to join or resync with the cluster. |
Number |
This measure is reported only for the Summary descriptor. |
||||||||||||||||||
|
Joining nodes |
Indicates the number of nodes currently in the process of joining the cluster. |
Number |
This measure is reported only for the Summary descriptor. |
||||||||||||||||||
|
Initialized nodes |
Indicates the number of nodes that are initialized and ready but not yet joined to the cluster. |
Number |
This measure is reported only for the Summary descriptor. |
||||||||||||||||||
|
Non-joined nodes |
Indicates the number of nodes that failed to join the cluster. |
Number |
This measure is reported only for the Summary descriptor. |
||||||||||||||||||
|
Read-write nodes |
Indicates the number of nodes that are accepting both read and write operations in the cluster. |
Number |
This measure is reported only for the Summary descriptor. A drop in this measure value may cause transactional delays or write contention. |
||||||||||||||||||
|
Read-only nodes |
Indicates the number of nodes restricted to read-only operations in the cluster. |
Number |
This measure is reported only for the Summary descriptor. A higher count helps with query load balancing. Sudden drops affect reporting performance. |
||||||||||||||||||
|
Health state |
Indicates the current health state of this cluster. |
|
This measure is reported only for the Summary descriptor. The values reported by this measure and their numeric equivalents are available in the table below:
Note: By default, this measure reports current health of the cluster. The graph of this measure however, is represented using the numeric equivalents only - 0 to 2. A status other than healthy could impact cluster performance or availability. |