Monitoring Couchbase Cluster
Monitoring Couchbase Cluster ensures performance, detects issues early, maintains data integrity, supports scalability, and enables secure, reliable, and efficient database operations across distributed systems, refer .
Figure 1 : Couchbase Cluster layer model
Each layer of is mapped to tests that report a wide variety of metrics related to Couchbase Cluster health. Using these metrics, administrators can find quick and accurate answers to the following performance queries:
-
Are all nodes in the cluster online and reachable?
-
Is the cluster balanced with no pending rebalance operations?
-
Are any nodes experiencing high CPU or memory usage?
-
Is the disk usage within safe thresholds on all nodes?
-
Are all services (Data, Index, Query, etc.) running as expected on their assigned nodes?
-
Is there any replication or XDCR lag reported for any bucket?
-
Are there any failed over nodes that haven’t been recovered?
-
Are all indexes in a healthy and online state?
-
Are N1QL queries responding within acceptable latency levels?
-
Are there any frequent or recent auto-failover events in the cluster?
-
Is the current number of connections within the acceptable limits for the workload?