Monitoring Couchbase Cluster

Monitoring Couchbase Cluster ensures performance, detects issues early, maintains data integrity, supports scalability, and enables secure, reliable, and efficient database operations across distributed systems, refer .

Figure 1 : Couchbase Cluster layer model

 

Each layer of is mapped to tests that report a wide variety of metrics related to Couchbase Cluster health. Using these metrics, administrators can find quick and accurate answers to the following performance queries:

  1. Are all nodes in the cluster online and reachable?

  2. Is the cluster balanced with no pending rebalance operations?

  3. Are any nodes experiencing high CPU or memory usage?

  4. Is the disk usage within safe thresholds on all nodes?

  5. Are all services (Data, Index, Query, etc.) running as expected on their assigned nodes?

  6. Is there any replication or XDCR lag reported for any bucket?

  7. Are there any failed over nodes that haven’t been recovered?

  8. Are all indexes in a healthy and online state?

  9. Are N1QL queries responding within acceptable latency levels?

  10. Are there any frequent or recent auto-failover events in the cluster?

  11. Is the current number of connections within the acceptable limits for the workload?