AWS MSK Cluster Test

AWS MSK Cluster uses Amazon EBS volume which is a durable, block-level storage device that you can attach to your instances. After you attach a volume to an instance, you can use it as you would use a physical hard drive. EBS volumes are flexible. Amazon EBS volume types that use burst performance, such as gp2, st1, and sc1, have a baseline performance depending on volume size. If your workload is driving I/O traffic to one of these volume types beyond its baseline performance, then burst credit gets spent.

This test helps the administrators to monitor whether/not the input-output burst balance is running out of credits for EBS volumes in the cluster such that the volume stops and no further reads or writes will occur until more credits are earned.

Target of the test : AWS Managed Service Kafka

Agent deploying the test : A remote agent

Outputs of the test : One set of results for each cluster executing in the target AWS Managed Service Kafka server.

Configurable parameters for the test
Parameter Description

Test Period

How often should the test be executed.

Host

The IP address of the AWS Managed Service Kafka Broker that is being monitored.

Port

Specify the port number at which the specified HOST listens. By default, this is NULL.

AWS Default Region

This test uses AWS CLI to interact with AWS Managed Service Kafka and pull relevant metrics. To enable the test to connect to AWS, you need to configure the test with the name of the region to which all requests for metrics should be routed, by default. Specify the name of this AWS Default Region, here.

AWS Access Key ID, AWS Secret Access Key and Confirm Password

To monitor AWS Managed Service Kafka, the eG agent has to be configured with the access key and secret key of a user with a valid AWS account. For this purpose, we recommend that you create a special user on the AWS cloud, obtain the access and secret keys of this user, and configure this test with these keys. The procedure for this has been detailed in the Obtaining an Access key and Secret key topic. Make sure you reconfirm the access and secret keys you provide here by retyping it in the corresponding Confirm Password text box.

Timeout Seconds

Specify the maximum duration (in seconds) for which the test will wait for a response from the server. The default is 10 seconds.

Measurements made by the test
Measurement Description Measurement Unit Interpretation

Remaining burst balance

Indicates the remaining balance of input-output burst credits for EBS volumes in the cluster.

Number

Burst Balance is not reported for EBS volumes when the baseline performance of a volume is higher than the maximum burst performance.

Messages in

Indicates the rate at which the messages are incoming to the broker.

Messages/Sec

 

Average data in

Indicates the average rate at which data sent from producers is consumed by the broker.

MB

 

Average data out

Indicates the average rate at which data is fetched and read from the broker.

MB

 

Percentage of disk space used for APP logs

Indicates the percentage of disk space used for application logs.

Percent

 

Leader count

Indicates the number of replicas for which this broker is the leader.

Number

 

Partition count

Indicates the number of partitions in the broker.

Number

 

Request data

Indicates the amount of data received during the last measurement period.

MB

 

Request time

Indicates the total amount of time taken for reading the request and sending the result.

Seconds

 

Percentage of root disk used

Indicates the percentage of the root disk used by the broker.

Percent

 

Swap free

Indicates the size of swap memory that is available for the broker.

MB

 

Swap used

Indicates the size of swap memory that is in use for the broker.

MB

 

Under minISR partitions

Indicates the number of partitions under the minimum In-Sync Replica (ISR) count.

Number

The ISR count indicates the set of replicas that are up-to-date with the leader.

Under replicated partitions

Indicates the number of partitions that have not been fully replicated in the follower replicas.

Number

 

Inbound aggregate bandwidth exceeded

Indicates the number of packets shaped because the inbound aggregate bandwidth exceeded the maximum for the broker.

Number

 

Outbound aggregate bandwidth exceeded

Indicates the number of packets shaped because the outbound aggregate bandwidth exceeded the maximum for the broker.

Number

 

Packets shaped because the bidirectional PPS exceed

Indicates the number of packets shaped because the bidirectional PPS exceeded the maximum for the broker.

Number

 

Percentage of idle request handler threads

Indicates the average percentage of the time the request handler threads are idle.

Percent