Kong Load Test

Load refers to the amount of computational work the gateway is handling at any given time. This includes processing incoming API requests, executing plugins, managing routing logic, handling authentication, logging, and rate limiting. Load is typically measured through system metrics like CPU usage, memory consumption, and request rate per second. High load can indicate increased traffic or inefficient configurations, potentially leading to latency, timeouts, or service disruption. Monitoring load helps administrators ensure the gateway scales effectively, maintains performance, and avoids resource exhaustion. Balancing load across multiple nodes also supports high availability and system reliability in production environments.

Monitoring Kong API Load provides insights into system performance and traffic patterns. It helps identify spikes, bottlenecks, or inefficient plugin behavior that could degrade API responsiveness. By tracking load, admins can proactively scale infrastructure, optimize configurations, and maintain high availability. It also aids in capacity planning and troubleshooting, ensuring the gateway handles requests efficiently without overloading resources, ultimately supporting a stable and reliable API delivery environment.

This test reports key metrics related to API load which are instrumental in identifying the issues and help admins plan and handle the load.

Target of the test : A Kong API Gateway

Agent deploying the test : An internal/remote agent

Outputs of the test : One set of results for the target Kong API Server being monitored.

Configurable parameters for the test

Parameter

Description

Test Period

How often should the test be executed.

Host

The IP address of the host for which this test is to be configured.

Port

Specify the port at which the specified Host listens. By default, this is 8001.

SSL Enabled

If the Kong API Gateway uses the HTTPS protocol, set the SSL ENABLED test parameter to Yes. By default, this parameter is set to No.

Measurements made by the test

Measurement

Description

Measurement Unit

Interpretation

Kong request rate

Indicates the number of requests received by this Kong API Gateway per minute.

Requests/min

Number of requests received by Kong Gateway per minute for a specific service and route during the last measurement period.

Processing time

Indicates the average processing time taken by this Kong API Gateway for each request.

Milliseconds/request

Measures the time Kong takes to process each request internally, excluding backend and network latency.

Upstream request rate

Indicates the number of requests forwarded to the upstream by this Kong API Gateway per minute

Requests/min

Number of backend (upstream) requests made per minute by Kong for a specific service and route during the last measurement period.

Upstream response time

Indicates the average response time from the upstream service for each request passing through this Kong API Gateway.

Milliseconds/request

Measures the time taken by the backend service (upstream) to respond to a request forwarded by Kong.

Total response time

Indicates the average response time for each request passing through this Kong API Gateway.

Milliseconds/request

Represents the average total time taken to serve each request. This includes Kong’s processing time, upstream (backend) response time, and network delay.

Network delay

Indicates the average network delay per request, for each request passing through this Kong API Gateway.

Milliseconds/request

Measures the average time lost due to network latency when communicating between Kong and the client or upstream.