Latency-Based Loadbalancing in Upstreams

Ross_Sbriscia · August 20, 2018, 3:30pm

Hi Kong Team,

Currently, LB to targets in an upstream employs a weighted-RR LB pattern.

It would be very beneficial for certain deployment configurations (particularly multi-dc) if we could support load balancing based on data-center or (preferably) healthcheck ping latency.

What I suggest: Is to add a new loadbalancing option to Kong for a node to locally cache the most recent healthcheck ping response times for all targets in an upstream, and route exclusively to the target with the lowest response times. This would allow users to greatly customize their routing, as it would largely be based on the frequency and concurrency of healthchecks.

I’d love some guidance from the Kong team on where to start looking to add some of this functionality!

Tieske · August 21, 2018, 10:49am

yes that is absolutely an interesting feature to add. But not an easy one to implement.

There are multiple components involved in this whole chain.

The healthcheck library would need to track latencies.
Kong would need to use the latencies to update weights in the LB

Some things that require attention:

weight = 0 means don’t add to LB currently
keep track of targets added by name (a single name might add multiple IP addresses to the balancer, there is a related issue with the healthchecks currently)
how many datapoints to use for latency calculation? incidental latency spikes might have a big impact otherwise

Ross_Sbriscia · August 21, 2018, 4:57pm

Thanks! Those are some very solid pointers. I’ll see what I can do here - I’d love to ultimately PR this

jeremyjpj0916 · June 16, 2022, 2:48pm

Via active health checks:

Take the avg latency of a configurable seen last X latency checks of the targets to dictate where to route. Until the latency aware routing can kick in(after # of checks) can route to various healthy targets. Or route to the first lowest latency target seen? Whatever makes sense there is fine.

The req is that the targets are healthy ofc too, we don’t care to route to a super fast target throwing Errors back at us on the health checks.

Sreeram_Repaka · January 11, 2023, 9:09pm

Wondering if there’s an update on this? We are looking for similar functionality in Kong.

Topic		Replies	Views
Data Center Aware Load Balancing Questions	2	633	August 5, 2020
Does the Load Balancing work correctly? Questions	3	732	May 23, 2018
Upstreams and rack awareness Questions	6	726	March 14, 2019
Loadbalance and upstream Questions	4	977	April 12, 2018
Support geographical load-balancing kong-gateway	0	850	February 2, 2021

Latency-Based Loadbalancing in Upstreams

Related topics