Frequent "building a new plugins iterator"

vtlkvl · July 3, 2023, 1:48pm

We’ve got an intermittent issue that causes Kong API Gateway to stop responding to requests. The root cause is not clear at this point, but we made a few observations that could help in the troubleshooting. Firstly, this is our environment:

Kong 3.3.0 in DB-less mode
KIC 2.8.1
Python Kong-PDK 0.33
k8s 1.24.12
Kong Helm Chart 2.23.0

Everything seems to be running fine from the API perspective, but Kong logs are full of messages:

declarative reconfigure was started on worker #0
[DB cache] purging (local) cache
building a new plugins iterator

AFAIU reconfiguration could be triggered by k8s infrastructure changes, but what is unclear is why it’s purging the whole cache which results is rebuilding plugins iterator. This can happen a dozen of times in a single second and occasionally Kong may become completely unresponsive and when it happens, we are starting to see:

Could not claim instance_id for {{PLUGIN_NAME}} (key: {{PLUGIN_ID}})

Memory and CPU usage is stable and below 50%.

Any idea what could be the root cause? What k8s changes trigger reconfigure?

vtlkvl · July 3, 2023, 3:00pm

What could make kong/kong/runloop/handler.lua at master · Kong/kong · GitHub turn true given that plugins never get changed?

vtlkvl · July 4, 2023, 11:31am

It turns out that if declarative configuration has changed, all caches will be purged and previous information about plugins will be invalidated. This leads to new plugin instances to be created. This thing is that our upstream services run on spot instances and their IPs change pretty often that leads to updates to declarative configuration, however plugins never get changed and it does not make sense to reload them every time. The problem that we are observing when one of the pods becomes unresponsive (it gets stuck on Could not claim instance_id) could be mitigated by avoiding frequent plugin reloads - it makes sense to reload plugins only when plugins hash has changed.

vtlkvl · July 4, 2023, 3:42pm

What could make https://github.com/Kong/kong/blob/master/kong/runloop/handler.lua#L640 turn true given that plugins never get changed?

Answering myself. plugins_hash comes always nil from /config?check_hash=1 and plugins iterator gets rebuilt on every reconfiguration.

vtlkvl · July 4, 2023, 4:13pm

A potential issue could be that reset_instance is called only for not ready or no plugin instance. If any other error occurs, non-initialized plugin instance will not be cleaned up, and other threads will not make through the while loop in get_instance_id.

vtlkvl · July 5, 2023, 11:50am

I made a bug report Could not claim instance_id for {{PLUGIN_NAME}} (key: {{PLUGIN_ID}}) · Issue #11173 · Kong/kong · GitHub.

Topic		Replies	Views
Slowly increasing CPU usage for Kong with Ingress Controller Questions	23	2503	April 30, 2020
Use Kong '/config' interface Kubernetes	5	585	May 28, 2020
Route configuration is deleted for DB-Less on 1.3.0 during high traffic Questions	2	574	November 8, 2019
Kong 1.4, K8S, DB-less, 504 Gateway timeout Questions	19	5823	January 23, 2020
Failed recreating loadbalancer after few days when pods running Kubernetes	1	498	August 6, 2021

Frequent "building a new plugins iterator"

Related Topics