Hello,
we would like to use the rate-limiting plugin to protect against a request burst.
Our setup is as follows, we are using docker based architecture (marathon)
2 docker containers using Kong (0.13 CE) and a cluster based policy backed up by a PostgreSQL 9.6 database.
2 back-end REST services which expose our database.
The problem is that a request can take up to 10 seconds, and if the client will perform several requests in multiple threads during that interval only the first one will be recorded by the rate limiting plugin, the rest of them will pass without any limitation.
This can lead to degrading of performance of our REST services, and even denial of service.
Can anybody give us any pointers on how to configure this ?