'Elastic Cloud APM Server - Queue is full
I have many Java microservices running in a Kubernetes Cluster. All of them are APM agents sending data to an APM server in our Elastic Cloud Cluster.
Everything was working fine but suddenly every microservice received the error below showed in the logs.
I tried to restart the cluster, increase the hardware power and I tried to follow the hints but no success.
Obs: The disk is almost empty and the memory usage is ok. Everything is in 7.5.2 version
Solution 1:[1]
I deleted all the indexes related to APM and everything worked after some minutes.
Solution 2:[2]
for better performance u can fine tune these fields in apm-server.yml file
- internal queue size increase
queue.mem.events=output.elasticsearch.worker * output.elasticsearch.bulk_max_size
default is 4096 - output.elasticsearch.worker (increase) default is 1
- output.elasticsearch.bulk_max_size (increase) default is 50 very less
Example : for my use case i have used following stats for 2 apm-server nodes and 3 es nodes (1 master 2 data nodes )
queue.mem.events=40000
output.elasticsearch.worker=4
output.elasticsearch.bulk_max_size=10000
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|---|
Solution 1 | Edgar Peixoto |
Solution 2 | Paulo |