How To Identify And Tune Worker Bottlenecks

Dhiraj_Bhakta · January 21, 2023, 9:05am

I’ve used the above guides to tune worker performance.

Grafana Panels to monitor the metrics that matter

avg by(task_queue) (temporal_sticky_cache_size{}) Gives you sticky cache size which may or may not be shared across your workers, depending on the SDK`

avg by(task_queue) (temporal_worker_task_slots_available{worker_type="WorkflowWorker"}) Gives you avg workflow tasks slots available per worker

avg by(task_queue) (temporal_worker_task_slots_available{worker_type="ActivityWorker"}) Gives you avg activity tasks slots available per worker

avg(temporal_workflow_task_schedule_to_start_latency_seconds_count{}) Gives you schedule to start latency for workflow tasks

avg(temporal_activity_schedule_to_start_latency_seconds_count{}) Gives you schedule to start latency for activity tasks

100* avg((poll_success + poll_success_sync)/(poll_success+poll_success_sync+poll_timeouts)) Gives you poll success rate

Actions to be taken w.r.t metrics observed above

if worker resource consumption(CPU,RAM) is low and…

if sticky_cache_size hits workerCacheSize —> Increase worker cache size
if available_slots falls —> Increase slots per worker
if poll success rate AND schedule_to_start_latency both fall, —> You have too many workers
if available_slots is high AND schedule_to_start_latency is abnormally long and high(longpolling) —> Increase the poller count per worker This is rarely needed, and should be your last resort

Topic		Replies	Views
What are the recommended settings for workflow and activity pollers count? Developer Corner general-impl	0	4581	August 8, 2022
Workflow Performance with Java SDK Community Support java-sdk	1	742	February 20, 2023
Tuning Temporal setup for better performance Community Support cassandra , performance , kubernetes	5	8848	November 13, 2021
Clarification on metrics (client + server) Community Support java-sdk , metrics	14	2637	April 13, 2022
Cannot figure out how to reduce the activity start latency Community Support	1	33	March 6, 2025

How To Identify And Tune Worker Bottlenecks

Grafana Panels to monitor the metrics that matter

Actions to be taken w.r.t metrics observed above

Related topics