Recommended metrics to use for autoscaling temporal server pods

jwang97 · April 28, 2023, 8:57pm

Currently, we are just scaling off of CPU. Still, we were wondering if there are any official recommendations on what metrics to best scale the various deployments of the temporal server.

Seems like task_schedule_to_start_latency would be a reasonable candidate.

sushil_singh · May 4, 2023, 1:06am

task_schedule_to_start_latency is useful for scaling temporal workers. But you have to look at other 3 servers as well (history, frontend and matching) servers. Scaling those servers should be based on CPU and memory like you would scale any horizontally scaled service

Topic		Replies	Views
Suggested metrics to autoscale Temporal workers on Community Support general-impl , metrics , kubernetes	9	7980	January 3, 2024
What are the best metrics to autoscale each cluster service on? Community Support	5	907	May 8, 2023
Auto scaling worker deployment Community Support python-sdk , scaling , deployment	9	2636	April 4, 2024
Kubernates autoscaling of workers Community Support scaling , kubernetes	0	135	July 10, 2024
Automatically scale server components e.g. using `HorizontalPodAutoscaler` Community Support	2	1807	March 23, 2021

Recommended metrics to use for autoscaling temporal server pods

Related topics