How to make dynamic activity workers / auto scaling to 0

Artem_Kazakov · September 22, 2023, 12:02pm

Hi Temporal team,

I have a use case where activity is executed relatively infrequently (~ once an hour), but requires a lot of cpu and ram. Currently we deploy a static pool of workers (1-2 replicas), that are unused 90% of the time. This is obviously very wasteful, especially when we have many such activities.

What I would like to have: workers spin down to 0 when there are no tasks assigned for that activity type, and scale them up to necessary number of replicas to process the tasks in the queue.

How would you do it with temporal? Is there a plan to have a native support for k8s workers to enable such capabilities?

tihomir · September 22, 2023, 7:08pm

I think there are plans to add worker scaling feature but no ETA on that as of yet.
One idea currently could be to look at task_schedule_to_start_latency server metric for Activity task_type, sample query

histogram_quantile(0.95, sum(rate(task_schedule_to_start_latency_bucket{namespace="$namespace", task_type="Activity"}[$__rate_interval])) by (task_type, le))

if this latency goes up and you don’t have any activity pollers, sample query

sum(rate(service_pending_requests{namespace="$namespace", operation="PollActivityTaskQueue", service_name="frontend"}[1m])) or vector(0)

would mean you need to start your activity worker(s) to process pending activity tasks

Topic		Replies	Views
Need suggestion in scale worker from 0 to 1 with keda Community Support metrics , worker , kubernetes	1	648	March 20, 2024
Temporal with K8S job pattern Community Support general-impl , kubernetes	2	2015	January 17, 2024
Suggested metrics to autoscale Temporal workers on Community Support general-impl , metrics , kubernetes	9	7914	January 3, 2024
Kubernates autoscaling of workers Community Support scaling , kubernetes	0	125	July 10, 2024
Scaling Temporal Workers with K8S Community Support metrics , worker , typescript-sdk , kubernetes	1	482	February 22, 2024

How to make dynamic activity workers / auto scaling to 0

Related topics