Automatically scale server components e.g. using `HorizontalPodAutoscaler`

Slijkhuis · March 22, 2021, 4:07pm

In the provided Helm chart for deploying Temporal on Kubernetes fixed-size replica sets are deployed for each server component. Is it recommended to have the same number of instances running for longer periods of time or is it fine to autoscale on short time scales? For example by employing a HorizontalPodAutoscaler?
If so, what is the recommended metric to autoscale the different server components on (frontend, matching, history)?

derek · March 23, 2021, 1:19am

Autoscale is a great idea!

We don’t include that in the helm chart but you can totally deploy HPA and scale on metrics from k8s-prometheus-adapter.

As to how to scale, that really depends on your workload. It’s fairly likely though that your persistence layer will be a large bottleneck in overall system performance. You could tune things like number of workers based on sync match rates. As for other components, I haven’t had a chance to play with autoscaling those based on a metric yet.

If you go this route, we’d love to hear what you find out!

Slijkhuis · March 23, 2021, 10:02am

Cool, I think then I’ll start with something dead simple like:

apiVersion: autoscaling/v2beta2
kind: HorizontalPodAutoscaler
metadata:
  name: temporal-frontend
spec:
  minReplicas: 2
  maxReplicas: 10
  metrics:
  - type: Resource
    resource:
      name: cpu
      target:
        type: Utilization
        value: "50"
  scaleTargetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: temporal-frontend

From there I have to do load tests (I’ll check out this: link) to see what’s my bottleneck and what metric I should be scaling on, but good to know there’s no reason not to autoscale.

If I have any significant findings I’ll report here.

Topic		Replies	Views
Guidance for Autoscaling replica sets for Temporal Services using `HorizontalPodAutoscaler` Community Support	1	239	August 25, 2023
Recommended metrics to use for autoscaling temporal server pods Community Support	1	484	May 4, 2023
Kubernates autoscaling of workers Community Support scaling , kubernetes	0	135	July 10, 2024
Which metric should be used for HPA activity workers in Kubernetes? Community Support php-sdk	2	1294	February 19, 2022
What are the best metrics to autoscale each cluster service on? Community Support	5	911	May 8, 2023

Automatically scale server components e.g. using `HorizontalPodAutoscaler`

Related topics