HA of Temporal Server(Cluster)

whitecrow · March 15, 2022, 3:18am

Hi experts,
I am trying to setup a temporal environment on k8s with high availability.
On the offical document, I see we recommend a cluster deployment through helm chart. But that seems a little bit complicated for me. We don’t need to setup different numbers of front-ends or workers for now.
In a simplest manner, can I achieve that just by running multiple replicas of the temporal server?
is there any risk for temporal synchronization?
Here is my k8s manifest for temporal server:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: temporal-server-deployment
spec:
  replicas:2 # ??? Does this work for HA or is there any risk for temporal synchronization???
  selector:
    matchLabels:
      app: temporal-server
  template:
    metadata:
      labels:
        app: temporal-server
    spec:
      containers:
        - name: temporal-server
          image: temporalio/auto-setup:1.14.0
          env:
            - name: LANG
              value: "en_US.UTF-8"
            - name: DB
              value: postgresql
            - name: DB_PORT
              value: "5432"
            - name: POSTGRES_USER
              value: postgres
            - name: POSTGRES_PWD
              value: postgres
            - name: POSTGRES_SEEDS
              value: 10.168.168.221
            - name: DYNAMIC_CONFIG_FILE_PATH
              value: config/dynamicconfig/development.yaml
            - name: PROMETHEUS_ENDPOINT
              value: 0.0.0.0:8000
          ports:
            - containerPort: 7233
            - containerPort: 8000
          volumeMounts:
            - name: config
              mountPath: /etc/temporal/config/dynamicconfig

          resources:
            requests:
              cpu: 100m
      volumes:
        - name: config
          configMap:
            name: temporal-server-configmap
            items:
              - key: development.yaml
                path: development.yaml

BTW, if this is not a good practise, can anyone share how to generated the k8s manifests for each services?

tihomir · March 16, 2022, 3:53am

image: temporalio/auto-setup:1.14.0

Using auto-setup image for prod clusters is typically not recommended but if you have a single node cluster it should be ok for small scale app.
See also relevant forum post here.

Have you tried using replicas:2 and did you run into any issues? Not sure if this type of deployment would run into issue with cross-pod communications. Worth giving it a try.

whitecrow · March 18, 2022, 6:31am

I tried it and no issues found for now.
There is one thing I want to confirm from the implementation mechanism :
no matter a workflow runs on which server, when I query the workflow list from web, we can always get it, right?
I think the scenario is the same in a typical temporal cluster which has different number of internal services, for example, 2 frontends, 3 instances each for matching, history, and worker.

tihomir · March 18, 2022, 3:23pm

Temporal relies on the db being fully consistent in all failure scenarios. I believe what you said is correct if all your replicas are configured to a single fully consistent db (and visibility db).

derek · March 24, 2022, 2:24pm

Hi! just wanted to followup on this point in particular. In a production environment we do recommend that the individual services be run in their own binary. there shouldn’t be any risk to data integrity, but you may have some negative performance impact from doing this. that said, I haven’t done much experimenting with this and I’m interested in your experience if you try it and you are willing to share!

To run services in their own binary, even though you don’t want to use helm, you can generate the manifests using helm template and then just pair them down to what you want and use those directly.

One more comment on production in general - you will not want to use auto-setup but instead should use the non-auto images and the temporal-*-tool database tools we provide to create and update schema.

whitecrow · June 2, 2022, 2:41am

Wish I can share our experience in a near future. Our system is still under development for now. Our product is a data analyze system, which serves for individual organizations. Requests and workflows numbers are not so big as SAAS platforms. While, ease of deployment means much for us. That is why I choose such a solution.

Topic		Replies	Views
Running temporal across multiple Kubernetes clusters Community Support multicluster , kubernetes	6	1752	September 1, 2022
Some easier way to deploy temporal cluster in Kubernates. May be some direct yaml files? Server Deployment	2	5474	March 11, 2024
How to scale temporal to run across multiple hosts for HA Server Deployment python-sdk , production , postgresql	5	922	February 24, 2025
Any docs for install temporal as a multi-node cluster? Community Support	2	684	December 15, 2020
Production Deployment - Kubernetes Discussion Community Support	1	493	February 19, 2023

HA of Temporal Server(Cluster)

Related topics