How to scale temporal to run across multiple hosts for HA

wshv · November 3, 2023, 5:39pm

Hello, I’m looking forward to running temporal in a production-ready way.

What we’ve done
In the initial experiment, we currently run temporal using docker-compose and each major component corresponds to a container

temporal
temporal-admin-tools
temporal-postgres
temporal-ui

What we want
Running on a single host, things are working fine. Now we start to think about HA. Ideally, we want such kind of scenario: suppose the active temporal server is running on host A and there are some workflows currently running in the middle. Now host A is shut down. The previous standby temporal server running on host B can pick up the unfinished workflows and resume them seamlessly. For some reason, we don’t want to run temporal on top of K8s.

What we think
To accomplish this goal, we are thinking there are a few points that need to be ensured

We need to replicate the state of the temporal-postgres container. Containers running on different hosts should have eventual consistent copies of data.
We deploy multiple copies of temporal services on different hosts. Those copies have to be aware of each other and establish leadership and membership so that only one of them is active at a time ==> Does temporal provide this out-of-box?

Please advise if we are on the right track and if there is anything that we need to be mindful of. Also, does Temporal provide any documentation we can follow for such kind of scaled deployment?

I know I’ve asked lots of questions but I would really appreciate your help

tihomir · November 27, 2023, 3:58am

Check out Self-hosted Multi-Cluster Replication | Temporal Documentation

maxim · November 27, 2023, 8:06pm

The Temporal cluster is already HA if its DB is HA (we recommend Cassandra for this). You can add and remove nodes for each of its roles without downtime.

If you want multi-region availability, then the multi-cluster setup @tihomir mentioned is the way to go.

ysl · March 27, 2024, 4:35pm

@maxim, if we run multiple nodes for the Temporal cluster, and we use the Schedule feature in Temporal, could workflows get double-scheduled?

maxim · March 27, 2024, 5:37pm

No, they will be scheduled only once, as they rely on a database for consistency.

genek · February 24, 2025, 11:57am

Hello!

I’m also interested in multicluster temporal hosting, but I have not undesrtand this case correctly:

imagine we have 2 datacenters with k8s cluster on each
every k8s cluster runs 3 pods of each termporal service (frontend, history, matching, worker) - 24 pods total
each pod in every datacenter is connected to a signle cassandra keyspace

Is this topology correct or pods from different clusters will interfer each other? Or it is possible only with multicluster feature?

Thank you for help.

Topic		Replies	Views
How to easily deploy a temporal server cluster in production？ Community Support production	17	3987	February 18, 2025
Active-active deployment of temporal services on multiple kubernetes cluster cross DC Community Support deployment	1	1644	November 16, 2022
Running temporal across multiple Kubernetes clusters Community Support multicluster , kubernetes	6	1818	September 1, 2022
Temporal Multi-Server Deployment on Openshift Server Deployment deployment	5	1121	November 1, 2023
HA of Temporal Server(Cluster) Community Support deployment , kubernetes	5	1486	June 2, 2022

How to scale temporal to run across multiple hosts for HA

Related topics