Planning a production deployment

jmac · April 5, 2021, 5:28pm

I am in the process of setting up a production deployment of Temporal, and I have a few questions.

Is there an acceptable amount of latency between server nodes? We have multiple DCs connected with private fiber, and I’m wondering if I can have a single cross-DC cluster, or if I should be designing this as a multiple-cluster setup using the replication/failover functionality. Is there any rule of thumb here?
Is Kafka still required for the replication system? Is this system still considered experimental?

Wenquan_Xing · April 5, 2021, 6:54pm

what is the latency between physical DCs?
do you have absolute strong requirement for availability, event if it means losing consistency?

NOTE:

kafka is not required for cross DC
cross DC is still experimental

jmac · April 6, 2021, 1:37pm

I’m going to work on getting better numbers, but for now, I would assume 30ms would be the absolute worst case scenario, and in most cases we’d be seeing <10ms.

Consistency is probably a higher priority for us. Is there any latency number at which we would have to worry about seeing erratic behavior in the cluster?

I’m trying to determine if a multi-cluster replication setup is something I want to try to tackle right now, given this cluster should see relatively light use out of the gate.

Wenquan_Xing · April 6, 2021, 4:24pm

if latency is ~10ms then i guess it is ok to have a single Temporal cluster on top of multiple physical DCs.

About cross DC, this feature by design does not guarantee consistency, but availability. e.g. what if an entire DC is down.

You need to setup 2 Temporal cluster, each on top of a dedicated physical DC.
You also need to configure dedicated worker fleet per above Temporal cluster.
When a DC is down, you need to manually failover to the still functional DC.

Topic		Replies	Views
Temporal cluster cross-dc deployment Community Support cross-dc , deployment	7	1035	September 17, 2021
Active-active deployment of temporal services on multiple kubernetes cluster cross DC Community Support deployment	1	1644	November 16, 2022
Postgres stretched cluster along with Temporal stretched cluster across 2 DCs with ~7ms latency Community Support postgresql	5	1782	September 22, 2021
How to deploy in production environment Community Support	4	781	January 15, 2021
Production HA setup Community Support mysql , production	9	3960	November 23, 2020

Planning a production deployment

Related topics