I am in the process of setting up a production deployment of Temporal, and I have a few questions.
Is there an acceptable amount of latency between server nodes? We have multiple DCs connected with private fiber, and I’m wondering if I can have a single cross-DC cluster, or if I should be designing this as a multiple-cluster setup using the replication/failover functionality. Is there any rule of thumb here?
Is Kafka still required for the replication system? Is this system still considered experimental?
I’m going to work on getting better numbers, but for now, I would assume 30ms would be the absolute worst case scenario, and in most cases we’d be seeing <10ms.
Consistency is probably a higher priority for us. Is there any latency number at which we would have to worry about seeing erratic behavior in the cluster?
I’m trying to determine if a multi-cluster replication setup is something I want to try to tackle right now, given this cluster should see relatively light use out of the gate.