Auto scaling Temporal History

hferentschik · February 4, 2026, 2:55pm

Hi,

I am looking for some recommendations around Temporal History configuration. We are running Temporal 1.26 against a PostgreSQL backend (AWS RDS). Every day we experience two latency spikes, caused by increased traffic due to lots of schedules triggering. After some time latencies recover and the cluster can comfortably serve requests even though the actual frontend request rate does only drop slightly.

We also use horizontal autoscaling on CPU and Memory and auto-scaling does kick in these situations, however, we are wondering whether auto-scaling makes things in this case worse than better? The cluster already is under higher load and DB latencies go up. Bringing up another history node will cause shard rebalancing which means even more traffic to the database to load the required context.

Anyone having similar experience? Any other idea around what the problem could be?

Thanks for any feedback,

–Hardy

Topic		Replies	Views
Will the history service get unsteady in rolling update of deployment? Server Deployment	3	490	December 12, 2022
Recommended metrics to use for autoscaling temporal server pods Community Support	1	508	May 4, 2023
Bottleneck at scaling Temporal server Community Support mysql , performance	1	215	March 11, 2025
Slow workflow completion rates in a Temporal cluster running on RDS Community Support go-sdk	0	58	May 10, 2025
Temporal studying - various questions Community Support	5	1631	February 9, 2021

Auto scaling Temporal History

Related topics