Capacity Planning for a Higher Throughput Temporal Cluster

matthew.hou.paytm · April 13, 2023, 12:56am

Hi folks. We are investigating the feasibility and resource requirements of a somewhat high throughput, low latency Temporal cluster. Here are the high level requirements:

Throughput:
- Short term: 100 wf completion / seconds
- Mid to long term: 300 - 500 wf completion / seconds
Latency:
- Workflow schedule to start: less than 10 seconds
- Activity / task schedule to start: less than 10 seconds

For easy maintenance, we’d like to use RDS for our persistence layer. Will MySQL / Aurora provide enough performance for our use case? What would be a good choice for history shard? Is 2048 to much?

In addition, what will be the recommended resource setting for Temporal service pods? (We are hosting them in K8S). And what other dynamic configs we should tune to achieve the performance goal?

I noticed that there isn’t much doc about operating / tuning self managed Temporal cluster in production. Is there any good articles about it?

Cheers folks! Thank you so much for the help

Topic		Replies	Views
Temporal studying - various questions Community Support	5	1489	February 9, 2021
Temporal + Aurora Mysql Performance Community Support performance	1	1434	July 12, 2021
Estimating the right configuration values of the temporal services Server Deployment	2	549	January 26, 2025
Bottleneck at scaling Temporal server Community Support mysql , performance	1	124	March 11, 2025
Improving Temporal cluster performance Server Deployment go-sdk , aws , scylla , kubernetes	1	1005	November 21, 2022

Capacity Planning for a Higher Throughput Temporal Cluster

Related topics