We are 1M workflows that triggers timers simultaneously, and this is causing a significant spike in DB CPU utilization—reaching ~90% for about an hour. Our Temporal DB is running on a db.r8g.2xlarge instance.
Is there a recommended way to handle or mitigate this issue?
Yes adding jitter is definitely an option and we can implement that.
Are there any additional approaches or best practices you would recommend for handling large timer bursts ??
Also, are there any configuration changes we could consider if firing all timers at the same time is unavoidable?