Temporal performance issues

java_dev · April 25, 2023, 7:58am

Hello!
When testing the performance of Temporal, I had a problem with the workflow running time being too high.

The workflow consists of one activity with the simplest logic (log.info())

By disposal, all components work without reaching the limit.

The load on the temporal is 200 RPS.

All settings are listed below. Please, help.

Resources of temporal server components:

Component name	CPU	RAM	Count of replicas
Frontend	1600m	1024Mi	1
History	6400m	8192Mi	1
Matching	1600m	1024Mi	1
Worker	200m	512Mi	1

Count of workers = 2

tihomir · April 26, 2023, 4:26am

Would look at:

increase of service requests sum (rate(service_requests[5m])) see if goes up around the same time your workflow and activity schedule to start latencies go up.
if you had any resource exhausted issues during this time (also will show if you got rate limited by frontend hosts rps limits):
sum(rate(service_errors_resource_exhausted{}[1m])) by (operation, resource_exhausted_cause)
your sync match rate sum(rate(poll_success_sync{}[1m])) / sum(rate(poll_success{}[1m]))’

If you did not get rate limited and your sync match rate looks ok (around 100%, no dips), would look if you have “stuck” executions (workflow task failed/timed out).
Server metric for that is workflow_task_attempt (histogram metric) and also look at history hosts logs specifically for " Critical attempts processing workflow task".

Also start to close timeout for workflow task, server metric:
sum(rate(start_to_close_timeout{operation="TimerActiveTaskWorkflowTaskTimeout"}[1m]))

Topic		Replies	Views
Workflow Performance with Java SDK Community Support java-sdk	1	736	February 20, 2023
Bad performance when deployed in Kubernetes - How to diagnose bottleneck? Community Support java-sdk , helm	2	1652	March 25, 2022
Temporal is slow to start burst of 1000s of workflows Server Deployment go-sdk	0	98	January 22, 2025
Temporal Performance Community Support java-sdk	1	289	January 31, 2024
Temporal seems to hit scale wall Community Support performance	6	3351	March 29, 2024

Temporal performance issues

Resources of temporal server components:

Related topics