How to understand workflow task metrics

Roman · May 4, 2021, 11:54am

Good day! During testing we measure workflow metrics such as:

rate of Started Tasks
rate of Completed Tasks
rate of Scheduled Tasks
I thought when system can handle load, all of these metrics should be equal. But Scheduled Tasks’ rate is always larger than Completed and Started. So can you explain this behavior and what shall we see on our
dashboard when the system that can handle some load without problems?

Снимок экрана 2021-05-04 в 14.34.412770×1868 454 KB

Wenquan_Xing · May 4, 2021, 11:03pm

can you provide the metrics below?

poll_success_sync
poll_success

and persistence_requests for CreateTask operation?

Roman · May 5, 2021, 6:33am

Thank You for answer! I’ll attach here necessary metrics within the same time range.
By the way can You tell me, what is the difference between poll_success and poll_success_sync.

samar · May 5, 2021, 4:15pm

Hey @Roman,
Based on the graphs it looks like you don’t have enough pollers to keep up with the load. Can you bring up more workers and see if that helps. The metric poll_success_sync is the counter which tells how many sync matches are happening on the matching engine. This is a special optimization which allows us to dispatch a task to the poller without writing to the database. If this is lower than poll_success it typically means you don’t have enough concurrent pollers.
Another thing is you might be running into scalability bottleneck on number of TaskQueue partitions. By default we create 4 partitions for a TaskQueue. For a throughput of 2.5k tasks per second I recommend you should have atleast 10 partitions (250 tasks per partition). Here are the dynamic config knobs which controls number of partitions.

Nick_Laros · November 23, 2021, 12:24pm

Hi @samar thanks for your answer, really helpful.

Hi @Roman I am curious about your temporal deployment setup (replicaCount, resource request / limit, persistence setting, etc) to achieve average 2k transaction completion. I’ll be grateful if you could share the setup here for my reference.

Topic		Replies	Views
Workflow Performance with Java SDK Community Support java-sdk	1	743	February 20, 2023
Meaning of worker task slots available metric Community Support	7	2563	August 29, 2023
Workflows getting stuck after some N workflows with timers Community Support go-sdk , helm , cassandra , cadence	6	1650	May 4, 2021
Workflow Task Schedule To Start Latency High Community Support java-sdk , deployment	11	3950	February 8, 2025
What are the recommended settings for workflow and activity pollers count? Developer Corner general-impl	0	4587	August 8, 2022

How to understand workflow task metrics

Related topics