Temporal and Cadence performance comparison

Sunkwan.Kwon · April 8, 2022, 1:41am

Hello.

I did Temporal and Cadence performance comparison testing, and I saw different patterns between them. Please give me some advice to understand the result.

If you need more information to analyze the result, let me know, please.

[Test environment]

3 hosts
OS version: CentOS 8.5.2111
Cassandra version: 3.11.12-1
Elasticsearch version: 7.17.1
Temporal-server version: 1.15.2
Cadence-server version: 0.23.1
Kafka/zookeeper for Cadence
12 workers (4 workers per host)

[Temporal/Cadence configuration]

Number of history shards: 100
Enable advanced visibility
history,matching,worker.persistenceMaxQPS: 20000

[Test method]

Create 1000, 5000, 8000 cron workflows that will be triggered for each minute.
Mesure the workflow latency (end time - execution time)

[Test result]

Average latency is similar between them.
But Temporal’s maximum latency was long rather than Cadence. there was a long tail in the Temporal result.

[Test data]

Note that the vertical axis of the 5k and 1k chart is log-scale. (not 8k)

temporal-cadence-performance-result1680×771 58.3 KB

Wenquan_Xing · April 8, 2022, 4:12am

According to my mem, at least the default configs for task queue partition are different (if your worker poller count is not large enough, this config can affect latency):

By default, cadence use 1 task queue partition: cadence/config.go at v0.23.1 · uber/cadence · GitHub

By default, temporal use 4 task queue partition:

github.com

temporalio/temporal/blob/v1.15.2/service/matching/config.go#L123-L124

      
        
            		NumTaskqueueWritePartitions:     dc.GetIntPropertyFilteredByTaskQueueInfo(dynamicconfig.MatchingNumTaskqueueWritePartitions, dynamicconfig.DefaultNumTaskQueuePartitions),
            		NumTaskqueueReadPartitions:      dc.GetIntPropertyFilteredByTaskQueueInfo(dynamicconfig.MatchingNumTaskqueueReadPartitions, dynamicconfig.DefaultNumTaskQueuePartitions),

github.com

temporalio/temporal/blob/v1.15.2/common/dynamicconfig/constants.go#L611

      
        
            	TaskQueueName
            	// TaskType is the task type (0:Workflow, 1:Activity)
            	TaskType
            	// ShardID is the shard id
            	ShardID
            
            
	// lastFilterTypeForTest must be the last one in this const group for testing purpose
            	lastFilterTypeForTest
            )
            
            
const DefaultNumTaskQueuePartitions = 4
            
            
// FilterOption is used to provide filters for dynamic config keys
            type FilterOption func(filterMap map[Filter]interface{})
            
            
// TaskQueueFilter filters by task queue name
            func TaskQueueFilter(name string) FilterOption {
            	return func(filterMap map[Filter]interface{}) {
            		filterMap[TaskQueueName] = name
            	}
            }

can your try to use the same number? either all 4 or all 1

Sunkwan.Kwon · April 8, 2022, 7:52am

@Wenquan_Xing Thank you for your comment.
I changed the number of task queue partitions to 1, and the result has been changed that is almost similar to Cadence.

Here is the comparison data and chart.

Wenquan_Xing · April 8, 2022, 6:57pm

One more thing to adjust is the default rate limit of task dispatching (default to 1000) temporal/config.go at v1.15.2 · temporalio/temporal · GitHub

According to the pic, I believe the above config should be increased

Sunkwan.Kwon · April 11, 2022, 2:13am

It’s good. That configuration affected performance enhancement.
Thank you for the suggestion.

During the test, I found another issue regarding advanced visibility, I’ll leave a new post regarding it.

Topic		Replies	Views
Activity in scheduled state for very long Community Support go-sdk , cadence	3	630	September 16, 2020
Tuning Temporal setup for better performance Community Support cassandra , performance , kubernetes	5	9143	November 13, 2021
High Activity Latency Community Support	2	521	March 21, 2021
Temporal performance with golang microservice, Cassandra & Elasticsearch Community Support go-sdk , elasticsearch , cassandra , docker , performance	14	3472	February 1, 2023
May I know the disadvantage and advantages over cadence? Community Support java-sdk	5	722	May 20, 2021

Temporal and Cadence performance comparison

Related topics