Ordering Guarantee of messages / Sequencing

PARIKSHIT · August 16, 2023, 9:49am

We have use cases for ordering of messages.
Needed to understand if there is any case of ordering/sequencing of messages with Temporal?
Similar to Kafka topic partition, is there anything on temporal for sequencing.

As per documentation by temporal: it is not provided, can somehow confirm this or is there some workaround for this

→ do not have any ordering guarantees
→ Task Queues support server-side throttling, which enables you to limit the Task dispatching rate to the pool of Worker Processes while still supporting Task dispatching at higher rates when spikes happen.

maxim · August 16, 2023, 4:30pm

@PARIKSHIT it is hard to give recommendation without understanding your requirements.

Do you need global ordering? Or per some business level ID like a customer?
What is the maximum rate of events per such ID/partition?
Why do you need the ordering in the first place?

PARIKSHIT · August 29, 2023, 3:44am

Hi Maxim,

We need to processes messages in our queue in a defined order: Since all messages update the same document. If all the messages are processed at once, we land up into optimistic locking exception on the same document. Hence we were using a Kafka topic partition to guarantee ordering of messages.
But now we want to move to Temporal.

Rate of events: x > 0 and x < 2000 (These events/updates can be triggered concurrently, we need to control the processing on the consuming side)

I am trying a POC with temporal currently, where i have defined multiple activiy methods: @ActivityMethod which gets called under a @WorkflowMethod when a worker picks up a message.
Here i see that the acitivities are always executed in sequence. Even I bring down the worker and then bring it up and replay the workflow again, it starts from the activity it left off.
Just wondering is this the way to potentially order the execution of events, Or do we have better way in Temporal to order the events?

maxim · August 29, 2023, 5:05pm

Is 2000 messages per second for a single document or across all the documents?

PARIKSHIT · August 30, 2023, 9:29am

Hi @maxim

Yes, in a worst case scenario, we can have upto 100 messages coming to the backend at once.

maxim · August 30, 2023, 1:05pm

To the same document?

PARIKSHIT · August 30, 2023, 4:52pm

Yes this can happen to the same document

maxim · August 30, 2023, 5:17pm

The usual pattern is to have a workflow per business entity (document in your case) and send signals to it. Then, the workflow can process them in order. But this design doesn’t work for high update rates. If you need 100 requests/second to the same workflow, then it wouldn’t work. If you can rate limit them to something like 20 then it could work.

PARIKSHIT · August 31, 2023, 12:29am

Understood.

Additionally If I were to create multiple workflows for all the updates (1 workflow per update which would be carrying the JSON payload to update single document), can I have ordering at a workflow level

maxim · August 31, 2023, 3:15pm

No, we don’t support ordering across workflows yet.

Benedito_Marques · December 20, 2024, 8:48pm

Hello @maxim ! Temporal currently support ordering across workflows? I have the same demand here

maxim · December 20, 2024, 9:22pm

What is your use case?

Benedito_Marques · December 20, 2024, 9:43pm

I have process that sends a lot of events (signals). All signals are sent to respective tenant long running workflow, in which acumulate this signals in a array inside workflow for ordering.

For each tenant workflow was created 2 limits check:

Per signal
Per workflow size

When limit is reached, a continue as new is performed.

The problem is: when workers are restarted for some reason (reach pod CPU/memory limit, for example) the new worker don’t pickup the workflow immediately. I see that a time is spended until new workers pickup workflow and show logs again (I don’t know the cause of this)

So from the time of restart until the workers continue process signal again, the workflow already has reached the signal or workflow size limit, cause the continue as new is just performed if worker execute it and check the size of array signals or workflow size.

I want to not use a long running workflow anymore, and open 1 workflow per signal, but for this, I need to ensure that workflows will be executed in order.

maxim · December 21, 2024, 6:00pm

What is the average and peak rate of signals per workflow ID?

Benedito_Marques · December 21, 2024, 8:03pm

Analyzing the json history file, this is the current rate, but we has a projection to increase this for 500x (currently has just 1 tenant on the cluster, and the projection is 500 in max):

Average rate: 0.91 signals/second
Peak rate: 1.00 signals/second

maxim · December 21, 2024, 8:07pm

I’m confused by your answer. I asked about the maximum rate per ID (tenant). You said that the rate is going to increase because you are going to increase the number of tenants. How does it relate to the rate per tenant?

Benedito_Marques · December 21, 2024, 10:22pm

Let me explain:

When I need to scale, the number of “senders” are increased. This senders are pbx’s
Each sender has data about all tenants in same time. For example:
In pod 1, the number of received calls of an agent is 10.
In pod 2, the number of calls of this same agent is 5
In pod 3, the number of calls of this same agent is 1

this happens because the loadbalancer can send the call to pod1, pod2, or pod3, making the counters different in each pod.

So if I scale one more pbx, I’ll have this same agent in pod4, increasing the number of signals that will be sent to this tenant long running workflow.

maxim · December 21, 2024, 10:53pm

So, what is the average and peak rate per single tenant workflow you need when the system is fully scaled?

Benedito_Marques · December 21, 2024, 10:56pm

Something around this:

Average rate : 455 signals/second
Peak rate : 500 signals/second

maxim · December 21, 2024, 11:19pm

A single Temporal workflow cannot support such a high rate of signals. Currently, Temporal is not the right technology if you need to guarantee the complete ordering of messages at such a rate.

Temporal scales out with the number of open workflows. For example, it can process hundreds of thousands of events per second if they are distributed over many workflow instances, and each instance processes a maximum of a few signals per second.

Topic		Replies	Views
Temporal and Kafka Community Support java-sdk , task-queue , kafka	10	13801	June 21, 2023
Async execution of activities with guaranteed order outside of the current workflow Community Support java-sdk , feature-request	1	1138	June 27, 2022
Thoughts on implementing Kafka exactly-once messaging integration Community Support	9	963	February 10, 2025
Order for picking scheduled workflow/activity task Community Support go-sdk , matching-service , task-queue , worker	4	1538	July 19, 2022
Concurrent execution of same activity across different workers Community Support	3	99	August 30, 2024

Ordering Guarantee of messages / Sequencing

Related topics