Extremely high database calls

Abinasha_Karana · November 11, 2020, 8:17am

We have setup temporal.io with mysql database. Currently we have around 2000 workflows in open state. We are observing extremely high requests per minute(rpm) SQL calls to database.

1
INSERT INTO buffered_events(shard_id, namespace_id, workflow_id, run_id, data, data_encoding)
VALUES (?, ?, ?, ?, ?, ?) -
63292 rpm

2
SELECT
shard_id, namespace_id, workflow_id, run_id, create_request_id, state, status, start_version, last_write_version
FROM current_executions WHERE shard_id = ? AND namespace_id = ? AND workflow_id = ?
72181 rpm

3
SELECT next_event_id FROM executions
WHERE shard_id = ? AND namespace_id = ? AND workflow_id = ? AND run_id = ? FOR UPDATE
77756 rpm

4
UPDATE executions SET
next_event_id = ?, last_write_version = ?, data = ?, data_encoding = ?, state = ?, state_encoding = ?
WHERE shard_id = ? AND namespace_id = ? AND workflow_id = ? AND run_id = ?
77756 rpm

5
SELECT
shard_id, namespace_id, workflow_id, run_id, create_request_id, state, status, start_version, last_write_version
FROM current_executions WHERE shard_id = ? AND namespace_id = ? AND workflow_id = ? FOR UPDATE
77757 rpm

6
UPDATE current_executions SET
run_id = ?,
create_request_id = ?,
state = ?,
status = ?,
start_version = ?,
last_write_version = ?
WHERE
shard_id = ? AND
namespace_id = ? AND
workflow_id = ?
77788 rpm

7
SELECT range_id FROM shards WHERE shard_id = ? LOCK IN SHARE MODE
77824 rpm

What tuning we should look into?

samar · November 11, 2020, 5:59pm

What is the rate at which your application is doing state transitions? Looks like you are doing updates at a very high rate. Can you describe what you are doing within your workflows?

Gayatri_Mali · November 12, 2020, 11:37am

We have 4 workflows which are causing all update queries

Workflow is going in an infinite loop of signaling and even though we have terminated workflow, it gets started again.

samar · November 12, 2020, 5:44pm

Can you look at the WorkflowExecutionStarted event to see who is starting the workflow execution? It has identity field on it.
Signals are coming from outside the workflow implementation, so you need to chase who is sending all those signals.
But this would explain the high database load.

maxim · November 12, 2020, 11:30pm

Signals also have identity field of the process sending them.

yugandhar · November 13, 2020, 8:35am

We terminated sender workflow, which is generating the events. However the sender workflow generated signals in millions and it used SignalWithStartWorkflow API, this is causing the receiver workflow to start again (even if we terminate manually), Is there a way to clean the signals to the receiver workflow ? Source workflow is not starting any more on it’s own.

This is the receiver workflow history event count in the current execution

yugandhar · November 13, 2020, 8:36am

[Continued]

we terminated Receiver workflow manually many times

samar · November 13, 2020, 4:49pm

Since you are using the SignalWithStartWorkflow API it will start a new workflow execution if one is not running. So terminating the receiver workflow execution will not have any effect as the next call to SignalWithStart will spin up a new workflow execution. You need to chase the source which is sending all those signals and clear it. Can you provide more information about the source which is generating those signals?

yugandhar · November 13, 2020, 6:12pm

The source has a bug, which is generating more events due to some issue, we found the issue and fixed it, but the existing workflows running on the system are making lot of DB calls, causing SLA reduction from temporal service (it’s delaying other workflows execution),

Topic		Replies	Views
High frequency query mysql Community Support go-sdk	2	458	April 27, 2023
High database calls Community Support go-sdk , mysql , general-impl	0	374	April 19, 2023
Temporal and concurrency Community Support mysql , scaling , performance	4	2304	July 10, 2020
Handling millions of concurrent executions Community Support	3	873	April 20, 2021
[Urgent] Upgraded to 1.11.2 and making 11k calls/sec to Postgres Community Support typescript-sdk	19	202	October 23, 2024

Extremely high database calls

Related topics