Workflow history size / count exceeds limit

Ramprasad_Indarapu · April 24, 2021, 2:37pm

Hi

We are doing some performance testing, where we have a main flow calling subflow(as childflow). The subflow execution is called in the loop, when we increased loop count from 500 to 600 it is failing.
The workflow is terminated with the message “ Workflow history size / count exceeds limit”. Attached the screenshot.

Below is from the event history

4239
WorkflowExecutionTerminated
4m 20s
reason
Workflow history size / count exceeds limit.
identity
history-service

maxim · April 24, 2021, 6:17pm

Ramprasad_Indarapu · April 25, 2021, 11:36am

Thanks. @maxim
And let me explain my use case.

The payload for the workflow is below

main:
foreach

call subflow1 100 times

subflow1:
steps:
step1
step2

call subflow2
step3

subflow2:
steps:
step1

Current implementation
Payload is sent to main flow execution
When the subflow is invoked the complete payload is again sent to subflow, as the nested subflows are possible
This is the main reason I guess for the history size increase.

Thinking of the below solution. Can you please validate the following solution?

Solution1
Maintain the complete payload in the main workflow.
Implement a query method in the main flow to return by taking a subflow name as an argument and return subflow payload and cache it as one of the flow instance variable
When a child flow requires subflow payload, use the query method of the main workflow to get the same.
We have a case where the subflow1 can be executed in parallel(using child flows) where the work is distributed say 10 child flows each handles 10 iterations). I am using the promise to complete the child flow as below.

JiffyWorkflow workflow = WorkFlowUtils.buildChildWorkFlowObject();
executions.add(Async.function(workflow::execute, flowInput));
Promise.allOf(executions).get();

One question I have is “will this work in case of child flow crashes also”?

maxim · April 25, 2021, 5:07pm

If the payload is large we recommend storing it in some external store (like S3) and pass only references to it through workflow and activity arguments.

One question I have is “will this work in case of child flow crashes also”?

I’m not sure I correctly understand the question. What do you mean by “child flow crashes”? If the worker process that hosts the child workflow crashes then the recovery will be seamless and nothing should be done. If the child workflow throws an exception the parent workflow will need to handle its failure.

Ramprasad_Indarapu · April 25, 2021, 5:13pm

Thanks @maxim . The payload is not more than 2mb, but as we are passing to childflows, which are executing in loop, the history size is increasingly. Just to confirm my question is is it better to pass only reference to childflows and get the subflow/childflow payload from the parentflow by query method.

maxim · April 25, 2021, 6:25pm

The query approach works in your case.

I filed an issue to see if there is a way to optimize this use case.

Topic		Replies	Views
Workflow flowload history size issue Community Support history	1	880	April 25, 2021
Querying active workflows by their history size Community Support history , workflow-config	5	2911	June 16, 2023
The workflow history in the UI throws error with max GRPC message limit Community Support java-sdk	6	1174	March 12, 2022
Clarification of "Transaction size exceeds limit" Workflow Error Community Support python-sdk , failures	1	114	November 4, 2024
How can I catch the exception related to exceeding the workflow history size or count limit in Java? Community Support java-sdk	3	353	June 24, 2024

Workflow history size / count exceeds limit

Related topics