Design for coordinator workflow with potentially large history

maxim · August 31, 2021, 3:12am

Also, what are the implications of having more than 50000 history events? I didn’t see it being configurable, so there must be some restrictions on your side.

The recovery time of a workflow gets longer and longer with history size. In some situations, the frontends can run out of memory if history is too large (which should eventually be fixed).

I recommend the following workaround. Do not rely on the childFuture to get notified about the child’s completion (until the issue #680 is implemented) as it doesn’t play nice with continue-as-new. Instead, the children can use a signal to report its completion to the parent by the parent WorkflowID. In this case, the parent can call continue-as-new and still wait for all its children’s completion in the form of signals. Make sure to start the children asynchronously for them to continue executing after the parent’s continue-as-new call.

Topic		Replies	Views
Workflow Performance with Java SDK Community Support java-sdk	1	730	February 20, 2023
Possibility of scalable buffered counter Community Support go-sdk , design	2	1124	July 13, 2020
Continue as new when reaching 5 000 events limit Community Support retries , web-ui	9	3358	July 6, 2022
Question about Temporal worker starvation + scalability Community Support java-sdk	4	2169	January 26, 2022
Human dependent long running Workflows Community Support general-impl	11	5165	March 2, 2023

Design for coordinator workflow with potentially large history

Related topics