Misuse (?) of activity retry logic

sdonovan · April 6, 2023, 12:27am

Greetings. There are many AWS action “pairs” where you issue a command and then poll for the response to detect completion (yes, yes, there are alternatives to polling, you could hook up to AWS cloud events and use the activity doNotComplete capability but ignore that for now).

Examples include:

EC2/EBS: AttachVolume, DescribeVolumes (e.g. until those volumes becomes Attached).
SSM: SendCommand, GetCommandInvocation (until the command becomes Completed).

Ideally, you want to poll for the response every N seconds, because you really don’t want to have a long-running activity sitting a the worker stack for no reason. The ideal pattern is setting ActivityOptions with RetryOptions. In the activity, you check three states:

Completion, you just return.
Not completed, i.e. waiting – throw newFailure(..).
Will never complete (e.g. failure, cancelled, etc.), throw newNonRetryableFailure(..).

Except, in the UI. Whenever it sees a failed activity, it gets plastered on the workflow.

So. it is possible to set a failure exception to indicate “it’s not really a failure, we just want to retry”?

I realize it’s a slight misuse of exception handling, but it works well otherwise.

Obviously, the alternative is to do the logic myself in the workflow using a timer, but then I have code the retry logic too.

maxim · April 6, 2023, 2:31pm

You are absolutely correct that this is not the best user experience. Here is the issue to get this fixed.

Topic		Replies	Views
Activity retries without exception Community Support java-sdk , activity , best-practices	12	3677	August 16, 2023
Re-execute activity till specific status is reached Community Support	2	349	September 21, 2023
How to rerun an activity when its async output produces an error downstream Community Support java-sdk	4	762	August 10, 2023
Child workflow call on failed activity Community Support retries , activity	4	1097	October 27, 2020
Recommendation on Activity definition Community Support java-sdk	4	23	December 10, 2024

Misuse (?) of activity retry logic

Related topics