We are seeing intermittent errors in activity execution. From logs we could see that activity execution completes but while sending completion event to temporal server, something breaks and throws below exception
WARN i.i.internal.worker.ActivityWorker - Failure during reporting of activity result to the server
These exceptions are very generic and doesn’t actually throw light on what caused it.
Since StartToClose timeout is reached, workflow does retry and then activity call succeeds. So we dont suspect any code issue here as first retry works.
Could you please help troubleshoot this issue? Please suggest if we can tweak any settings.
StartToClose timeout is set to 2hrs as this service sometimes take more time to respond in worst case scenario. In this particular case, activity was completed within seconds and but something failed while sending completion event to temporal. As a result temporal didnt receive any event and it waited for 2 hours to initiate retry which is expected as StartToClose timeout is 2hrs.
Question is what went wrong while sending the activity completion event as logs are not helpful to diagnose the issue.
WARN i.i.internal.worker.ActivityWorker - Failure during reporting of activity result to the server