Long running activity with auto_heartbeater failing

inishchith · September 18, 2024, 12:53am

Hi,

I am running an activity where a SQL query is run - the execution can take from a few seconds to several hours. For this reason I have kept my start_to_close start_to_close_timeout to be a large value, while having a heartbeat_timeout.

I am using the auto_heartbeater to avoid long wait times in case of worker crashes or any other issues, where there would be a heartbeat timeout and the activity would be retried.

What I am observing is that - In case of a long running query (say 10 minutes) - the heartbeat is produced, but then halfway through I am seeing this error and the activity is retried.

2024-09-18T00:45:49.402378Z  WARN temporal_sdk_core::worker::activities::activity_heartbeat_manager: Error when recording heartbeat: Status { code: Cancelled, message: "operation was canceled", source: Some(tonic::transport::Error(Transport, hyper::Error(Canceled, "connection closed"))) }

I am unable to catch what could cause this. could someone please help understand this better.

Thank you!

inishchith · September 22, 2024, 8:15pm

Hi @maxim, any thoughts on how can I proceed further on this or where I redirect this to find some help?

maxim · September 22, 2024, 8:30pm

I don’t know Python. My guess is that something in your python code blocks heartbeating.

Chad_Retz · September 23, 2024, 12:08pm

Which SDK version? There was an issue recently fixed in 1.7.1 that looks similar.

inishchith · September 23, 2024, 12:35pm

Hi @Chad_Retz - thanks for sharing reference to the issue.

I am currently on 1.7.0 - I’ll review the issue and check with the new version if I am still seeing this.

Topic		Replies	Views
Observing issues with heartbeat in case of processing tasks Community Support python-sdk , general-impl , activity , heartbeat , best-practices	1	22	February 28, 2025
Activity timeout questions Community Support	3	1360	November 9, 2020
Python SDK, workflow terminated but activity still running Community Support	3	939	August 18, 2023
Activity HeartBeat Issue Community Support go-sdk , activity , heartbeat	11	2187	January 15, 2021
Best way to support a long-running Activity without running into an Activity Timeout Error? Community Support timeout , workflow-options	1	1170	July 23, 2021

Long running activity with auto_heartbeater failing

Related topics