Connection failure

mohits · October 15, 2021, 2:34pm

Hi,
I have seen an issue while using temporal is that the worker is unable to connect to the temporal server and all the workflows got stuck in queue with heartbeat failing in the workflow.
Can i know if there is a solution like worker automatic reconnection or something like worker heartbeat to know the connection health.

tihomir · October 15, 2021, 4:50pm

For health checks, SDKs perform a gRPC health check when you create a client.
You can also run a health check via tctl with:

tctl cluster health

and can health checks for different services with code, see for example:
https://github.com/temporalio/temporal/blob/master/tools/cli/clusterCommands.go#L39
where fullWorkflowServiceName
can be changed depending on which service you want to health check:

temporal.api.workflowservice.v1.WorkflowService
temporal.api.workflowservice.v1.HistoryService
temporal.api.workflowservice.v1.MatchingService

You can also look at some other good posts with more health check information:

Hope this helps.

Topic		Replies	Views
Helath Check failure Community Support go-sdk	6	1395	February 22, 2021
Temporal Server Logs location & Health check context url Community Support go-sdk	6	3730	September 14, 2021
Temporal Client/Worker health-check Community Support java-sdk	8	3838	January 21, 2021
Temporal Server Health Check Community Support production	6	7967	January 13, 2024
Worker Service Pod Crashed Community Support kubernetes	13	2623	September 15, 2021

Connection failure

Related topics