I have seen an issue while using temporal is that the worker is unable to connect to the temporal server and all the workflows got stuck in queue with heartbeat failing in the workflow.
Can i know if there is a solution like worker automatic reconnection or something like worker heartbeat to know the connection health.
For health checks, SDKs perform a gRPC health check when you create a client.
You can also run a health check via tctl with:
tctl cluster health
and can health checks for different services with code, see for example:
can be changed depending on which service you want to health check:
temporal.api.workflowservice.v1.WorkflowService temporal.api.workflowservice.v1.HistoryService temporal.api.workflowservice.v1.MatchingService
You can also look at some other good posts with more health check information:
Hope this helps.