Worker trying to connect to old frontend?

RichRM · January 10, 2023, 12:06am

Our worker services are struggling to start with errors like:

{"level":"fatal","ts":"2023-01-09T23:45:59.585Z","msg":"error creating sdk client","service":"worker","error":"failed reaching server: last connection error: connection error: desc = \"transport: Error while dialing dial tcp 10.0.11.207:7233: connect: connection refused\"","logging-call-

That looks like an old IP address of a frontend service. The actual running frontend is on a different IP address.

Running the following tcl (I think) indicates that 356 services are registered (only 3 are running, heh). I think these are old registrations from where I was trial-and-erroring getting the services stood up:

tctl admin membership list_db | grep role | wc -l
     356

Is there some way to “purge” the list of registered servers, so that the worker can connect to one that’s actually alive? Or do I need to wait for them to timeout?

Many thanks

RichRM · January 10, 2023, 12:17am

Although not 100% on that theory, because I also see this in the logs, which is the correct IP of the frontend

{"level":"info","ts":"2023-01-10T00:16:02.876Z","msg":"Current reachable members","service":"worker","component":"service-resolver","service":"frontend","addresses":["10.0.11.215:7233"],"logging-call-at":"rpServiceResolver.go:283"}

Not sure what its missing as matcher/history connect OK.

I think it’s trying to connect to a frontend service on its own IP (even tho its not running a frontend) instead of using the one that’s already running

RichRM · January 10, 2023, 12:47am

nvm, setting the PUBLIC_FRONTEND_ADDRESS var helped steer this.

The only strange thing now is the worker says this

"service":"worker","component":"worker","address":"10.0.12.188:7239",

But refuses all TCP connections on port 7239

Topic		Replies	Views
[SERVER] Set FrontEnd IP on Worker Service Community Support server	2	1034	July 6, 2021
Temporal worker not able to connect to internal frontend Community Support worker	3	472	March 5, 2025
Temporal worker failing to connect to frontend in 1.18.2 post removing publicClient from config Community Support server	5	2061	October 25, 2022
Temporal-worker pod fails to connect to frontend after changing node IP Server Deployment helm , kubernetes	2	129	April 3, 2025
Error starting temporal-sys-tq-scanner-workflow workflow Community Support mysql	6	2042	August 13, 2020

Worker trying to connect to old frontend?

Related topics