Struggling to deploy self-hosted Temporal on Amazon EKS

Brendan_Falk · May 5, 2025, 3:05am

Hi there - we are trying to deploy self-hosted Temporal on Amazon EKS and are really struggling.

We keep getting hit with error after error. Some examples:

When we set the password through AWS secret manager it says it is unable to read the secret name
We are unable to deploy the temporal schema job
We are unable to deploy the temporal frontend pod (and the logs aren’t showing anything)

We just continue to get this error in the developer console of the temporal web frontend

last connection error: connection error: desc = "transport: Error while dialing: dial tcp 172.20.41.63:7233: connect: connection refused"

I know this is an incredibly broad issue, but does anyone in the community have any advice? Any best practice guides, common bugs, “gotchas” we should be aware of? Alternatively, would someone on The Temporal team be open to getting on a debugging call?

grant · May 5, 2025, 3:43am

Currently running into this error with Job/temporal-schema, it seems that it is not able to connect to the db.

2025-05-05T03:38:47.790Z	ERROR	sql handle: unable to refresh database connection pool	{"error": "unable to connect to DB, tried default DB names: postgres,defaultdb, errors: [dial tcp 10.0.x.x:5432: connect: connection timed out dial tcp 10.0.x.x:5432: connect: connection timed out]", "logging-call-at": "/home/runner/work/docker-builds/docker-builds/temporal/common/persistence/sql/sqlplugin/db_handle.go:128"}

hferentschik · May 5, 2025, 7:58am

Hi,

I think you need to provide more context to useful advice. How are you trying to install Temporal? Through its Helm charts or some custom method?

The errors seem to indicate connection problems. Either services are not up or not allowing access. The Temporal UI needs to access the Temporal frontend. If the latter is not running, you won’t get much useful information, if any at all.

If the schema job has not run yet, the frontend would not be able to come up, so no surprise there.
I’d take a step back and only progress until a previous step has succeeded. So to start with I look at the DB and the schema job. Can you access the DB via the CLI or some other form? What is the error when running the schema jobs? I am not sure about if and how AWS secret manager is supported, but assuming that’s the problem, you could try to run with explicit username/password secrets to narrow down the problem.

–Hardy

Topic		Replies	Views
Error while connecting to temporal in Kubernetes Community Support go-sdk , mysql , helm , worker	1	1061	April 19, 2021
Temporal Web GRPC Connectivity Issue - AWS ECS Fargate Community Support elasticsearch , docker , aws	3	1637	December 24, 2021
Temporal version 1.22.4: Unable to start server. Error: sql schema version compatibility check failed Community Support	3	554	April 1, 2025
Sql schema version compatibility check failed connection timed out Community Support	0	276	March 6, 2024
Cannot connect to RDS - `sql schema version compatibility check failed: unable to read DB schema version keyspace/database: temporal error: no usable database connection found` Community Support docker , aws , postgresql	1	212	March 5, 2025

Struggling to deploy self-hosted Temporal on Amazon EKS

Related topics