Temporal Performance

Prats · January 29, 2024, 10:52am

Hi,

Lately we have been doing PERF tests. The same workflow runs which used to take very short time earlier, now they are taking quite a bit of time.

On further investigation noticed these error logs below.

Tasks are just left with “ActivityTaskScheduled”, I could see from the logs that activity processing starts but then lot of grpc errors (eg: Exception in oDataRead() as shown in log below), ultimately it retries and times out.

Backend kubernetes temporal history pod logs.

in temporal matching pod

So is this because of resource contention w.r.t temporal cluster? I am not so sure, because in kubernetes I could see all the temporal pods running and healthy. No restarts or degradations.

Could you please let me know what might be the issue here, do I have to tune any config?
If not, how to debug and pinpoint what exactly is causing this.

Prats · January 31, 2024, 6:00am

Hi Team,

Any insights here.

Topic		Replies	Views
Java Grpc high latencies Community Support java-sdk	8	1028	March 1, 2022
Bad performance when deployed in Kubernetes - How to diagnose bottleneck? Community Support java-sdk , helm	2	1657	March 25, 2022
Temporal performance issues Community Support java-sdk , performance , worker , kubernetes	1	1837	April 26, 2023
Potential deadlock detected Community Support java-sdk	4	4228	December 2, 2022
Restarted and New Pods Not Picking Up Old Workflows from Task Queue in K8s Cluster Community Support general-impl , typescript-sdk	11	820	September 5, 2023

Temporal Performance

Related topics