How to best handle mysterious context deadline exceeded/502 errors

Tristan_Fletcher · August 10, 2021, 5:50pm

Thank you for the reply.

I have access to all server logs. The only suspicious events in the same timeframe are “history size exceeds warn limit” but they are on a different workflow in a different namespace.

I do not think these errors are the result of not replying in time based on the passed context. I have been setting high deadlines, e.g. 30-60s and then validating. They come back with a deadline exceeded at 10s from the time I initiate the request regardless of the +30-60s deadline set in the passed context.

I am usually able to reproduce when I blast the system with 1000 workflow starts within a few seconds. I do see spikes in request volume 1-2 minutes ahead of a failure (about 5k requests/20s to 9k requests/20s), but not at the same time as a failure. There is no visible impact to CPU or mem in the window.

Perhaps I should be increasing some limits to allow it to consume more resources? Are there tuning parameters I can adjust to absorb the request volume spikes better?

Topic		Replies	Views
Context deadline exceeded issue Community Support go-sdk	15	6718	November 20, 2024
"Context Deadline Exceeded" errors after upgrading from v1.18.5 to v.1.19.0 Server Deployment	0	47	July 22, 2024
Context deadline exceeded after upgrading go.temporal.io/sdk to version 1.27.0 Community Support go-sdk	0	229	July 14, 2024
Incessant "context deadline exceeded" errors after upgrading server images to v1.19.1 Community Support go-sdk , kubernetes	1	1743	February 16, 2023
DEADLINE_EXCEEDED: deadline exceeded after 9.999933037s Community Support java-sdk	9	2360	July 13, 2023

How to best handle mysterious context deadline exceeded/502 errors

Related topics