Greetings,
We enabled archiving earlier this week on a high volume namespace. We initially experienced high levels of failure on inline archiving and we were able to resolve that by configuring timeouts per this post: Weird Behaviour with Archival - Requires HttpGet before archiving.
However we are still seeing 100% failure of all the signal based archiving, all logging “failed to send signal to archival system workflow” with an error of “context deadline exceeded”.
I found this topic: SignalWithStartWorkflow behaviour with context.WithTimeout - #2 by tihomir where this bug was opened (Archival attempted for already archived workflow · Issue #2464 · temporalio/temporal · GitHub) which this pr (Handle history not found error when archiving history by yycptt · Pull Request #2465 · temporalio/temporal · GitHub) presumably resolves. However that PR doesn’t actually address the timeout issue by making that timeout configurable, it only resolves how attempts to subsequently delete and already archived workflow are handled.
I have established that connectivity is possible between the history and the frontend services so I don’t think it is networking. Is there any other explanation for why we might get 100% context deadline exceeded when trying to archive workflows?
We are on temporal version 1.13.3
THank you,
{
"content": {
"timestamp": "2022-03-26T00:07:36.821Z",
"host": "i-06416f2c0019966d2",
"service": "history.temporal.infra",
"attributes": {
"service": "history",
"level": "error",
"service_name": "history.temporal.infra",
"wf-id": "temporal-archival-867",
"meta": {
"log_processor": "global"
},
"msg": "failed to send signal to archival system workflow",
"stacktrace": "go.temporal.io/server/common/log.(*zapLogger).Error\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/common/log/zap_logger.go:142\ngo.temporal.io/server/service/worker/archiver.(*client).sendArchiveSignal\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/worker/archiver/client.go:297\ngo.temporal.io/server/service/worker/archiver.(*client).Archive\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/worker/archiver/client.go:190\ngo.temporal.io/server/service/history.(*timerQueueTaskExecutorBase).archiveWorkflow\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/history/timerQueueTaskExecutorBase.go:210\ngo.temporal.io/server/service/history.(*timerQueueTaskExecutorBase).executeDeleteHistoryEventTask\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/history/timerQueueTaskExecutorBase.go:129\ngo.temporal.io/server/service/history.(*timerQueueActiveTaskExecutor).execute\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/history/timerQueueActiveTaskExecutor.go:109\ngo.temporal.io/server/service/history.(*timerQueueActiveProcessorImpl).process\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/history/timerQueueActiveProcessor.go:307\ngo.temporal.io/server/service/history.(*taskProcessor).processTaskOnce\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/history/taskProcessor.go:269\ngo.temporal.io/server/service/history.(*taskProcessor).processTaskAndAck.func1\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/history/taskProcessor.go:221\ngo.temporal.io/server/common/backoff.Retry.func1\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/common/backoff/retry.go:104\ngo.temporal.io/server/common/backoff.RetryContext\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/common/backoff/retry.go:125\ngo.temporal.io/server/common/backoff.Retry\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/common/backoff/retry.go:105\ngo.temporal.io/server/service/history.(*taskProcessor).processTaskAndAck\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/history/taskProcessor.go:248\ngo.temporal.io/server/service/history.(*taskProcessor).taskWorker\n\t/go/pkg/mod/go.temporal.io/server@v1.13.3/service/history/taskProcessor.go:171",
"archival-request-namespace-id": "f15ed5e9-b7e6-49ea-9b11-707424096c00",
"archival-request-run-id": "e6c1fb4f-0ab9-45cc-866e-711bffacb352",
"archival-request-namespace": "payments_prod",
"address": "172.30.112.211:7234",
"logging-call-at": "client.go:297",
"shard-id": 4085,
"ts": "2022-03-26T00:07:36.819Z",
"shard-item": "0xc006982680",
"error": "context deadline exceeded",
"archival-request-workflow-id": "money-allocation-alloc_od_6497773539130607",
"archival-archive-attempted-inline": false,
"archival-caller-service-name": "history"
}
}
}