Detecting business errors by listing for workflows with high retry attempts


I want to be alerted when there is a workflow with activities that have retried for more than N attempts, so I can investigate if it is a bug in our business logic.

Is there a quick way to list such workflows?

If not, what is the recommended way for such scenarios?

Would be curious to know how others do it too.

We’ve been using metrics on error logs to monitor. But, am aware there are now interceptors in Go SDK for activity errors, see Workflow / activity middleware