Metrics to monitor regarding activity failures

I have activities that will retry for up to a day. I’d like to be alerted whenever the failure rates for this activity get too high, which likely points to some type of underlying code or data issue. I have Typescript SDK worker metrics being emitted, and my workers are running on 1.8.6 of the Temporal Typescript SDK.

Is the right metrics to use here temporal_activity_execution_failed / temporal_activity_task_received to get % of activities failing? This doc Temporal SDK metrics reference | Temporal Documentation only contains the first metric, but not the second one. sdk-core/core/src/telemetry/metrics.rs at master · temporalio/sdk-core · GitHub seems to indicate that my metrics are right, but when I tried them, they didn’t quite look right, so want to check if I’m misunderstanding something here. Thank you!