Hi wonderful Temporal folks. I’m interested to know if it makes sense for Temporal to provide metrics on each task queue (or workflow). I looked at defs.go based this recommendation. At a glance, it looks like Temporal provides overall server health, but not on task queue/workflow specific metrics.
I’m a novice at the operational side of things so I don’t know if this makes sense to be part of Temporal server. I do find myself as a developer frequently implementing metrics on:
- Processing lag (delta between message process vs creation)
- Error rates
- Success rate
Would love to hear your thoughts on the subject.