Is there a way to categorize the metrics available from temporal server and client?

Hello Experts,

I am new to Temporal. In my POC, I’ve deployed Temporal cluster on my Kubernetes cluster using helm charts from https://github.com/temporalio/helm-charts.git. Also, I have enabled scrape endpoints from my client code(both starter and worker in java),

Is there a way to categorize between metrics generated from Temporal server and Client? Also, is there a list of all the available metrics along with a brief description to indicate what the metric indicates?

Regards,
Ruchir

Is there a way to categorize between metrics generated from Temporal server and Client?

SDK metrics have by default the “temporal_” prefix for both service and worker metrics.

is there a list of all the available metrics along with a brief description to indicate what the metric indicates?

SDK metrics: Java, Go. Also see docs page on SDK metrics. Here are all server metrics, as well as a docs page.

1 Like

SDK metrics have by default the “temporal_” prefix for both service and worker metrics.

@tihomir but in some cases, the temporal prefix is not present, for eg: workflow_success, workflow_failed, service_requests. Hence, asked for a list.

Will definitely go through the links you have provided. Thank you so much for your quick response!

@tihomir Is there any one liner description for these metrics? Need to understand which of them might be useful for us to monitor as the list is huge. Thanks.

That’s something that we should be adding to our docs soon.
To get a feel of which ones might be useful for you (including grafana queries) see this sdk and server dashboard defs.
This dashboard is also set up out of the box in the background checks learning path demo.

1 Like

Hi @tihomir,

Thanks for letting us know that this is coming to the docs soon. Several of my peers have been chatting about this and hunting for metrics meaning in the code, etc. I think having a simple page listing all metrics and their meanings (and units) on the docs would be WONDERFUL. Please take this as a gentle “bump” that many Temporal users would love this documentation! (We can find what we need by hunting in the code today, but it is always a hunt)

Thanks,
Dan

3 Likes

@dpincas totally agree, team is looking into that and making sure the SDKs are aligned as well.
Also looking into providing better out-of-box dashboards for SDK and server metrics as well.
Will make sure to post updates here.

1 Like