Latency Metric that excludes sleep or idle time cases

peeyushchawla · February 24, 2023, 7:28am

Hi,
We’re looking forward to measure latency in a manner that reflects only the area where a client sends a request and its processing is started or it is being put in task queue. I suppose, anything of sort where end to end latency would be calculated wouldn’t be useful for us since it would also reflect idle time in cases such as long running workflows etcetera.

We used temporal_workflow_task_schedule_to_start_latency_seconds_bucket and service_latency but soon realized that it would not work since there were instances where some requests would fall under infinity bucket in case of temporal_workflow_task_schedule_to_start_latency_seconds_bucket.

My next bet would be on activity_schedule_to_start_latency. Any thoughts or other suggestions ?

Thanks,
PC

tihomir · February 25, 2023, 6:23am

We’re looking forward to measure latency in a manner that reflects only the area where a client sends a request and its processing is started or it is being put in task queue.

You could look at service latencies (server metrics):

histogram_quantile(0.95, sum(rate(service_latency_bucket{service_type="frontend"}[5m])) by (operation, le))

and can add additional filter for specific operations such as “StartWorkflowExecution” and worker poll specific “RespondActivityTaskCompleted”, “RespondWorkflowTaskCompleted”.

SDK schedule to start latencies measure the time from when a workflow/activity task is placed on your matching host task queue until one of your workers successfully polls it to process.

peeyushchawla · February 27, 2023, 7:08am

Is there any latency metric which would just get the time
From - client starts
and
To - its placed to task queue.

Thanks

tihomir · February 27, 2023, 5:09pm

On the server metrics side, service_latency_bucket latencies are measured from when the Temporal frontend service receives the client request. Latencies of client requests to when frontend service receives this are not included as Temporal server does not control that part.

For communication latencies between your SDK apis and service, SDK metrics expose:
temporal_request_failure
temporal_request_latency
temporal_long_request_failure
temporal_long_request_latency

if that helps.

Topic		Replies	Views
Metric to measure async workflow invocation time Community Support java-sdk , general-impl	4	548	June 21, 2022
Temporal Server Metrics Community Support	4	714	September 19, 2023
Question about getting some metrics using the SDK Community Support java-sdk	10	1730	September 27, 2022
Temporal metric for task queue size/backlog, or schedule to start latency for task queue Community Support metrics	3	3019	December 7, 2021
Matching service GetTaskQueue latency metrics is very large Community Support metrics	1	687	December 11, 2023

Latency Metric that excludes sleep or idle time cases

Related topics