What are the best metrics to autoscale each cluster service on?

Is there a list anywhere that shows which metrics we should be watching/using to scale each individual service?

TIA!

Hi, don’t think we have such docs yet, would start with metrics mentioned in post here. For service errors and latencies you can change the service_name if needed.

1 Like

What about these, are they also worth scaling the individual cluster services on?

Sorry, I get confused between “Temporal workers” and the “worker” service.

Yeah those are good as well.

I get confused between “Temporal workers” and the “worker” service.

Workers are worker processes that you deploy and that execute your code. Worker service is one of Temporal server services (frontend, history, matching, worker). It’s mainly responsible to run background system workflow on the temporal-system namespace.

1 Like

Thanks, I think I am slowly getting it :joy:

Which cluster service do those “worker processes” run inside? Or does it depend a lot on the exact code?

Your workers independent of the service (temporal server). You can deploy them any way you like. Workers poll service (specifically frontend service) for workflow and activity tasks.
Maybe see this video that might help with understanding.