We are testing Temporal workers scaling strategy, and using metrics like temporal_activity_schedule_to_start_latency
and workflow_task_schedule_to_start_latency
for scaling criteria. We see that scaling up and down happen based on these metrics.
However, when scaling down, I see that sometimes some activities get killed if they do not finish within the cooldown period. Does Temporal automatically realize that those activities resulted in error, and schedule them to run in other worker nodes?
Thank you