How to manage a number of different workers running on different taskqueue? Is CRD a suitable solutions?

rainfd · July 12, 2022, 12:27pm

Background

To controll concurrency, lots of algorigthm services need to run in a separate taskqueue
I’ve already implemenet our own DSL to support the fusion of algorithmic flows
In go-sdk, a worker can only run in one taksqueue
All services run on a k8s cluster

Difficulty

One algorigthm serive per taskqueue.It need to start lots of different workers. It’s too much trouble to manage a dozen configurations and deployment manifests
Similar to the above point, many predefined DSL workflows also need to run on a unique taskqueue

My design

Impletion our CRD controller in k8s.
Combine workflow related configuration and DSL into CRD.
Now, after we submit DSL or activity CRD , the CRD controller automatically create the worker deployment running on the assigned taskqueue.

CRD template:

name: TestWorkflow
workflowOptions:
  taksqueue: test
  concurrency: 3
  replicas: 3
inputs:
  - name: userid
template:
  sequence:
    - activity:
        name: algo0
    - activity:
        name: algo1
        arguments:
          - userid
        result: output1
    - parallel:
        - activity:
            name: algo1
            arguments:
              - output1
            result: output2
        - activity:
            expect:
              op: and
              val:
                - key: output1
                  value: a
                - key: output2
                  value: b
            name: algo1
            arguments:
              - output1
              - output2

Question

Does anyone have a similar problem? How did you solve it?
Is CRD a suitable solutions? Do you have any suggesetions?

tihomir · July 12, 2022, 1:49pm

To controll concurrency, lots of algorigthm services need to run in a separate taskqueue

Can you give more info on your use case and concurrency needs? What is the rate of execution and what are you trying to limit? A single worker can handle many concurrent workflow executions. Just trying to understand your needs/limitations better.

rainfd · July 13, 2022, 2:09am

Most of algorithm services are running on gpu. Take one for example,
a request can take anywhere from 100 milliseconds to 1 minute.
In the old message queue system, the concurrency is limit to 5.
If all serivces sharing the same taskqueue, I can’t limit the concurrency separately.

maxim · July 13, 2022, 5:24pm

If you need to guarantee concurrency then use a single task queue.

What was the original reason for using multiple task queues in your case?

rainfd · July 14, 2022, 2:20am

The original reason is to maximize the use of these gpu algorithm services.

I don’t know to get there by using a single task queue.

maxim · July 14, 2022, 6:15pm

Would you provide more context? I cannot provide any recommendation without understanding what “maximize the use of these gpu algorithm services” means.

Topic		Replies	Views
Question regarding workflow execution count Community Support java-sdk	5	1178	September 22, 2022
Can temporal support multi-stage pipeline? Community Support general-impl	8	789	March 8, 2022
Need Help in understanding worker to server communication Community Support helm , cassandra , kubernetes	10	2133	June 5, 2022
TaskQueues in workflow Community Support task-queue , typescript-sdk , workflow-config	7	1389	January 12, 2023
Clarity on task queue and worker segregation patterns? Community Support task-queue , worker , multi-tenant	4	3467	July 19, 2022

How to manage a number of different workers running on different taskqueue? Is CRD a suitable solutions?

Background

Difficulty

My design

Question

Related topics