Can I scale request queuing and worker differently?

emailtowalter · February 22, 2021, 4:28am

We are interested in using Temporal as the workflow engine to serve the online customer request. We just start the learning, so bear with me for sec.

Request & reply use case: Our online portal calls bunch of services behind, the front-end will wait until the orchestration complete . then show the result to user. The elapse time is around 5 sec. How to design the system using Temporal for this case? I am worried about the resource being held on the web portal if there are too many request. Is there a way to listen to the completion of the workflow from the requesting side?
Can queuing API and worker API be deployed separately in order to scale them differently? From the architecture diagram (https://docs.temporal.io/docs/server-architecture), it seems API is held together. Our thought is that queuing part should be super fast and reliable, but worker part can be very heavy and slow. By the way, where can I find the info on how to scale Temporal?
security: is there a way to encrypt the data in the database?

Thanks,

maxim · February 22, 2021, 8:30pm

I’m not sure that this is a good use case for Temporal. What is expected behaviour if any of the orchestrated downstream services is not available? Is it OK to exceed 5 second timeframe? Is it OK to return error in this case to the client?
Temporal already has multiple roles that can scale independently. So it is possible to scale frontends (that expose API) and matching (that deals with queueing) and history (that maintains workflow state) separately.
We recommend encrypting data on the client. All SDKs expose DataConverter API that can be used to encrypt all the payloads.

emailtowalter · February 22, 2021, 9:59pm

Hi Max… thanks for the feedback. Your answer makes me to think the request-reply probably should be within a single microservice boundary. behind the scene this service might use message bus/event stream to clone the data for the performance reason.

With that, It would be great to see more architecture guidance/design pattern using temporal.

maxim · February 22, 2021, 10:05pm

I’m not sure we are on the same page. If you are planning to use queue/message bus then Temporal could be a better fit. But as you didn’t answer my question (1) I’m not able to give any concrete recommendations.

emailtowalter · February 22, 2021, 10:21pm

My apology for the confusion. To your question, if downstream service is not available, I would say retry 3 times. if it still fails, then I guess we have to tell customer " the process is not complete somehow , and we will get back to him/her." If it is over 5 sec, the front end has to assume the request failed somewhere, and show the same issue message to customer.

My thought is that from user experience angle, we cannot let user wait for even 5 sec. Once the request is submitted, user should be able to navigate to other pages. Once the workflow is done, we need to notify user, either through web UI, or email. Hence asked if notification exists in Temporal.

Thanks,

maxim · February 22, 2021, 10:28pm

Then Temporal is a good fit for your use case. Start the workflow when the user initiates the process and let the workflow notify the user through an activity about its state changes.

Topic		Replies	Views
Temporal and concurrency Community Support mysql , scaling , performance	4	2287	July 10, 2020
Seeking Guidance on Temporal Application Design, scaling workflow queues as per load Community Support java-sdk	0	37	March 1, 2025
Is it possible to achieve org-based request processing and concurrency control in Temporal? Community Support general-impl	3	271	September 25, 2023
Question about Temporal worker starvation + scalability Community Support java-sdk	4	2185	January 26, 2022
Strategies for Scaling AWS Services Community Support scaling	9	2221	October 1, 2021

Can I scale request queuing and worker differently?

Related topics