AbstractWorkflow management system (WfMS) in cloud always works as platform as a service to manage customized business processes for massive enterprises. In big data era, non‐functional guarantees of such systems are significant when facing a large number of users and concurrent requests. It is not trivial to support multi‐tenancy and hold high‐availability, because traditional architecture cannot simultaneously satisfy requirements about data isolation and runtime efficiency. In this paper, a modularized distributed workflow management system is proposed, which considers both multi‐tenancy and high‐availability in storage and engine parts of the system. A multiple‐worker‐with‐separate‐schema mechanism is defined to jointly manage the data for tenants, and a proactive strategy is presented to intelligently dispatch large concurrent requests from users to engine workers. After extensive case studies and experiments in practical scenes, our system deployed on modest machines is proved to support tens of thousands of tenants, second‐level response time for 10 K concurrency, and no‐human‐intervened failure recovery for a fail‐stop system node.
Read full abstract