AbstractThe allocation of resources is a foremost demanding task in cloud computing. Scholars are yet finding it difficult to allocate appropriate resources to the set of user tasks. Our objective is to provide a platform that optimizes a dynamic resource allocation scheme. Multi‐agent deep reinforcement learning‐based greedy adaptive firefly algorithm (MAD‐GAF) has been proposed herein includes both the resource management and allocation techniques. This chooses the best Quality of Service (QoS) measured host for a group of tasks efficiently and subsequently minimize the task execution time. The proposed cloud brokering architecture comprises a multi‐agent system, the cloud provider and the user. Initially, deep reinforcement learning has been built to recreate the request of cloud customers by forecasting the value of unused resources. Then the recreated customer request is forwarded to the global broker agent, which maps the virtual machine (VM) to the most appropriate cluster of physical machine (PM). The virtual machine monitor (VMM) selects VMs by managing and accessing the physical resources. The global utility agent allocates VMs using the GAF optimization algorithm, which specifies the best QoS measured host to decrease the whole tasks' average response time, thus optimizing resource allocation compared to the current approaches.