Cloud Computing has been a trending technology for a few years supporting computational services over internet. But ever since its adoption, cloud’s consistent challenge is in its dynamic resource allocation. The existing cloud model details the online and offline algorithms used to decide the dynamic resource allocation. The goal is to have a dynamic resource allocation framework that aligns to cloud data management’s objective of maximizing revenue with minimum cost. This encourages both consumers and cloud providers not only with energy-efficient power usage but also high CPU utilization. This article discusses the impediments of migrating to Public Cloud, what is dynamic resource allocation, HPC workloads with complex communication path on cloud platform, and the benefits of bare metal platform for latency-sensitive applications. We shed light on trade- offs (compute balance) between Private and Public Cloud, how existing resources can be leveraged, Random forest (RF) solutions including a study on hybrid cloud computing capacity optimization framework. Understanding RL architecture, problem solving approach, learning structure and Hybrid Cloud Management Architecture framework are also explored. Also given are a few RL implemented gaming examples on how it makes an impact. Lastly, we shall do the comparisons of RL with other Machine Learning (ML) approaches.