Jane Jacob’s concepts of urban vitality and diversity have become prevailing urban planning philosophies in most countries for making cities more livable. Recent changes in demographics and the impacts of COVID-19 have exacerbated the economic and social challenges that cities commonly face, particularly spatiotemporal heterogeneities. Being able to understand these heterogeneities in scalable approaches is fundamental to tackling these challenges in cities. Therefore, this article aims to provide a new form of scalable estimation of urban vitality by using the de facto population. Instead of merely adopting static statistical information such as morphological characteristics in areas, we leverage dynamic factors such as internal mobility flows and energy use intensity as proxies for the spatiotemporal dynamics of indoor and outdoor behaviors of crowds. In this way, we combine dynamic attributes and static features to describe the patterns of urban vitality, which are directly related to spatiotemporal dynamics in urban places. We utilize GNSS-based mobile data and building energy usage intensity as dynamic proxies along with static data such as land use mix and age distribution. To better capture spatial heterogeneity, we use a Multiscale Geographically Weighted Regression (MGWR) model to identify the relationships between the de facto population and the dynamic and static factors. Drawing from the factors determining urban vitality, this article provides policy implications for alleviating spatiotemporal urban imbalances. These data-driven implications can fill the technical knowledge gaps in establishing planning strategies for achieving urban sustainability while enhancing overall subjective livability.