Building energy baseline models, particularly machine learning-based models, are a core aspect in the evaluation of building energy performance to identify inefficient energy consumption behavior. In smart city design, energy planners and decision makers require comprehensive information on energy consumption across diverse building types as well as comparisons between different types of buildings. However, there is no comprehensive study of baseline modeling across the main building types to help identify factors that influence the performance of different machine learning algorithms for baseline modeling. Therefore, the goal of this paper is to review and analyze energy consumption behavior and evaluate the prediction performance and interpretability of machine learning-based baseline modeling techniques across major building types. The results have shown that the Extreme Gradient Boosting Machine (XGBoost) model is the most accurate baseline modeling method for all building types. Time-related factors, especially the week of the year and the day of the week, have the most impact on energy consumption across all building types. This study is presented as a useful resource for smart city energy managers to help in choosing and setting up appropriate methodologies for better operational effectiveness and efficiencies when designing and planning smart energy systems.