Cluster analysis has often been used to obtain reference models. However, using different methods in each analysis step may yield different results. Furthermore, the validation of the results is often neglected. This study aims to develop a cluster analysis method to obtain reference models for use in thermal performance studies of buildings from different clustering configurations. The procedure proposed is composed of six steps: (1) initial database composition, (2) data matrices formation, (3) cluster analysis, (4) determination of reference models, (5) methods validation and (6) results interpretation. Different data treatments, similarity measures and partitioning algorithms were combined in the cluster analysis to define the method with the best cluster formation. The weighting factor was the data treatment that most contributed to obtaining suitable clustering solutions. City-block, Euclidean Distance and Squared Euclidean Distance similarity measures resulted in suitable formations as well as Complete Linkage, Ward and K-means algorithms. Two clusters were obtained from the cluster methodology, for which two reference models were determined. Hypothesis tests showed that clusters differ for most performance indicators. It was concluded that the procedure proposed could identify the combination of methods that results in the best application of cluster analysis. The main contribution of this paper is to introduce a procedure for validating clusters. This procedure is an applicable technique to obtain reference models.