Abstract

This paper proposes a methodology for using machine learning regression models to create price indices. In our study we developed six commercial real estate price indeces for the city of New York from year 2000 to 2019. The regression models used in this study are eXtreme Gradient Boosting Tree (XGBT), Support Vector Regression (SVR) and averaged Neural Networks (avNNet). The benchmark for comparison of the results is Ordinary Least Squares (OLS). There are two main index methodologies, a chained, where the index is constructed using out-of-sample data, and a pooled, where the index is constructed using in-sample data. The two main index methodologies can be divided into sub categories that utilizes a Paasche and Laspeyres like index formulations. Another important factor included in this study is the size, in years, of the optimal training window for building the index. The results show that the machine learning approach produced, overall, lower estimation errors. Nevertheless, these lower estimation errors not always constitutes a more stable index. Additionally, all models, including the benchmark, are sensitive to the training window size regarding the out-of-sample estimation error.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.