Abstract

This paper discusses how to apply the random forest algorithm in building a mass appraisal system for residential property and analyzes issues related to modelling process. The paper investigates the relationship between the random forest model and the complexities of housing market. Based on the findings, various qualitative analyses has been attempted for effective model design. The findings are summarized as followed;-. First, the random forest model is performative in capturing the non-linearity from sub-market and locational effects. The random forest model has significantly low average percentage error (approx. 4%) compared to with a linear Hedonic model (approx. 11%). Second, the random forest model can efficiently capture the locational effects only with locational information (coordinates), without proxy variables. Third, using dummy variables may reduce explanatory power of the model, compared to label indexes because the number of variables included in an operation increases. Fourth, the advantage from model complexity seems overwhelm the disadvantage from overfitting. Fifth, modellers are not required to ensure consistency in the time periods contained in a dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.