Si-Hao Du, Yi. Gu, Yuewei Zhu
Nov 1, 2021
2021 International Conference on Signal Processing and Machine Learning (CONF-SPML)
Zillow’s is a real estate company that relies on the estimated costs of a house to set its price. The log error of prediction is calculated by the log difference between the prediction and the actual sale price. Thusly, the goal of this work is trying to minimize this error in order to improve accuracy. Due to the fact that real estate dataset has multiple feature blanks, preprocessing methods of the data show large significance in this work. On the other hand, particularly important features are selected, and several machine learning models— Decision Tree, Random Forest, Linear Regression— are applied to predict. In conclusion, Linear Regression performs better than the other two models. Some future work, like feature engineering methods, can be done to further improve the accuracy.