Mass appraisal is the standardized procedure of valuing a large number of properties at the same time and is commonly used to compute real estate tax.
While a hedonic pricing model based on the ordinary least squares (OLS) linear regression has been employed as the traditional method in this process, the stability and accuracy of
the model remain questionable. This paper investigates the features of a house price predictor based on the Random Forest (RF) method by comparing it with that of a conventional
hedonic pricing model. We used apartment transaction data from the period of 2006 to 2017 in the district of Gangnam, one of the most developed areas in South Korea. Using a data
set covering 40% of all transactions in the sample area, we demonstrate that the accuracy of a machine learning-based predictor can be surprisingly high. The average of percentage
deviations between the predicted and the actual market price was found to be only around 5.5% in the RF predictor, whereas it was almost 20% in the OLS-based predictor. With the RF
predictor, the probability of the predicted price being within 5% of its actual market price was 72%, while only about 17.5% of the regression-based predictions fell within the same
range. These results show that, in the practice of mass appraisal, the RF method may be a useful complement to the hedonic models, as it more adequately captures the complexity or
non-linearity of actual housing markets.
In this study, we analyze the case of induced seismicity in Pohang, South Korea, in 2017 to investigate the effect of seismic risk perception on the local residential property market. Based on a hedonic pricing model with a difference-in-differences method, we examine the geographic distribution of the effects of unexpected earthquake hazards. Our results indicate an overall reduction in local property values, but the magnitudes of negative externality for housing prices decrease with respect to the distance from the epicenter. In areas within 3 km of the epicenter, the asset value reduced by approximately 20% after an earthquake event, but if the distance from the epicenter was higher than 12 km, the negative effect on the price was not significant. In addition, we examine how the experience of seismic events affect the preference on the anti-seismic building structure. The results show that the market valuation on the anti-seismic system significantly escalated after the earthquake.
This paper compares the predictive power of a conventional hedonic pricing model and three machine learning algorithm (XGBoost, LightGBM, CatBoost) based models by using 620,617 apartment data in Seoul from 2009 to 2019. The results are summarised as follows; First, the predictive power of the machine learning models are significantly high not only in the comparison to the conventional model but also in the absolute accuracy related to its practical usefulness. The mean percentage error of XGBoost, LightGBM, and CatBoost were only, respectively, 3.7%, 3.8%, and 3.6% while those of the hedonic model was around 11%. Second, we found that CatBoost algorithm is slightely more performative to the other two algorithms in terms of overall predictive power and frequency of outlier occurrences. Third, this paper show that an ensemble model of the three algorithms can raise the predictive power further.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.