DOI: https://doi.org/10.18517/ijods.4.2.97-106.2023

Utilizing Model Residuals to Identify Rental Properties of Interest: The Price Anomaly Score (PAS) and Its Application to Real-time Data in Manhattan

Youssef Sultan (1) , Jackson Rafter (2) , Huyen Nguyen (3)
(1) Georgia Institute of Technology, North Avenue Atlanta, GA 30332, United States
(2) Georgia Institute of Technology, North Avenue Atlanta, GA 30332, United States
(3) Georgia Institute of Technology, North Avenue Atlanta, GA 30332, United States
Fulltext View | Download

Abstract

Understanding whether a property is priced fairly hinders buyers and sellers since they usually do not have an objective viewpoint of the price distribution for the overall market of their interest. Drawing from data collected of all possible available properties for rent in Manhattan as of September 2023, this paper aims to strengthen our understanding of model residuals; specifically on machine learning models which generalize for a majority of the distribution of a well-proportioned dataset. Most models generally perceive deviations from predicted values as mere inaccuracies, however this paper proposes a different vantage point: when generalizing to at least 75% of the dataset, the remaining deviations reveal significant insights. To harness these insights, we introduce the Price Anomaly Score (PAS), a metric capable of capturing boundaries between irregularly predicted prices. By combining relative pricing discrepancies with statistical significance, the Price Anomaly Score (PAS) offers a multifaceted view of rental valuations. This metric allows experts to identify overpriced or underpriced properties within a dataset by aggregating PAS values, then fine-tuning upper and lower boundaries to any threshold to set indicators of choice.

Article Details

How to Cite
[1]
Y. Sultan, J. Rafter, and H. Nguyen, “Utilizing Model Residuals to Identify Rental Properties of Interest: The Price Anomaly Score (PAS) and Its Application to Real-time Data in Manhattan”, Int. J. Data. Science., vol. 4, no. 2, pp. 97-106, Dec. 2023.
Section
Articles

References

Census reporter: Manhattan borough, new york county, ny.

Quickfacts: New york city, new york.

Here are the most affordable nyc neighborhoods for recent college grads, Jun 2023.

Nyc residential rental market report: October 2023, Oct 2023.

Y. Chen, X. Liu, X. Li, Y. Liu, and X. Xu. Mapping the fine-scale spatial pattern of housing rent in the metropolitan area by using online rental listings and ensemble learning. Applied Geography, 75:200–212, 2016.

A. Costa, V. Sass, I. Kennedy, R. Roy, R. J. Walter, A. Acolin, K. Crowder, C. Hess, A. Ramiller, and S. Chasins. Toward a crossplatform framework: Assessing the comprehensiveness of online rental listings. cityscape. Cityscape, 23(2):327, 2021.

Greg David and Sam Rabiyah. Why is nyc rent so high?, 2023. Accessed: October 14, 2023.

J. Dhillon, N. P. Eluri, D. Kaur, A. Chhipa, A. Gadupudi, R. C. Eravi, and M. Pirouz. Analysis of airbnb prices using machine learning techniques. In 2021 IEEE 11th Annual Computing and Communication Workshop and Conference (CCWC), pages 0297–0303, 2021.

Lisa Holden. How a Listing Makes Its Way to StreetEasy, 2019.

M. Mijatovic. An algorithm for the multidimensional analysis of´ the overestimate and underestimate of property rental value. In Sinteza 2020 - International Scientific Conference on Information Technology and Data Related Research, pages 268–274, 2020.

NBC New York Staff. Manhattan’s population is returning after pandemic plunge, other boroughs not so much, 2023. Accessed: October 14, 2023.

Youssef Sultan. Streeteasy manhattan properties september 2023. https://doi.org/10.5281/zenodo.10207478, Nov 2023.

X. Zhou, W. Tong, and D. Li. Modeling housing rent in the atlanta metropolitan area using textual information and deep learning. ISPRS International Journal of Geo-Information, 8(8):349, 2019.

A. Zhu, R. Li, and Z. Xie. Machine learning prediction of new york airbnb prices. In 2020 Third International Conference on Artificial Intelligence for Industries (AI4I), pages 1–5, 2020.