Identifying household finance heterogeneity via deep clustering

Yoontae Hwang; Yongjae Lee; Frank J Fabozzi

doi:10.1007/s10479-022-04900-3

Identifying household finance heterogeneity via deep clustering

Ann Oper Res. 2023;325(2):1255-1289. doi: 10.1007/s10479-022-04900-3. Epub 2022 Sep 21.

Authors

Yoontae Hwang¹, Yongjae Lee¹, Frank J Fabozzi²

Affiliations

¹ Department of Industrial Engineering, Ulsan National Institute of Science and Technology (UNIST), 50 UNIST gil, Ulju gun, Ulsan, 44919 Republic of Korea.
² EDHEC Business School, 393 Promenade des Anglais, 06202 Nice Cedex 3, France.

Abstract

Households are becoming increasingly heterogeneous. While previous studies have revealed many important insights (e.g., wealth effect, income effect), they could only incorporate two or three variables at a time. However, in order to have a more detailed understanding of complex household heterogeneity, more variables should be considered simultaneously. In this study, we argue that advanced clustering techniques can be useful for investigating high-dimensional household heterogeneity. A deep learning-based clustering method is used to effectively handle the high-dimensional balance sheet data of approximately 50,000 households. The employment of appropriate dimension-reduction techniques is the key to incorporate the full joint distribution of high-dimensional data in the clustering step. Our study suggests that various variables should be used together to explain household heterogeneity. Asset variables are found to be crucial for understanding heterogeneity within wealthy households, while debt variables are more important for those households that are not wealthy. In addition, relationships with sociodemographic variables (e.g., age, education, and family size) were further analyzed. Although clusters are found only based on financial variables, they are shown to be closely related to most sociodemographic variables.

Keywords: Clustering; Deep learning; Heterogeneous household; High-dimensional data; Household finance; Machine learning.

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022. Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.