Expanding portfolio diversification through cluster analysis beyond traditional volatility
- 
                        Received December 9, 2024;Accepted January 15, 2025;Published January 23, 2025
- 
	Author(s)Mykhailo KuzhelievLink to ORCID Index: https://orcid.org/0000-0002-7895-7879   , 
	
		Dmytro ZherlitsynLink to ORCID Index: http://orcid.org/0000-0002-2331-8690 , 
	
		Dmytro ZherlitsynLink to ORCID Index: http://orcid.org/0000-0002-2331-8690   , 
	
		Ihor RekunenkoLink to ORCID Index: https://orcid.org/0000-0002-1558-629X , 
	
		Ihor RekunenkoLink to ORCID Index: https://orcid.org/0000-0002-1558-629X   , 
	
		Alina NechyporenkoLink to ORCID Index: https://orcid.org/0000-0003-2494-1465 , 
	
		Alina NechyporenkoLink to ORCID Index: https://orcid.org/0000-0003-2494-1465   , 
	
		Sergii StabiasLink to ORCID Index: https://orcid.org/0000-0003-2758-5662 , 
	
		Sergii StabiasLink to ORCID Index: https://orcid.org/0000-0003-2758-5662    
- 
	DOIhttp://dx.doi.org/10.21511/imfi.22(1).2025.12
- 
	Article InfoVolume 22 2025, Issue #1, pp. 147-159
- TO CITE АНОТАЦІЯ
- 
	Cited by1 articlesJournal title: Scientific bulletin of the International Association of scientists. Series: Economy, management, security, technologiesArticle title: THE ROLE OF THE FINANCIAL SYSTEM IN ENSURING SUSTAINABLE DEVELOPMENT OF UKRAINE IN THE CONTEXT OF POST-WAR RECONSTRUCTIONDOI: 10.56197/2786-5827/2025-4-3-7Volume: / Issue: / First page: / Year: 2025Contributors: Alina Nechyporenko, Mariana Sulyma
- 1844 Views
- 
	677 Downloads
							
								 
							
							
							This work is licensed under a
							
								Creative Commons Attribution 4.0 International License
							
						
The study reviews the application of machine learning tools in financial investment portfolio management, focusing on cluster analysis for asset allocation, diversification, and risk optimization. The paper aims to explore the use of clustering analysis to broaden the concept of portfolio diversification beyond traditional volatility metrics. An open dataset from Yahoo Finance includes a ten-year historical period (2014–2024) of 130 actively traded securities from international stock markets used. Dataset selection prioritizes top liquidity and trading activity. Python analytical tools were employed to clean, process, and analyze the data. The methodology combines classical Markowitz optimization with clustering analysis techniques, highlighting variance-return trade-offs. Various asset characteristics, including annualized return, standard deviation, Sharpe ratio, correlation with indices, skewness, and kurtosis, were incorporated into the clustering models to reveal hidden patterns and groupings among financial assets. Results show that while clustering enhances insights into asset diversity, classical approaches remain historically superior in optimizing risk-adjusted returns. This study concludes that clustering complements, rather than replaces, classical methods by broadening the understanding of diversification and addressing many diversity factors, such as metrics of the technical, graphical, and fundamental analysis. The paper also introduces the diversity rate based on clustering, which measures the variance balance by all features within and between clusters, providing a broader perspective on diversification beyond traditional metrics. Future research should investigate dynamic clustering techniques, integrate fundamental economic indicators, and develop adaptive models for effective portfolio management in evolving financial markets.
- Keywords
- 
	JEL Classification (Paper profile tab)C63, C61, D53, G11, G17
- 
	References30
- 
	Tables3
- 
	Figures3
- 
	- Figure 1. Flowchart of the research
- Figure 2. Efficient frontier and simulated portfolio risk-return trade-off
- Figure 3. Hierarchical clustering dendrogram for portfolio assets using Ward’s method
 
- 
	- Table 1. Performance and diversification metrics for optimized portfolios (classical optimization method) from January 1, 2014 to December 1, 2024
- Table 2. Cluster Centroids: annualized mean and standard deviation
- Table 3. Cluster-based analysis and portfolio comparison of selected assets
 
- 
	- Agudelo Aguirre, A. A., Rojas Medina, R. A., & Duque Méndez, N. D. (2020). Machine learning applied in the stock market through the Moving Average Convergence Divergence (MACD) indicator. Investment Management and Financial Innovations, 17(4), 44-60.
- Aiche, A., Winer, Z., & Cohen, G. (2024). Constructing Cybersecurity Stocks Portfolio Using AI. Forecasting, 6(4), 1065-1077.
- Apalkova, V., Tsyganov, S., Meshko, N., Tsyganova, N., & Apalkov, S. (2022). Evaluation models for the impact of pricing factor on environmental performance in different countries. Problems and Perspectives in Management, 20(2), 135-148.
- Aziz, S., Dowling, M., Hammami, H., & Piepenbrink, A. (2021). Machine learning in finance: A topic modeling approach. European Financial Management.
- Babenko, V., Panchyshyn, A., Zomchak, L., Nehrey, M., Artym-Drohomyretska, Z., & Lahotskyi, T. (2021). Classical machine learning methods in economics research: Macro and micro level examples. WSEAS Transactions on Business and Economics, 18, 209-217.
- Bhama, V. (2024). Does an increase in portfolio volatility create more returns? Evidence from India. Investment Management and Financial Innovations, 21(2), 345-354.
- Clarissa, A., & Koesrindartoto, D. P. (2024). Strategic portfolio rebalancing: Integrating predictive models and adaptive optimization objectives in a dynamic market. Investment Management and Financial Innovations, 21(3), 304-316.
- Derbentsev, V., Datsenko, N., Babenko, V., Pushko, O., & Pursky, O. (2021). Forecasting cryptocurrency prices using ensembles-based machine learning approach. In 2020 IEEE International Conference on Problems of Infocommunications Science and Technology (PIC S&T) Proceedings (pp. 707-712).
- Fantazzini, D., & Zimin, S. (2020). A multivariate approach for the simultaneous modelling of market risk and credit risk for cryptocurrencies. Journal of Industrial and Business Economics, 47(1), 19-69.
- Feng, X., von Mettenheim, H.-J., Sermpinis, G., & Stasinakis, C. (2024). Sustainable portfolio construction via machine learning: ESG, SDG, and sentiment. European Financial Management.
- Gallastegui, L. M. G., Forradellas, R. R., & Alonso, S. L. N. (2024). Applying advanced sentiment analysis for strategic marketing insights: A case study of BBVA using machine learning techniques. Innovative Marketing, 20(2), 100-115.
- Glazunova, O., Saiapina, T., Korolchuk, V., Kasatkina, O., & Voloshyna, T. (2021, May 12-14). Digital intelligence of a modern economist: An exploratory case study. Paper presented at the 2nd International Conference on History, Theory and Methodology of Learning (ICHTML). Kryvyi Rih, Ukraine.
- Heaton, J. B., Polson, N. G., & Witte, J. H. (2017). Deep learning for finance: Deep portfolios. Applied Stochastic Models in Business and Industry, 33(1), 3-12.
- Inani, S. K., Pradhan, H., Kumar, S., & Biswas, B. (2024). Navigating the technical analysis in stock markets: Insights from bibliometric and topic modeling approaches. Investment Management and Financial Innovations, 21(1), 275-288.
- Jain, P., & Jain, S. (2019). Can Machine Learning-Based Portfolios Outperform Traditional Risk-Based Portfolios? The Need to Account for Covariance Misspecification. Risks, 7(3), 74.
- Korstanje, J. (2021). Advanced forecasting with Python: With state-of-the-art models including LSTMs, Facebook’s Prophet, and Amazon’s DeepAR. Apress.
- Kuzheliev, M., Rekunenko, I., Boldova, A., Zhytar, M., & Stabias, S. (2019). Modeling of structural and temporal characteristics in the corporate securities market of Ukraine. Investment Management and Financial Innovations, 16(2), 260-269.
- Kuzheliev, M., Zherlitsyn, D., Rekunenko, I., Nechyporenko, A., & Nemsadze, G. (2020). The impact of inflation targeting on macroeconomic indicators in Ukraine. Banks and Bank Systems, 15(2), 94-104.
- Leung, M.-F., Jawaid, A., Ip, S.-W., Kwok, C.-H., & Yan, S. (2023). A portfolio recommendation system based on machine learning and big data analytics. Data Science in Finance and Economics, 3(2), 152-165.
- Liew, J. K. S., & Mayster, B. (2018). Forecasting ETFs with machine learning algorithms. Journal of Alternative Investments, 20(3), 58-78.
- López de Prado, M. (2016). Building diversified portfolios that outperform out of sample. The Journal of Portfolio Management, 42(4), 59-69.
- Markowitz, H. (1952). Portfolio selection. The Journal of Finance, 7(1), 77-91.
- Mints, A. (2017). Classification of tasks of data mining and data processing in the economy. Baltic Journal of Economic Studies, 3(3), 47-52.
- Owen, S. R. (2023). An analysis of conditional mean-variance portfolio performance using hierarchical clustering. The Journal of Finance and Data Science, 9, 100112.
- Pinelis, M., & Ruppert, D. (2022). Machine learning portfolio allocation. The Journal of Finance and Data Science, 8, 35-54.
- Sang, N.M. (2024). Bibliometric insights into the evolution of digital marketing trends. Innovative Marketing, 20(2), 1-14.
- Viebig, J. (2020). Exuberance in financial markets: Evidence from machine learning algorithms. Journal of Behavioral Finance, 21(2), 128-135.
- Yahoo! (2024). Yahoo! Finance Data.
- Zherlitsyn, D. (2024) Python for Finance: Data analysis, financial modeling, and portfolio management (English Edition) (1st ed.). BPB Publications.
- Zmuk, B., & Josic, H. (2020). Forecasting stock market indices using machine learning algorithms. Interdisciplinary Description of Complex Systems, 18(4), 471-489.
 
- 
	- 
            Data curation
            Mykhailo Kuzheliev
- 
            Investigation
            Mykhailo Kuzheliev, Dmytro Zherlitsyn, Alina Nechyporenko
- 
            Methodology
            Mykhailo Kuzheliev, Dmytro Zherlitsyn, Ihor Rekunenko
- 
            Project administration
            Mykhailo Kuzheliev, Ihor Rekunenko
- 
            Supervision
            Mykhailo Kuzheliev
- 
            Writing – original draft
            Mykhailo Kuzheliev, Dmytro Zherlitsyn, Ihor Rekunenko
- 
            Conceptualization
            Dmytro Zherlitsyn
- 
            Software
            Dmytro Zherlitsyn, Ihor Rekunenko, Sergii Stabias
- 
            Visualization
            Dmytro Zherlitsyn, Alina Nechyporenko
- 
            Formal Analysis
            Ihor Rekunenko, Alina Nechyporenko, Sergii Stabias
- 
            Validation
            Ihor Rekunenko, Sergii Stabias
- 
            Resources
            Alina Nechyporenko, Sergii Stabias
- 
            Writing – review & editing
            Alina Nechyporenko, Sergii Stabias
 
- 
            Data curation
            
- 
                The macroeconomic factors affecting government bond yield in Indonesia, Malaysia, Thailand, and the PhilippinesBenny Budiawan Tjandrasa , 
    Hotlan Siagian , 
    Hotlan Siagian   , 
    Ferry Jie , 
    Ferry Jie doi: http://dx.doi.org/10.21511/imfi.17(3).2020.09 				
                            Investment Management and Financial Innovations Volume 17, 2020 Issue #3 pp. 111-121 Views: 3311 Downloads: 703 TO CITE АНОТАЦІЯ doi: http://dx.doi.org/10.21511/imfi.17(3).2020.09 				
                            Investment Management and Financial Innovations Volume 17, 2020 Issue #3 pp. 111-121 Views: 3311 Downloads: 703 TO CITE АНОТАЦІЯThe government bond (GB) has become the most attractive investment portfolio option, even though many macroeconomic factors affect the bond yield. This paper aims to investigate the determining factor of local currency government bond yield by considering the inflation rate, credit default swap, stock market index, exchange rate, and volatility index. This study used 240 data panel from the Bloomberg stock market in the form of data panel covering Southeast developing countries, namely Indonesia, Thailand, Malaysia, and the Philippines, for five years or sixty months from January 2015 to December 2019. Data analysis used recursive models and multivariate regression techniques using EViews software. The random effect model results revealed that change in the foreign exchange rate and volatility indexes affected, partially and simultaneously, the changes in the stock market index. The result also showed that changes in the stock market index, inflation rate, and credit default swap affected, partially and simultaneously, government bond yield changes. These results suggest that the government bond yield could be managed by controlling volatility index, foreign exchange rate, stock market index, inflation rates, and credit default swaps. This finding could provide an insight into the policymaker and fiscal authority on managing the risk of government bonds under control during high volatility or even making it reasonably lower. This result could contribute to the current research in the field of financial management. Acknowledgment 
 It is the author’s pleasure to thank Muhammad Aulia SE MSc CSA® from the Ministry of Finance of Republic Indonesia, for his invaluable contribution to encourage this study and also to share the data required for this paper. He also delivers essential insights into improving the quality of this work. This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.
- 
                Multi-agent modeling and simulation of a stock marketMohamed Amine Souissi , 
    Khalid Bensaid , 
    Khalid Bensaid , 
    Rachid Ellaia     				
                                                    
					doi: http://dx.doi.org/10.21511/imfi.15(4).2018.10 				
                            Investment Management and Financial Innovations Volume 15, 2018 Issue #4 pp. 123-134 Views: 2924 Downloads: 1593 TO CITE АНОТАЦІЯ , 
    Rachid Ellaia     				
                                                    
					doi: http://dx.doi.org/10.21511/imfi.15(4).2018.10 				
                            Investment Management and Financial Innovations Volume 15, 2018 Issue #4 pp. 123-134 Views: 2924 Downloads: 1593 TO CITE АНОТАЦІЯThe stock market represents complex systems where multiple agents interact. The complexity of the environment in the financial markets in general has encouraged the use of modeling by multi-agent platforms and particularly in the case of the stock market. 
 In this paper, an agent-based simulation model is proposed to study the behavior of the volume of market transactions. The model is based on the case of a single asset and three types of investor agents. Each investor can be a zero intelligent trader, fundamentalist trader or traders using historical information in the decision making process. The goal of the study is to simulate the behavior of a stock market according to the different considered endogenous and exogenous variables.
- 
                Overconfidence bias among retail investors: A systematic review and future research directionsInvestment Management and Financial Innovations Volume 21, 2024 Issue #1 pp. 302-316 Views: 2705 Downloads: 1486 TO CITE АНОТАЦІЯThis paper comprehensively evaluates the literature on retail investor overconfidence using a framework-based systematic approach to understand the various dimensions of overconfidence bias, its effect on investing choices, and market dynamics. A systematic review of 137 publications from the Scopus database have been done to detect the research trend concerning investor overconfidence bias from its inception. An integrated ADO-TCM framework has been employed to present a systematic analysis of the theory, context, and methodologies (TCM) employed in the reviewed studies. The ADO (Antecedents, Decisions, and Outcomes) framework thoroughly examines the antecedents, decisions, and results of investor overconfidence. 
 The study identified four broad sets of factors contributing to investor overconfidence, as found in the existing literature. These factors include demographic characteristics, personality traits of investors, their knowledge and experience, and the features of investments and investor types. The Prospect theory is the most popular theory in the literature, with much research using secondary data and experiment-based analysis. The prospective study directions, based on the gaps in the existing literature, are as follows: further investigation into the decision-making processes of overconfident retail and professional investors is a worthwhile subject. Future research may shift their focus from financial outcome variables to non-financial outcome variables such as the impact of investor overconfidence on individuals’ stress levels, subjective financial well-being, and overall life happiness.

 
					