Analyzing factors linked to happiness through Machine Learning techniques

Authors

  • Roberto León Leyva National University of Engineering image/svg+xml Author

DOI:

https://doi.org/10.71701/5bnqv178

Keywords:

Happiness, machine learning, cleaning and data imputation, classification, decision trees, ensemble models, clustering

Abstract

This study aims to identify the importance of factors associated with happiness and to find patterns in the evolution of the countries that participate in the Annual world happiness report of Gallup using Machine Learning techniques. This is a longitudinal non-experimental and explanatory research of applied method (2008-2022). The work methodology used is CRISP-DM (Cross Industry Standard-Data Mining). For data imputation purposes, time series techniques were used assuming that each factor per country was a univariate time series. To identify the importance of happiness-associated factors a dependent variable level of happiness was generated from the continuous happiness variable and three other levels were proposed: high, medium, and low. During the exploratory data analysis, it was found that countries with a high level of happiness are more sensitive to the study factors. Through the decision tree model there was an average accuracy of 0.80 and three significant variables listed in order of importance: GDP per capita, life expectancy at birth and support networking. These results match with the regression by least squares with fixed effects proposed by Gallup, except in the order of the second and third factor. As for the patterns in the happiness variable evolution, the countries that participated in the last 10 years were selected, a range of techniques were applied to determine the number of clusters, and four groups were established for the clustering analysis. The results show that the first three groups maintain or improve their level of happiness while the fourth group deteriorates it.

Downloads

Download data is not yet available.

References

Real Academia Española. (s. f.). Felicidad. En Diccionario de la lengua española. Recuperado de https://dle.rae.es/felicidad?m=form

Lopez Otin, C. (2019). La vida en cuatro letras. Paidós.

Neira, H. (2012). ¿Qué es república? Fondo Editorial Universidad San Martín de Porres.

Waldinger Robert, S. M. (2023). The Good Life. Simon & Schuster.

Helliwell, J. F. (2023). World Hapiness Report 2023. Recuperado de https://worldhappiness.report/

Chollet, F., y Allaire, J. (2017). Deep Learning wih R. Manning Publications.

Aldas, J., y Uriel, E. (2017). Análisis multivariante aplicado con R. Paraninfo.

Lantz, B. (2019). Machine Learning with R Expert techniques for predictive modeling. Packt.

Zumel, N., y Mount, J. (2020). Practical data science with R. New York. Manning Publications.

Chapman, P. C. (2000). CRISP-DM 1.0 Step-by-step data mining guide. CRISP-DM consortium.

Downloads

Published

2024-10-15

Issue

Section

Artículos

How to Cite

Analyzing factors linked to happiness through Machine Learning techniques. (2024). Revista I+i, 17. https://doi.org/10.71701/5bnqv178