Open Access Journal
4.8
Calculated on 05 May, 2025
0.31
Powered by scimagojr.com
Author information
Author information
Author information
Author information
Food insecurity is a complex and multidimensional issue that requires accurate predictions or classifications to design effective interventions. The availability of large and complex datasets has enabled the use of machine learning approaches. Moreover, because food insecurity data are often hierarchical/clustered, mixed effects modelling is well suited for capturing intergroup variation. This study compared three models: the generalized linear mixed model (GLMM), a parametric model that accounts for random effects but is limited to linear relationships; the generalized mixed effects tree (GMET), a flexible decision tree framework with random effects; and the generalized mixed effects random forest (GMERF), an ensemble of trees with random effects. The analysis used data from 25,890 households in West Java, Indonesia in 2021. Model evaluation showed that GMERF provided the best prediction results compared to the other models. This study concludes that integrating random forests with mixed-effects modelling offers a robust and effective approach for predicting household food insecurity. The primary predictors of food insecurity identified by the random forest in GMERF model were the age of the household head (years) and house size (m2).
[1] K. Essel and K. A. Courts. (2018). In: " H. Kersten, A. Beck, and M. Klein (Eds) Identifying and Addressing Childhood Food Insecurity in Healthcare and Community Settings". Cham: Springer. 10.1007/978-3-319-76048-3_1.
DOI: https://doi.org/10.1007/978-3-319-76048-3_1[2] K. Schroeder, A. Smaldone, and S. Smaldone. (2015). "Food Insecurity: A Concept Analysis". Nursing Forum. 50 (4): 274-284. 10.1111/nuf.12118.
DOI: https://doi.org/10.1111/nuf.12118[3] F. Sporchia, M. Antonelli, A. Aguilar-Martínez, A. Bach-Faig, D. Caro, K. F. Davis, R. Sonnino, and A. Galli. (2024). "Zero Hunger: Future Challenges and the Way Forward Towards the Achievement of Sustainable Development Goal 2". Sustainable Earth Reviews. 7 (1). 10.1186/s42055-024-00078-7.
DOI: https://doi.org/10.1186/s42055-024-00078-7[4] S. W. Raudenbush and A. S. Bryk. (2022). "Hierarchical Linear Models: Applications and Data Analysis Methods". SAGE Publications, Newbury Park, CA.
[5] M. Fokkema, N. Smits, A. Zeileis, T. Hothorn, and H. Kelderman. (2018). "Detecting Treatment–Subgroup Interactions in Clustered Data with Generalized Linear Mixed-Effects Model Trees". Behavior Research Methods. 50 : 2016-2034. 10.3758/s13428-017-0971-x.
DOI: https://doi.org/10.3758/s13428-017-0971-x[6] R. J. Sela and J. S. Simonoff. (2012). "RE-EM Trees: A Data Mining Approach for Longitudinal and Clustered Data". Machine Learning. 86 (2): 169-207. 10.1007/s10994-011-5258-3.
DOI: https://doi.org/10.1007/s10994-011-5258-3[7] H. Dharmawan, B. Sartono, A. Kurnia, A. F. Hadi, and E. Ramadhani. (2022). "A Study of Machine Learning Algorithms to Measure Feature Importance in Class-Imbalance Data of Food Insecurity Cases in Indonesia". Communications in Mathematical Biology and Neuroscience. 2022. 10.28919/cmbn/7636.
DOI: https://doi.org/10.28919/cmbn/7636[8] K. N. Khikmah, B. Sartono, B. Susetyo, and G. A. Dito. (2024). "Performance Comparative Study of Machine Learning Classification Algorithms for Food Insecurity Experience by Households in West Java". Jurnal Online Informatika. 9 (1): 128-137. 10.15575/join.v9i1.1012.
DOI: https://doi.org/10.15575/join.v9i1.1012[9] Z. Yu, M. Guindani, S. F. Grieco, L. Chen, T. C. Holmes, and X. Xu. (2022). "Beyond t Test and ANOVA: Applications of Mixed-Effects Models for More Rigorous Statistical Analysis in Neuroscience Research". Neuron. 110 (1): 21-35. 10.1016/j.neuron.2021.10.030.
DOI: https://doi.org/10.1016/j.neuron.2021.10.030[10] A. Hajjem, F. Bellavance, and D. Larocque. (2011). "Mixed Effects Regression Trees for Clustered Data". Statistics & Probability Letters. 81 (4): 451-459. 10.1016/j.spl.2010.12.003.
DOI: https://doi.org/10.1016/j.spl.2010.12.003[11] A. Hajjem, F. Bellavance, and D. Larocque. (2014). "Mixed-Effects Random Forest for Clustered Data". Journal of Statistical Computation and Simulation. 84 (6): 1313-1328. 10.1080/00949655.2012.741599.
DOI: https://doi.org/10.1080/00949655.2012.741599[12] B. Suseno, K. A. Notodiputro, and B. Sartono. (2023). "GLMMTree for Modelling Poverty in Indonesia". Proceedings of the International Conference on Data Science and Official Statistics. 121-131. 10.34123/icdsos.v2023i1.333
DOI: https://doi.org/10.34123/icdsos.v2023i1.333[13] Sukarna, K. A. Notodiputro, and B. Sartono. (2023). "Comparison Between Binomial GLMM and Binomial GMET for Temporary Unemployment in West Java, Indonesia". Proceedings of the 5th International Conference on Statistics, Mathematics, Teaching, and Research (ICSMTR 2023). 198-209. 10.2991/978-94-6463-332-0_22
DOI: https://doi.org/10.2991/978-94-6463-332-0_22[14] D. Kusumaningrum, H. Wijayanto, A. Kurnia, K. A. Notodiputro, and M. Ardiansyah. (2024). In: "Mathematical and Statistical Methods for Actuarial Sciences and Finance". Cham: Springer. 10.1007/978-3-031-64273-9_35.
DOI: https://doi.org/10.1007/978-3-031-64273-9_35[15] M. Pellagatti, C. Masci, F. Ieva, and A. M. Paganoni. (2021). "Generalized Mixed-Effects Random Forest: A Flexible Approach to Predict University Student Dropout". Statistical Analysis and Data Mining: The ASA Data Science Journal. 14 (3): 241-257. 10.1002/sam.11505.
DOI: https://doi.org/10.1002/sam.11505[16] F. Asadi, R. Homayounfar, Y. Mehrali, C. Masci, S. Talebi, and F. Zayeri. (2024). "Detection of Cardiovascular Disease Cases Using Advanced Tree-Based Machine Learning Algorithms". Scientific Reports. 14 : 22230. 10.1038/s41598-024-72819-9.
DOI: https://doi.org/10.1038/s41598-024-72819-9[17] G. Y. Lee, L. Alzamil, B. Doskenov, and A. Termehchy. (2021). "A Survey on Data Cleaning Methods for Improved Machine Learning Model Performance". arXiv Preprint. 10.48550/arXiv.2109.07127.
[18] P. Agasthi, H. Ashraf, S. H. Pujari, M. Girardo, A. Tseng, F. Mookadam, N. Venepally, M. R. Buras, B. Abraham, B. K. Khetarpal, and M. Allam. (2023). "Prediction of Permanent Pacemaker Implantation After Transcatheter Aortic Valve Replacement: The Role of Machine Learning". World Journal of Cardiology. 15 (3): 95-105. 10.4330/wjc.v15.i3.95.
DOI: https://doi.org/10.4330/wjc.v15.i3.95[19] A. Hajjem, D. Larocque, and F. Bellavance. (2017). "Generalized Mixed Effects Regression Trees". Statistics & Probability Letters. 126 : 114-118. 10.1016/j.spl.2017.02.033.
DOI: https://doi.org/10.1016/j.spl.2017.02.033[20] J. Hu and S. Szymczak. (2023). "A Review on Longitudinal Data Analysis with Random Forest". Briefings in Bioinformatics. 24 (2): 1-11. 10.1093/bib/bbad002.
DOI: https://doi.org/10.1093/bib/bbad002[21] P. Krennmair and T. Schmid. (2022). "Flexible Domain Prediction Using Mixed Effects Random Forests". Journal of the Royal Statistical Society: Series C (Applied Statistics). 71 (5): 1865-1894. 10.1111/rssc.12600.
DOI: https://doi.org/10.1111/rssc.12600[22] J. L. Speiser, B. J. Wolf, D. Chung, C. J. Karvellas, D. G. Koch, and V. L. Durkalski. (2020). "BiMM Tree: A Decision Tree Method for Modeling Clustered and Longitudinal Binary Outcomes". Communications in Statistics – Simulation and Computation. 49 (4): 1004-1023. 10.1080/03610918.2018.1490429.
DOI: https://doi.org/10.1080/03610918.2018.1490429[23] L. Fontana, C. Masci, F. Ieva, and A. M. Paganoni. (2021). "Performing Learning Analytics via Generalized Mixed-Effects Trees". Data. 6 : 74. 10.3390/data6070074.
DOI: https://doi.org/10.3390/data6070074[24] E. Setiawan, K. A. Notodiputro, and B. Sartono. (2024). "Generalized Linear Mixed-Model Tree for Modeling Dengue Fever Cases". COGITO Smart Journal. 10 (2): 380-392. 10.31154/cogito.v10i2.715.380-392.
DOI: https://doi.org/10.31154/cogito.v10i2.715.380-392[25] W. W. Stroup, M. Ptukhina, and J. Garai. (2024). "Generalized Linear Mixed Models". CRC Press, New York. 10.1201/9780429092060.
DOI: https://doi.org/10.1201/9780429092060[26] D. Bates, M. Mächler, B. Bolker, and S. Walker. (2015). "Fitting Linear Mixed-Effects Models Using lme4". Journal of Statistical Software. 67 (1): 1-48. 10.18637/jss.v067.i01.
DOI: https://doi.org/10.18637/jss.v067.i01[27] I. Sriliana, S. Nugroho, W. Agwil, and E. D. Sihombing. (2025). "Evaluation of Multivariate Adaptive Regression Splines on Imbalanced Dataset for Poverty Classification in Bengkulu Province". Barekeng: Jurnal Ilmu Matematika dan Terapan. 19 (2): 1143-1156. 10.30598/barekengvol19iss2pp1143-1156.
DOI: https://doi.org/10.30598/barekengvol19iss2pp1143-1156[28] M. Heydarian, T. E. Doyle, and R. Samavi. (2022). "MLCM: Multi-Label Confusion Matrix". IEEE Access. 10 : 19083-19095. 10.1109/ACCESS.2022.3151048.
DOI: https://doi.org/10.1109/ACCESS.2022.3151048[29] D. Krstinić, M. Braović, L. Šerić, and D. Božić-Štulić. (2020). "Multi-Label Classifier Performance Evaluation with Confusion Matrix". Computer Science & Information Technology. 1 (1): 1-14. 10.5121/csit.2020.100801.
DOI: https://doi.org/10.5121/csit.2020.100801[30] S. C. Kumar and R. J. RamaSree. (2015). "Dimensionality Reduction in Automated Evaluation of Descriptive Answers Through Zero Variance, Near Zero Variance, and Non-Frequent Words Techniques: A Comparison". Proceedings of the 9th IEEE International Conference on Intelligent Systems and Control (ISCO). 1-6. 10.1109/ISCO.2015.7282351.
DOI: https://doi.org/10.1109/ISCO.2015.7282351