Identifying Risk Indicators of Cardiovascular Disease in Fasa Cohort Study (FACS): An Application of Generalized Linear Mixed-Model Tree

Fariba Asadi; Reza Homayounfar; Mojtaba Farjam; Yaser Mehrali; Fatemeh Masaebi; Farid Zayeri

doi:10.34172/aim.2024.35

Arch Iran Med. 2024;27(5): 239-247.
doi: 10.34172/aim.2024.35

PMID: 38690790
PMCID: PMC11097325
Scopus ID: 85192035916

Abstract View: 2198

PDF Download: 1395

Full Text View: 865

Original Article

Identifying Risk Indicators of Cardiovascular Disease in Fasa Cohort Study (FACS): An Application of Generalized Linear Mixed-Model Tree

Fariba Asadi ¹ , Reza Homayounfar ², Mojtaba Farjam ³, Yaser Mehrali ⁴, Fatemeh Masaebi ¹, Farid Zayeri ⁵^*

¹ Department of Biostatistics, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
² National Nutrition and Food Technology Research Institute, Faculty of Nutrition Sciences and Food Technology, Shahid Beheshti University of Medical Sciences, Tehran, Iran
³ Noncommunicable diseases research center, Fasa University of Medical Sciences, Fasa, Iran
⁴ Statistical Center of Iran, Tehran, Iran
⁵ Proteomics Research Center and Department of Biostatistics, School of Allied Medical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran

*Corresponding Author: Farid Zayeri, Email: fzayeri@gmail.com

Abstract

Background: Today, cardiovascular disease (CVD) is the most important cause of death around the world. In this study, our main aim was to predict CVD using some of the most important indicators of this disease and present a tree-based statistical framework for detecting CVD patients according to these indicators.

Methods: We used data from the baseline phase of the Fasa Cohort Study (FACS). The outcome variable was the presence of CVD. The ordinary Tree and generalized linear mixed models (GLMM) were fitted to the data and their predictive power for detecting CVD was compared with the obtained results from the GLMM tree. Statistical analysis was performed using the RStudio software.

Results: Data of 9499 participants aged 35‒70 years were analyzed. The results of the multivariable mixed-effects logistic regression model revealed that participants’ age, total cholesterol, marital status, smoking status, glucose, history of cardiac disease or myocardial infarction (MI) in first- and second-degree relatives, and presence of other diseases (like hypertension, depression, chronic headaches, and thyroid disease) were significantly related to the presence of CVD (P<0.05). Fitting the ordinary tree, GLMM, and GLMM tree resulted in area under the curve (AUC) values of 0.58 (0.56, 0.61), 0.81 (0.77, 0.84), and 0.80 (0.76, 0.83), respectively, among the study population. In addition, the tree model had the best specificity at 81% but the lowest sensitivity at 65% compared to the other models.

Conclusion: Given the superior performance of the GLMM tree compared with the standard tree and the lack of significant difference with the GLMM, using this model is suggested due to its simpler interpretation and fewer assumptions. Using updated statistical models for more accurate CVD prediction can result in more precise frameworks to aid in proactive patient detection planning.

Keywords: Cardiovascular diseases, Fasa Cohort Study, GLMM tree, Mixed-effect model

Cite this article as: Asadi F, Homayounfar R, Farjam M, Mehrali Y, Masaebi F, Zayeri F. Identifying risk indicators of cardiovascular disease in Fasa Cohort Study (FACS): an application of generalized linear mixed-model tree. Arch Iran Med. 2024;27(5):239-247. doi: 10.34172/aim.2024.35