1 Department of Artificial Intelligence and Data Science, Kumaraguru College of Technology, Coimbatore, Tamil Nadu, India.
2 Department of MCA, Kumaraguru College of Technology, Coimbatore, Tamil Nadu, India.
3 Department of CSE, Kumaraguru College of Technology, Coimbatore, Tamil Nadu, India.
International Journal of Science and Research Archive, 2026, 19(01), 827-834
Article DOI: 10.30574/ijsra.2026.19.1.0824
Received on 02 March 2026; revised on 18 April 2026; accepted on 20 April 2026
Accurate prediction of patient health risk is important for early diagnosis and clinical decision-making. However, healthcare data is often complex and noisy, which limits the performance of individual machine learning models. This study proposes a stacked ensemble framework for healthcare risk prediction. The approach combines Random Forest, LightGBM, XGBoost, and CatBoost as base learners, with Logistic Regression as a meta-learner. The model is trained on 49,942 patient records with 15 clinical and lifestyle features. Class imbalance is handled using SMOTE applied within a stratified 5-fold cross-validation framework. SHAP-based explainability is used to improve interpretability of model predictions along with preprocessing steps such as feature scaling and encoding. Experimental results show strong performance across all evaluation metrics, outperforming individual models.
Stacked ensemble; Healthcare risk prediction; Machine learning; SMOTE; Clinical decision support
Preview Article PDF
Sudharson D, Sri Dharaneesh P, Preethi K N, Rakshitha S, Vijilesh V and Vimal EA. A stacked ensemble machine learning framework for healthcare risk prediction International Journal of Science and Research Archive, 2026, 19(01), 827-834. Article DOI: https://doi.org/10.30574/ijsra.2026.19.1.0824






