An Improved Framework for Cardiovascular Disease Prediction using Hybrid Ensemble Learning Soft-Voting Model
DOI:
https://doi.org/10.22399/ijcesen.2348Keywords:
Cardiovascular disease, Artificial Intelligence, Machine Learning, Hybrid Ensemble Learning, Soft-Voting ModelAbstract
Cardiovascular diseases (CVDs) remain the leading cause of death globally, accounting for approximately 17.9 million deaths each year. While diagnostic technologies have advanced, traditional methods often fail to detect early-stage heart disease, particularly in asymptomatic individuals. This study presents an intelligent, interpretable machine learning framework designed to enhance early CVD prediction. guided by domain-specific medical knowledge. Two well-established datasets the Kaggle Cardiovascular Disease Dataset and the Framingham Heart Study Dataset were used for model training and evaluation. Machine learning models including Random Forest, Support Vector Machine, and neural networks were applied, along with SMOTE-Tomek Links to address class imbalance. A score-specific weighted soft voting mechanism was used to improve prediction across varying risk categories. The final model achieved high performance: Score 1 (low risk) reached 97.4% accuracy and 0.99% AUC; Score 2 (moderate risk) achieved 83.6% accuracy and 0.90% AUC; Score 3 (high risk) yielded 93% accuracy and 0.97% AUC. These results demonstrate the model's high precision, recall, and generalizability across diverse patient profiles. By combining ensemble learning, medical knowledge, and advanced risk stratification, the framework offers a clinically relevant tool for early detection and personalized intervention, supporting proactive cardiovascular healthcare.
References
[1] World Health Organization. Cardiovascular diseases. https://www.who.int/health-topics/cardiovascular-diseases/#tab=tab_1
[2] Patidar, S., Kumar, D., & Rukwal, D. (2022). Comparative Analysis of Machine Learning Algorithms for Heart Disease Prediction. Advances in Transdisciplinary Engineering. 64-69. https://doi.org/10.3233/ATDE220723
[3] Mansoor, C., Chettri, S. K., & Naleer, H. (2023). Efficient Prediction Model for Cardiovascular Disease Using Deep Learning Techniques. Migration Letters. (S13);449-459. https://doi.org/10.59670/ml.v20iS13.6475
[4] Hammadi, J. F., Latif, A. B. A., & Cob, Z. B. C. (2025). Artificial intelligence approaches for cardiovascular disease prediction: a systematic review. Indonesian Journal of Electrical Engineering and Computer Science. 38(2);1208-1218. https://doi.org/10.11591/ijeecs.v38.i2.pp1208-1218
[5] Hasanah, U., Soleh, A. M., & Sadik, K. (2024). Effect of Random Under sampling, Oversampling, and SMOTE on the Performance of Cardiovascular Disease Prediction Models. Jurnal Matematika, Statistika dan Komputasi. 21(1);88-102. https://doi.org/10.20956/j.v21i1.35552
[6] Khurana, P., Sharma, S., & Goyal, A. (2021). Heart Disease Diagnosis: Performance Evaluation of Supervised Machine Learning and Feature Selection Techniques. Proceedings of the 8th International Conference on Signal Processing and Integrated Networks, SPIN 2021. 510-515. https://doi.org/10.1109/SPIN52536.2021.9565963
[7] Mohammad, M. A., Koul, S., Rylance, R., Fröbert, O., Alfredsson, J., Akhtar, Z., Buchebner, D., Erlinge, D., James, S., Varenhorst, C., Szummer, K., Jernberg, T., & Spaak, J. (2022). Development and validation of an artificial neural network algorithm to predict mortality and admission to hospital for heart failure after myocardial infarction: a nationwide population-based study. Lancet Digital Health. 4(1);e37-e45. https://doi.org/10.1016/S2589-7500(21)00228-4
[8] Nayak, D. S. K., et al. (2023). Enhancing Cardiovascular Disease Prediction Based on AI and IoT Concepts. International Journal on Recent and Innovation Trends in Computing and Communication. 11(10);218-228. https://doi.org/10.17762/IJRITCC.V11I10.8483
[9] Modak, S., Abdel-Raheem, E., & Rueda, L. (2022). Heart Disease Prediction Using Adaptive Infinite Feature Selection and Deep Neural Networks. 4th International Conference on Artificial Intelligence in Information and Communication, ICAIIC 2022 - Proceedings. 235-240. https://doi.org/10.1109/ICAIIC54071.2022.9722652
[10] Yilmaz, R., & Yağin, F. H. (2022). Early Detection of Coronary Heart Disease Based on Machine Learning Methods. Medical Records. 4(1);1-6. https://doi.org/10.37990/medr.1011924
[11] García-Ordás, M. T., Bayón-Gutiérrez, M., Benavides, C., Aveleira-Mata, J., & Benítez-Andrades, J. A. (2023). Heart disease risk prediction using deep learning techniques with feature augmentation. Multimedia Tools and Applications. 82(20);31759-31773. https://doi.org/10.1007/s11042-023-14817-z
[12] K. C, L., & C, A. B. (2023). Performance Analysis by Using the Knime Analytical Platform to Forecast Heart Failure Using Several Machine Learning Methods. International Journal of Research in Applied Science and Engineering Technology. 11(3);205-211. https://doi.org/10.22214/ijraset.2023.49376
[13] Sureja, N., Chawda, B., & Vasant, A. (2022). A novel salp swarm clustering algorithm for prediction of the heart diseases. Indonesian Journal of Electrical Engineering and Computer Science. 25(1);265-272. https://doi.org/10.11591/ijeecs.v25.i1.pp265-272
[14] Reis, H. C., & Turk, V. (2025). DSCIMABNet: A novel multi-head attention depthwise separable CNN model for skin cancer detection. Pattern Recognition. 159. https://doi.org/10.1016/j.patcog.2024.111182
[15] Doval Amiri, P. A., & Pierre, S. (2023). A Soft Voting Classification Model for Network Traffic Prediction in VANET/V2X. International Conference on Wireless and Mobile Computing, Networking and Communications. 231-236. https://doi.org/10.1109/WiMob58348.2023.10187731
[16] Rafi, R., & Anusha Bamini, A. M. (2024). An Enhanced Ensemble-based Approach for Network Anomaly Detection using Voting Classifier. 8th International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud), I-SMAC 2024 - Proceedings. 905-910. https://doi.org/10.1109/I-SMAC61858.2024.10714902
[17] Kablan, E. B. (2024). An Ensemble Voting Approach for Shoulder Implant Classification from X-Ray Images. 2024 47th International Conference on Telecommunications and Signal Processing, TSP 2024. 92-95. https://doi.org/10.1109/TSP63128.2024.10605940
[18] Mallampati, B., Ishaq, A., Rustam, F., Kuthala, V., Alfarhood, S., & Ashraf, I. (2023). Brain Tumor Detection Using 3D-UNet Segmentation Features and Hybrid Machine Learning Model. IEEE Access. 11;135020-135034. https://doi.org/10.1109/ACCESS.2023.3337363
[19] Tanveer, M. U., Munir, K., Amjad, M., Jafar Zaidi, S. A., Bermak, A., & Rehman, A. U. (2024). Ensemble-Guard IoT: A Lightweight Ensemble Model for Real-Time Attack Detection on Imbalanced Dataset. IEEE Access. 12;168938-168952. https://doi.org/10.1109/ACCESS.2024.3495708
[20] Mulya, A. D., & Alamsyah, A. (2024). Exploring Creditworthiness Using Social Media Data and Ensemble Soft Voting. 2024 12th International Conference on Information and Communication Technology (ICoICT). 29-35. https://doi.org/10.1109/ICoICT61617.2024.10698312
[21] Yang, D., & Xiao, B. (2024). Feature Enhanced Ensemble Modeling With Voting Optimization for Credit Risk Assessment. IEEE Access. 12;115124-115136. https://doi.org/10.1109/ACCESS.2024.3445499
[22] Badathala, R. K., Vajapeyayajula, V. R., & Junnuri, R. K. (2024). Hybrid-Ensemble Learning Based Fault Detection and Diagnosis of Chillers. 10th 2024 International Conference on Control, Decision and Information Technologies, CoDIT 2024. 1613-1618. https://doi.org/10.1109/CoDIT62066.2024.10708526
[23] Jamil, H., Qayyum, F., Iqbal, N., Jamil, F., & Kim, D. H. (2023). Optimal Ensemble Scheme for Human Activity Recognition and Floor Detection Based on AutoML and Weighted Soft Voting Using Smartphone Sensors. IEEE Sensors Journal. 23(3);2878-2890. https://doi.org/10.1109/JSEN.2022.3228120
[24] Rahman, A., Parvej, S., Alam, K. S., & Fattah, H. M. A. (2024). Optimizing SMS Spam Detection: Comparative Analysis of Hybrid Voting Ensembles and Bi-LSTM Networks with Stratified Cross-Validation. Proceedings of the 5th International Conference on Data Intelligence and Cognitive Informatics, ICDICI 2024. 1030-1035. https://doi.org/10.1109/ICDICI62993.2024.10810777
[25] Remadna, I., Terrissa, L. S., Al Masry, Z., & Zerhouni, N. (2023). RUL Prediction Using a Fusion of Attention-Based Convolutional Variational AutoEncoder and Ensemble Learning Classifier. IEEE Transactions on Reliability. 72(1);106-124. https://doi.org/10.1109/TR.2022.3190639
[26] Mohan, D., Amritha, R., & Raj, C. C. (2024). URL based Phishing Detection System using Ensemble of LSD Model. Proceedings of the 5th International Conference on Data Intelligence and Cognitive Informatics, ICDICI 2024. 68-73. https://doi.org/10.1109/ICDICI62993.2024.10810953
[27] Mohapatra, S., et al. (2023). A stacking classifiers model for detecting heart irregularities and predicting Cardiovascular Disease. Healthcare Analytics. 3;100133. https://doi.org/10.1016/J.HEALTH.2022.100133
[28] Das, T., & Sinha, B. B. (2023). A Comprehensive Study on Machine Learning Methods for Predicting Heart Disease: A Comparative Analysis. IET Conference Proceedings. 2023(5);205-210. https://doi.org/10.1049/ICP.2023.1491
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal of Computational and Experimental Science and Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.