Design and implementation of an educational data warehouse with advanced mining techniques for data-driven decision making: A Comprehensive ReviewECISION MAKING: A COMPREHENSIVE REVIEW
DOI:
https://doi.org/10.22399/ijcesen.5190Keywords:
Design educational data , implementation educational data , advanced mining techniques, data-driven decision makingAbstract
In the contemporary educational landscape, the exponential growth of data from Learning Management Systems (LMS), student information systems, and assessment portals presents both a challenge and an opportunity. This review paper critically examines the integration of Educational Data Warehousing (EDW) with Educational Data Mining (EDM) to foster data-driven decision-making in higher education. We explore the architectural requirements for a scalable EDW, focusing on hybrid cloud infrastructures that ensure security and accessibility. Furthermore, the review analyzes the efficacy of advanced data mining techniques—specifically clustering, classification, and predictive modeling using algorithms like Decision Trees, Neural Networks, and Naïve Bayes—in identifying at-risk students and optimizing curriculum effectiveness. By synthesizing findings from recent high-impact literature (2021–2025), we identify a critical gap in the real-time processing capabilities of current systems and propose a unified framework that couples robust warehousing with intelligent AI-driven dashboards. This research contributes to the strategic alignment of institutional goals with student outcomes, offering a roadmap for administrators to implement precision education.
References
1. Ismaili, B., & Besimi, A. (2024). A Data Warehousing Framework for Predictive Analytics in Higher Education: A Focus on Student at-Risk Identification. SEEU Review, 19(2), 43-56.
2. Saraswat, P., & Raj, S. (2022). Educational Data Mining and Data Warehouse Design Using Business Intelligence. International Journal of Innovative Research in Computer Science & Technology, 10(1), 97-101.
3. Kartik, & Manhas, S. (2024). Analysing Data Warehousing and Data Mining Applications in the Education Sector with Realism. International Journal of Engineering Research & Technology, 12(3).
4. Kimball, R., & Ross, M. (2013). The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling (3rd ed.). Wiley.
5. Inmon, W. H. (2005). Building the Data Warehouse (4th ed.). Wiley.
6. Triayudi, A., Aldisa, R. T., & Sumiati, S. (2024). New Framework of Educational Data Mining to Predict Student Learning Performance. Journal of Wireless Mobile Networks, Ubiquitous Computing, and Dependable Applications, 15(1), 115-132.
7. Ahmed, A. (2024). Predictive Model for Student's Academic Performance Using Machine Learning Techniques. International Journal for Science Technology and Engineering.
8. Almotiry, O. N., Sha, M., Rahamathulla, M. P., & Omer, O. S. D. (2021). Hybrid Cloud Architecture for Higher Education System. Computer Systems Science & Engineering, 36(1).
9. El Mahmoudi, A., Chaoui, N. E. H., & Chaoui, H. (2025). Predictive Analytics Leveraging a Machine Learning Approach to Identify Students’ Reasons for Dropping out of University. Applied Sciences, 15, 8496.
10. Gupta, A., Garg, D., & Kumar, P. (2022). An ensembling model for early identification of at-risk students in higher education.
11. Vasconcelos, A. N., Freires, L. A., Loureto, G. D. L., & others. (2023). Advancing school dropout early warning systems: the IAFREE relational model for identifying at-risk students.
12. Owolabi, & Obadaki. (2024). Decision Tree implementation for educational data mining. Journal of Educational Computing Research.
13. Nowakowski, P., Zórawski, P., Cabaj, K., & Mazurczyk, W. (2021). Detecting Network Covert Channels using Machine Learning, Data Mining and Hierarchical Organisation of Frequent Sets. Journal of Wireless Mobile Networks, Ubiquitous Computing, and Dependable Applications, 12(1), 20-43.
14. Rosda, R., Wali, M., & Imilda, I. (2023). Evaluation of the Successful Implementation of the SIMARDI Using the Technology Acceptance Model (TAM). SAGA: Journal of Technology and Information System, 1(2), 56-63.
15. Shahzad, F., Saeed, A., Asim, G. A., Qureshi, F., Rehman, I. U., & Qureshi, S. (2021). Political connections and firm performance: Further evidence using a generalised quantile regression approach. IIMB Management Review, 33(3), 205-213.
16. Sharma, S., & Shiwani, S. (2014). Data mining based accuracy enhancement of ANN using Swarm intelligence. International Journal of Communication and Computer Technologies, 2(1), 43-46.
17. Sreenivasulu, G. (2024). A Hybrid Optical-Acoustic Modem Based on Mimo Ofdm for Reliable Data Transmission in Green Underwater Wireless Communication. Journal of VLSI Circuits and Systems, 6(1), 36-42.
18. Talakua, M. A. (2023). The Role of Christian Religious Education Teachers in Shaping Student Character Through Peace Education and Conflicts Resolution Among Students. Athena: Journal of Social, Culture and Society, 1(4), 257-261.
19. Triayudi, A., & Fitri, I. (2018). ALG clustering to analyze the behavioural patterns of online learning students. Journal of Theoretical & Applied Information Technology, 96(16), 5327-5337.
20. Triayudi, A., & Fitri, I. (2019). A new agglomerative hierarchical clustering to model student activity in online learning. TELKOMNIKA, 17(3), 1226-1235.
21. Varshavardhini, S., & Rajesh, A. (2023). An Efficient Feature Subset Selection with Fuzzy Wavelet Neural Network for Data Mining in Big Data Environment. Journal of Internet Services and Information Security, 13(2), 233-248.
22. Wijayanto, A., Suprapto, Y. K., & Wulandari, D. P. (2019). Clustering on Multidimensional Poverty Data using PAM and K-prototypes Algorithm. IEEE International Seminar on Intelligent Technology and Its Applications.
23. Yusupa, A., Manullang, J., Marbun, N., & Ginting, S. B. F. (2023). Decision Support System for Determining the Best PAUD Teacher Using the MOORA Method. SAGA: Journal of Technology and Information System, 1(2), 50-55.
24. Ahmed, S. T., Al-Hamdani, R., & Croock, M. S. (2020). Developed third iterative dichotomizer based on feature decisive values for educational data mining. Indonesian Journal of Electrical Engineering and Computer Science, 18(1), 209-217.
25. Alejandro, A., Vaisman, F., Besteiro, M., & Valverde. (2019). Modelling and Querying Star and Snowflake Warehouses Using Graph Databases.
26. Aman, F., Rauf, A., Ali, R., Iqbal, F., & Khan, M. (2019). A predictive model for predicting students' academic performance.
27. Amani, K., Fatma, B., Yessine, H. K., & Zouhaier, J. (2023). At-Risk Students Identification based on Machine Learning Approach: A Case Study.
28. Amita, D., Kamna, S., Sandeep, D., & Omdev. (2021). Predicting Students Performance Using Educational Data Mining and Learning Analytics: A Systematic Literature Review.
29. Andres, G. N., Julieta, N., Luis, N., Robledo-Rella, V., & García-Castelán, R. M. G. (2023). Predictive analytics study to determine undergraduate students at risk of dropout. Frontiers in Education.
30. Asiah, M., Zulkarnaen, K. N., Safaai, D., & Shahbudin, H. M. (2019). A review on predictive modeling technique for student academic performance monitoring.
31. Akid, H., Frey, G., Ayed, M., & Lachiche, N. (2022). Performance of NoSQL graph implementations of star vs. snowflake schemas. IEEE Access.
32. Duarte, R., Ramos-Pires, A., & others. (2014). Identifying at-risk students in higher education.
33. Nimy, E. N. M., Mosia, M., & Chibaya, C. (2023). Identifying At-Risk Students for Early Intervention - a Probabilistic Machine Learning Approach. Social Science Research Network.
34. Gaftandzhieva, S., Hussain, S., Hilcenko, S., Doneva, R., & Boykova, K. (2023). Data-driven decision making in higher education institutions: state-of-play.
35. He, J., Bailey, J., Rubinstein, B., & Zhang, R. (2015). Identifying at-risk students in massive open online courses.
36. Helal, S., Li, J., Liu, L., Ebrahimie, E., Dawson, S., Murray, D., & Long, Q. (2018). Predicting academic performance by considering student heterogeneity.
37. Hellas, A., Ihantola, P., Petersen, A., Ajanovski, V., Gutica, M., Hynninen, T., Knutas, A., Leinonen, J., Messom, C., & Liao, S. (2018). Predicting academic performance: a systematic literature review. ACM Conference.
38. Iqbal, M., Mustafa, G., Sarwar, N., Wajid, S., Nasir, J., & Siddque, S. (2019). A Review of Star Schema and Snowflakes Schema.
39. Ogundele, I. M., Taiwo, O., Babalola, A. E., & Ayeni, O. C. (2024). Prediction of Student Academic Performance Based on Machine Learning Model.
40. Osborne, J. B., & Lang, A. S. (2023). Predictive Identification of At-Risk Students: Using Learning Management System Data.
41. Kam, C. L., Billy, W., & Hon, T. C. (2023). Prediction of At-Risk Students Using Learning Analytics: A Literature Review. Communications in Computer and Information Science.
42. Manish, A. B., Srinivas, K. M., & Sriram, P. (2011). Star and snowflake schemas in extract, transform, load processes.
43. Ofori, R., & Charlton, J. P. (2002). A path model of factors influencing the academic performance of nursing students.
44. Ouatik, F., Erritali, M., Ouatik, F., & Jourhmane, M. (2022). Predicting Student Success Using Big Data and Machine Learning Algorithms. Int. J. Emerg. Technol. Learn.
45. Pardo, A., Mirriahi, N., Martinez-Maldonado, R., Yacef, K., & Kay, J. (2016). Generating actionable predictive models of academic performance.
46. Pei, B., & Xing, W. (2022). An interpretable pipeline for identifying at-risk students. Journal of Educational Computing Research.
47. Preet, K., & Sachin, A. (2017). A Review on Prediction of Academic Performance of Students At-Risk Using Data Mining Techniques.
48. Shen, F. (2007). An explorer of the student crisis forecast and support system in higher education based on data warehouse. Journal of Chaohu College.
49. Teng, Y., Zhang, J., & Sun, T. (2022). Data-driven decision-making model based on artificial intelligence in higher education system.
50. Tucker, L., & McKnight, O. (2019). Assessing the Validity of College Success Indicators for the At-Risk Student. Journal of College Student Retention.
51. Attaran, M. M., Attaran, S., & Celik, G. B. (2017). Promises and challenges of cloud computing in higher education. Journal of Higher Education Theory and Practice, 17(6), 20-38.
52. Pardeshi, V. H. (2013). Cloud computing for higher education institutes: Architecture, strategy and recommendations. SIMSARC.
53. El Mhouti, A., & Erradi, M. (2019). Harnessing cloud computing services for e-learning systems in higher education. International Journal of Information and Communication Technology Education.
54. Moh Noor, A. S. B., Atoom, M. N. Y., & Mamat, R. (2019). A review of cloud oriented mobile learning platform and frameworks. International Journal of Electrical and Computer Engineering.
55. Riahi, G. (2015). E-learning systems based on cloud computing: A review. Procedia Computer Science.
56. Baldassarre, M. T., Caivano, D., Dimauro, G., Gentile, E., & Visaggio, G. (2018). Cloud computing for education: A systematic mapping study. IEEE Transactions on Education.
57. Elgelany, A., & Alghabban, W. G. (2017). Cloud computing: Empirical studies in higher education. International Journal of Advanced Computer Science and Applications.
58. Nan Cenka, B. A., & Hasibuan, Z. A. (2013). Enhancing educational services using cloud technology. ICoICT.
59. Bogdanovic, Z., Milic, A., & Labus, L. (2017). Model of e-education infrastructure based on cloud computing. IGI Global.
60. Mehdi, W. A. (2015). A proposed architecture of cloud computing for teaching and education. GSTF Journal on Computing.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal of Computational and Experimental Science and Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.