International Journal of

ADVANCED AND APPLIED SCIENCES

EISSN: 2313-3724, Print ISSN: 2313-626X

Frequency: 12

line decor
  
line decor

 Volume 11, Issue 5 (May 2024), Pages: 140-150

----------------------------------------------

 Original Research Paper

A novel approach to mitigate academic underachievement in higher education: Feature selection, classifier performance, and interpretability in predicting student performance

 Author(s): 

 Safira Begum *, M. V. Ashok

 Affiliation(s):

 Department of Computer Applications, HKBKDC, Bangalore, India

 Full text

  Full Text - PDF

 * Corresponding Author. 

  Corresponding author's ORCID profile: https://orcid.org/0000-0003-2883-9994

 Digital Object Identifier (DOI)

 https://doi.org/10.21833/ijaas.2024.05.015

 Abstract

The main goal of this study is to address the ongoing problem of low academic performance in higher education by using machine learning techniques. We use a dataset from a higher education institution that includes various information available at student enrollment, such as academic history, demographics, and socio-economic factors. To address this issue, we introduce a new method that combines the Slime Mould Algorithm (SMA) for efficient feature selection with a Forest-Optimized Neural Network (FO-NN) Classifier. Our method aims to identify students at risk of academic failure early. Using the SMA, we simplify the feature selection process, identifying important attributes for accurate predictions. The Forest Optimization technique improves the classification process by optimizing the neural network model. The experimental results of this study show that our proposed method is effective, with significant improvements in feature selection accuracy and notable enhancements in the predictive performance of the neural network classifier. By selecting a subset of relevant features, our approach deals with high-dimensional datasets and greatly improves the quality and interpretability of predictive models. The innovative combination of the SMA and the FO-NN classifier increases accuracy, interpretability, and the ability to generalize in predicting student performance. This work contributes to a more effective strategy for reducing academic underachievement in higher education.

 © 2024 The Authors. Published by IASE.

 This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

 Keywords

 Academic underachievement, Machine learning, Slime mould algorithm, Feature selection, Predictive performance

 Article history

 Received 24 August 2023, Received in revised form 10 February 2024, Accepted 1 May 2024

 Acknowledgment 

No Acknowledgment.

 Compliance with ethical standards

 Conflict of interest: The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

 Citation:

 Begum S and Ashok MV (2024). A novel approach to mitigate academic underachievement in higher education: Feature selection, classifier performance, and interpretability in predicting student performance. International Journal of Advanced and Applied Sciences, 11(5): 140-150

 Permanent Link to this page

 Figures

 Fig. 1 Fig. 2 Fig. 3 Fig. 4 Fig. 5 Fig. 6 Fig. 7 Fig. 8 Fig. 9 Fig. 10 

 Tables

 Table 1 Table 2 Table 3 Table 4 Table 5 

----------------------------------------------   

 References (21)

  1. Alex SA, Jhanjhi NZ, Humayun M, Ibrahim AO, and Abulfaraj AW (2022). Deep LSTM model for diabetes prediction with class balancing by SMOTE. Electronics, 11(17): 2737. https://doi.org/10.3390/electronics11172737   [Google Scholar]
  2. Andrade TLD, Rigo SJ, and Barbosa JLV (2021). Active methodology, educational data mining and learning analytics: A systematic mapping study. Informatics in Education, 20(2): 171-204.   [Google Scholar]
  3. Batool S, Rashid J, Nisar MW, Kim J, Kwon HY, and Hussain A (2023). Educational data mining to predict students' academic performance: A survey study. Education and Information Technologies, 28(1): 905-971. https://doi.org/10.1007/s10639-022-11152-y   [Google Scholar]
  4. Hall MM, Worsham RE, and Reavis G (2021). The effects of offering proactive student-success coaching on community college students’ academic performance and persistence. Community College Review, 49(2): 202-237. https://doi.org/10.1177/0091552120982030   [Google Scholar]
  5. Hamoud A (2016). Selection of best decision tree algorithm for prediction and classification of students’ action. American International Journal of Research in Science, Technology, Engineering and Mathematics, 16(1): 26-32.   [Google Scholar]
  6. Khan A and Ghosh SK (2021). Student performance analysis and prediction in classroom learning: A review of educational data mining studies. Education and Information Technologies, 26: 205-240. https://doi.org/10.1007/s10639-020-10230-3   [Google Scholar]
  7. Lampropoulos G (2023). Educational data mining and learning analytics in the 21st century. In: Wang J (Ed.), Encyclopedia of data science and machine learning: 1642-1651. IGI Global, Pennsylvania, USA. https://doi.org/10.4018/978-1-7998-9220-5.ch098   [Google Scholar]
  8. Li S, Chen H, Wang M, Heidari AA, and Mirjalili S (2020). Slime mould algorithm: A new method for stochastic optimization. Future Generation Computer Systems, 111: 300-323. https://doi.org/10.1016/j.future.2020.03.055   [Google Scholar]
  9. Mangina E and Psyrra G (2021). Review of learning analytics and educational data mining applications. In the 13th International Conference on Education and New Learning Technologies: 949-954. https://doi.org/10.21125/edulearn.2021.0250   [Google Scholar]
  10. Martins MV, Tolledo D, Machado J, Baptista LM, and Realinho V (2021). Early prediction of student’s performance in higher education: A case study. In: Rocha Á, Adeli H, Dzemyda G, Moreira F, and Ramalho Correia AM (Eds.), Trends and applications in information systems and technologies: 166-175. Volume 19, Springer International Publishing, Cham, Switzerland. https://doi.org/10.1007/978-3-030-72657-7_16   [Google Scholar]
  11. Mduma N (2023). Data balancing techniques for predicting student dropout using machine learning. Data, 8(3): 49. https://doi.org/10.3390/data8030049   [Google Scholar]
  12. Miguéis VL, Freitas A, Garcia PJ, and Silva A (2018). Early segmentation of students according to their academic performance: A predictive modelling approach. Decision Support Systems, 115: 36-51. https://doi.org/10.1016/j.dss.2018.09.001   [Google Scholar]
  13. Mutrofin S, Mu'alif A, Ginardi RVH, and Fatichah C (2019). Solution of class imbalance of k-nearest neighbor for data of new student admission selection. International Journal of Artificial Intelligence Research, 3(2): 47-55. https://doi.org/10.29099/ijair.v3i2.92   [Google Scholar]
  14. Orujpour M, Feizi-Derakhshi MR, and Rahkar-Farshi T (2020). Multi-modal forest optimization algorithm. Neural Computing and Applications, 32(10): 6159-6173. https://doi.org/10.1007/s00521-019-04113-z   [Google Scholar]
  15. Phan M, De Caigny A, and Coussement K (2023). A decision support framework to incorporate textual data for early student dropout prediction in higher education. Decision Support Systems, 168: 113940. https://doi.org/10.1016/j.dss.2023.113940   [Google Scholar]
  16. Pojon M (2017). Using machine learning to predict student performance. M.Sc. Thesis, University of Tampere, Tampere, Finland.   [Google Scholar]
  17. Romero C and Ventura S (2020). Educational data mining and learning analytics: An updated survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 10(3): e1355. https://doi.org/10.1002/widm.1355   [Google Scholar]
  18. Sha L, Raković M, Das A, Gašević D, and Chen G (2022). Leveraging class balancing techniques to alleviate algorithmic bias for predictive tasks in education. IEEE Transactions on Learning Technologies, 15(4): 481-492. https://doi.org/10.1109/TLT.2022.3196278   [Google Scholar]
  19. Smadi A, Al-Qerem A, Nabot A, Jebreen I, Aldweesh A, Alauthman M, and Alzghoul MB (2023). Unlocking the potential of competency exam data with machine learning: Improving higher education evaluation. Sustainability, 15(6): 5267. https://doi.org/10.3390/su15065267   [Google Scholar]
  20. Tsiakmaki M, Kostopoulos G, Kotsiantis S, and Ragos O (2020). Transfer learning from deep neural networks for predicting student performance. Applied Sciences, 10(6): 2145. https://doi.org/10.3390/app10062145   [Google Scholar]
  21. Xiao W, Ji P, and Hu J (2022). A survey on educational data mining methods used for predicting students' performance. Engineering Reports, 4(5): e12482. https://doi.org/10.1002/eng2.12482   [Google Scholar]