Graduates Employability Prediction through Rule-Based Classification Techniques with SMOTE in Imbalanced Data Sets

Authors

  • Paranya Palwisut Department of Data Science, Faculty of Science and Technology, Nakhon Pathom Rajabhat University
  • Apinan Junkorn Department of Data Science, Faculty of Science and Technology, Nakhon Pathom Rajabhat University
  • Mongkol Rodjan Department of Computer Technology, Faculty of Science and Technology, Nakhon Pathom Rajabhat University
  • Prapapan Pienchob Department of Food Science and Technology, Faculty of Science and Technology, Nakhon Pathom Rajabhat University

Keywords:

graduates employability, rule-based classification, SMOTE

Abstract

This research aims to develop graduates’ employability prediction models through rule-based classification techniques with SMOTE in imbalanced data sets. After analyzing the data, a class imbalance problem was found. In order to improve the quality of the data, SMOTE was used to increase the minority class. Then rules-based classification techniques (RIPPER, PART, and PRISM) were used to build the prediction models. Moreover, 5-fold cross-validation was utilized to split the data into the training and test sets. This research has measured performance models with accuracy, precision, recall, and f-measure. The experimental results demonstrated that the PRISM algorithm combined with SMOTE had the highest efficiency, with an accuracy of 85.69%, precision of 85.60%, recall of 85.70%, and f-measure of 85.60%, respectively.

References

Amphawan, K. (2022). Data mining. Retrieved from https://staff.informatics.buu.ac.th/~komate/886464/%5B6%5D-Classification.pdf. (in Thai)

Chawla, N. V., Bowyer, K. W., Hall, L. O., & Kegelmayer, W. P., (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligent Research, 16(2002), 321-357. https://doi.org/10.1613/jair.953

Chinjoho, P., Jabjone S., & Jongmuanwai, B. (2021). Developing information technology model into future careers forecast of graduate information technology major in Nakhonratchasima Rajabhat University. Journal of Technology Management Rajabhat Maha Sarakham University, 8(2), 20–32. (in Thai)

Chiroma, F., Liu, H., & Cocea, M. (2018). Suiciderelated text classification with prism algorithm. 2018 International Conference on Machine Learning and Cybernetics (ICMLC), Chengdu, China, 2018, (pp. 575-580). China: IEEE. doi: 10.1109/ICMLC.2018.8527032.

Hevo Data Inc. (2024). Rule based data mining classifier: A comprehensive guide 101. Retrieved from https://hevodata.com/learn/rule-based-data-mining/

Li, X., & Liu, B. (2014). Rule-based classification. Retrieved from https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=8557fde1e865f0dcc209468ccaf94f12a04b7835.

Kijsirikul, B. (2002). Data mining algorithm. Bangkok: Department of Computer Engineering, Faculty of Engineering, Chulalongkorn University, Thailand. (in Thai)

Kladkaew, S. (2014). Decision support systems for selecting careers in accordance with the ability of the graduates (Master’s thesis). Ramkhamhaeng University. Bangkok (in Thai)

Markoulidakis, J., Rallis, I., Georgoulas, I., Kopsiaftis, G., Doulamis, A., & Doulamis, N. (2021). Multiclass confusion matrix reduction method and Its application on net promoter score classification problem. Technologies 2021, 9, 81. doi: 10.3390/technologies9040081

Mohamed, W. N. H. W., Salleh, M. N. M., & Omar, A. H. (2012). A comparative study of reduced error pruning method in decision tree algorithms. 2012 IEEE International Conference on Control System,Computing and Engineering, Penang, Malaysia, 2012 (pp. 392-397). Malaysia: IEEE. doi: 10.1109/ICCSCE.2012.6487177.

Thakur, S. (2020). Detection of malicious URLs in big data using RIPPER algorithm. International Journal of Creative Research Thoughts (IJCRT), 8(4), 767-771. https://ijcrt.org/papers/IJCRT2004096.pdf

Vilailuck, S., Jaroenpuntaruk, V., & Wichadakul, D. (2015). Utilizing data mining techniques to forecast student academic achievement of Kasetsart University Laboratory School Kamphaeng Saen Campus Educational Research and Development Center. Veridian E-Journal, Science and Technology Silpakorn University, 2(2), 1-17. (in Thai)

Wanon, S., Arreerard T., & Sanrach, C. (2018). A study of techniques in predicting career counseling for undergraduate students of the computer program by using data mining technique. Journal of Technology Management Rajabhat Maha Sarakham University, 5(1), 164–171. (in Thai)

Downloads

Published

2024-04-26

How to Cite

Palwisut, P. ., Junkorn, A. ., Rodjan, M. ., & Pienchob, P. . (2024). Graduates Employability Prediction through Rule-Based Classification Techniques with SMOTE in Imbalanced Data Sets. EAU Heritage Journal Science and Technology (Online), 18(1), 145–160. retrieved from https://he01.tci-thaijo.org/index.php/EAUHJSci/article/view/264157

Issue

Section

Research Articles