K-Means and J48 Algorithms to Categorize Student Research Abstracts
DOI:
https://doi.org/10.34306/ijcitsm.v3i1.125Keywords:
Text Mining, K-Means Algorithm, J48 Algorithm, Classification Method, Research AbstractsAbstract
Text mining is a rapidly growing field in computer science that is used to extract meaningful information from text data. This information can be used for various applications, such as categorizing research abstracts based on their content. This study focuses on the use of text mining techniques. The goal was to determine which algorithm was more accurate in categorizing the research abstracts. The results of the study indicated that the J48 algorithm outperformed the K-Means algorithm in terms of accuracy. This suggests that the J48 algorithm is a more effective method for categorizing research abstracts based on their content. Additionally, the findings provide insight into the use of text mining techniques for categorizing research abstracts in specific fields, such as computer science. Overall, the study demonstrates the potential of text mining techniques for analyzing and categorizing large volumes of text data. As the field of text mining continues to grow, it is likely that more applications will emerge, making it easier to extract valuable information from unstructured text data. The findings of this study can be used to improve the efficiency and accuracy of text mining techniques, particularly for categorizing research abstracts in specific fields.
References
Ahmed, M., Seraj, R., & Islam, S. M. S. (2020). The k-means algorithm: A comprehensive survey and performance evaluation. Electronics, 9(8), 1295.
Sinaga, K. P., & Yang, M.-S. (2020). Unsupervised K-means clustering algorithm. IEEE Access: Practical Innovations, Open Solutions, 8, 80716–80727.
Tian, K., Li, J., Zeng, J., Evans, A., & Zhang, L. (2019). Segmentation of tomato leaf images based on adaptive clustering number of K-means algorithm. Computers and Electronics in Agriculture, 165(104962), 104962.
Zhu, E., Zhang, Y., Wen, P., & Liu, F. (2019). Fast and stable clustering analysis based on Grid-mapping K-means algorithm and new clustering validity index. Neurocomputing, 363, 149–170.
Song, K., Yao, X., Nie, F., Li, X., & Xu, M. (2021). Weighted bilateral K-means algorithm for fast co-clustering and fast spectral clustering. Pattern Recognition, 109(107560), 107560.
Rahardja, U., Harahap, E. P., & Dewi, S. R. (2019). The strategy of enhancing article citation and H-index on SINTA to improve tertiary reputation. TELKOMNIKA (Telecommunication Computing Electronics and Control), 17(2), 683.
Zarlis, M., Harahap, E. P., & Husna, L. N. (2019). Test appraisal system application based on YII Framework as media input student value final project and thesis session at higher education. Aptisi Transactions On Technopreneurship (ATT), 1(1), 73–81.
Kartini, Santoso, S., Harahap, E. P., Khoirunisa, A., & Zelina, K. (2021). A systematic review through intellectual based blockchain-intermediary. 2021 9th International Conference on Cyber and IT Service Management (CITSM), 1–7.
Krishnakumar, S., & Manivannan, K. (2021). RETRACTED ARTICLE: Effective segmentation and classification of brain tumor using rough K means algorithm and multi kernel SVM in MR images. Journal of Ambient Intelligence and Humanized Computing, 12(6), 6751–6760.
Bienvenido-Huertas, D., Nieto-Julián, J. E., Moyano, J. J., Macías-Bernal, J. M., & Castro, J. (2020). Implementing artificial intelligence in H-BIM using the J48 algorithm to manage historic buildings. International Journal of Architectural Heritage: Conservation, Analysis, and Restoration, 14(8), 1148–1160.
Adnan, M., Sarno, R., & Sungkono, K. R. (2019). Sentiment analysis of restaurant review with classification approach in the decision tree-J48 algorithm. 2019 International Seminar on Application for Technology of Information and Communication (ISemantic), 121–126.
Azizah, N. N., & Mariyanti, T. (2022). Education and technology management policies and practices in madarasah. International Transactions on Education Technology, 1(1), 29–34.
Hermawan, D. R., Fahrio Ghanial Fatihah, M., Kurniawati, L., & Helen, A. (2021). Comparative study of J48 decision tree classification algorithm, random tree, and random forest on in-vehicle CouponRecommendation data. 2021 International Conference on Artificial Intelligence and Big Data Analytics, 1–6.
Moodi, F., & Saadatfar, H. (2022). An improved K‐means algorithm for big data. IET Software, 16(1), 48–59.
Nandapala, E. Y. L., & Jayasena, K. P. N. (2020). The practical approach in Customers segmentation by using the K-Means Algorithm. 2020 IEEE 15th International Conference on Industrial and Information Systems (ICIIS), 344–349.
Razdan, S., Gupta, H., & Seth, A. (2021). Performance analysis of network intrusion detection systems using J48 and naive Bayes algorithms. 2021 6th International Conference for Convergence in Technology (I2CT), 1–7.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Lee Kyung Choi, Kim Beom Rii, Han Woo Park

This work is licensed under a Creative Commons Attribution 4.0 International License.