ejecvnlp Open Access Journal

European Journals of Emerging Computer Vision and Natural Language Processing

eISSN: Applied
Publication Frequency : 2 Issues per year.

  • Peer Reviewed & International Journal
Table of Content
Issues (Year-wise)
Loading…

Open Access iconOpen Access

ARTICLE

Enhancing Indonesian Scientific Article Management through Machine Learning and NLP

1 Department of Communication, University of Nevada, Las Vegas (UNLV), Las Vegas, NV, USA
2 Department of Political Science, University of Arkansas, Fayetteville, AR, USA

Citations: Loading…
ABSTRACT VIEWS: 5   |   FILE VIEWS: 3   |   PDF: 3   HTML: 0   OTHER: 0   |   TOTAL: 8
Views + Downloads (Last 90 days)
Cumulative % included

Abstract

The exponential growth of scientific literature in Indonesia necessitates efficient automated systems for organizing, retrieving, and assessing the originality of scholarly articles. This paper explores the application of computational methods, specifically machine learning algorithms for classification and similarity measures, to enhance the management of Indonesian scientific journal articles. We investigate the effectiveness of Naive Bayes and Support Vector Machine (SVM) algorithms for thematic categorization and employ Cosine Similarity for identifying content proximity. The proposed framework includes data preprocessing, feature extraction using TF-IDF, and rigorous evaluation of the models. The findings demonstrate the viability of these approaches in improving the accessibility, discoverability, and integrity of the burgeoning volume of Indonesian academic publications. The Naive Bayes method, when applied to a balanced dataset, achieved an impressive F1-score of 98%, indicating high classification accuracy, with the classification process taking less than 60 minutes. Article similarity detection using the Cosine Similarity method accurately reflected the degree of similarity between concatenated titles and abstracts. This research offers a robust framework for enhancing the classification and search capabilities within national aggregator services like Garba Rujukan Digital (GARUDA).


Keywords

Similarity Detection,, Indonesian Scientific Journals, Natural Language Processing, Support Vector Machine

References

[1] L. Lukman et al., “Proposal of the s-score for measuring the performance of researchers, institutions, and journals in Indonesia,” Science Editing, vol. 5, no. 2, pp. 135–141, 2018, doi: 10.6087/KCSE.138.

[2] M. M. Saritas and A. Yasar, “Performance analysis of ANN and naive Bayes classification algorithm for data classification,” International Journal of Intelligent Systems and Applications in Engineering, vol. 7, no. 2, pp. 88–91, 2019, doi: 10.18201//ijisae.2019252786.

[3] A. S. Osman, “Data mining techniques: review,” International Journal of Data Science Research, vol. 2, no. 1, pp. 1–4, 2019.

[4] F. R. Lumbanraja, E. Fitri, Ardiansyah, A. Junaidi, and R. Prabowo, “Abstract classification using support vector machine algorithm (case study: abstract in a computer science journal),” Journal of Physics: Conference Series, vol. 1751, no. 1. 2021, doi: 10.1088/1742-6596/1751/1/012042.

[5] S. Latif, U. Suwardoyo, and E. A. W. Sanadi, “Content abstract classification using naive Bayes,” Journal of Physics: Conference Series, vol. 979, no. 1, 2018, doi: 10.1088/1742-6596/979/1/012036.


How to Cite

Enhancing Indonesian Scientific Article Management through Machine Learning and NLP. (2025). European Journals of Emerging Computer Vision and Natural Language Processing, 2(02), 29-44. https://parthenonfrontiers.com/index.php/ejecvnlp/article/view/451

Share Link