SMOTE to Improve the Performance of Naïve Bayes and Random Forest in Sentiment Analysis of Digitalent applications
DOI:
https://doi.org/10.31316/jdi.v14i2.347Keywords:
Sentiment Analysis, Naïve Bayes, Random Forest, Smote Implementation, DigitalentAbstract
Sentiment analysis is critical to understanding how an app, such as a digital training app like Digitalent, is viewed by users. User reviews available on app distribution platforms provide ample data for this analysis. However, in sentiment analysis, data imbalance is a common problem; positive reviews tend to outnumber negative and neutral reviews. This imbalance can impact machine learning models, which can lead to inaccurate predictions of the majority class. The purpose of this research is to solve this problem by using SMOTE (Synthetic Minority Selection Technique) technique in sentiment analysis of Digitalent app reviews and comparing the performance of two machine learning algorithms, Naive Bayes and Random Forest. The research data was collected from Indonesian user reviews from the Digitalent platform. Before being processed for analysis, the data went through pre-processing processes such as cleaning, tokenization, and normalization. SMOTE technique was applied to balance the number of reviews for each sentiment class. Furthermore, Naive Bayes and Random Forest algorithms are used to categorize the sentiment. The results of the SMOTE application research successfully increased the proportion of negative and neutral classes, so that the distribution of the dataset became balanced. The test results show that the accuracy of Naïve Bayes increased from 68.25% to 92.16%, while Random Forest increased from 68.25% to 92.16%.Keywords: K-Means Clustering, education level, clustering, village education, RapidMiner
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Yusril Muhamad Izha Mahendra

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-ShareAlike 4.0 International License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
