ADDITIONAL MENU
Analyzing Opinion Polarization on Joko Widodo's Diploma Using Machine Learning
Abstract
The issue of the authenticity of former President Joko Widodo's diploma has become a hot topic of debate in the digital space, especially in the comments section of Kompas TV's YouTube channel. The wide diversity of opinions reflects a polarization of public opinion that is interesting to analyze further. Given the large amount of text data from public comments, manual analysis is ineffective, so a technology-based approach is needed to systematically group opinions. Therefore, this study was conducted to analyze the polarization of public opinion using a machine learning approach. Two classification algorithms, Naive Bayes and Random Forest, were used to distinguish between pro and con public comments on the issue. Data was obtained through an automated collection process (web scraping), followed by text pre-processing and word weighting using the TF-IDF (Term Frequency–Inverse Document Frequency) method. The test results showed that the Random Forest algorithm performed best with an accuracy of 91%, while Naïve Bayes only achieved 74%. This shows that the Random Forest method is more effective than the Naïve Bayes approach in detecting unstructured text patterns. This study concludes that machine learning can be used effectively to identify trends in public opinion on social media and can serve as a basis for further research using word embedding and deep learning models
Keywords
Machine Learning; Naïve Bayes; Public Opinion Polarization; Random Forest; Sentiment Analysis
References
P. Studi et al., "Sentiment Analysis of YouTube TvOne Comments on Ustadz Abdul Somad's Deportation from Singapore Using the SVM Algorithm," vol. 13, no. 1, pp. 72–80, 2023, doi: 10.37859/jf.v13i01.4920.
A. A. Chamid, R. Nindyasari, N. Azizah, and A. Hariyadi, “Analysis of public opinion on the governor candidate debate using LDA and IndoBERT,” vol. 4, no. 3, 2025, doi: 10.22219/kinetik.v10i3.2221.
A. Azzam and S. Mahardiningtyas, “Integrating Sentiment Analysis and Quality Function Deployment for Product Development,” vol. 7, no. 2, pp. 1–9, 2023, doi: 10.29099/ijair.v7i2.1100.
N. Bayes, C. Dan, and R. Forest, “Sentiment Analysis Of The Sirekap 2024 Playstore Application Post-Presidential Election With A Comparison Of Support Vector Machine (Svm), Naïve Bayes Classifier, And Random Forest Methods,” Vol. 11, No. 3, 2025.-Presidential Election With Comparison Of Support Vector Machine (Svm), Naïve Bayes Classifier, And Random Forest Methods,” vol. 11, no. 3, pp. 661–670, 2025, doi: 10.25126/jtiik.2025129608.
S. Khomsah, P. Studi, S. Data, J. Tengah, and I. Artikel, “Sentiment Analysis on YouTube Comments Using Word2Vec and Random Forest,” vol. 18, no. 1, pp. 61–72, 2021, doi: 10.31515/telematika.v18i1.4493.
K. Di, Y. Tentang, I. Afdhal, R. Kurniawan, I. Iskandar, and R. Salambue, “Application of Random Forest Algorithm for Sentiment Analysis,” vol. 5, no. 1, pp. 122–130, 2022, doi: 10.32672/jnkti.v5i1.4004.
M. R. Ramadhan and K. Budiman, “METHOD Sentiment Analysis of Presidential Candidates in 2024: A Comparison of the Performance of Support Vector Machine and Random Forest with N-Gram Method," vol. 3, no. 1, pp. 34–42, 2025, doi: 10.15294/rji.v3i1.8385.
S. A. Penggalih, S. Mujilahwati, and A. A. Bettaliyah, “Twitter Sentiment Analysis to Assess Public Opinion on Jokowi ’ s Performance Over Two Periods using the Recurrent Neural Network ( RNN ) Method,” vol. 5, no. 1, 2025, doi: 10.59934/jaiea.v5i1.1736.
X. S. Media, “Sentiment Analysis on Application X on the Use of Red Oil Using the Naïve Bayes Method,” vol. 8, no. 1, pp. 149–160, 2025, doi: 10.24014/ijaidm.v8i1.35422.
A. M. Putri, R. Yanuarti, M. Dasuki, and A. M. Susetyo, “Indonesian Journal of Information Systems and Technology Sentiment Analysis of Digital Population Identity Using the Naïve Bayes Multinomial Algorithm Sentiment Analysis of Digital Population Identity Using the Naïve Bayes Multinomial Algorithm data disk,” vol. 10, no. 2, pp. 85–95, 2025, doi: 10.32528/justindo.v10i2.2777.
D. Hardiansyah, R. Z. Abdul, and M. Said, “The Classification Method is Used for Sentiment Analysis in My Telkomsel,” vol. 8, no. 2, 2024, doi: 10.29099/ijair.v8i2.1229.
A. R. Isnain, N. S. Marga, and D. Alita, “Sentiment Analysis of Government Policy on Corona Cases Using the Naive Bayes Algorithm,” vol. 15, no. 1, pp. 55–64, 2021, doi: 10.22146/ijccs.60718.
V. B. Adiguna, M. Aqqad, J. S. Pt, and H. R. Dp, “Improving Performance Sentiment Analysis Movie Review Film using Random Forest with Feature Selection Information Gain,” vol. 8, no. 1, 2024.
R. Sistem, M. Lestandy, A. Abdurrahim, and L. Syafa, “Analysis of COVID-19 Vaccine Tweet Sentiment Using Recurrent,” vol. 5, no. 10, pp. 802–808, 2021, doi: 10.29207/resti.v5i4.3308.
R. D. Kurniawan, A. Yohannis, and W. T. Atmojo, “Sentiment Analysis of Getcontact Application Reviews on Google Play Store Using Naive Bayes Algorithm,” vol. 6, no. 4, pp. 2848–2858, 2025, doi: 10.52436/1.jutif.2025.6.4.5248.
H. Mustakim and S. Priyanta, “Aspect-Based Sentiment Analysis of KAI Access Reviews Using NBC and SVM,” vol. 16, no. 2, pp. 113–124, 2022, doi: 10.22146/ijccs.68903.
A. Muhaddisi, B. N. Prastowo, D. Utami, and K. Putri, “Sentiment Analysis With Sarcasm Detection On Politicians’ Instagram,” vol. 15, no. 4, pp. 349–358, 2021, doi: 10.22146/ijccs.66375.
N. Anggraini, S. J. Putra, L. K. Wardhani, F. Dhiya, and U. Arif, “A Comparative Analysis of Random Forest, XGboost, and LightGBM Algorithms for Emotion Classification in Reddit Comments,” vol. 17, no. 1, pp. 88–97, 2024, doi: 10.15408/jti.v17i1.38651.
H. Herlinda, M. I. Mazdadi, D. Kartini, and I. Budiman, “Implementation of Particle Swarm Optimization on Sentiment Analysis of Cyberbullying using Random Forest,” vol. 11, no. 2, pp. 301–309, 2023, doi: 10.30595/juita.v11i2.17920.
C. Ayunda et al., “Comparative Analysis of Machine Learning Algorithms for Sentiment Analysis,” vol. 7, no. 2, pp. 80–84, 2022, doi: 10.30591/jpit.v7i2.3547.
A. Mardiah, S. Dillah, D. F. Surianto, N. Fadilah, and S. G. Zain, “Classification of Livin ’ by Mandiri customer satisfaction using MLP with BM25 and TF-IDF feature weighting,” vol. 4, no. 3, 2025, doi: 10.22219/kinetik.v10i3.2248.
A. N. Syafia, M. F. Hidayattullah, and W. Suteddy, “Comparative Study of SVM and Random Forest Algorithms in Analyzing Sentiment in BTS YouTube Comments,” vol. 8, no. 3, pp. 207–212, 2023, doi: 10.30591/jpit.v8i3.5064.
DOI: http://dx.doi.org/10.24014/ijaidm.v9i1.38616
Refbacks
- There are currently no refbacks.
Office and Secretariat:
Big Data Research Centre
Puzzle Research Data Technology (Predatech)
Laboratory Building 1st Floor of Faculty of Science and Technology
UIN Sultan Syarif Kasim Riau
Jl. HR. Soebrantas KM. 18.5 No. 155 Pekanbaru Riau – 28293
Website: http://predatech.uin-suska.ac.id/ijaidm
Email: ijaidm@uin-suska.ac.id
e-Journal: http://ejournal.uin-suska.ac.id/index.php/ijaidm
Phone: 085275359942
Journal Indexing:
Google Scholar | ROAD | PKP Index | BASE | ESJI | General Impact Factor | Garuda | Moraref | One Search | Cite Factor | Crossref | WorldCat | Neliti | SINTA | Dimensions | ICI Index Copernicus
IJAIDM Stats










