Sentiment Analysis on IMDB Movie Reviews using BERT

Rani Puspita, Cindy Rahayu

Abstract


Before technology existed, opinions could only be obtained from acquaintances, friends, or experts who were experts in certain fields. However, as technology develops, it turns out that opinions can be expressed through social media so that they can influence everyone who sees them. One of them is movie reviews. Human opinion about something is often not valid. So, this study aims to investigate the sentiment analysis related to IMDB Movie Reviews. The approach used is BERT. BERT is a deep learning approach. The data used in this study is the IMDB Movie Review of 50,000 data. The existing data is divided into three parts, namely training data, validation data, and testing data. The results obtained from the BERT model are 91.69% for training accuracy 0.187 for training loss, 91.85% for validation accuracy, 0.212 for validation loss, 91.78% for testing accuracy, and 0.207 for testing loss. It can be seen, that BERT is a very effective approach for sentiment analysis of IMDB Movie Review so that the research problem regarding the invalidity of one's opinion can be handled properly.

Keywords


Sentiment Analysis; IMDB; Evaluation; Deep Learning; Movie Reviews

Full Text:

PDF

References


Z. Shaukat, A. A. Zulfiqar, C. Xiao, M. Azeem, and T. Mahmood, “Sentiment analysis on IMDB using lexicon and neural networks,” SN Appl. Sci., vol. 2, no. 2, Jan. 2020, doi: 10.1007/s42452-019-1926-x.

M. Yasen and S. Tedmori, “Movies reviews sentiment analysis and classification,” 2019 IEEE Jordan Int. Jt. Conf. Electr. Eng. Inf. Technol. JEEIT 2019 - Proc., pp. 860–865, May. 2019, doi: 10.1109/JEEIT.2019.8717422.

T. P. Sahu and S. Ahuja, “Sentiment analysis of movie reviews: A study on feature selection and classification algorithms,” Int. Conf. Microelectron. Comput. Commun. MicroCom 2016, Jul. 2016, doi: 10.1109/MicroCom.2016.7522583.

N. G. Ramadhan and T. I. Ramadhan, “Analysis Sentiment Based on IMDB Aspects from Movie Reviews using SVM,” Sinkron, vol. 7, no. 1, pp. 39–45, Jan. 2022, doi: 10.33395/sinkron.v7i1.11204.

S. M. Qaisar, “Sentiment Analysis of IMDb Movie Reviews Using Long Short-Term Memory,” 2020 2nd Int. Conf. Comput. Inf. Sci. ICCIS 2020, pp. 12–15, Nov. 2020, doi: 10.1109/ICCIS49240.2020.9257657.

J. D. Bodapati, N. Veeranjaneyulu, and S. Shaik, “Sentiment analysis from movie reviews using LSTMs,” Ing. des Syst. d’Information, vol. 24, no. 1, pp. 125–129, Jan. 2019, doi: 10.18280/isi.240119.

M. R. Haque, S. Akter Lima, and S. Z. Mishu, “Performance Analysis of Different Neural Networks for Sentiment Analysis on IMDb Movie Reviews,” 3rd Int. Conf. Electr. Comput. Telecommun. Eng. ICECTE 2019, pp. 161–164, Dec. 2020, doi: 10.1109/ICECTE48615.2019.9303573.

D. M. Qaseem, N. Ali, W. Akram, A. Ullah, and K. Polat, “Movie Success-Rate Prediction System through Optimal Sentiment Analysis,” pp. 15–33, Oct. 2022, doi: 10.33969/JIEC.2022.41002.

P. H. Gunawan, T. D. Alhafidh, and B. A. Wahyudi, “The Sentiment Analysis of Spider-Man: No Way Home Film Based on IMDb Reviews,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 6, no. 1, pp. 177–182, Feb. 2022, doi: 10.29207/resti.v6i1.3851.

M. Hoang, O. Alija Bihorac, and J. Rouces, “Aspect-Based Sentiment Analysis Using BERT,” Proc. 22nd Nord. Conf. Comput. Linguist., pp. 187–196, Oct. 2019.

J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers for language understanding,” NAACL HLT 2019 - 2019 Conf. North Am. Chapter Assoc. Comput. Linguist. Hum. Lang. Technol. - Proc. Conf., vol. 1, no. Mlm, pp. 4171–4186, May. 2019, doi: 10.48550/arXiv.1810.04805.

M. M. Mutlu and A. Özgür, “A Dataset and BERT-based Models for Targeted Sentiment Analysis on Turkish Texts,” Proc. Annu. Meet. Assoc. Comput. Linguist., pp. 467–472, May. 2022, doi: 10.48550/arXiv.2205.04185.

C. H. Kumar and R. S. Kumar, “Natural Language Processing of Movie Reviews to Detect the Sentiments using Novel Bidirectional Encoder Representation-BERT for Transformers over Support Vector Machine,” J. Pharm. Negat. Results, vol. 13, no. 4, pp. 619–628, Sep. 2022, doi: 10.47750/pnr.2022.13.S04.069.

U. M. Dahir and F. K. Alkindy, “Utilizing Machine Learning for Sentiment Analysis of IMDB Movie Review Data,” vol. 71, no. 5, pp. 18–26, May. 2023, doi: 10.14445/22315381/IJETT-V71I5P203.

V. D. Derbentsev, V. S. Bezkorovainyi, and A. V Matviychuk, “Sentiment Analysis of Electronic Social Media Based on Deep Learning,” no. M3e2 2022, pp. 163–175, 2023, doi: : 10.5220/0011932300003432.

L. Mathew and V. R. Bindu, “A Review of Natural Language Processing Techniques for Sentiment Analysis using Pre-trained Models,” Proc. 4th Int. Conf. Comput. Methodol. Commun. ICCMC 2020, no. Iccmc, pp. 340–345, Apr. 2020, doi: 10.1109/ICCMC48092.2020.ICCMC-00064.

S. Tao, “Parameter Optimization of Educational Network Ecosystem Based on BERT Deep Learning Model,” Math. Probl. Eng., vol. 2022, Sep. 2022, doi: 10.1155/2022/3119014.

P. Balaji and D. Haritha, “An Ensemble Multi-layered Sentiment Analysis Model (EMLSA) for Classifying the Complex Datasets,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 3, pp. 185–190, 2023, doi: 10.14569/IJACSA.2023.0140320.

E. Çano, “AlbMoRe: A Corpus of Movie Reviews for Sentiment Analysis in Albanian,” pp. 3–6, Jun. 2023, doi: 10.48550/arXiv.2306.08526.

I. Steinke, J. Wier, L. Simon, and R. Seetan, “Sentiment Analysis of Online Movie Reviews using Machine Learning,” Int. J. Adv. Comput. Sci. Appl., vol. 13, no. 9, pp. 618–624, 2022, doi: 10.14569/IJACSA.2022.0130973.

D. D. Tran, T. T. S. Nguyen, and T. H. C. Dao, “Sentiment Analysis of Movie Reviews Using Machine Learning Techniques,” Lect. Notes Networks Syst., vol. 235, no. 12, pp. 361–369, 2022.

A. U. Rehman, A. K. Malik, B. Raza, and W. Ali, “A Hybrid CNN-LSTM Model for Improving Accuracy of Movie Reviews Sentiment Analysis,” Multimed. Tools Appl., vol. 78, no. 18, pp. 26597–26613, Jun. 2019, doi: 10.1007/s11042-019-07788-7.

U. D. Gandhi, P. Malarvizhi Kumar, G. Chandra Babu, and G. Karthick, “Sentiment Analysis on Twitter Data by Using Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM),” Wirel. Pers. Commun., no. 0123456789, May. 2021, doi: 10.1007/s11277-021-08580-3.

E. Savini and C. Caragea, “Intermediate-Task Transfer Learning with BERT for Sarcasm Detection,” Mathematics, vol. 10, no. 5, Mar. 2022, doi: 10.3390/math10050844.

M. Ogunleye and A. Mbakwe, “Investigating the robustness of Language Models to Non-native English text,” no. January, May. 2022, doi: 10.13140/RG.2.2.11357.28649.

Z. Bowen, “A BERT-CNN Based Approach on Movie Review Sentiment Analysis,” vol. 04007, pp. 1–6, 2023, doi: 10.1051/shsconf/202316304007.

M. Singh, A. K. Jakhar, and S. Pandey, “Sentiment analysis on the impact of coronavirus in social life using the BERT model,” Soc. Netw. Anal. Min., vol. 11, no. 1, pp. 1–11, Feb. 2021, doi: 10.1007/s13278-021-00737-z.

A. Chiorrini, C. Diamantini, A. Mircoli, and D. Potena, “Emotion and sentiment analysis of tweets using BERT,” CEUR Workshop Proc., vol. 2841, 2021.

Z. Gao, A. Feng, X. Song, and X. Wu, “Target-dependent sentiment classification with BERT,” IEEE Access, vol. 7, pp. 154290–154299, 2019, doi: 10.1109/ACCESS. Oct. 2019.2946594.

I. Tenney, D. Das, and E. Pavlick, “BERT rediscovers the classical NLP pipeline,” ACL 2019 - 57th Annu. Meet. Assoc. Comput. Linguist. Proc. Conf., pp. 4593–4601, Aug. 2019, doi: 10.48550/arXiv.1905.05950.




DOI: http://dx.doi.org/10.24014/ijaidm.v6i2.24239

Refbacks

  • There are currently no refbacks.


Office and Secretariat:

Big Data Research Centre
Puzzle Research Data Technology (Predatech)
Laboratory Building 1st Floor of Faculty of Science and Technology
UIN Sultan Syarif Kasim Riau

Jl. HR. Soebrantas KM. 18.5 No. 155 Pekanbaru Riau – 28293
Website: http://predatech.uin-suska.ac.id/ijaidm
Email: ijaidm@uin-suska.ac.id
e-Journal: http://ejournal.uin-suska.ac.id/index.php/ijaidm
Phone: 085275359942

Click Here for Information


Journal Indexing:

Google Scholar | ROAD | PKP Index | BASE | ESJI | General Impact Factor | Garuda | Moraref | One Search | Cite Factor | Crossref | WorldCat | Neliti  | SINTA | Dimensions | ICI Index Copernicus 

IJAIDM Stats