IMPLEMENTATION OF DEEP LEARNING MODELS IN HATE SPEECH DETECTION ON TWITTER USING AN NATURAL LANGUAGE PROCESSING APPROACH
Abstract
In the digital era, the misuse of the freedom to communicate on the internet often leads to problems such as the spread of hate speech, which can harm individuals based on race, religion, and other characteristics. This issue requires effective solutions for content moderation, particularly on social media platforms like Twitter. This research develops a deep learning model utilizing Natural Language Processing (NLP) to detect hate speech and aims to improve existing content moderation mechanisms. The methods used include data collection, preprocessing through techniques such as case folding, tokenization, lemmatization, and model creation using TensorFlow Extended (TFX) involving embedding, dense, and global pooling layers. The model is trained to optimize accuracy by minimizing the loss function and closely monitoring evaluation metrics. The results show that this model achieves a prediction accuracy of 84%, an AUC value of 0.796, and a binary accuracy of 76%. The conclusion of this research is that the use of deep learning and NLP in detecting hate speech offers a highly potential approach to enhancing digital content moderation, providing a solution that is not only efficient but also accurate.
Downloads
References
A. P. J. Dwitama and S. Hidayat, “Identifikasi Ujaran Kebencian Multilabel Pada Teks Twitter Berbahasa Indonesia Menggunakan Convolution Neural Network,” Jurnal Sistem Komputer dan Informatika (JSON), vol. 3, no. 2, p. 117, Dec. 2021, doi: 10.30865/json.v3i2.3610.
I. Riadi, A. Fadlil, and Murni, “Identifying Hate Speech in Tweets with Sentiment Analysis on Indonesian Twitter Utilizing Support Vector Machine Algorithm,” Jurnal Ilmu Komputer dan Informatika, vol. 9, no. 2, pp. 179–191, 2023.
C. Erico, R. Salim, and D. Suhartono, “A Systematic Literature Review of Different Machine Learning Methods on Hate Speech Detection,” International Journal On Informatics Visualization, vol. 4, no. 4, pp. 213–218, 2020.
H. Xie, M. Namvar, and M. Risius, “A Review of Hate Speech Detection: Challenges and Innovations,” in AIS Electronic Library (AISeL), Haiderabad, Dec. 2023, pp. 1–9.
C. Kumar, R. Kumar Yadav, and V. Namdeo, “A Review on Hate Speech Recognition on Social Media,” International Journal of Innovative Research in Technology and Management, vol. 4, no. 4, pp. 50–57, 2020, [Online]. Available: www.ijirtm.com
M. Amien Ibrahim et al., “An Explainable AI Model for Hate Speech Detection on Indonesian Twitter,” CommIT Journal, vol. 16, no. 2, pp. 175–182, 2022.
A. Zozan Miran and A. Mohsin Abdulazeez, “Detection of Hate-Speech Tweets Based on Deep Learning: A Review,” JISA (Jurnal Informatika dan Sains), vol. 6, no. 2, pp. 174–188, 2023.
S. Rohmah and H. Kapi, “Analysis of Hate Speech Detection based on Obstacles and Solutions using Deeplearning Methods,” International Journal of Informatics Technology (INJIT), vol. 1, no. 3, pp. 108–117, 2023.
Preethi and R. Dodmane, “Hate and Sarcasm Classification Using Machine Learning and Deep Learning Techniques: A Survey,” International Journal of Research in Engineering, Science and Management, vol. 5, no. 7, pp. 24–30, Jul. 2022.
S. Kurniawan and I. Budi, “Utilizing Translation to Enhance NLP Models in Offensive Language and Hate Speech Identification,” Journal of Engineering Sciences (Improsci), vol. 1, no. 4, pp. 182–197, Feb. 2024.
A. Z. Miran and H. S. Yahia, “Hate Speech Detection in Social Media (Twitter) Using Neural Network,” Journal of Mobile Multimedia, vol. 19, no. 3, pp. 765–798, 2023, doi: 10.13052/jmm1550-4646.1936.
J. N. Saeed and A. M. Abdulazeez, “Facial Beauty Prediction and Analysis Based on Deep Convolutional Neural Network: A Review,” Journal of Soft Computing and Data Mining, vol. 2, no. 1, pp. 1–12, Apr. 2021, doi: 10.30880/jscdm.2021.02.01.001.
B. Li, Y. Hou, and W. Che, “Data augmentation approaches in natural language processing: A survey,” AI Open, vol. 3, pp. 71–90, Jan. 2022, doi: 10.1016/j.aiopen.2022.03.001.
A. Majid, D. Nugraha, and F. Dharma Adhinata, “Sentiment Analysis on Tiktok Application Reviews Using Natural Language Processing Approach,” 2023, doi: 10.26858/jessi.v4i1.41897.
A. P. R. Nababan, A. S. M. Lumenta, Y. D. Y. Rindengan, F. J. Pontoh, and Y. V. Akay, “Analisis Sentimen Twitter Pasca Pengumuman Hasil Pilpres 2019 Menggunakan Metode Lexicon Analysis,” Jurnal Teknik Informatika, vol. 15, no. 1, pp. 33–34, Jan. 2020.
A. Tabassum and R. R. Patil, “A Survey on Text Pre-Processing & Feature Extraction Techniques in Natural Language Processing,” International Research Journal of Engineering and Technology, vol. 7, no. 6, pp. 4864–4867, 2020, [Online]. Available: www.irjet.net
P. K. Roy, A. K. Tripathy, T. K. Das, and X. Z. Gao, “A framework for hate speech detection using deep convolutional neural network,” IEEE Access, vol. 8, pp. 204951–204962, 2020, doi: 10.1109/ACCESS.2020.3037073.
M. Z. Bimantoro and A. W. R. Emanuel, “Sheep Face Classification using Convolutional Neural Network,” in 3rd 2021 East Indonesia Conference on Computer and Information Technology, EIConCIT 2021, Institute of Electrical and Electronics Engineers Inc., Apr. 2021, pp. 111–115. doi: 10.1109/EIConCIT50028.2021.9431933.
D. Xin, H. Miao, A. Parameswaran, and N. Polyzotis, “Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities,” in Proceedings of the ACM SIGMOD International Conference on Management of Data, Association for Computing Machinery, 2021, pp. 2639–2652. doi: 10.1145/3448016.3457566.
Y. Zhou, Y. Yu, and B. Ding, “Towards MLOps: A Case Study of ML Pipeline Platform,” in Proceedings - 2020 International Conference on Artificial Intelligence and Computer Engineering, ICAICE 2020, Institute of Electrical and Electronics Engineers Inc., Oct. 2020, pp. 494–500. doi: 10.1109/ICAICE51518.2020.00102.
Copyright (c) 2024 Muhammad Arifin, Deni Mahdiana
This work is licensed under a Creative Commons Attribution 4.0 International License.