IDENTIFYING POSSIBLE RUMOR SPREADERS ON TWITTER USING THE SVM AND FEATURE LEVEL EXTRACTION
Abstract
In everyday life, many events occur and give rise to various kinds of information, which are also rumors. Rumors can cause fear and influence public opinion about the event in question. Identifying possible rumor spreaders is extremely helpful in preventing the spread of rumors. Feature extraction can be done to expand the feature set, which consists of conversational features in the form of social networks formed from user replies, user features such as following, tweet count, verified, etc., and tweet features with text analysis such as punctuation and sentiment values. These features become instances used for classification. This study aims to identify possible spreaders of rumors on Twitter with the SVM classification model. This instance-based classification algorithm is good for linear and non-linear classification. In the non-linear classification, additional kernels are used, such as linear, RBF, and sigmoid. The research focuses on getting the best model with high performance values from all the models and kernel functions that have been defined. It was found that the SVM classification model with the RBF kernel has a high overall performance value for each data combination with a ratio of the amount of data is 1:1 or the difference is very large. This model gives accurate results with an average of 97.02%. With a wide distribution of data, the SVM classification model with the RBF kernel is able to map the data properly.
Downloads
References
L. (Monroe) Meng, T. Li, X. Huang, dan S. (Kevin) Li, “Lift the veil of rumors: the impact of the characteristics of information sources on the effectiveness of rumors spreading,” Internet Res., vol. 32, no. 1, 2022, doi: 10.1108/INTR-11-2020-0620.
I. C. Hsu dan C. C. Chang, “Integrating machine learning and open data into social Chatbot for filtering information rumor,” J. Ambient Intell. Humaniz. Comput., vol. 12, no. 1, 2021, doi: 10.1007/s12652-020-02119-3.
R. Dekker, G. Engbersen, J. Klaver, dan H. Vonk, “Smart Refugees: How Syrian Asylum Migrants Use Social Media Information in Migration Decision-Making,” Soc. Media Soc., vol. 4, no. 1, 2018, doi: 10.1177/2056305118764439.
S. Sharma dan R. Sharma, “Identifying Possible Rumor Spreaders on Twitter: A Weak Supervised Learning Approach,” Proc. Int. Jt. Conf. Neural Networks, vol. 2021-July, 2021, doi: 10.1109/IJCNN52387.2021.9534185.
M. Maan, M. K. Jain, S. Trivedi, dan R. Sharma, “Machine Learning Based Rumor Detection on Twitter Data,” Emerg. Technol. Comput. Eng. Cogn. Comput. Intell. IoT. ICETCE 2022. Commun. Comput. Inf. Sci., vol. 1591, 2022, doi: https://doi.org/10.1007/978-3-031-07012-9_23.
A. Kaur dan A. Sinha, “Multi-contextual spammer detection for online social networks,” J. Discret. Math. Sci. Cryptogr., hal. 777–786, 2021, doi: https://doi.org/10.1080/09720529.2020.1794517.
B. Dixon, Social Media for School Leaders: A Comprehensive Guide to Getting the Most Out of Facebook, Twitter, and Other Essential Web Tools. 2012.
M. Waskale dan P. Jain, “Rumors Detection on Twitter Using Machine Learning Techniques,” 2019.
D. Koggalahewa, Y. Xu, dan E. Foo, An unsupervised method for social network spammer detection based on user information interests, vol. 9, no. 1. Springer International Publishing, 2022. doi: 10.1186/s40537-021-00552-5.
A. Ramalingaiah, S. Hussaini, dan S. Chaudhari, “Twitter bot detection using supervised machine learning,” J. Phys. Conf. Ser., vol. 1950, no. 1, 2021, doi: 10.1088/1742-6596/1950/1/012006.
Yuliant sibaroni dan Sri Suryani Prasetiyowati, “Buzzer Detection on Indonesian Twitter using SVM and Account Property Feature Extension,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 6, no. 4, hal. 663–669, 2022, doi: 10.29207/resti.v6i4.4338.
M. Cardaioli, S. Cecconello, M. Conti, L. Pajola, dan F. Turrin, “Fake News Spreaders Profiling through Behavioural Analysis Notebook for PAN at CLEF 2020,” CEUR Workshop Proc., vol. 2696, no. September, hal. 22–25, 2020.
A. Bodaghi dan J. Oliveira, “No The characteristics of rumor spreaders on Twitter: A quantitative analysis on real data,” Comput. Commun., vol. 160, hal. 674–687, 2020, doi: https://doi.org/10.1016/j.comcom.2020.07.017.
B. Rath, W. Gao, J. Ma, dan E. Al., “Utilizing computational trust to identify rumor spreaders on Twitter,” Soc. Netw. Anal. Min., 2018, doi: https://doi.org/10.1007/s13278-018-0540-z.
V. Piccialli dan M. Sciandrone, “Nonlinear optimization and support vector machines,” Ann. Oper. Res., vol. 314, no. 1, hal. 15–47, 2022, doi: 10.1007/s10479-022-04655-x.
V. Vapnik, The Nature of Statistical Learning Theory. 1995. doi: http://dx.doi.org/10.1007/978-1-4757-2440-0.
S. Džeroski, Data Mining. 2008. doi: 10.1016/B978-008045405-4.00153-1.
A. Zubiaga, M. Liakata, dan R. Procter, “Learning Reporting Dynamics during Breaking News for Rumour Detection in Social Media,” 2016.
S. Agarwal, Data mining: Data mining concepts and techniques. 2014. doi: 10.1109/ICMIRA.2013.45.
D. Valero-Carreras, J. Alcaraz, dan M. Landete, “Comparing two SVM models through different metrics based on the confusion matrix,” Comput. Oper. Res., vol. 152, no. December 2022, hal. 106131, 2023, doi: 10.1016/j.cor.2022.106131.
Copyright (c) 2023 Claudia Mei Serin Sitio, Yuliant Sibaroni, Sri Suryani Prasetiyowati, Sri Suryani Prasetiyowati
This work is licensed under a Creative Commons Attribution 4.0 International License.