Comparative Analysis of Explainable AI Methods LIME, SHAP, and ELI5 on Random Forest Based Indonesian E-Commerce Sentiment Classification

Haditya Pandu Winanta; Muhammad Yusril Hana; Firman Noor Hasan

doi:10.52436/1.jutif.2026.7.2.5642

Authors

Haditya Pandu Winanta Informatics Engineering, Universitas Muhammadiyah Prof. Dr. Hamka, Indonesia
Muhammad Yusril Hana Master of Business Administration, Carnegie Mellon University, United States
Firman Noor Hasan Informatics Engineering, Universitas Muhammadiyah Prof. Dr. Hamka, Indonesia

DOI:

https://doi.org/10.52436/1.jutif.2026.7.2.5642

Keywords:

Explainable AI, Product Reviews, Random Forest, Sentiment Analysis

Abstract

The rapid growth of e-commerce platforms in Indonesia has generated a massive volume of product reviews, making sentiment classification essential for understanding customer perceptions and supporting data-driven decision making. This study aims to develop a sentiment classification model for Indonesia e-commerce product reviews while enhancing model transparency through Explainable Artificial Intelligence (XAI). The proposed approach employs a Random Forest classifier eith Term Frequency-Inverse Document Frequency (TF-IDF) for feature extraction. The dataset consists of 23,194 product reviews from the fashion and electronics categories, classified into positive, negative, and neutral sentiment. Model performance is evaluated using accuracy, precision, recall, and F1-Score metrics. Experimental results show taht the Random Forest model achieves an accuracy of 93.74%, with the best performance observed in the postive sentiment class. To improve interpretability, three XAI methods-LIME, SHAP, and ELI5-are applied. The analysis indicates that LIME is effective for local explanations, SHAP provides consistent global and local feature importence, and ELI5 offers concise and computationally efficient global explanations. This study contributes to the field of computer science by demostrating how comparative XAI analysis can bridge the gap between high-performing black-box models and interpretable sentiment classification in high-dimensional extual data, thereby supporting transparent and accountavle AI system in e-commerce applications.

Downloads

Download data is not yet available.

Author Biography

Haditya Pandu Winanta, Informatics Engineering, Universitas Muhammadiyah Prof. Dr. Hamka, Indonesia

Informatics Engineering

References

G. Maulani et al., “Machine Learning,” 2025.

Sugiarto et al., Fenomena Bisnis AI. 2024.

D. Pakpahan, V. Siallagan, and S. Siregar, “Classification of E-Commerce Product Descriptions with The Tf-Idf and Svm Methods,” sinkron, vol. 8, no. 4, pp. 2130–2137, Oct. 2023, doi: 10.33395/sinkron.v8i4.12779.

L. Gomes, R. da Silva Torres, and M. L. Côrtes, “BERT- and TF-IDF-based feature extraction for long-lived bug prediction in FLOSS: A comparative study,” Inf. Softw. Technol., vol. 160, p. 107217, Aug. 2023, doi: 10.1016/j.infsof.2023.107217.

L.-C. Chen, “An extended TF-IDF method for improving keyword extraction in traditional corpus-based research: An example of a climate change corpus,” Data Knowl. Eng., vol. 153, p. 102322, Sep. 2024, doi: 10.1016/j.datak.2024.102322.

K. M. Suryaningrum, “Comparison of the TF-IDF Method with the Count Vectorizer to Classify Hate Speech,” Engineering, MAthematics and Computer Science (EMACS) Journal, vol. 5, no. 2, pp. 79–83, May 2023, doi: 10.21512/emacsjournal.v5i2.9978.

J. Lu, “Enhancing Chatbot User Satisfaction: A Machine Learning Approach Integrating Decision Tree, TF-IDF, and BERTopic,” in 2024 IEEE 6th International Conference on Power, Intelligent Computing and Systems (ICPICS), IEEE, Jul. 2024, pp. 823–828. doi: 10.1109/ICPICS62053.2024.10796445.

D. F. Surianto and D. F. Surianto, “Enhancing K-Means Clustering for Journal Articles using TF-IDF and LDA Feature Extraction,” Brilliance: Research of Artificial Intelligence, vol. 4, no. 2, pp. 964–972, Mar. 2025, doi: 10.47709/brilliance.v4i2.5547.

M. D. Rizkiyanto, M. D. Purbolaksono, and W. Astuti, “Sentiment Analysis Classification on PLN Mobile Application Reviews using Random Forest Method and TF-IDF Feature Extraction,” INTEK: Jurnal Penelitian, vol. 11, no. 1, pp. 37–43, Apr. 2024, doi: 10.31963/intek.v11i1.4774.

T. A. U. Azmi, L. Hakim, D. C. R. Novitasari, and W. D. U. D. Utami, “Application Random Forest Method for Sentiment Analysis in Jamsostek Mobile Review,” Telematika, vol. 20, no. 1, p. 117, Mar. 2023, doi: 10.31315/telematika.v20i1.8868.

M. Rusdi Rahman, A. Febri Diansyah, and H. Hanafi, “Sentiment Analysis on the Shopee Application on Playstore Using the Random Forest Classification Method,” Inform : Jurnal Ilmiah Bidang Teknologi Informasi dan Komunikasi, vol. 9, no. 1, pp. 20–24, Nov. 2023, doi: 10.25139/inform.v9i1.5465.

C. R. Hassolthine, T. Haryanto, F. Adline Twince Tobing, and M. Ikhwani Saputra, “E-Commerce Product Review Sentiment Analysis: A Comparative Study of Naïve Bayes Classifier and Random Forest Algorithms on Marketplace Platforms,” IJNMT (International Journal of New Media Technology), vol. 12, no. 1, pp. 55–60, Jul. 2025, doi: 10.31937/ijnmt.v12i1.4246.

“Deciphering Digital Social Dynamics: A Comparative Study of Logistic Regression and Random Forest in Predicting E-Commerce Customer Behavior,” Journal of Applied Data Sciences, vol. 5, no. 1, pp. 100–113, Jan. 2024, doi: 10.47738/jads.v5i1.155.

C. AVCI, M. BUDAK, N. YAĞMUR, and F. BALÇIK, “Comparison between random forest and support vector machine algorithms for LULC classification,” International Journal of Engineering and Geosciences, vol. 8, no. 1, pp. 1–10, Feb. 2023, doi: 10.26833/ijeg.987605.

N. Istiqamah and M. Rijal, “Klasifikasi Ulasan Konsumen Menggunakan Random Forest dan SMOTE,” Journal of System and Computer Engineering (JSCE), vol. 5, no. 1, pp. 66–77, Jan. 2024, doi: 10.61628/jsce.v5i1.1061.

T. Becker, A.-J. Rousseau, M. Geubbelmans, T. Burzykowski, and D. Valkenborg, “Decision trees and random forests,” American Journal of Orthodontics and Dentofacial Orthopedics, vol. 164, no. 6, pp. 894–897, Dec. 2023, doi: 10.1016/j.ajodo.2023.09.011.

R. Iranzad and X. Liu, “A review of random forest-based feature selection methods for data science education and applications,” Int. J. Data Sci. Anal., vol. 20, no. 2, pp. 197–211, Aug. 2025, doi: 10.1007/s41060-024-00509-w.

H. A. Salman, A. Kalakech, and A. Steiti, “Random Forest Algorithm Overview,” Babylonian Journal of Machine Learning, vol. 2024, pp. 69–79, Jun. 2024, doi: 10.58496/BJML/2024/007.

Alfandi Safira and F. N. Hasan, “ANALISIS SENTIMEN MASYARAKAT TERHADAP PAYLATER MENGGUNAKAN METODE NAIVE BAYES CLASSIFIER,” ZONAsi: Jurnal Sistem Informasi, vol. 5, no. 1, pp. 59–70, Jan. 2023, doi: 10.31849/zn.v5i1.12856.

A. R. Abdillah and F. N. Hasan, “Sentiment Analysis of Presidential Candidates Based on Tweets on Social Media Using the Naive Bayes Classifier,” STIKI Informatika Jurnal, vol. 13, 2023.

H. Ammar, F. Al Gani, M. Rifansyah, and F. N. Hasan, “Perbandingan Tingkat Akurasi Algoritma Naïve Bayes dan Support Vector Machine Dalam Analisis Sentimen Pengguna Aplikasi ShopeePay Pada Google Play Store,” Proceeding of TEKNOKA National Seminar - 9, vol. 9, 2024.

Meliyawati and F. N. Hasan, “Analisis Sentimen Pengguna Aplikasi CapCut Pada Ulasan di Play Store Menggunakan Metode Naïve Bayes,” KLIK; KAJIAN ILMIAH INFORMATIKA DAN KOMPUTER, vol. 4, 2024.

B. Gezici Geçer and A. Kolukısa Tarhan, “Explainable AI Framework for Software Defect Prediction,” Journal of Software: Evolution and Process, vol. 37, no. 4, Apr. 2025, doi: 10.1002/smr.70018.

C. Molnar, Interpretable Machine Learning. 2022.

N. Uddin, M. S. Mia, S. Rana, P. Mahmud, and Md. J. Islam, “An Explainable AI-Driven Machine Learning Approach for Student Depression Detection,” in 2025 International Conference on Electrical, Computer and Communication Engineering (ECCE), IEEE, Feb. 2025, pp. 1–6. doi: 10.1109/ECCE64574.2025.11013941.

M. Kulkarni and M. Stamp, “XAI and Android Malware Models,” in Machine Learning, Deep Learning and AI for Cybersecurity, Cham: Springer Nature Switzerland, 2025, pp. 327–355. doi: 10.1007/978-3-031-83157-7_12.

B. P. Bhuyan and S. Srivastava, “Feature Importance in Explainable AI for Expounding Black Box Models,” 2023, pp. 815–824. doi: 10.1007/978-981-19-6634-7_58.

Ninda Rizky Nuraeda, Muhaza Liebenlito, and Taufik Edy Sutanto, “Explainable Sentiment Analysis pada Ulasan Aplikasi Shopee Menggunakan Local Interpretable Model-agnostic Explanations,” Indonesian Journal of Computer Science, vol. 13, no. 3, Jun. 2024, doi: 10.33022/ijcs.v13i3.3870.

I. F. Rosyid and H. Pramaditya, “Visual Interpretation of Machine Learning Models (Random Forest) for Lung Cancer Risk Classification Using Explainable Artificial Intelligence (SHAP & LIME),” Jurnal Teknik Informatika (Jutif), vol. 6, no. 4, pp. 2187–2206, Aug. 2025, doi: 10.52436/1.jutif.2025.6.4.4925.

H. Rathore, H. K. Meena, and P. Jain, “Prediction of EV Energy consumption Using Random Forest And XGBoost,” in 2023 International Conference on Power Electronics and Energy (ICPEE), IEEE, Jan. 2023, pp. 1–6. doi: 10.1109/ICPEE54198.2023.10060798.

Comparative Analysis of Explainable AI Methods LIME, SHAP, and ELI5 on Random Forest Based Indonesian E-Commerce Sentiment Classification

Authors

DOI:

Keywords:

Abstract

Downloads

Author Biography

Haditya Pandu Winanta, Informatics Engineering, Universitas Muhammadiyah Prof. Dr. Hamka, Indonesia

References

Additional Files

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Make a Submission

sidebar

Information