Evaluating Classification Models for Predicting Product Success in Indonesian E-Commerce
DOI:
https://doi.org/10.52436/1.jutif.2025.6.4.5071Keywords:
Classification, Comparative Analysis, E-commerce, Machine Learning, Product Success Prediction, Random ForestAbstract
The intense competition within the Indonesian e-commerce landscape presents a significant challenge for sellers in forecasting product performance. This study offers a unique contribution by systematically comparing seven machine learning classification algorithms to predict product success across Indonesia's three largest platforms: Shopee, Tokopedia, and Lazada. The primary objective is to identify the most effective algorithm for predicting whether a product's sales will surpass the market median. The methodology involved aggregating and preprocessing a dataset of 3,673 product listings. Product success was defined as a binary variable based on sales volume exceeding the dataset's median. Seven models, including Logistic Regression, KNN, SVM, and tree-based ensembles like Random Forest, XGBoost, and LightGBM, were trained and optimized using a 5-fold cross-validated GridSearchCV. Evaluation was based on accuracy, ROC AUC, and F1-score. The results demonstrate a clear performance hierarchy, with tree-based ensemble models achieving superior results. Random Forest emerged as the premier model, attaining an accuracy of 83.2% and an AUC of 0.907. A subsequent feature importance analysis revealed that shop_followers and price were the most significant predictors of success. This finding has crucial practical implications, particularly for Micro, Small, and Medium Enterprises (MSMEs), by providing a data-driven framework for decision-making. The model enables them to focus resources on actionable strategies—building seller reputation and optimizing pricing—to enhance their competitiveness effectively.
Downloads
References
C. W. Prasetyandari, “E-Commerce as Indonesia’s Economic Development Effort,” Indonesian Journal of Economics and Management, vol. 3, no. 1, pp. 70–78, 2022, doi: 10.35313/ijem.v3i1.4456.
K. D. Pratiwi, “E-Commerce and Economic Growth in Indonesia: Analysis of Panel Data Regression,” Gema Publica, vol. 7, no. 1, pp. 171–186, 2022, doi: 10.14710/gp.7.1.2022.171-186.
L. Judijanto, “Strategy to Increase Tax Revenue to Capture Digital Economy Growth Opportunities in Indonesia,” Maneggio, vol. 1, no. 4, pp. 95–106, 2024, doi: 10.62872/05ft0874.
A. Wibasuri, C. Pratisti, and I. I. Nurjanah, “A Model of Indonesian Consumers’ Online Shopping Behavior, an Extension of TAM,” International Journal of Artificial Intelegence Research, vol. 7, no. 1, p. 2023, 2023, [Online]. Available: http://dx.doi.org/10.29099/ijair.v7i1.1.1115http://ijair.id
P. Putra Dharma, M. S. Purwanegara, and S. A. Wibowo, “Proposed Marketing Strategy to Increase Brand Loyalty: Study Case of Lazada Indonesia,” International Journal of Current Science Research and Review, vol. 07, no. 07, pp. 4701–4716, 2024, doi: 10.47191/ijcsrr/v7-i7-15.
S. Y. Frantika and N. I. Riwajanti, “The Effect of the Pandemic on E-Commerce Competition in Indonesia,” International Journal of Innovative Science and Research Technology, vol. 7, no. 1, 2022, doi: https://doi.org/10.5281/zenodo.5905305.
Abdan Sifa, Isti Masruroh, Muhamad Abdan Zulfa, Sirfi Nur Fitriani, and Naerul Edwin Kiky Aprianto, “Transformasi Digital E-Commerce Dalam Menguasai Kosentrasi Pasar di Indonesia,” Jurnal Ilmiah Ekonomi Dan Manajemen, vol. 2, no. 12, pp. 405–413, 2024, doi: 10.61722/jiem.v2i12.3239.
J. Ghuntesa, S. O. Gunawan, and Mulyono, “Modernized business: Live streaming feature adoption impact on business competitive advantage of Indonesia’s Gen Z perceptions,” Edelweiss Applied Science and Technology, vol. 9, no. 5, pp. 2432–2453, 2025, doi: 10.55214/25768484.v9i5.7484.
K. J. S. -, S. S. -, and B. J. S. -, “Micro, Small, and Medium-Sized Enterprises (MSMEs): The Significant Role and Challenges in Indonesia’s Economy,” International Journal For Multidisciplinary Research, vol. 6, no. 3, pp. 0–10, 2024, doi: 10.36948/ijfmr.2024.v06i03.20824.
Q. Hossain, F. Yasmin, T. R. Biswas, and N. B. Asha, “Data-Driven Business Strategies: A Comparative Analysis of Data Science Techniques in Decision-Making,” Scholars Journal of Economics, Business and Management, vol. 11, no. 09, pp. 257–263, 2024, doi: 10.36347/sjebm.2024.v11i09.002.
E. Tim and S. V. Goriparthi, “Optimizing Decision-Making Processes with Data-Driven Strategies in Modern Business Administration and Information Analysis,” International Journal of Research Publication and Reviews, vol. 6, no. 1, pp. 1986–1986, 2025, doi: 10.55248/gengpi.6.0125.0421.
E. Martins and N. V. Galegale, “Sales forecasting using machine learning algorithms,” Revista de Gestão e Secretariado (Management and Administrative Professional Review), vol. 14, no. 7, pp. 11294–11308, 2023, doi: 10.7769/gesec.v14i7.1670.
Mutia, Ibrahim Iskandar, and Sunilfa Maharani Tanjung, “Double Moving Average and Double Exponential Smoothing Method in Sales Forecasting,” Journal of Computers and Digital Business, vol. 3, no. 3, pp. 112–120, 2024, doi: 10.56427/jcbd.v3i3.599.
Olamide Raimat Amosu, Praveen Kumar, Yewande Mariam Ogunsuji, Segun Oni, and Oladapo Faworaja, “AI-driven demand forecasting: Enhancing inventory management and customer satisfaction,” World Journal of Advanced Research and Reviews, vol. 23, no. 2, pp. 708–719, 2024, doi: 10.30574/wjarr.2024.23.2.2394.
P. Irwandi, R. Oktavianto, A. Adetya, and S. Sopian, “Predicting Agricultural Product E-Commerce Usage Behavior In Indonesia Predicting Agricultural Product E-Commerce Usage Behavior In Indonesia With Machine Learning Algorithms,” Nominal Barometer Riset Akuntansi dan Manajemen, vol. 14, no. April, pp. 101–117, 2025, doi: 10.21831/nominal.v14i1.71565.
D. Witro, M. I. Nurjaman, D. Ayu, and C. S. Al-Rasyid, “Kontestasi Marketplace Di Indonesia Pada Era Pandemi: Analisis Strategi Promosi Tokopedia, Shopee, Bukalapak, Lazada, Dan Blibli Dalam Pemulihan Ekonomi Nasional,” OIKONOMIKA : Jurnal Kajian Ekonomi dan Keuangan Syariah, vol. 2, no. 2, pp. 33–42, 2022, doi: 10.53491/oikonomika.v2i2.178.
O. Arisinta, N. Canda Sakti, and W. T. Subroto, “Increasing the Competitiveness of Global MSMEs Amid Inflation Rate 2024-2025: Government Policy of the Government of Indonesia,” Formosa Journal of Science and Technology (FJST), vol. 3, no. 12, pp. 2581–2596, 2024, doi: https://doi.org/10.55927/fjst.v3i12.12694.
R. Ramli, A. A. Bakar, and F. A. Rahim, “What Influences Customer’s Trust on Online Social Network Sites (SNSs) Sellers?,” International Journal of Advanced Computer Science and Applications, vol. 13, no. 1, pp. 480–489, 2022, doi: 10.14569/IJACSA.2022.0130160.
M. A. Al Rahib, N. Saha, R. Mia, and A. Sattar, “Customer data prediction and analysis in e-commerce using machine learning,” Bulletin of Electrical Engineering and Informatics, vol. 13, no. 4, pp. 2624–2633, 2024, doi: 10.11591/eei.v13i4.6420.
N. Z. B. Jannah and K. Kusnawi, “Comparison of Naïve Bayes and SVM in Sentiment Analysis of Product Reviews on Marketplaces,” Sinkron, vol. 8, no. 2, pp. 727–733, 2024, doi: 10.33395/sinkron.v8i2.13559.
M. Febima and L. Magdalena, “Predictive Analytics on Shopee for Optimizing Product Demand Prediction through K-Means Clustering and KNN Algorithm Fusion,” Journal of Information Systems and Informatics, vol. 6, no. 2, pp. 751–765, 2024, doi: 10.51519/journalisi.v6i2.720.
Novianda Adisti, Puji Isyanto, and Neni Sumarni, “Analisis Pengaruh Program Flash Sale dan Discount Terhadap Impulsive Buying pada Generasi Milenial Pengguna Shopee,” Al-Kharaj: Jurnal Ekonomi, Keuangan & Bisnis Syariah, vol. 6, no. 10, pp. 6616–6635, 2024, doi: 10.47467/alkharaj.v6i10.3022.
I. N. Futri, F. Afisah, and N. Saputro, “How did Tokopedia assist economic recovery through the MSMEs during the Covid-19 pandemic?,” Sebelas Maret Business Review, vol. 6, no. 1, p. 1, 2021, doi: 10.20961/smbr.v6i1.55644.
M. P. Andriani and E. Prihantoro, “Analysis of Business Communication Strategy for Information Technology Lazada Indonesia,” Jurnal Pewarta Indonesia, vol. 4, no. 1, pp. 41–59, 2022, doi: 10.25008/jpi.v4i1.102.
L. O. Joel, W. Doorsamy, and B. Sena Paul, “A Review of Missing Data Handling Techniques for Machine Learning,” International Journal of Innovative Technology and Interdisciplinary Sciences www.IJITIS.org, vol. 5, no. 3, pp. 971–1005, 2022, doi: 10.15157/IJITIS.2022.5.3.971-1005.
L. M. Neto, S. R. da Silva Neto, and P. T. Endo, “A comparative analysis of converters of tabular data into image for the classification of Arboviruses using Convolutional Neural Networks,” PLoS ONE, vol. 18, no. 12 December, pp. 1–23, 2023, doi: 10.1371/journal.pone.0295598.
B. B. Acharya, “Comparative Analysis of Machine Learning Algorithms: KNN, SVM, Decision Tree and Logistic Regression for Efficiency and Performance,” International Journal for Research in Applied Science and Engineering Technology, vol. 12, no. 11, pp. 614–619, 2024, doi: 10.22214/ijraset.2024.65138.
T. F. Sterkenburg and P. D. Grünwald, “The no-free-lunch theorems of supervised learning,” Synthese, vol. 199, no. 3–4, pp. 9979–10015, 2021, doi: 10.1007/s11229-021-03233-1.
I. H. Sarker, “Machine Learning: Algorithms, Real-World Applications and Research Directions,” SN Computer Science, vol. 2, no. 3, p. 160, May 2021, doi: 10.1007/s42979-021-00592-x.
R. Nurdiyanti and E. Utami, “Benchmarking Multiple Machine Learning Algorithms for Sentiment Analysis on Sexual Violence,” 2024 6th International Conference on Cybernetics and Intelligent System, ICORIS 2024, pp. 1–6, 2024, doi: 10.1109/ICORIS63540.2024.10903985.
R. Wahyusari and A. Fadlil, “Comparative Analysis Of Ant Lion Optimization And Jaya Algorithm For Feature Selection In K-Nearest Neighbor ( Knn ) Based Electricity Consumption Prediction,” Jurnal Teknik Informatika (JUTF), vol. 6, no. 3, pp. 1373–1388, 2025, doi: https://doi.org/10.52436/1.jutif.2025.6.3.4692.
M. Shi, J. Wu, and Y. Peng, “Research on Decision Tree-Based Risk Control Strategies in E-Commerce Platforms,” Journal of Combinatorial Mathematics and Combinatorial Computing, vol. 127a, no. 3, pp. 2861–2877, 2025, doi: 10.61091/jcmcc127a-162.
X. Yuan, S. Liu, W. Feng, and G. Dauphin, “Feature Importance Ranking of Random Forest-Based End-to-End Learning Algorithm,” Remote Sensing, vol. 15, no. 21, pp. 1–20, 2023, doi: 10.3390/rs15215203.
J. Zhang, “Enhancing Predictive Models in E-Commerce: A Comparative Study Using XGBoost Across Diverse Scenarios,” ITM Web of Conferences, vol. 70, p. 02014, 2025, doi: 10.1051/itmconf/20257002014.
P. A. Sunarya, U. Rahardja, S. C. Chen, Y. M. Li, and M. Hardini, “Deciphering Digital Social Dynamics: A Comparative Study of Logistic Regression and Random Forest in Predicting E-Commerce Customer Behavior,” Journal of Applied Data Sciences, vol. 5, no. 1, pp. 100–113, 2024, doi: 10.47738/jads.v5i1.155.
L. Yang and A. Shami, “On hyperparameter optimization of machine learning algorithms: Theory and practice,” Neurocomputing, vol. 415, no. 6, pp. 295–316, Nov. 2020, doi: 10.1016/J.NEUCOM.2020.07.061.
I. P. Kamila, C. A. Sari, E. H. Rachmawanto, and N. R. D. Cahyo, “A Good Evaluation Based on Confusion Matrix for Lung Diseases Classification using Convolutional Neural Networks,” Advance Sustainable Science, Engineering and Technology, vol. 6, no. 1, pp. 1–8, 2024, doi: 10.26877/asset.v6i1.17330.
N. E. W. Nugroho and A. Harjoko, “Transliteration of Hiragana and Katakana Handwritten Characters Using CNN-SVM,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 15, no. 3, p. 221, 2021, doi: 10.22146/ijccs.66062.
Dewi Widyawati and Amaliah Faradibah, “Comparison Analysis of Classification Model Performance in Lung Cancer Prediction Using Decision Tree, Naive Bayes, and Support Vector Machine,” Indonesian Journal of Data and Science, vol. 4, no. 2, pp. 80–89, 2023, doi: 10.56705/ijodas.v4i2.76.
B. Kandow, N. P. Legi, S. S. Tumbo, and P. N. Manado, “APPLICATION OF MACHINE LEARNING IN PREDICTING E-COMMERCE MARKET TRENDS,” Syntax Adminisration, vol. 6, no. 2, pp. 1315–1323, 2025, doi: https://doi.org/10.46799/jsa.v6i2.2133.
Toluwalase Vanessa Iyelolu and Patience Okpeke Paul, “Implementing machine learning models in business analytics: challenges, solutions, and impact on decision-making,” World Journal of Advanced Research and Reviews, vol. 22, no. 3, pp. 1906–1916, 2024, doi: 10.30574/wjarr.2024.22.3.1959.
A. M. Haddadi and H. Hamidi, “A hybrid model for improving customer lifetime value prediction using stacking ensemble learning algorithm,” Computers in Human Behavior Reports, vol. 18, no. 2, p. 100616, 2025, doi: 10.1016/j.chbr.2025.100616.
W. Saeed and C. Omlin, “Explainable AI (XAI): A systematic meta-survey of current challenges and future opportunities,” Knowledge-Based Systems, vol. 263, no. 24, p. 110273, 2023, doi: 10.1016/j.knosys.2023.110273.
Octario Emirat Azrie Saputra, Mahir Pradana, Mahendra Fakhri, “Customers’ Quality Perseption, Sensitivity, and Purchasing Decisions: A Case of a Fast-Fashion Product,” Journal of Information Systems Engineering and Management, vol. 10, no. 2, pp. 322–334, 2025, doi: 10.52783/jisem.v10i2.1913.
I. Pratiwi, R. Elizabeth, Fatari, Basrowi, and U. W. Nuryanto, “Assessing pricing, distribution, and warehousing strategies in influencing consumer demand and purchase decisions: Evidence from Indonesia,” Social Sciences and Humanities Open, vol. 12, no. 2, p. 101688, 2025, doi: 10.1016/j.ssaho.2025.101688.
N. V. Aryoko and Y. Dirgiatmo, “The Influence of Customer Trust as a Mediator Between Uncertainty Avoidance and Purchase Intention in Shopee,” International Journal of Economics, Business and Management Research, vol. 09, no. 01, pp. 151–167, 2025, doi: 10.51505/ijebmr.2025.9111.
Z. Arif Ali, Z. H. Abduljabbar, H. A. Tahir, A. Bibo Sallow, and S. M. Almufti, “eXtreme Gradient Boosting Algorithm with Machine Learning: a Review,” Academic Journal of Nawroz University, vol. 12, no. 2, pp. 320–334, 2023, doi: 10.25007/ajnu.v12n2a1612.
S. F. N. Hadju and R. Jayadi, “Sentiment analysis of Indonesian e-commerce product reviews using support vector machine based term frequency inverse document frequency,” Journal of Theoretical and Applied Information Technology, vol. 99, no. 17, pp. 4316–4325, 2021.
Additional Files
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Fiola Utri Aulya, Kusnawi

This work is licensed under a Creative Commons Attribution 4.0 International License.