UNRAVELING OF MEN'S FRAGRANCE PREFERENCES ON ONLINE MARKETPLACES: A MACHINE LEARNING STUDY USING DBSCAN CLUSTERING AND LINEAR REGRESSION

  • Zakiyyan Zain Alkaf Industrial Engineering, Faculty of Engineering, Universitas Jenderal Soedirman, Indonesia
  • Muhammad Ihsan Fawzi Informatics Engineering, Faculty of Engineering, Universitas Jenderal Soedirman, Indonesia
  • Murti Wisnu Ragil Sastyawan Industrial Engineering, Faculty of Engineering, Universitas Jenderal Soedirman, Indonesia
  • Radita Dwi Putera Industrial Engineering, Faculty of Engineering, Universitas Jenderal Soedirman, Indonesia
Keywords: Clustering Analysis, Customer Behaviour, Customer Preferences, Perfume, Regression Analysis

Abstract

The perfume industry is undergoing significant growth, driving the need to understand consumer preferences, particularly in men’s fragrances, to optimize business strategies. This study aims to analyze and uncover men’s fragrance preferences, using machine learning techniques. A dataset of approximately 1,000 men's perfume records from Kaggle was utilized, where systematic methodologies were employed. Data preprocessing involved handling missing values, removing duplicates, standardizing categorical entries, and performing feature engineering by extracting geographic information from item locations. Exploratory Data Analysis (EDA) was conducted to uncover data distribution. Clustering analysis using DBSCAN revealed consumer segments. Additionally, regression analysis was used to predict sales based on price and location, employing a linear regression model evaluated with metrics like Mean Squared Error (MSE). The findings indicate that price exhibits a complex relationship with sales; while affordable products drive higher sales volumes, premium-priced items cater to a niche yet impactful market segment. Geographic location plays a pivotal role in sales patterns. Clustering analysis reveals two distinct consumer segments: one driven by price sensitivity and another oriented towards premium preferences, influenced by regional factors. Regression analysis demonstrated a negative correlation between price and sales volume, with a coefficient of -1.81, while availability positively influenced sales with a coefficient of 8.36. Despite a moderate model fit (R² = 0.17), the analysis highlights key market dynamics. These insights emphasize the importance of leveraging data-driven strategies to develop targeted marketing campaigns, optimize inventory management and refine market segmentation.

Downloads

Download data is not yet available.

References

C. Ardelet, B. Slavich, and G. de Kerviler, “Self-referencing narratives to predict consumers’ preferences in the luxury industry: A longitudinal study,” J. Bus. Res., vol. 68, no. 9, pp. 2037–2044, 2015, doi: https://doi.org/10.1016/j.jbusres.2015.02.017

R. N. Joshi and D. Y. C. Joshi, “Construction and Validation of Indian Consumer Ethnocentrism Scale: The Ice-Scale,” Gadjah Mada Int. J. Bus., vol. 23, no. 2, p. 173, May 2021, doi: 10.22146/gamaijb.58039.

F. Begnaud, “Mass Spectrometry: An Essential Tool for the Flavor and Fragrance Industry,” Chimia (Aarau)., vol. 76, no. 1–2, p. 68, Feb. 2022, doi: 10.2533/chimia.2022.68.

K. Ponsree and P. Naruetharadhol, “Unveiling the determinants of alternative payment adoption: exploring the factors shaping generation Z’s intentions in Thailand,” Int. Entrep. Manag. J., vol. 21, no. 1, p. 45, Dec. 2025, doi: 10.1007/s11365-024-01057-2.

K. Adalarasu, K. Ghousiya Begum, M. Vishnu Priyan, C. Devendranath, and G. V. Sriram, “Neuro-signaling techniques in advertisement endorsements: Unveiling consumer responses and behavioral trends,” J. Retail. Consum. Serv., vol. 84, p. 104175, May 2025, doi: 10.1016/j.jretconser.2024.104175.

B. C. L. Rodrigues, V. V Santana, L. P. Queiroz, C. M. Rebello, and I. B․ R․ Nogueira, “Harnessing graph neural networks to craft fragrances based on consumer feedback,” Comput. Chem. Eng., vol. 185, p. 108674, Jun. 2024, doi: 10.1016/j.compchemeng.2024.108674.

S. Bogacki, P. Rymarczyk, T. Smutek, M. Rutkowski, and A. Chmielowska-Marmucka, “Advanced methods for target audience identification: enhancing marketing strategies through machine learning and data analytics,” J. Mod. Sci., vol. 57, no. 3, pp. 417–435, Aug. 2024, doi: 10.13166/jms/191180.

Y. Cai, Q. Shu, Z. Zhou, and H. Meng, “Incomplete multi-view clustering based on information fusion with self-supervised learning,” Inf. Fusion, vol. 117, p. 102849, May 2025, doi: 10.1016/j.inffus.2024.102849.

S. Sobantu and O. E. Isafiade, “Cascaded RFM-Based Fuzzy Clustering Model for Dynamic Customer Segmentation in the Retail Sector,” 2025, pp. 53–73. doi: 10.1007/978-3-031-78255-8_4.

N. M. Nhat, “Applied Density-Based Clustering Techniques for Classifying High-Risk Customers: A Case Study of Commercial Banks in Vietnam,” J. Appl. Data Sci., vol. 5, no. 4, pp. 1639–1653, Dec. 2024, doi: 10.47738/jads.v5i4.344.

P. Castricone and L. Long, “Deriving consumer insights with segmentation for identity and consent,” Appl. Mark. Anal. Peer-Reviewed J., vol. 10, no. 3, p. 245, Dec. 2024, doi: 10.69554/JZEK9481.

K. P. Suhaas, B. G. Deepa, D. Shashank, and M. Narender, “Millets Industry Dynamics: Leveraging Sales Projection and Customer Segmentation,” SN Comput. Sci., vol. 5, no. 8, p. 1063, Nov. 2024, doi: 10.1007/s42979-024-03437-5.

A. R. Oskooei and T. E. Adak, “B2B Customer Engagement Customer Behaviour Forecast Application,” Orclever Proc. Res. Dev., vol. 3, no. 1, pp. 188–196, Dec. 2023, doi: 10.56038/oprd.v3i1.323.

I. Maulana, A. M. Siregar, S. A. P. Lestari, and S. Faisal, “OPTIMAL STUDY OF REAL-ESTATE PRICE PREDICTION MODELS USING MACHINE LEARNING,” J. Tek. Inform., vol. 5, no. 4, pp. 1149–1164, 2024, doi: https://doi.org/10.52436/1.jutif.2024.5.4.2565.

C. W. Cheah, M. N. A Karim, Y. S. Lee, and M. D. i Md Ghazal, “The Accuracy and Error of Ground Penetrating Radar System with Machine Learning Support Vector Regression Technique,” J. Adv. Res. Appl. Sci. Eng. Technol., vol. 50, no. 1, pp. 191–202, Aug. 2024, doi: 10.37934/araset.50.1.191202.

M. Wan, M. A. Alshahrani, N. M. Aloraini, A. A. Alkhathami, and H. Alqahtani, “On predictive modeling of the twitter-based sales data using a new probabilistic model and machine learning methods,” Alexandria Eng. J., vol. 113, pp. 661–671, Feb. 2025, doi: 10.1016/j.aej.2024.11.041.

M. K. S. Rajalakshmi, A. Sriraman, and M. Krishnamoorthy, “Regression analysis for software development and management using machine learning.” Google Patents, Jun. 15, 2021.

R. Cerezo, J.-A. Lara, R. Azevedo, and C. Romero, “Reviewing the differences between learning analytics and educational data mining: Towards educational data science,” Comput. Human Behav., vol. 154, p. 108155, 2024, doi: https://doi.org/10.1016/j.chb.2024.108155.

C. Fan, M. Chen, X. Wang, J. Wang, and B. Huang, “A Review on Data Preprocessing Techniques Toward Efficient and Reliable Knowledge Discovery From Building Operational Data,” Front. Energy Res., vol. 9, p. 652801, Mar. 2021, doi: 10.3389/fenrg.2021.652801.

S. Galli, Python feature engineering cookbook. Packt Publishing Ltd, 2024.

T. Milo and A. Somech, “Automating Exploratory Data Analysis via Machine Learning: An Overview,” in Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, New York, NY, USA: ACM, Jun. 2020, pp. 2617–2622. doi: 10.1145/3318464.3383126.

J. Kim, H. Lee, and Y. M. Ko, “Constrained Density-Based Spatial Clustering of Applications with Noise (DBSCAN) using hyperparameter optimization,” Knowledge-Based Syst., vol. 303, p. 112436, 2024, doi: https://doi.org/10.1016/j.knosys.2024.112436.

C. Chen et al., “Aroma-sensory properties of Gouda cheeses based on young Chinese consumers’ preferences,” J. Dairy Sci., vol. 107, no. 7, pp. 4333–4343, 2024, doi: https://doi.org/10.3168/jds.2023-23525.

S. G. H. Meyerding and B. H. Trinh, “A cross-cultural study: How product attributes and cultural values influence chocolate preferences,” J. Agric. Food Res., vol. 19, p. 101579, Mar. 2025, doi: 10.1016/j.jafr.2024.101579.

J. van der Plas, M. G. Dekimpe, and I. Geyskens, “Mind the gap: National brands’ sensitivity to price and line-length differentials at hard discounters versus conventional retailers,” J. Retail., vol. 100, no. 2, pp. 199–216, 2024, doi: https://doi.org/10.1016/j.jretai.2024.05.005.

R. Anguzu and F. Aila, “Sources of Supply Chain Volatility: A Literature Review,” Int. J. Supply Chain Manag., vol. 9, no. 2, pp. 20–36, Feb. 2024, doi: 10.47604/ijscm.2370.

C. R. Kholibrina and A. Aswandi, “The consumer preferences for new styrax based perfume products using a conjoint analysis approach,” IOP Conf. Ser. Mater. Sci. Eng., vol. 935, no. 1, p. 012016, Sep. 2020, doi: 10.1088/1757-899X/935/1/012016.

K. Pentus, M. Ruusu, A. Kuusik, L. Dorokhova, and K. Ploom, “How Sexualised Images in Advertisements Influence the Attention and Preference of Consumers with a Modern View,” Organ. Mark. Emerg. Econ., vol. 14, no. 2(28), pp. 366–385, Jul. 2023, doi: 10.15388/omee.2023.14.97.

C. G. Kim, “The Fragrance of Cyber World 2080: A Perfume Forecast,” in Proceedings of the 24th International RAIS Conference on Social Sciences and Humanities, Scientia Moralitas Research Institute, 2021, pp. 125–132.

C.-L. Hung et al., “Enhancing healthcare services and brand engagement through social media marketing: Integration of Kotler’s 5A framework with IDEA process,” Inf. Process. Manag., vol. 60, no. 4, p. 103379, 2023, doi: https://doi.org/10.1016/j.ipm.2023.103379.

L. Nikitina, “Fragrance lexicon for analysis of consumer-generated perfume reviews in Russian and English,” MethodsX, vol. 12, p. 102627, 2024, doi: https://doi.org/10.1016/j.mex.2024.102627.

Y. Dwijayanty and N. A. W. S.A., MBA., Ph.D., “Perfume Influencers Impact on Customers’ Purchase Decision of HMNS Perfumery,” Int. J. Curr. Sci. Res. Rev., vol. 05, no. 12, Dec. 2022, doi: 10.47191/ijcsrr/V5-i12-31.

N. F. Isamudin and I. B. Islam, “Factors Influencing Young Consumer Trust in Online Purchase Decision of Luxury Perfume in Malaysia: A Preliminary Qualitative Study,” Int. J. Econ. Manag. Account., vol. 31, no. 2, pp. 299–323, Dec. 2023, doi: 10.31436/ijema.v31i2.1062.

H. Hussien, M. A. Saleh, R. Al Saed, and N. Sweidan, “The Impact of Perfume Packaging on Consumer Buying Behavior of Jordanian Female,” Int. J. Bus. Excell., vol. 1, no. 1, p. 1, 2020, doi: 10.1504/IJBEX.2020.10030847.

L. Udayanga, N. Subashini, M. Udugama, P. Silva, and T. Ranathunge, “Knowledge, perceptions, and consumption behaviour of cosmetics among undergraduates of Sri Lanka: a descriptive cross-sectional study,” Front. Public Heal., vol. 11, Jan. 2024, doi: 10.3389/fpubh.2023.1184398.

Published
2025-01-17
How to Cite
[1]
Z. Z. Alkaf, M. I. Fawzi, M. W. R. Sastyawan, and R. D. Putera, “UNRAVELING OF MEN’S FRAGRANCE PREFERENCES ON ONLINE MARKETPLACES: A MACHINE LEARNING STUDY USING DBSCAN CLUSTERING AND LINEAR REGRESSION”, J. Tek. Inform. (JUTIF), vol. 5, no. 6, pp. 1913-1920, Jan. 2025.