A Comparative Study of Machine Learning Models for Fashion Product Demand Prediction: Exploring Algorithms, Data Splitting, and Feature Engineering
DOI:
https://doi.org/10.15408/aism.v8i1.45600Keywords:
Data splitting, demand prediction, fashion product, feature engineering, machine learningAbstract
The fashion industry faces challenges in accurately predicting demand due to inherent uncertainty, leading to suboptimal inventory and financial losses. Machine learning (ML) offers a robust solution by analyzing large and complex data, identifying non-linear patterns, and providing more accurate predictions than conventional methods that rely on limited factors. This research aims to compare and evaluate the performance of six different ML models—XGBoost, SVM, RF, GBM, KNN, and NN, considering the influence of feature engineering and various data split ratios on predicting fashion product demand. KNN and NN were included due to distinct modeling approaches and competitive capabilities in identifying local and non-linear patterns across numerical, categorical, and time series data. Techniques such as feature extraction and selection and various data split ratios (70:30, 80:20, 90:10) were used. Using Adidas sales data, the models were evaluated based on Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). The results indicate that the XGBoost-based model with feature engineering consistently outperforms the other models across all data split ratios. Particularly, XGBoost with feature engineering at a data split ratio of 90:10 achieved the best performance with an RMSE of 4.46 and an MAE of 1.51. Analyzing model performance shows that the predictive ability of ML models is influenced by the implementation of feature engineering and the selection of the data split ratio. These results demonstrate the potential of using feature-engineered XGBoost models and optimized data ratios to mitigate the risk of stockouts or overstocks, and reduce financial losses and environmental waste.
Downloads
References
O.-E. Ørebæk and M. Geitle, "Exploring the Hyperparameters of XGBoost Through 3D Visualizations," in AAAI Spring Symposium: Combining Machine Learning with Knowledge Engineering, 2021.
M. Koren and M. Shnaiderman, "Forecasting in the fashion industry: a model for minimising supply-chain costs," J International Journal of Fashion Design, Technology Education, vol. 16, no. 3, pp. 308-318, 2023. doi: https://doi.org/10.1080/17543266.2023.2201508
K. Swaminathan and R. Venkitasubramony, "Demand forecasting for fashion products: A systematic review," J International Journal of Forecasting, vol. 40, no. 1, pp. 247-267, 2024. doi: https://doi.org/10.1016/j.ijforecast.2023.02.005
R. Szabó-Geletóczki, E. Szabó, and I. Rudnák, "STOCKPILE MANAGEMENT THROUGH THE EVERYDAY OPERATION OF A PHARMACEUTICAL COMPANY," J Management/Vadyba, vol. 38, no. 1, 2022. doi: 10.38104/vadyba.2022.1.06
X. Long and L. Gui, "Waste Not Want Not? The Environmental Implications of Quick Response and Upcycling," J The Environmental Implications of Quick Response Upcycling, 2023. doi: https://dx.doi.org/10.2139/ssrn.4013877
A. R. Chowdhury, A. M. Mithu, S. Ahmad, and A. A. Malek, "A Data-Driven Approach to Inventory Control under Uncertain Demand for Pharmaceutical Products using Continuous Review Policy," J GPH-International Journal of Business Management, vol. 7, no. 01, pp. 32-51, 2024. doi: https://doi.org/10.5281/zenodo.10700678
A. K. Das, M. F. Hossain, B. U. Khan, M. M. Rahman, M. Asad, and M. Akter, "Circular economy: A sustainable model for waste reduction and wealth creation in the textile supply chain," J SPE Polymers, vol. 6, no. 1, p. e10171, 2025. doi: https://doi.org/10.1002/pls2.10171
Y. Kim and S. Wu, "Circular Economy Meets the Fashion Industry: Challenges and Opportunities in New York City," J Smart Sustainable Planning for Cities Regions: Results of SSPCR 3, pp. 293-312, 2021. doi: https://doi.org/10.1007/978-3-030-57332-4_21
W. Leal Filho et al., "An overview of the contribution of the textiles sector to climate change," J Frontiers in environmental science, vol. 10, p. 973102, 2022. doi: https://doi.org/10.3389/fenvs.2022.973102
A. Shah, R. M. Ellahi, U. Nazir, and M. A. Soomro, "Forecasting Practices in Textile and Apparel Export Industry: A Systematic Review," J International Journal of Circular Economy Waste Management, vol. 2, no. 1, pp. 1-17, 2022. doi: 10.4018/IJCEWM.288501
C. Giri and Y. Chen, "Deep learning for demand forecasting in the fashion and apparel retail industry," J Forecasting, vol. 4, no. 2, pp. 565-581, 2022. doi: https://doi.org/10.3390/forecast4020031
M. Kharfan, V. W. K. Chan, and T. Firdolas Efendigil, "A data-driven forecasting approach for newly launched seasonal products by leveraging machine-learning approaches," J Annals of Operations Research, vol. 303, no. 1-2, pp. 159-174, 2021. doi: https://doi.org/10.1007/s10479-020-03666-w
Y. Ledmaoui, A. El Maghraoui, M. El Aroussi, R. Saadane, A. Chebak, and A. Chehri, "Forecasting solar energy production: A comparative study of machine learning algorithms," J Energy Reports, vol. 10, pp. 1004-1012, 2023. doi: https://doi.org/10.1016/j.egyr.2023.07.042
R. Rai, M. K. Tiwari, D. Ivanov, and A. Dolgui, "Machine learning in manufacturing and industry 4.0 applications," J International Journal of Production Research, vol. 59, no. 16, pp. 4773-4778, 2021. doi: https://doi.org/10.1080/00207543.2021.1956675
U. Mehmood, A. K. Bashir, K. Rabie, J. Broderick, and S. Davies, "Vending Machine Product Demand Prediction Using Machine Learning Algorithms," in 2023 International Symposium on Networks, Computers and Communications (ISNCC), 2023, pp. 1-6: IEEE. doi: https://doi.org/10.1109/ISNCC58260.2023.10323888
İ. GÜVEN, Ö. UYGUN, and F. ŞİMŞİR, "Machine Learning Algorithms with Intermittent Demand Forecasting: An Application in Retail Apparel with Plenty of Predictors," J Textile Apparel, vol. 31, no. 2, pp. 99-110, 2021. doi: https://doi.org/10.32710/tekstilvekonfeksiyon.809867
S. Hwang, G. Yoon, E. Baek, and B.-K. Jeon, "A Sales Forecasting Model for New-Released and Short-Term Product: A Case Study of Mobile Phones," J Electronics, vol. 12, no. 15, p. 3256, 2023. doi: https://doi.org/10.3390/electronics12153256
L. P. E. Yani and A. Aamer, "Demand forecasting accuracy in the pharmaceutical supply chain: a machine learning approach," J International Journal of Pharmaceutical Healthcare Marketing, vol. 17, no. 1, pp. 1-23, 2023. doi: http://dx.doi.org/10.1108/IJPHM-05-2021-0056
A. Mitra, A. Jain, A. Kishore, and P. Kumar, "A comparative study of demand forecasting models for a multi-channel retail company: a novel hybrid machine learning approach," in Operations research forum, 2022, vol. 3, no. 4, p. 58: Springer. doi: https://doi.org/10.1007/s43069-022-00166-4
M. Saglam, C. Spataru, and O. A. Karaman, "Forecasting Electricity Demand in Turkey Using Optimization and Machine Learning Algorithms," J Energies, vol. 16, no. 11, p. 4499, 2023. doi: https://doi.org/10.3390/en16114499
A. Brüggen, I. Grabner, and K. L. Sedatole, "The folly of forecasting: The effects of a disaggregated demand forecasting system on forecast error, forecast positive bias, and inventory levels," J The Accounting Review, vol. 96, no. 2, pp. 127-152, 2021. doi: https://doi.org/10.2308/tar-2018-0559
F. Yiğit, Ş. ESNAF, and B. Y. KAVUŞ, "A Poisson-Regression, Support Vector Machine and Grey Prediction Based Combined Forecasting Model Proposal: A Case Study in Distribution Business," J Turkish Journal of Forecasting, vol. 5, no. 2, pp. 23-35, 2021. doi: https://doi.org/10.34110/forecasting.957494
I. Amellal, A. Amellal, H. Seghiouer, and M. Ech-Charrat, "An integrated approach for modern supply chain management: Utilizing advanced machine learning models for sentiment analysis, demand forecasting, and probabilistic price prediction," J Decision Science Letters, vol. 13, no. 1, pp. 237-248, 2024.
M. Rodrigues, V. Miguéis, S. Freitas, and T. Machado, "Machine learning models for short-term demand forecasting in food catering services: A solution to reduce food waste," J Journal of Cleaner Production, vol. 435, p. 140265, 2024. doi: https://doi.org/10.1016/j.jclepro.2023.140265
D. Chung, C. G. Lee, and S. Yang, "A Hybrid Machine Learning Model for Demand Forecasting: Combination of K-means, Elastic-Net, and Gaussian Process Regression," J International Journal of Intelligent Systems Applications in Engineering, vol. 11, no. 6s, pp. 325-336, 2023.
N. Son and Y. Shin, "Short-and Medium-Term Electricity Consumption Forecasting Using Prophet and GRU," J Sustainability, vol. 15, no. 22, p. 15860, 2023. doi: https://doi.org/10.3390/su152215860
I.-F. Chen and C.-J. Lu, "Demand forecasting for multichannel fashion retailers by integrating clustering and machine learning algorithms," J Processes, vol. 9, no. 9, p. 1578, 2021. doi: https://doi.org/10.3390/pr9091578
Q. H. Nguyen et al., "Influence of data splitting on performance of machine learning models in prediction of shear strength of soil," J Mathematical Problems in Engineering, vol. 2021, no. 1, p. 4832864, 2021. doi: https://doi.org/10.1155/2021/4832864
J. Kamiri and G. Mariga, "Research methods in machine learning: A content analysis," J International Journal of Computer Information Technology, vol. 10, no. 2, 2021. doi: https://doi.org/10.24203/ijcit.v10i2.79
I. Vallés-Pérez, E. Soria-Olivas, M. Martínez-Sober, A. J. Serrano-López, J. Gómez-Sanchís, and F. Mateo, "Approaching sales forecasting using recurrent neural networks and transformers," J Expert Systems with Applications, vol. 201, p. 116993, 2022. doi: https://doi.org/10.1016/j.eswa.2022.116993
J. Yang, X. Tan, and S. Rahardja, "Outlier detection: How to select k for k-nearest-neighbors-based outlier detectors," J Pattern Recognition Letters, vol. 174, pp. 112-117, 2023. doi: https://doi.org/10.1016/j.patrec.2023.08.020
M. K. Dahouda and I. Joe, "A deep-learned embedding technique for categorical features encoding," J IEEE Access, vol. 9, pp. 114381-114391, 2021. doi: https://doi.org/10.1109/ACCESS.2021.3104357
A. Alabrah, "An improved CCF detector to handle the problem of class imbalance with outlier normalization using IQR method," J Sensors, vol. 23, no. 9, p. 4406, 2023. doi: https://doi.org/10.3390/s23094406
Q. H. Nguyen et al., "Influence of data splitting on performance of machine learning models in prediction of shear strength of soil," J Mathematical Problems in Engineering, vol. 2021, pp. 1-15, 2021. doi: https://doi.org/10.1155/2021/4832864
A. Grigorev, Machine Learning Bookcamp: Build a Portfolio of Real-life Projects. Shelter Island: Manning Publications co, 2021.
N. Shirzadi, A. Nizami, M. Khazen, and M. Nik-Bakht, "Medium-term regional electricity load forecasting through machine learning and deep learning," J Designs, vol. 5, no. 2, p. 27, 2021. doi: https://doi.org/10.3390/designs5020027
H. Darmawan, M. Yuliana, M. Hadi, and Z. Samsono, "GRU and XGBoost Performance with Hyperparameter Tuning Using GridSearchCV and Bayesian Optimization on an IoT-Based Weather Prediction System," J International Journal on Advanced Science, Engineering Information Technology, vol. 13, no. 3, p. 851, 2023. doi: https://doi.org/10.18517/ijaseit.13.3.18377
D. D. M. C. Maceda and J. C. D. Cruz, "Rainfall Classification Model for the Philippines using Optimized K-nearest Neighbor Algorithm with GridSearchCV Hyperparameter Tuning," in 2023 IEEE 13th International Conference on Control System, Computing and Engineering (ICCSCE), 2023, pp. 51-55: IEEE. doi: https://doi.org/10.1109/ICCSCE58721.2023.10237156
D. Chicco, M. J. Warrens, and G. Jurman, "The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation," J PeerJ Computer Science, vol. 7, p. e623, 2021. doi: https://doi.org/10.7717/peerj-cs.623
P. Dhal and C. Azad, "A comprehensive survey on feature selection in the various fields of machine learning," J Applied Intelligence, vol. 52, no. 4, pp. 4543-4581, 2022. doi: https://doi.org/10.1007/s10489-021-02550-9
S. Demir and E. K. Sahin, "An investigation of feature selection methods for soil liquefaction prediction based on tree-based ensemble algorithms using AdaBoost, gradient boosting, and XGBoost," J Neural Computing Applications, vol. 35, no. 4, pp. 3173-3190, 2023. doi: https://doi.org/10.1007/s00521-022-07856-4
C. Qin, Y. Zhang, F. Bao, C. Zhang, P. Liu, and P. Liu, "XGBoost optimized by adaptive particle swarm optimization for credit scoring," J Mathematical Problems in Engineering, vol. 2021, pp. 1-18, 2021. doi: https://doi.org/10.1155/2021/6655510
J. Wang and S. Zhou, "Particle swarm optimization‐XGBoost‐based modeling of radio‐frequency power amplifier under different temperatures," J International Journal of Numerical Modelling: Electronic Networks, Devices Fields, vol. 37, no. 2, p. e3168, 2024. doi: https://doi.org/10.1002/jnm.3168
L. Barreñada, P. Dhiman, D. Timmerman, A.-L. Boulesteix, and B. Van Calster, "Understanding overfitting in random forest for probability estimation: a visualization and simulation study," J Diagnostic Prognostic Research, vol. 8, no. 1, p. 14, 2024. doi: https://doi.org/10.1186/s41512-024-00177-1
J. Oh, K.-J. Ha, and Y.-H. Jo, "A predictive model of seasonal clothing demand with weather factors," J Asia-Pacific Journal of Atmospheric Sciences, vol. 58, no. 5, pp. 667-678, 2022. doi: https://doi.org/10.1007/s13143-022-00284-3
S. M. Robeson and C. J. Willmott, "Decomposition of the mean absolute error (MAE) into systematic and unsystematic components," J PloS one, vol. 18, no. 2, p. e0279774, 2023. doi: https://doi.org/10.1371/journal.pone.0279774
T. O. Hodson, "Root mean square error (RMSE) or mean absolute error (MAE): When to use them or not," J Geoscientific Model Development Discussions, vol. 2022, pp. 1-10, 2022. doi: https://doi.org/10.5194/gmd-15-5481-2022, 2022.
J. Wu, Y. Li, and Y. Ma, "Comparison of XGBoost and the neural network model on the class-balanced datasets," in 2021 IEEE 3rd international conference on frontiers technology of information and computer (ICFTIC), 2021, pp. 457-461: IEEE. doi: https://doi.org/10.1109/ICFTIC54370.2021.9647373
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Reviana Siti Mardiah, Fitrianingsih Fitrianingsih

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.