A forecasting method of multi-category product sales: analysis and application

Management System Engineering - Tập 2 - Trang 1-8 - 2023
Jing Wang1,2, Ling Luo3
1College of Information Engineering, Wuchang Institute of Technology, Wuhan, China
2School of Civil Engineering, Wuhan University, Wuhan, China
3School of Management, Wuhan University of Technology, Wuhan, China

Tóm tắt

To solve the problems of high prediction costs and difficult practices in multi-category product classification in the retail industry, optimize the inventory, and improve resilience, this work introduces a forecasting method for multi-category product sales. The forecasting method divides the data into a category set and a numerical set, uses the stacking strategy, and combines it with catboost, decision tree, and extreme gradient boosting. During the feature engineering process, the ratio and classification features are added to the category feature set; the sales at t are used for training to obtain the prediction at (t + 1); and the features used in the prediction at time (t + 1) are generated by the prediction results at t. The update processes of the two sets are combined to form a joint feature update mechanism, and multiple features of k categories are jointly updated. Using this method, data of all categories of retail stores can be linked so that historical data of different categories of goods can provide support for the prediction of each category of goods and solve the problem of insufficient product data and features. The method is verified on the retail sales data obtained from the Kaggle platform, and the mean absolute error and weighted mean absolute percentage error are adopted to analyze the performance of the model. The results reveal that the forecasting method can provide a useful reference to decision-makers.

Tài liệu tham khảo

Amin, P., L. Cherkasova, R. Aitken, et al. 2019. Analysis and Demand Forecasting of Residential Energy Consumption at Multiple Time Scales. IFIP/IEEE Symposium on Integrated Network and Service Management 494–499. Box, G.E.P., G.M. Jenkins, and G.C. Reinsel. 1970. Time series analysis: forecasting and control. New Jersey: Wiley. De Livera, A.M., R.J. Hyndman, and R.D. Snyder. 2011. Forecasting time series with complex seasonal patterns using exponential smoothing. Journal of the American Statistical Association 106 (496): 1513–1527. https://doi.org/10.1198/jasa.2011.tm09771. Heydon, A., and M. Najork. 1999. Mercator: A scalable, extensible web crawler. World Wide Web 2 (4): 219–229. https://doi.org/10.1023/A:1019213109274. Islamoglu, E., A. Islamoglu, and H. Bulut. 2015. Comparison of forecasting methods for interval-valued time series. International Journal of Statistics and Applications 5 (6): 317–337. https://doi.org/10.5923/j.statistics.20150506.07. Kim, K.J. 2003. Financial time series forecasting using support vector machines. Neurocomputing 55 (1–2): 307–319. https://doi.org/10.1016/S0925-2312(03)00372-2. Lu, C. 2014. Sales forecasting of computer products based on variable selection scheme and support vector regression. Neurocomputing 128: 491–499. https://doi.org/10.1016/j.neucom.2013.08.012. Polak, Y. 2017. Computer Technology in Russia and in the Former Soviet Union. 2017 4th International Conference Computer Technology in Russia and in the Former Soviet Union SoRuCom 172–175. Ramos, P., N. Santos, and R. Rebelo. 2015. Performance of state space and ARIMA models for consumer retail sales forecasting. Robotics and Computer-Integrated Manufacturing 34: 151–163. https://doi.org/10.1016/j.rcim.2014.12.015. Rubio, G., H. Pomares, I. Rojas, et al. 2011. A heuristic method for parameter selection in LS-SVM: Application to time series prediction. International Journal of Forecasting 27 (3): 725–739. https://doi.org/10.1016/j.ijforecast.2010.02.007. Silahtarogu, G., H. Donertasli. 2015. Analysis and Prediction of E-Customers’ Behavior by Mining Clickstream Data. IEEE International Conference on Big Data. Santa Clara, CA, USA: IEEE 1466–1472. https://doi.org/10.1109/BigData.2015.7363908. Tang, Z., C.D. Almeida, and P.A. Fishwick. 1991. Time series forecasting using neural networks vs. Box-Jenkins methodology. SIMULATION 57 (5): 303–310. https://doi.org/10.1177/003754979105700508. Terui, N., and Y. Li. 2019. Measuring large-scale market responses and forecasting aggregated sales: Regression for sparse high-dimensional data. Journal of Forecasting 38 (5): 440–458. https://doi.org/10.1002/for.2574. Wang, J., C. Shu. 2022. A joint feature updating training method based on time-series prediction. CIBDA 2022–3rd International Conference on Computer Information and Big Data Applications 134–140. Wang, H., and C. Li. 2020. Application of Stacking integrated learning method in sales forecasting. Computer Applications and Software 37 (8): 85–90. https://doi.org/10.3969/j.issn.1000-386x.2020.08.016. Ye, Q., H. Rao, and M. Ji. 2017. Sales prediction of stores based on xgboost algorithm. Journal of Nanchang University (natural Science) 3 (41): 275–281. https://doi.org/10.3969/j.issn.1006-0464.2017.03.015. Yule, G.U. 1927. On a Method of Investigating Periodicities in Disturbed Series, with Special Reference to Wolfer’s Sunspot Numbers. Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character: 267–298. http://www.jstor.org/stable/91170. Zhang, G.P. 2003. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing 50: 159–175. https://doi.org/10.1016/S0925-2312(01)00702-0. Zhang, L. 2020. Research and application of sales forecasting technology for spandex products. Shen Yang: University of Chinese Academy of Sciences. Zhao, H., and P. Li. 2020. Feature selection optimization method of machine learning based on neighbor gradient. Computer Simulation 37 (11): 289–293. https://doi.org/10.3969/j.issn.1006-9348.2020.11.060. Zhou, G., and F. Guo. 2020. Sampling diversity ensemble learning method for Maximizing minimum Margin. Computer Applications and Software 8: 300–306. https://doi.org/10.3969/j.issn.1000-386x.2020.08.051.