Extreme Gradient Boosting Model-based Forecasting of Big Data Online Sales Record
Keywords:
Big Data, E-Commerce, Extreme Gradient Boosting, Forecasting, PySpark.
Abstract
Nowadays, big data plays a crucial role for many online e-commerce businesses to generate more sales. Big data is a huge collection of data and information which are utilized by many organizations to forecast which products, costs, and advertisements are better to maximize their business profits. This paper aims to apply the extreme gradient boosting (XGBoost) based model to forecast sales growth of online products, specifically books and magazines, from massive datasets present in online shopping. PySpark, as the best suitable and compatible framework, is used for data analysis. The result shows that the proposed model has higher forecasting accuracy with a minimum error rate than other models. A comparative visualization and conclusion are presented in terms of the proposed system's prediction accuracy, error rate, and efficiency.Downloads
Download data is not yet available.
Published
2022-03-25
Section
Research Articles
Copyright (c) 2022 SAMRIDDHI : A Journal of Physical Sciences, Engineering and Technology
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.