Demand forecasting using statistical and machine learning algorithms

Authors

Vasudev, Shekhar Ramesh

Issue Date

2019

Degree

MSc in Data Analytics

Publisher

Dublin Business School

Rights

Items in eSource are protected by copyright. Previously published items are made available in accordance with the copyright policy of the publisher/copyright holder.

Abstract

This research titled “Demand forecasting using statistical and machine learning algorithms” aims to compare the performance of traditional statistical and machine learning algorithms to forecast the demand for 50 products. Demand forecasting is a crucial part of a firm’s operations, it aims at predicting and estimating the future demand of products to aid the decision making process. The current research conducted compares the time series models with one another to identify the better model, this research focuses on implementing different algorithms to identify the variation in performance for each product. Traditional statistical models such as ARIMA and Theta method are implemented alongside Machine learning algorithms such as MLP and a new technology by H2O called Driverless AI. The accuracy of each algorithm is evaluated using the Back-testing technique by splitting the existing data into a train and test set. The models are built using the train set and the demand of the products are forecasted for the existing year, the forecasted values are compared with the values of the test set to compute the MAPE. After computing the errors of the models, the entire data is used to forecast the future demand for the products using each of these algorithms. The results show that, ARIMA accurately forecasts the demand for 10 out of the 50 products, Theta accurately forecasts the demand for 25 out of the 50 products and MLP accurately forecasts for the remaining 15 products. ARIMA couldn’t handle the products with a strong pattern and returned a generic model, Theta method and MLP are able to decompose the data and forecast for products with a strong pattern. The Driverless AI was out performed by ARIMA and Theta for all the products and the MLP for majority of the products. The conclusion of this research is that, different statistical and machine learning algorithms needs to be implemented when forecasting the demand of a set of products to identify the best performing algorithm for each product.