Favorita Grocery Sales Forecasting
Corporación Favorita Grocery Sales Forecasting
Description of Problem
This notebook attempts to solve for a Kaggle problem given here: https://www.kaggle.com/competitions/favorita-grocery-sales-forecasting/. We are trying to solve for a delicate situation that brick and motar grocery stores face with overstocking goods and perishable goods. Items that are popular are quickly sold out while the other goods are left overstocked. As the retailer add more variables like locations, new product demands or seasonal products, to the problem, the issue becomes more complex. This data is derived from a large Ecuadorian-based grocery retailer, Corporación Favorita, they operate on a large scale with hundreds of locations and with more than 200,000 different products on their shelves. The question that we are trying to answer here is how to predict an accurate sales forcast. For this we will be using various machine learning models to do a predictive analysis. Python libraries like sklearn come in handy for this operation.
Import Python Libraries
Start with importing the libraries
!pip install jovian --upgrade --quiet
!pip install opendatasets --upgrade --quiet
!pip install plotly.express --upgrade --quiet
!pip install py7zr --upgrade --quiet
!pip install xgboost --upgrade --quiet
!pip install "dask[dataframe]" --upgrade --quiet
!pip install pandas ----upgrade --quiet
WARNING: You are using pip version 22.0.4; however, version 22.3 is available.
You should consider upgrading via the '/home/ec2-user/anaconda3/envs/python3/bin/python -m pip install --upgrade pip' command.
WARNING: You are using pip version 22.0.4; however, version 22.3 is available.
You should consider upgrading via the '/home/ec2-user/anaconda3/envs/python3/bin/python -m pip install --upgrade pip' command.
WARNING: You are using pip version 22.0.4; however, version 22.3 is available.
You should consider upgrading via the '/home/ec2-user/anaconda3/envs/python3/bin/python -m pip install --upgrade pip' command.
WARNING: You are using pip version 22.0.4; however, version 22.3 is available.
You should consider upgrading via the '/home/ec2-user/anaconda3/envs/python3/bin/python -m pip install --upgrade pip' command.
WARNING: You are using pip version 22.0.4; however, version 22.3 is available.
You should consider upgrading via the '/home/ec2-user/anaconda3/envs/python3/bin/python -m pip install --upgrade pip' command.
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
distributed 2021.11.2 requires dask==2021.11.2, but you have dask 2022.10.0 which is incompatible.
WARNING: You are using pip version 22.0.4; however, version 22.3 is available.
You should consider upgrading via the '/home/ec2-user/anaconda3/envs/python3/bin/python -m pip install --upgrade pip' command.
Usage:
pip install [options] <requirement specifier> [package-index-options] ...
pip install [options] -r <requirements file> [package-index-options] ...
pip install [options] [-e] <vcs project url> ...
pip install [options] [-e] <local project path> ...
pip install [options] <archive url/path> ...
no such option: ----upgrade
import jovian