Learn practical skills, build real-world projects, and advance your career

EXPLORATORY DATA ANALYSIS ON VIDEO GAMES SALES

INTRODUCTION

Exploratory analysis carried out on video games sales from the year 1980 until 2020. The dataset has 11 columns which are Index, Rank, Video game name, Gaming platform, Year of release, Genre, Publisher, European sales, Global sales,Japan sales and sales from Other countries. There are 16598 rows in total. The dataset includes giants in the gaming wall. The aim of this analysis is to see what Genre is the most popular among gamers and one platform is the most popular to publish on. The insights from the analysis will enable a game start-up firm launch a product that would sell.

DATASET DESCRIPTION

This dataset contains a list of video games with sales greater than 100,000 copies. It was generated by a scrape of vgchartz.com.

Fields include

  • Rank: Ranking of overall sales
  • Name: The games name
  • Platform: Platform of the games release (i.e. PC,PS4, etc.)
  • Year: Year of the game's release
    -Genre: Genre of the game
    -Publisher: Publisher of the game
    -NA_Sales: Sales in North America (in millions)
    -EU_Sales: Sales in Europe (in millions)
    -JP_Sales: Sales in Japan (in millions)
    -Other_Sales: Sales in the rest of the world (in millions)
    -Global_Sales: Total worldwide sales.

The script to scrape the data is available at https://github.com/GregorUT/vgchartzScrape.
It is based on BeautifulSoup using Python.
There are 16,598 records. 2 records were dropped due to incomplete information.

LOAD PYTHON LIBRARIES
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
%matplotlib inline 
sns.set_style('darkgrid')