Sign In

Formula One Visualized


Formula 1 is the highest class of international racing for open-wheel single-seater cars. The first driver to cross the finish line after a predefined amount of laps is declared the winner of the Grand Prix (Great Prize). Formula One often get's called the world's biggest moving circus, and not for nothing. In 2021 races were held in 22 countries across 5 different continents. With an average of 70.3 million viewers worldwide, the interest in the sport keeps on growing, and for good reason.

Teams use all sorts of data to predict and perfect strategies during the entire season. Formula One analyses in real-time during races and feeds this information in the form of polished graphs to the viewers. Not every graph made by other people can satisfy your own questions (in Formula One but really on every subject). Since there is much data floating around about the previous races, I will try to analyse some myself.

The dataset used is called Formula 1 World Championship (1950 - 2021) and is available for free on Kaggle. Kaggle is a webservice that supplies a broad variation of datasets for free. The Formula 1 dataset consists of 13 different csv files. There is a seperate file for drivers, constructors, lap times, pit stops, ... Some files can be linked together using id's.

This analysis is part of the course Data Analysis with Python: Zero to Pandas. This course starts with entry level Python programming and is thus available for everyone. The lessons are clearly explained and there is a wide community of people to help answer questions. If you are looking to get into Python I can highly recommend any of the courses listed here. Great thanks to Jovian for the (free) high quality lessons.

Ruben Debbaudt6 months ago