Exploratory Data Analysis Project
An Exploratory Data Analysis on e-Commerce events
This dataset contains
900K
detailed users events from an electronics store e-Commerce website
Image credits - rupixen.com from Unsplash
About the data
In this analysis we will use the "e-Commerce events history in electronics store"
data by Michael Kechinov on Kaggle from the REES46 Marketing Platform.
The Kaggle dataset is in
.csv
file format having900K
row and9
columns.
Each row in the file represents an event. All events are related to products and users. Each event is like many-to-many relation between products and users.
Below are description of 9
columns
event_time
: Time when the event occured (in UTC) in the format "2019-10-01 00:00:04+00:00"event_type
: A single event either "view", "cart" or "purchase"product_id
: A unique code for each productcategory_id
: A unique code for each product categorycategory_code
: A '.' separated text with names of categories and subcategories "computers.network.router"brand
: Brand name of the product "sony"price
: Price of the product in decimals "31.90"user_id
: A permanent unique ID per shopper "1515915625519388267"user_session
: A temporary ID per session "LJuJVLEjPT"
Events in the column event_type
can be:
view
- a user viewed a productcart
- a user added a product to shopping cartremove_from_cart
- a user removed a product from shopping cartpurchase
- a user purchased a product
What does each column tell us
At time event_time
and during session user_session
, user user_id
purchased
(event_type is "purchase") a product product_id
of brand brand
in category category_code
of price price
.