alt

New York City Airbnb Analysis

This project aims to analyze Airbnb Listings and metric activities in Newyork city for 2019. The data set contains information about the host, Geographical availablilty, price, miminum night, number of review etc.The data set was gotten from Kaggle and we will be trying to answer questions such as but not limited to;

  • What is the most Popular room type in NYC
  • What is the average Listing price
  • Which host has the highest number of reviews
  • Which host has the highest number of reviews etc.

In the first phase of this analysis, I will prepare and clean the data using Pandas/Numpy libraries, perform numerical computing and visualize results using matplotlib and seaborn, ask and answer questions about the data and then summarize and my inferences and write a conclusion.

Downloading the Dataset

I downloaded the dataset from kaggle with the following steps:

  • i installed 'opendataset' library
  • i imported 'opendataset' library and downloaded the the dataset to my jupyter Notebook using 'od.download' function
!pip install jovian opendatasets --upgrade --quiet

Let's begin by downloading the data, and listing the files within the dataset.

dataset_url = 'https://www.kaggle.com/datasets/dgomonov/new-york-city-airbnb-open-data'