Jovian
Sign In

Project 1 Web Scraping With Python

Zero to Data Science - Project 1: Disney-Wiki Movie Data Scraper

alt

Project Summary

This project will use Requests and Beautiful Soup to scrape the list of movies from Disney's list of movies Wikipedia page (https://en.wikipedia.org/wiki/List_of_Walt_Disney_Pictures_films) and pull from these links (see image below) the data contained in the information box (see image below) from each movie's individual Wikipedia page. This data will be organized in to a JSON and slightly cleaned before saving into a JSON that will be loaded via a Pandas DataFrame to be accessed for further cleaning and data analysis.

List of Movie Links Example
alt
Information Box Example
alt

Import Necessary Libraries

wanderduck
wanderduck6 months ago