Jovian
Sign In

Instagram Webscraper

Web Scraping the most hired Instagam Influencers of GCC countries from an instagram marketing platform - Star N Gage using Python

bannerstarngageinfluencer

Star N Gage is a popular instagram marketing platform for sharing a directory of the most hired infuencers from Instagram in a particulary country. For Example, this directory showcases a list of the most hired influencers in UAE. The list showcases the username of Instagram handle of the influencer, the gcc "Country" they represent and the "no of followers" they have in the instagram and also many engagement indicators like "likes", "comments", "engagement rate".

In this project, we'll retrieve information from this page using web scraping: The process of extracting information from a website in an automated fashion using code. We'll use the python libraries Requests and Beautiful Soup to scrape data from this page.

We will use the python libraries requests and beautifulsoup4 to scrape data from this page. We use

Here is an outline of the steps we'll follow:

  1. Download the webpage using requests.
  2. Parse the HTML source code using beautifulsoup.
  3. Extract the usernames from influential instagram users, their instagram profile links,
    and the GCC country they represent.
  4. Compile extracted information into Python Lists and Dictionaries.
  5. Extract and combine data from multiple pages.
  6. Save the extracted information to a CSV file.

By the end of the project, we'll create a csv file in the following format:

Instagram Handle,Instagram Profile Link,Country
@zeefu,https://instagram.com/zeefu,United Arab Emirates
@aakash,https://instagram.com/aakash,Kuwait
...

How to Run the Code

You can execute the code using the "Run" button at the top of this page and selecting " Run on Binder ". You can make changes and save your own version of the notebook in Jovian by executing the following cells:

!pip install jovian --upgrade --quiet

Download the webpage using requests.

We'll use the requests library to download the webpage.

The library can be installed using pip

!pip install requests --upgrade --quiet
zeefu
zeefu6 months ago