Scraping Diabetologists On Credithealth
Scraping Diabetologists on CreditHealth using Python
data source: CreditHealth Website ([https://www.credihealth.com/doctors/india/diabetes])
About CreditHealth : Credihealth is an online solution for our healthcare needs. Credithealth's team of medical experts are there for their customers in every step of the way: from finding the right doctor and hospital to any kind of assistance. Credithealth educates and empowers families to ensure that right healthcare decisions are made.
Motive of Project
For this project, we'll retrieve the list of diabetologists from the webpage of Credithealth by using Web Scraping.
Web Scraping: It's a technique used to automatically extract large amounts of data from websites and save it to a file or database. The data scraped will usually be in tabular or spreadsheet format(e.g : CSV file)
We'll use the Python Libraries requests and beautifulsoup4 to scrape the info from the web page.
Below are the steps we're going to follow to complete the project:
- Download the webpage using requests library
- Parse the HTML source code using beautifulsoup4
- Extract Doctor name, Hospital Name, Degree, Designation, Discount of Fee
- Compile the extracted information into dictionaries and lists and create a dataframe using pandas
- Create a CSV file
How to run the code
This tutorial is an executable Jupyter notebook hosted on Jovian. You can run this tutorial and experiment with the code examples in a couple of ways: using free online resources (recommended) or on your computer.