Learn practical skills, build real-world projects, and advance your career

Web Scraping Project- Amazon top rated Books

alt

alt

Web Scraping

Web scraping is the process of extracting and parsing data from websites in an automated fashion using a computer program. It's a useful technique for creating datasets for research and learning. Mostly it is unstructured html data which is then converted into structured data and stored in spreadsheet or in database format.

The steps we'll follow:

  • We're going to scrape https://www.amazon.in/gp/bestsellers/books/
  • We'll get a list of topics.
  • For each topic, we'll get topic title, topic page URL
  • For each topic, we'll get the top 50 books in the topic from the topic page
  • For each book, we'll grab the book name, book URL, author name,book price,star rating and No of customer rated as rating.
  • Save the information data to CSV file Using Pandas library

The output will look like this:

title, url ,book_name ,author name ,book price ,star rating , rating, book_url.

alt