Sign In
Learn practical skills, build real-world projects, and advance your career

Learn how to implement machine learning algorithms using Python with this student project blog. Discover tips and tricks for successful ML projects.


ML Course Project - ZERO TO GBMs : Health Insurance Cross sell Prediction 🏠 🏥

Sushil Rajeeva Bhandary

Welcome to my Machine Learning project for the course Zero to GBMs from I'd like to take a moment to thank the instructor Aakash N S and the whole team for this awesome course, I got to learn a whole lot of ML, Linear Regression using Scikit Learn, Logestic Reasinging for Classification, Decision Trees and Random Forests, intuitives behind the Gradient Boosting with XGBoost and I appreciate them greatly for that.

Health Insurance Cross Sell Prediction 🏠 🏥

Predict Health Insurance Owners' who will be interested in Vehicle Insurance

Problem Statement -

Our client is an Insurance company that has provided Health Insurance to its customers now they need your help in building a model to predict whether the policyholders (customers) from past year will also be interested in Vehicle Insurance provided by the company.

An insurance policy is an arrangement by which a company undertakes to provide a guarantee of compensation for specified loss, damage, illness, or death in return for the payment of a specified premium. A premium is a sum of money that the customer needs to pay regularly to an insurance company for this guarantee.

For example, you may pay a premium of Rs. 5000 each year for a health insurance cover of Rs. 200,000/- so that if, God forbid, you fall ill and need to be hospitalised in that year, the insurance provider company will bear the cost of hospitalisation etc. for upto Rs. 200,000. Now if you are wondering how can company bear such high hospitalisation cost when it charges a premium of only Rs. 5000/-, that is where the concept of probabilities comes in picture. For example, like you, there may be 100 customers who would be paying a premium of Rs. 5000 every year, but only a few of them (say 2-3) would get hospitalised that year and not everyone. This way everyone shares the risk of everyone else.

Just like medical insurance, there is vehicle insurance where every year customer needs to pay a premium of certain amount to insurance provider company so that in case of unfortunate accident by the vehicle, the insurance provider company will provide a compensation (called ‘sum assured’) to the customer.

Building a model to predict whether a customer would be interested in Vehicle Insurance is extremely helpful for the company because it can then accordingly plan its communication strategy to reach out to those customers and optimise its business model and revenue.

Now, in order to predict, whether the customer would be interested in Vehicle insurance, you have information about demographics (gender, age, region code type), Vehicles (Vehicle Age, Damage), Policy (Premium, sourcing channel) etc.

Understanding of Problem Statement and Data -

Predicting whether user who has a health insurance will be interested in getting a vehicle insurance or not based on the following data -

  • Gender
  • Age
  • Driving Licencse Status
  • Region Code
  • Previously Vehicle Insurance Status
  • Vehicle Age
  • Vehicle Damage
  • Annual Premium
  • Sales Channel
  • No. of Days of Customer Association (Vintage)
!pip install jovian --upgrade --quiet
import jovian