Learn practical skills, build real-world projects, and advance your career

Evaluation of Diagnostic Models

Welcome to the second assignment of course 1. In this assignment, we will be working with the results of the X-ray classification model we developed in the previous assignment. In order to make the data processing a bit more manageable, we will be working with a subset of our training, and validation datasets. We will also use our manually labeled test dataset of 420 X-rays.

As a reminder, our dataset contains X-rays from 14 different conditions diagnosable from an X-ray. We'll evaluate our performance on each of these classes using the classification metrics we learned in lecture.

By the end of this assignment you will learn about:

  1. Accuracy
  2. Prevalence
  3. Specificity & Sensitivity
  4. PPV and NPV
  5. ROC curve and AUCROC (c-statistic)
  6. Confidence Intervals

1. Packages

In this assignment, we'll make use of the following packages:

  • numpy is a popular library for scientific computing
  • matplotlib is a plotting library compatible with numpy
  • pandas is what we'll use to manipulate our data
  • sklearn will be used to measure the performance of our model

Run the next cell to import all the necessary packages as well as custom util functions.

import numpy as np
import matplotlib.pyplot as plt 
import pandas as pd  

import util