Evaluation of Diagnostic Models
Welcome to the second assignment of course 1. In this assignment, we will be working with the results of the X-ray classification model we developed in the previous assignment. In order to make the data processing a bit more manageable, we will be working with a subset of our training, and validation datasets. We will also use our manually labeled test dataset of 420 X-rays.
As a reminder, our dataset contains X-rays from 14 different conditions diagnosable from an X-ray. We'll evaluate our performance on each of these classes using the classification metrics we learned in lecture.
Outline
Click on these links to jump to a particular section of this assignment!
By the end of this assignment you will learn about:
- Accuracy
- Prevalence
- Specificity & Sensitivity
- PPV and NPV
- ROC curve and AUCROC (c-statistic)
- Confidence Intervals
1. Packages
In this assignment, we'll make use of the following packages:
- numpy is a popular library for scientific computing
- matplotlib is a plotting library compatible with numpy
- pandas is what we'll use to manipulate our data
- sklearn will be used to measure the performance of our model
Run the next cell to import all the necessary packages as well as custom util functions.
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
import util