Learn practical skills, build real-world projects, and advance your career

Dataset II - Census

  • Using sklearn and XGboost for classification
  • Decision trees for classification
  • Random forest
  • AdaBoost classifier

Source: UCI Machine Learning Repository Census dataset

import jovian
jovian.commit(project='dataset2-classification', filename='dataset2-classification.ipynb')
[jovian] Attempting to save notebook.. [jovian] Updating notebook "patxigad/dataset2-classification" on https://jovian.ai/ [jovian] Uploading notebook.. [jovian] Capturing environment.. [jovian] Committed successfully! https://jovian.ai/patxigad/dataset2-classification

Getting the data

# main imports
import pandas as pd
import datetime as dt
import numpy as np

import seaborn as sns
import matplotlib
import matplotlib.pyplot as plt
%matplotlib inline

sns.set_style('darkgrid')
matplotlib.rcParams['font.size'] = 14
matplotlib.rcParams['figure.figsize'] = (9, 5)
matplotlib.rcParams['figure.facecolor'] = '#00000000'

# silence warnings
import warnings
warnings.filterwarnings('ignore')
# upload 'census.data' to DF and display first few rows
df = pd.read_csv('http://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data', header=None)
df.head()