Updated 4 years ago
Dataset II - Census
- Using sklearn and XGboost for classification
- Decision trees for classification
- Random forest
- AdaBoost classifier
Source: UCI Machine Learning Repository Census dataset
import jovian
jovian.commit(project='dataset2-classification', filename='dataset2-classification.ipynb')
[jovian] Attempting to save notebook..
[jovian] Updating notebook "patxigad/dataset2-classification" on https://jovian.ai/
[jovian] Uploading notebook..
[jovian] Capturing environment..
[jovian] Committed successfully! https://jovian.ai/patxigad/dataset2-classification
Getting the data
# main imports
import pandas as pd
import datetime as dt
import numpy as np
import seaborn as sns
import matplotlib
import matplotlib.pyplot as plt
%matplotlib inline
sns.set_style('darkgrid')
matplotlib.rcParams['font.size'] = 14
matplotlib.rcParams['figure.figsize'] = (9, 5)
matplotlib.rcParams['figure.facecolor'] = '#00000000'
# silence warnings
import warnings
warnings.filterwarnings('ignore')
# upload 'census.data' to DF and display first few rows
df = pd.read_csv('http://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data', header=None)
df.head()