Learn practical skills, build real-world projects, and advance your career

Dataset IV - Exoplanets

  • Using sklearn and XGboost for classification
  • XGBoost with imbalanced data

Source: Kaggle Exoplanets dataset

import jovian
jovian.commit(project='dataset4-classification', filename='dataset4-classification.ipynb')
[jovian] Attempting to save notebook.. [jovian] Updating notebook "patxigad/dataset4-classification" on https://jovian.ai/ [jovian] Uploading notebook.. [jovian] Capturing environment.. [jovian] Committed successfully! https://jovian.ai/patxigad/dataset4-classification

Getting the data

# main imports
import pandas as pd
import datetime as dt
import numpy as np

import seaborn as sns
import matplotlib
import matplotlib.pyplot as plt
%matplotlib inline

sns.set()
matplotlib.rcParams['font.size'] = 14
matplotlib.rcParams['figure.figsize'] = (9, 5)
matplotlib.rcParams['figure.facecolor'] = '#00000000'

# silence warnings
import warnings
warnings.filterwarnings('ignore')
# upload 'exoplanets' to DF and display first few rows
df = pd.read_csv('exoTrain.csv', nrows=400)
df.head()