Classification Trees

Decision trees can overfit

How to split nodes?

Feature selection

Missing values in a feature

  1. Put the most common value for that class → Mean/Median
  2. Find another column with high correlation, use regression with that column to find value