2015년 7월 31일 금요일

[editing] Ensemble learning for trees





Problem: Trees can grow too noisy. (= high variance)
Solutions: Bagging, Boosting, Random Forests

General performance: Boosting > Random Forests > Bagging > Single Tree

1. Bagging (= bootstrap aggregation)

  • Random samples of equal size -> generate a fitted tree for each
  • Average them!


2. Random Forests

  • De-correlate! How? 
  • At each point of split, pick sqrt(number of features) random features as candidates


3. Boosting (= stage-wise additive modeling)

  • Adaboost
  • Gradient Boosting Machine


References:

댓글 없음:

댓글 쓰기