How can it be possible for tree h to fit the training examples better than h', but for it to perform
more poorly over subsequent examples?
1. Overfitting can occur when the training examples contain random errors or noise
2. When small numbers of examples are associated with leaf nodes.
Noisy Training Example ‘See
[9+,5-]
Example 15: <Sunny, Hot, Normal, Strong, -
Ed z 1,2,8,9, 11 Sunny am Rain 4,5,6,10,14
+ Example is noisy because the correct label is + [2+,3-] [3+,2-]
+ Previously constructed tree misclassifies it ¡ara 12,13
High Normal [4+,0-] Strong Light
Yes
128 91115 6,14 4,5,10
19437 [2411 Hot” mita Coo [+21 [3+,0]
15 Cae a> CD.
[0+ 1] 114,0]
1120.
Deepak D, Asst. Prof, Dept