sources of error in machine learning mlb

seshathirid 13 views 13 slides Sep 11, 2024

Slide 1 of 13

About This Presentation

sources of error

Size: 868.06 KB

Language: en

Added: Sep 11, 2024

Slides: 13 pages

Slide Content

Sources of error CS771: Introduction to Machine Learning Nisheeth

Understanding error in machine learning Cross-validation Learning with Decision Trees Plan for today 2

Every ML model has some hyperparameters that need to be tuned, e.g., K in KNN or in -NN Choice of distance to use in LwP or nearest neighbors Would like to choose h.p. values that would give best performance on test data Hyperparameter Selection 3

Generalization How well does a learned model generalize from the data it was trained on to a new test set? Training set (labels known) Test set (labels unknown) Slide credit: L. Lazebnik

Generalization Components of generalization error Bias: how much the average model over all training sets differ from the true model? Error due to inaccurate assumptions/simplifications made by the model Variance: how much models estimated from different training sets differ from each other Underfitting : model is too “simple” to represent all the relevant class characteristics High bias and low variance High training error and high test error Overfitting: model is too “complex” and fits irrelevant characteristics (noise) in the data Low bias and high variance Low training error and high test error Slide credit: L. Lazebnik

No Free Lunch Theorem Slide credit: D. Hoiem

Bias-Variance Trade-off Models with too few parameters are inaccurate because of a large bias (not enough flexibility). Models with too many parameters are inaccurate because of a large variance (too much sensitivity to the sample). Slide credit: D. Hoiem

Bias-Variance Trade-off E(MSE) = noise 2 + bias 2 + variance See the following for explanations of bias-variance (also Bishop’s “Neural Networks” book): http://www.inf.ed.ac.uk/teaching/courses/mlsc/Notes/Lecture4/BiasVariance.pdf Unavoidable error Error due to incorrect assumptions Error due to variance of training samples Image credit: geeksforgeeks.com Slide credit: D. Hoiem

Bias-variance tradeoff Training error Test error Underfitting Overfitting Complexity Low Bias High Variance High Bias Low Variance Error Slide credit: D. Hoiem

Bias-variance tradeoff Many training examples Few training examples Complexity Low Bias High Variance High Bias Low Variance Test Error Slide credit: D. Hoiem

Effect of Training Size Testing Training Generalization Error Number of Training Examples Error Fixed prediction model Slide credit: D. Hoiem

The perfect classification algorithm Objective function: encodes the right loss for the problem Parameterization: makes assumptions that fit the problem Regularization: right level of regularization for amount of training data Training algorithm: can find parameters that maximize objective on training set Inference algorithm: can solve for objective function in evaluation Slide credit: D. Hoiem

Remember… No classifier is inherently better than any other: you need to make assumptions to generalize Three kinds of error Inherent: unavoidable Bias: due to over-simplifications Variance: due to inability to perfectly estimate parameters from limited data Slide credit: D. Hoiem

sources of error in machine learning mlb

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

sources of error in machine learning mlb

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

8-top-ai-courses-for-customer-support-representatives-in-2025.pptx

7-essential-ai-courses-for-call-center-supervisors-in-2025.pptx

25-essential-ai-courses-for-user-support-specialists-in-2025.pptx

8-essential-ai-courses-for-insurance-customer-service-representatives-in-2025.pptx

Know for Certain

PPT OPD LES 3ertt4t4tqqqe23e3e3rq2qq232.pptx