Calories Burnt
Prediction using Ridge
regression and XGBOOST
Size: 1.05 MB
Language: en
Added: Sep 22, 2024
Slides: 25 pages
Slide Content
Calories Burnt
Prediction using
Machine Learning
with Python
Sports and new
Technologies
TABLE OF CONTENTS
01
02 What Happens when you
exercise ?
Problematic
Introduction
Working method
03
04
Conclusion
RESULT
CRISP-DM
2
INTRODUCTION
Today’s Topic
01
3
“How can they measure burnt calories
while exercising ?”
PROBLEMATIC
02
4
Correlations between
calories burnt, exercise
and body type
SOLUTION
5
CRISP-DM
CRoss Industry Standard Process for Data Mining
03
6
Regression is a modeling task
that involves predicting a
numeric value given an input.
Linear
regression
Relationship between
dependent variable (Y) and one
or more independent variables
(X) using a best fit straight line
.
16
Ridge
regression
Technique used when the data
suffers from multicollinearity .
?
Ridge regression solves the
multicollinearity problem
through shrinkage the
coefficients for those input
variables that do not
contribute much to the
prediction task .
Including all parameters leads
to a high level of overfitting.
Find the coefficients that
minimize the sum of error
squares by applying a penalty
to these coefficients.
17
Ridge
regression
Parameter λ penalty term :
If λ is zero, this gives us the classical
regression equation.
The higher the Alpha values, the
greater the penalty ,therefore, the
size of the coefficients is reduced.
Better than the Least
Squares method when
there are too many
parameters.
18
Penalty term λ
19
Modeling
XGBoost
XGBoost is an efficient algorithm that can
be used for regression predictive
modeling.
20
Modeling
Testing set
Training set
70 %
30 %
21
CRISP-Dm
22
Evaluation
R Square
Mean Absolute Error
Mean Square Error
n = the number of
errors,
Σ = summation symbol
|xi – x| = the absolute
errors.
mae = metrics.mean_absolute_error(Y_test,
test_data_prediction)
Mean Absolute Error = 2.7159012502233186
23