It will give idea about Simple linear Regression and Multiple Regression
Size: 3.48 MB
Language: en
Added: Jan 16, 2021
Slides: 27 pages
Slide Content
Lecture 10
Regression
Dr. Ashish. C. Patel
Assistant Professor,
Dept. of Animal Genetics & Breeding,
Veterinary College, Anand
STAT-531
Data Analysis using Statistical Packages
•It is sometime very important to determine that
how the change in one variable brings about the
change in another variable. For examples:
Experiment Dependent/Response
Variable (Y)
Independent/Predictor
variable (X)
How alteration of air
temperature affects feed
intake?
Feed intake Air temperature
How increasing the protein
level in a feed intake affects
daily gain?
Daily gain Protein level
How the quality of concentrate
affects the milk yield in cow?
Milk yield Concentrate quality
•Thesimplelinearregressionisoftheformy=a+bx,
where‘b’representstheslopeoftheline(alsocalled
asregressioncoefficient)and'a'theinterceptofthe
line.
•Inastudywheredataonageandweightofanimalsare
available,agecouldbeconsideredastheindependent
variable,whileweightasthedependentvariable.
•Itmeansthatweightregressesonage.
•Unlike the two correlation coefficients (r
xy
and r
yx
) that
have the same computational formula, the two
regression coefficients (b
yx
and b
xy
) are obtained by
different formulas.
Twomainapplicationsofregressionanalysisare:
•Estimationoffunctionofdependencybetweenvariables
•Predictionofvaluesofthedependentvariableusingnew
measurementoftheindependentvariables
Thesimpleregressionmodel
•Aregressionthatexplainsthechangeofadependentvariable
basedonchangesofoneindependentvariableiscalledasimple
linearregression
•Asimplelinearregressionmodel:y=a+bx+e,
Where
Y=dependentvariable(orResponsevariable)
X=independentvariable(orPredictorvariable)
e=randomerror,
A=Constant,y-interceptoftheline
B=Regressioncoefficient,slopeoftheline
•In the regression line y = a + bx, a and b are calculated by
•Assumptions of Simple linear regression
model:
•Linearityof the relationship between
dependent and independent variables
•Independenceof the errors (no serial
correlation)
•Homoscedasticity(constant variance) of the
errors
•Normalityof the error distribution
•Test of significance of regression coefficient (b
yx)
•We can test whether a sample regression coefficient
is significantly different from zero either by using a
“t” test or by F-test as described below:
•H
0
: b = 0
•H
a
: b ≠ 0
Y = -514.14 + 5.41 X
•For X = 230
•Ŷ = -514.14 + 5.41 (230)
= -514.14+1244.3
=730.16 kg
Regression Analysis: How Do I Interpret R-
squared and Assess the Goodness-of-Fit?
•Linearregressioncalculates
anequationthatminimizes
thedistancebetweenthe
fittedlineandallofthe
datapoints.
•Technically,ordinaryleast
squares(OLS)regression
minimizesthesumofthe
squaredresiduals.
Residual = Observed value -Fitted value
What is R-squared value?
•R-squared is a statistical measure of how close the data are
to the fitted regression line. It is also known as the
coefficient of determination, or the coefficient of multiple
determination for multiple regression.
•R-squared = Explained variation / Total variation
•R-squared is always between 0 and 100%:
•Higher the R-squared, the better the model fits your data.