Statistical Reasoning in education- Multiple linear regression

Yuvarani749017 27 views 53 slides Jul 10, 2024
Slide 1
Slide 1 of 53
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53

About This Presentation

PMC500


Slide Content

PMC 500 STATISTICAL REASONING IN EDUCATION Lecture 9: Multiple Linear Regression Dr. Nor Asniza Ishak Dr. AhmadZamri Khairani

1. Multiple Linear Regression Multiple linear regression is used to predict scores on one variable using scores from two or more different variables based on a multiple regression plane.  The variable that is being predicted is known as the dependent or criterion variable (Y )  The variables that are used to predict scores are known as the independent or predictor variables (X1 , X2 , ... Xk )

2. Multiple Regression Plane The multiple regression plane is a plane that best represents the linear relationship between two or more predictor variables (X1 , X2 , ... Xk ) and one criterion variable (Y ) based on the method of least squares

2. Multiple Regression Plane The method of least squares fits the multiple regression plane in such a way that the sum of squares of the difference between the actual value of Y and the predicted value of Y (Y’ ) is a minimum Or in symbol,

2. Multiple Regression Plane To build the multiple regression plane, the equation of the multiple regression plane ( multiple regression equation ) needs to be determined.

3. Multiple Regression Equation

3. Multiple Regression Equation

3. Multiple Regression Equation Example: To determine the multiple regression equation involving the linear relationship between two predictor variables (X1 and X2 ) and one criterion variable (Y )

3. Multiple Regression Equation

3. Multiple Regression Equation

Example

Obtaining a 3-D Scatterplot using SPSS Create a data file Graphs > Chart Builder ... > Scatter/Dot Move Simple 3-D Scatter to Chart preview box Move the criterion variable, Y to the Y-axis: box Move the predictor variables X1 to the X-axis box and X2 to the Z- axis: box Click OK Double-click on the 3-D scatterplot Click Options > Show Grid Lines > Close Click on any point Click Elements > Show Data Labels > Close Close Chart Editor

Example Cost of advertisement (X1) and number of students (X2) are the predictor variables Sale of books (Y) is the criterion variable Plot graph Y against X1 and X2 to produce a 3-D Scatterplot by using SPSS

3-D Scatterplot

Obtaining the multiple regression equation

4. Obtaining the Multiple Regression Equation Using SPSS  Create a data file Click Analyze > Regression > Linear When the Linear Regression box is displayed, move the criterion variable (Y ) to the Dependent: box and move the predictor variables (X1 and X2) to the Independent(s): box Click OK

4. Obtaining the Multiple Regression Equation Using SPSS

Meaning of a The value of a = 6.40 means that the predicted value of sale of books (Y’) is 6.40 thousand Ringgit Malaysia (or RM 6 400) when the advertising cost of advertisement (X1) is zero and the number of students (X2) is zero

Meaning of b 1 The value of b1 = 20.49 means that the predicted value of sale of books (Y’) increases by 20.49 thousand ringgit Malaysia (i.e., RM 20 490) for a 1-unit change (i.e., RM 1 000) in the cost of advertisement (X1) when the number of students (X2) remains constant.

Meaning of b 2 The value of b2 = 0.28 means that the predicted value of sale of books (Y’) increases by 0.28 thousand Ringgit Malaysia (i.e., RM 280) for a 1-unit change (i.e., 1000) in the number of students (X2) when the cost of advertisement (X1) remains constant.

5. Errors of Prediction

6. Standard Error of the Estimate

6. Standard Error of the Estimate

6. Standard Error of the Estimate n multiple linear regression, the predictor variables should have a strong linear relationship with the criterion variable, but have a weak linear relationship between themselves The smaller the standard error of the estimate, the better is the prediction of the value of the criterion variable from the values of the predictor variables

7. Obtaining the Standard Error of the Estimate Using SPSS

Standard error of the estimate

Effect Size R Square (R2 ) is the effect size for multiple linear regression Shows the percentage of variance in the criterion variable (Y) that is explained by the linear combination of the predictor variables.

Adjusted R Square

8. Testing the Multiple Correlation Coefficient for Statistical Significance

8. Testing the Multiple Correlation Coefficient for Statistical Significance

8. Testing the Multiple Correlation Coefficient for Statistical Significance Steps for the significance test: State the null and alternative hypotheses Set the significance level for rejecting the null hypothesis Analyse the data using SPSS Make a decision about the null hypothesis Draw a conclusion

Example 1 A researcher wanted to predict the sale of wood from the price of wood and thickness of wood. Table 1 shows the sale of wood, price of wood and thickness of wood for 30 pieces of wood that were sold by a hardware shop. Test the hypothesis of whether the price of wood and thickness of wood are significant predictors of the sale of wood at the 0.05 level of significance. Which predictor variable is the better predictor?

Step 1: State the null and alternative hypotheses

Step 2: Set the significance level for rejecting the null hypothesis Example: Set α = .05, If p ≤ .05, reject the null hypothesis. If p > .05, fail to reject the null hypothesis.

Step 3: Analyse the data using SPSS Create a data file Analyze > Regression > Linear Move the criterion variable (Y ) to the Dependent: box Move the predictor variables (X1 and X2) to the Independent(s): box Statistics... > Confidence intervals Level (95 %): > Descriptives > Casewise diagnostics Continue Plots... Move *ZRESID to Y box Move *ZPRED to X box Choose Histogram and Normal probability plot Continue > OK

Step 4: Make a decision about the null hypothesis

Step 5: Draw a Conclusion

Step 5: Draw a Conclusion Which predictor variable is a significant predictor of the sale of wood? Price of wood or thickness of wood or both?

Step 5: Draw a Conclusion

Step 5: Draw a Conclusion

Step 5: Draw a Conclusion Which predictor variable is a better predictor of the sale of wood? The price of wood or the thickness of wood?

Step 5: Draw a Conclusion

THANK YOU #STAYSAFE #STAYATHOME
Tags