What is a one way anova?

1,855 views 237 slides Sep 16, 2014
Slide 1
Slide 1 of 375
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59
Slide 60
60
Slide 61
61
Slide 62
62
Slide 63
63
Slide 64
64
Slide 65
65
Slide 66
66
Slide 67
67
Slide 68
68
Slide 69
69
Slide 70
70
Slide 71
71
Slide 72
72
Slide 73
73
Slide 74
74
Slide 75
75
Slide 76
76
Slide 77
77
Slide 78
78
Slide 79
79
Slide 80
80
Slide 81
81
Slide 82
82
Slide 83
83
Slide 84
84
Slide 85
85
Slide 86
86
Slide 87
87
Slide 88
88
Slide 89
89
Slide 90
90
Slide 91
91
Slide 92
92
Slide 93
93
Slide 94
94
Slide 95
95
Slide 96
96
Slide 97
97
Slide 98
98
Slide 99
99
Slide 100
100
Slide 101
101
Slide 102
102
Slide 103
103
Slide 104
104
Slide 105
105
Slide 106
106
Slide 107
107
Slide 108
108
Slide 109
109
Slide 110
110
Slide 111
111
Slide 112
112
Slide 113
113
Slide 114
114
Slide 115
115
Slide 116
116
Slide 117
117
Slide 118
118
Slide 119
119
Slide 120
120
Slide 121
121
Slide 122
122
Slide 123
123
Slide 124
124
Slide 125
125
Slide 126
126
Slide 127
127
Slide 128
128
Slide 129
129
Slide 130
130
Slide 131
131
Slide 132
132
Slide 133
133
Slide 134
134
Slide 135
135
Slide 136
136
Slide 137
137
Slide 138
138
Slide 139
139
Slide 140
140
Slide 141
141
Slide 142
142
Slide 143
143
Slide 144
144
Slide 145
145
Slide 146
146
Slide 147
147
Slide 148
148
Slide 149
149
Slide 150
150
Slide 151
151
Slide 152
152
Slide 153
153
Slide 154
154
Slide 155
155
Slide 156
156
Slide 157
157
Slide 158
158
Slide 159
159
Slide 160
160
Slide 161
161
Slide 162
162
Slide 163
163
Slide 164
164
Slide 165
165
Slide 166
166
Slide 167
167
Slide 168
168
Slide 169
169
Slide 170
170
Slide 171
171
Slide 172
172
Slide 173
173
Slide 174
174
Slide 175
175
Slide 176
176
Slide 177
177
Slide 178
178
Slide 179
179
Slide 180
180
Slide 181
181
Slide 182
182
Slide 183
183
Slide 184
184
Slide 185
185
Slide 186
186
Slide 187
187
Slide 188
188
Slide 189
189
Slide 190
190
Slide 191
191
Slide 192
192
Slide 193
193
Slide 194
194
Slide 195
195
Slide 196
196
Slide 197
197
Slide 198
198
Slide 199
199
Slide 200
200
Slide 201
201
Slide 202
202
Slide 203
203
Slide 204
204
Slide 205
205
Slide 206
206
Slide 207
207
Slide 208
208
Slide 209
209
Slide 210
210
Slide 211
211
Slide 212
212
Slide 213
213
Slide 214
214
Slide 215
215
Slide 216
216
Slide 217
217
Slide 218
218
Slide 219
219
Slide 220
220
Slide 221
221
Slide 222
222
Slide 223
223
Slide 224
224
Slide 225
225
Slide 226
226
Slide 227
227
Slide 228
228
Slide 229
229
Slide 230
230
Slide 231
231
Slide 232
232
Slide 233
233
Slide 234
234
Slide 235
235
Slide 236
236
Slide 237
237
Slide 238
238
Slide 239
239
Slide 240
240
Slide 241
241
Slide 242
242
Slide 243
243
Slide 244
244
Slide 245
245
Slide 246
246
Slide 247
247
Slide 248
248
Slide 249
249
Slide 250
250
Slide 251
251
Slide 252
252
Slide 253
253
Slide 254
254
Slide 255
255
Slide 256
256
Slide 257
257
Slide 258
258
Slide 259
259
Slide 260
260
Slide 261
261
Slide 262
262
Slide 263
263
Slide 264
264
Slide 265
265
Slide 266
266
Slide 267
267
Slide 268
268
Slide 269
269
Slide 270
270
Slide 271
271
Slide 272
272
Slide 273
273
Slide 274
274
Slide 275
275
Slide 276
276
Slide 277
277
Slide 278
278
Slide 279
279
Slide 280
280
Slide 281
281
Slide 282
282
Slide 283
283
Slide 284
284
Slide 285
285
Slide 286
286
Slide 287
287
Slide 288
288
Slide 289
289
Slide 290
290
Slide 291
291
Slide 292
292
Slide 293
293
Slide 294
294
Slide 295
295
Slide 296
296
Slide 297
297
Slide 298
298
Slide 299
299
Slide 300
300
Slide 301
301
Slide 302
302
Slide 303
303
Slide 304
304
Slide 305
305
Slide 306
306
Slide 307
307
Slide 308
308
Slide 309
309
Slide 310
310
Slide 311
311
Slide 312
312
Slide 313
313
Slide 314
314
Slide 315
315
Slide 316
316
Slide 317
317
Slide 318
318
Slide 319
319
Slide 320
320
Slide 321
321
Slide 322
322
Slide 323
323
Slide 324
324
Slide 325
325
Slide 326
326
Slide 327
327
Slide 328
328
Slide 329
329
Slide 330
330
Slide 331
331
Slide 332
332
Slide 333
333
Slide 334
334
Slide 335
335
Slide 336
336
Slide 337
337
Slide 338
338
Slide 339
339
Slide 340
340
Slide 341
341
Slide 342
342
Slide 343
343
Slide 344
344
Slide 345
345
Slide 346
346
Slide 347
347
Slide 348
348
Slide 349
349
Slide 350
350
Slide 351
351
Slide 352
352
Slide 353
353
Slide 354
354
Slide 355
355
Slide 356
356
Slide 357
357
Slide 358
358
Slide 359
359
Slide 360
360
Slide 361
361
Slide 362
362
Slide 363
363
Slide 364
364
Slide 365
365
Slide 366
366
Slide 367
367
Slide 368
368
Slide 369
369
Slide 370
370
Slide 371
371
Slide 372
372
Slide 373
373
Slide 374
374
Slide 375
375

About This Presentation

What is a one way anova?


Slide Content

One-Way Analysis of Variance

We will now venture into the world of One-way Analysis of Variance.

How did we get here?

How did we get here? We will begin with a real-world problem to consider how we got to a One-Way ANOVA.

How did we get here? We will begin with a real-world problem to consider how we got to a One-Way ANOVA . We want to determine if there is a statistically significant difference between English comprehension test scores

How did we get here? We will begin with a real-world problem to consider how we got to a One-Way ANOVA . We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor

How did we get here? We will begin with a real-world problem to consider how we got to a One-Way ANOVA . We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German ,

How did we get here? We will begin with a real-world problem to consider how we got to a One-Way ANOVA . We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French ,

How did we get here? We will begin with a real-world problem to consider how we got to a One-Way ANOVA . We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French, or Dutch Speaker.

How did we get here? First, we have to determine if the problem we are solving poses an inferential or descriptive question.

How did we get here? First, we have to determine if the problem we are solving poses an inferential or descriptive question. Problem

How did we get here? First, we have to determine if the problem we are solving poses an inferential or descriptive question. Problem Inferential Descriptive

How did we get here? First, we have to determine if the problem we are solving poses an inferential or descriptive question. Problem Inferential Descriptive We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? First, we have to determine if the problem we are solving poses an inferential or descriptive question. Problem Inferential Descriptive We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? First, we have to determine if the problem we are solving poses an inferential or descriptive question. Problem Inferential Descriptive We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker. Statistical Significance means we are dealing with the probability that something has or has not occurred which means we are dealing with INFERENTIAL STATISTICS

How did we get here? First, we have to determine if the problem we are solving poses an inferential or descriptive question. Problem Inferential Descriptive We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Second, we determine if the inferential problem poses a difference , relationship, independence or goodness of fit question.

How did we get here? Second, we determine if the inferential problem poses a difference , relationship, independence or goodness of fit question. Problem Inferential Descriptive Difference Relationship Goodness of Fit Independence

How did we get here? Second, we determine if the inferential problem poses a difference , relationship, independence or goodness of fit question. Problem Inferential Descriptive Difference Relationship Goodness of Fit Independence We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Second, we determine if the inferential problem poses a difference , relationship, independence or goodness of fit question. Problem Inferential Descriptive Difference Relationship Goodness of Fit Independence We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker. Notice the word DIFFERENCE here.

How did we get here? Second, we determine if the inferential problem poses a difference , relationship, independence or goodness of fit question. Problem Inferential Descriptive Difference Relationship Goodness of Fit Independence We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Third, we determine if we will use parametric or nonparametric analytical methods by determining if the distributions are normal , skewed or kurtotic .

How did we get here? Third, we determine if we will use parametric or nonparametric analytical methods by determining if the distributions are normal , skewed or kurtotic . Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence

How did we get here? Third, we determine if we will use parametric or nonparametric analytical methods by determining if the distributions are normal , skewed or kurtotic . Let’s say we find that the distribution is NORMAL Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence

How did we get here? Third, we determine if we will use parametric or nonparametric analytical methods by determining if the distributions are normal , skewed or kurtotic . Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Let’s say we find that the distribution is NORMAL

How did we get here? Third, we determine if we will use parametric or nonparametric analytical methods by determining if the distributions are normal , skewed or kurtotic . Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Let’s say we find that the distribution is NORMAL

How did we get here? Fourth, we determine if we will use parametric or nonparametric analytical methods by determining if the data are ratio/interval , ordinal , or nominal .

How did we get here? Fourth, we determine if we will use parametric or nonparametric analytical methods by determining if the data are ratio/interval , ordinal , or nominal . Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal

How did we get here? Fourth, we determine if we will use parametric or nonparametric analytical methods by determining if the data are ratio/interval , ordinal , or nominal . Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal Let’s say we find that the data are Ratio/Interval

How did we get here? Fourth, we determine if we will use parametric or nonparametric analytical methods by determining if the data are ratio/interval , ordinal , or nominal . Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal Let’s say we find that the data are Ratio/Interval

How did we get here? Fifth, we determine the number of dependent variables.

How did we get here? Fifth, we determine the number of dependent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables

How did we get here? Fifth, we determine the number of dependent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Fifth, we determine the number of dependent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Fifth, we determine the number of dependent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Sixth, we determine the number of independent variables.

How did we get here? Sixth, we determine the number of independent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables

How did we get here? Sixth, we determine the number of independent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables

How did we get here? Sixth, we determine the number of independent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Sixth, we determine the number of independent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Sixth, we determine the number of independent variables. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Seventh, we determine the number of levels of the independent variable.

How did we get here? Seventh, we determine the number of levels of the independent variable. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables 2 levels 3 or more levels

How did we get here? Seventh, we determine the number of levels of the independent variable. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables 2 levels 3 or more levels We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German, French or Dutch Speaker.

How did we get here? Seventh, we determine the number of levels of the independent variable. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables 2 levels 3 or more levels We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German , French or Dutch Speaker.

How did we get here? Seventh, we determine the number of levels of the independent variable. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables 2 levels 3 or more levels We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German , French or Dutch Speaker.

How did we get here? Seventh, we determine the number of levels of the independent variable. Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables 2 levels 3 or more levels We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German , French or Dutch Speaker. 3

Problem Inferential Descriptive Difference Relationship Distributions Normal Distributions Skewed or Kurtotic Goodness of Fit Independence Data: Ratio/Interval Data: Ordinal Data: Nominal 1 Dependent Variable 2 or more Dependent Variables 1 Independent Variable 2 or more Independent Variables 2 levels 3 or more levels How did we get here? Seventh, we determine the number of levels of the independent variable. These elements point to a one-way ANOVA as an appropriate method to answer this question. We want to determine if there is a statistically significant difference between English comprehension test scores of students who have an English speaking instructor who is a native German , French or Dutch Speaker.

Now let’s consider the concepts that support the use of a One-Way Analysis of Variance

Once we have made the leap from simple means-differences embedded in t-tests to the logic of sums of squares, a wide range of procedures opens up to us.

Once we have made the leap from simple means-differences embedded in t-tests to the logic of sums of squares, a wide range of procedures opens up to us. mean 1 mean 2 Simple Mean Difference

Once we have made the leap from simple means-differences embedded in t-tests to the logic of sums of squares, a wide range of procedures opens up to us. mean 2 mean 3 mean 1 Sums of Squares Logic mean 1 mean 2 Simple Mean Difference

Once we have made the leap from simple means-differences embedded in t-tests to the logic of sums of squares, a wide range of procedures opens up to us. mean 2 mean 3 mean 1 Sums of Squares Logic mean 1 mean 2 Simple Mean Difference The possibilities are endless !

Once we have made the leap from simple means-differences embedded in t-tests to the logic of sums of squares, a wide range of procedures opens up to us. mean 2 mean 3 mean 1 Sums of Squares Logic mean 1 mean 2 Simple Mean Difference The possibilities are endless ! (well, almost … )

As reminder, t -tests can compare means only between two levels of one independent variable.

As reminder, t -tests can compare means only between two levels of one independent variable . First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player

As reminder, t -tests can compare means only between two levels of one independent variable . First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player Second example : One Independent variable: Religious Affiliation Two levels : Catholic and Protestant

As reminder, t -tests can compare means only between two levels of one independent variable . First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player Second example : One Independent variable: Religious Affiliation Two levels : Catholic and Protestant Third example : One Independent variable: Age Two levels : 19-45 yrs old and 31-64 yrs old

Fourth example : One Independent variable: Test takers in a stats class Two levels : Pre-test and posttest

Fourth example : One Independent variable: Test takers in a stats class Two levels : Pre-test and posttest Fifth example : One Independent variable: People from Texas Two levels : Population of Texans and a sample of Texans who ate raw fish

… across one dependent variable.

… across one dependent variable. First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player

… across one dependent variable. First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player One Dependent variable : degree of flexibility

… across one dependent variable. First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player One Dependent variable : degree of flexibility Second example : One Independent variable: Religious Affiliation Two levels : Catholic and Protestant

… across one dependent variable. First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player One Dependent variable : degree of flexibility Second example : One Independent variable: Religious Affiliation Two levels : Catholic and Protestant One Dependent variable : level of education

… across one dependent variable. First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player One Dependent variable : degree of flexibility Second example : One Independent variable: Religious Affiliation Two levels : Catholic and Protestant One Dependent variable : level of education Third example : One Independent variable: Age Two levels : 19-45 yrs old and 31-64 yrs old

… across one dependent variable. First example : One Independent variable: Athlete Two levels : Soccer player and Basketball player One Dependent variable : degree of flexibility Second example : One Independent variable: Religious Affiliation Two levels : Catholic and Protestant One Dependent variable : level of education Third example : One Independent variable: Age Two levels : 19-45 yrs old and 31-64 yrs old One Dependent variable : level of comfort with technology

Fourth example : One Independent variable: Test takers in a stats class Two levels : Pre-test and posttest

Fourth example : One Independent variable: Test takers in a stats class Two levels : Pre-test and posttest One Dependent variable : scores on a basic statistics test

Fourth example : One Independent variable: Test takers in a stats class Two levels : Pre-test and posttest One Dependent variable : scores on a basic statistics test Fifth example : One Independent variable: People from Texas Two levels : Population of Texans and a sample of Texans who ate raw fish

Fourth example : One Independent variable: Test takers in a stats class Two levels : Pre-test and posttest One Dependent variable : scores on a basic statistics test Fifth example : One Independent variable: People from Texas Two levels : Population of Texans and a sample of Texans who ate raw fish One Dependent variable : IQ scores

With a one way analysis of variance, on the other hand, we are generally comparing more than two levels of an independent variable.

With a one way analysis of variance, on the other hand, we are generally comparing more than two levels of an independent variable. First example : One Independent variable: Athlete Two levels : Soccer players and Basketball players One Dependent variable: degree of flexibility

With a one way analysis of variance, on the other hand, we are generally comparing more than two levels of an independent variable. First example : One Independent variable: Athlete Three levels : Soccer players and Basketball players and Football players One Dependent variable: degree of flexibility

With a one way analysis of variance, on the other hand, we are generally comparing more than two levels of an independent variable. First example : One Independent variable: Athlete Three levels : Soccer players and Basketball players and Football Players One Dependent variable: degree of flexibility Second example : One Independent variable : Religious Affiliation Two levels: Catholic and Protestant One Dependent variable: level of education

With a one way analysis of variance, on the other hand, we are generally comparing more than two levels of an independent variable. First example : One Independent variable: Athlete Three levels : Soccer players and Basketball players and Football players One Dependent variable: degree of flexibility Second example : One Independent variable : Religious Affiliation Four levels: Catholic and Protestant and Mormon and Muslim One Dependent variable : level of education

With a one way analysis of variance, on the other hand, we are generally comparing more than two levels of an independent variable. First example : One Independent variable: Athlete Three levels : Soccer players and Basketball players and Football players One Dependent variable: degree of flexibility Second example : One Independent variable : Religious Affiliation Four levels: Catholic and Protestant and Mormon and Muslim One Dependent variable : level of education Third example : One Independent variable: Age Two levels : 19-30 yrs old and 31-64 yrs old One Dependent variable: level of comfort with technology

With a one way analysis of variance, on the other hand, we are generally comparing more than two levels of an independent variable. First example : One Independent variable: Athlete Three levels : Soccer players and Basketball players and Football players One Dependent variable: degree of flexibility Second example : One Independent variable : Religious Affiliation Four levels: Catholic and Protestant and Mormon and Muslim One Dependent variable : level of education Third example : One Independent variable: Age Three levels: 19-30 yrs old and 31-64 yrs old and 65-79 yrs old One Dependent variable: level of comfort with technology

Fourth example : One Independent variable: Test takers in a stats class Two levels : Pre-test and posttest One Dependent variable: scores on a basic statistics test

Fourth example : One Independent variable: Test takers in a stats class Three levels : Pre-test and midterm test and posttest One Dependent variable: scores on a basic statistics test

Fourth example : One Independent variable: Test takers in a stats class Three levels : Pre-test and midterm test and posttest One Dependent variable: scores on a basic statistics test Fifth example : One Independent variable: People from Texas Two levels: Population of Texans and a sample of Texans who ate raw fish One Dependent variable: IQ scores

Fourth example : One Independent variable: Test takers in a stats class Three levels : Pre-test and midterm test and posttest One Dependent variable: scores on a basic statistics test Fifth example : One Independent variable: People from Texas Three levels: Population of Texans and a sample of Texans who ate raw fish and a sample of Texans who ate cooked fish One Dependent variable: IQ scores

While there is a whole family of procedures under the umbrella of Analysis of Variance we will begin with the most basic: One-Way Analysis of Variance .

While there is a whole family of procedures under the umbrella of Analysis of Variance we will begin with the most basic: One-Way Analysis of Variance . First, let’s remind ourselves what variance is before we do an “analysis” of it.

While there is a whole family of procedures under the umbrella of Analysis of Variance we will begin with the most basic: One-Way Analysis of Variance . First, let’s remind ourselves what variance is before we do an “analysis” of it. Technically, the variance is the average squared distance of the scores around the mean.

Variance Conceptually, the variance represents the average squared deviations of scores from the mean .

Variance Conceptually, the variance represents the average squared deviations of scores from the mean . So, what does that mean?

Variance Conceptually, the variance represents the average squared deviations of scores from the mean . So, what does that mean? Here is a visual depiction of the variance:

Variance Conceptually, the variance represents the average squared deviations of scores from the mean . So, what does that mean? Here is a visual depiction of the variance: 1 2 3 4 5 6 7 8 9 1 2 3 4 5

Variance Conceptually, the variance represents the average squared deviations of scores from the mean . So, what does that mean? Here is a visual depiction of the variance: 1 2 3 4 5 6 7 8 9 1 2 3 4 5 So, the mean of this distribution is 5

Variance Conceptually, the variance represents the average squared deviations of scores from the mean . So, what does that mean? Here is a visual depiction of the variance: 1 2 3 4 5 6 7 8 9 1 2 3 4 5 And this observation is 9 So we subtract this observation “9” from the mean “5” and we get +4 This means that this observation is 4 units from the mean. And this observation is 3 units from the mean. And this observation is 2 units from the mean. And this observation is 1 unit from the mean. And this observation is 1 unit from the mean. And this observation is units from the mean. And this observation is -1 units from the mean. And this observation is 2 units from the mean. And this observation is 1 unit from the mean. And this observation is units from the mean. And this observation is units from the mean. And this observation is units from the mean. And this observation is -1 units from the mean. And this observation is -1 units from the mean. And this observation is -2 units from the mean. And this observation is -2 units from the mean. And this observation is -3 units from the mean. And this observation is -4 units from the mean.

Variance Conceptually, the variance represents the average squared deviations of scores from the mean . You might think that the variance is just the average of all of the deviations (the average distance between each score from the mean), right? 1 2 3 4 5 6 7 8 9 1 2 3 4 5 But, because half of the deviations are positive and the other half are negative, if you take the average it will come to zero 1 2 3 4 1 2 1 2 1 3 4 So to avoid this from happening, we square each deviation ( -4 2 = 16 , -3 2 = 9, - 2 2 = 4, -1 2 = 1, 1 2 = 1, 2 2 = 4 , 3 2 = 9, 4 2 = 16). Then we add up all of the squared deviations and divide that number by the number of observations 4 units above the mean squared = 16 2 units above the mean squared = 4 1 unit below the mean squared = 1 4 units above the mean squared = 16 Now we sum all of the squared deviations 16 + 9 + 4 + 4 + 1 + 1 + 1 + 0 + 0 + 0 + 0 + 1+ 1 + 1 + 4 + 4 + 9 + 16 = 116 Remember – these values represent the distance or deviation from the mean, squared. This is Sum of Squared Deviation The average of the sum of squared deviations = = 116 SS /18 observations = 6.4 Variance

The “one way” in one way analysis of variance indicates that we have only one dependent variable

The “one way” in one way analysis of variance indicates that we have only one dependent variable Dependent Variable: amount of pizza eaten

The “one way” in one way analysis of variance indicates that we have only one dependent variable and only one independent variable

The “one way” in one way analysis of variance indicates that we have only one dependent variable and only one independent variable Independent Variable: athletes

The “one way” in one way analysis of variance indicates that we have only one dependent variable and only one independent variable with at least two levels.

The “one way” in one way analysis of variance indicates that we have only one dependent variable and only one independent variable with at least two levels. Level one: football player

Level two: basketball player The “one way” in one way analysis of variance indicates that we have only one dependent variable and only one independent variable with at least two levels. Level one: football player

Note that an independent sample t -test could be run as well . The results will be the same, but one-way ANOVA can also analyze at least three levels.

Note that an independent sample t -test could be run as well . The results will be the same, but one-way ANOVA can also analyze at least three levels. Level one: football player

Note that an independent sample t -test could be run as well . The results will be the same, but one-way ANOVA can also analyze at least three levels. Level two: basketball player Level one: football player

Note that an independent sample t -test could be run as well . The results will be the same, but one-way ANOVA can also analyze at least three levels. Level two: basketball player Level one: football player Level three: soccer player

We begin with a question:

We begin with a question: Who eats more slices of pizza in one sitting, football players, basketball players or soccer players ?

We begin with a question: Who eats more slices of pizza in one sitting, football players, basketball players or soccer players? How would you convert this question into a null hypothesis ?

N ull hypothesis:

N ull hypothesis: There is no statistically significant difference in the quantity of pizza consumed in one sitting among football, basketball, and soccer players .

N ull hypothesis: There is no statistically significant difference in the quantity of pizza consumed in one sitting among football, basketball, and soccer players . To test whether the null hypothesis will be retained or rejected we compare a value called the F statistic or F ratio with what we call the F critical value .

We will compute the F ratio from the following data set:

We will compute the F ratio from the following data set: football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

We will compute the F ratio from the following data set: football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

We will compute the F ratio from the following data set : And then check to see if it is larger than the F-critical. If it is, then we will reject our null hypothesis: football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

We will compute the F ratio from the following data set : And then check to see if it is larger than the F-critical. If it is, then we will reject our null hypothesis: There is no statistically significant difference in the quantity of pizza consumed in one sitting among football, basketball, and soccer players . football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

We will compute the F ratio from the following data set : And then check to see if it is larger than the F-critical. If it is, then we will reject our null hypothesis: There is no statistically significant difference in the quantity of pizza consumed in one sitting among football, basketball, and soccer players . football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

Otherwise, we will retain the null hypothesis:

Otherwise, we will retain the null hypothesis: There is no statistically significant difference in the quantity of pizza consumed in one sitting among football, basketball, and soccer players .

So, let’s say after doing our Analysis of Variance calculation we have an F ratio or F value of 22.17 (nice round number, right? ).

So, let’s say after doing our Analysis of Variance calculation we have an F ratio or F value of 22.17 (nice round number, right? ). We would then compare it with the F critical value.

So, let’s say after doing our Analysis of Variance calculation we have an F ratio or F value of 22.17 (nice round number, right? ). We would then compare it with the F critical value. If the F ratio is larger than the F critical value then we would consider it to be a rare occurrence and reject the null hypothesis .

So, let’s say after doing our Analysis of Variance calculation we have an F ratio or F value of 22.17 (nice round number, right? ). We would then compare it with the F critical value. If the F ratio is larger than the F critical value then we would consider it to be a rare occurrence and reject the null hypothesis . So, let’s see if it is

So, let’s say after doing our Analysis of Variance calculation we have an F ratio or F value of 22.17 (nice round number, right? ). We would then compare it with the F critical value. If the F ratio is larger than the F critical value then we would consider it to be a rare occurrence and reject the null hypothesis . So, let’s see if it is by looking at what is called the Table of Probabilities for the F Distribution .

So, let’s say after doing our Analysis of Variance calculation we have an F ratio or F value of 22.17 (nice round number, right? ). We would then compare it with the F critical value. If the F ratio is larger than the F critical value then we would consider it to be a rare occurrence and reject the null hypothesis . So, let’s see if it is by looking at what is called the Table of Probabilities for the F Distribution . This table will help us locate the F critical for our data set.

Let’s say that the F critical value turns out to be 5.14 (we’ll show you how we got that value later).

Let’s say that the F critical value turns out to be 5.14 (we’ll show you how we got that value later).

Let’s say that the F critical value turns out to be 5.14 (we’ll show you how we got that value later).

Let’s say that the F critical value turns out to be 5.14 (we’ll show you how we got that value later) . So, with an F critical of 5.14 and an F ratio of 22.17, (we will show you how to calculate this F ratio from the data set below ),

Let’s say that the F critical value turns out to be 5.14 (we’ll show you how we got that value later) . So, with an F critical of 5.14 and an F ratio of 22.17, (we will show you how to calculate this F ratio from the data set below ), football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

Let’s say that the F critical value turns out to be 5.14 (we’ll show you how we got that value later) . So, with an F critical of 5.14 and an F ratio of 22.17, (we will show you how to calculate this F ratio from the data set below ), we would have to say that that is a rare occurrence at the .05 alpha level and we would reject the null hypothesis. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

Obviously, if the critical F value had been, say, 9.55,

Obviously, if the critical F value had been, say, 9.55,

Obviously, if the critical F value had been, say, 9.55,

Obviously, if the critical F value had been, say, 9.55, we would have to say that that is a not a rare occurrence at the .05 alpha level and we would retain the null hypothesis.

Obviously, if the critical F value had been, say, 9.55, we would have to say that that is a not a rare occurrence at the .05 alpha level and we would retain the null hypothesis. So how did we calculate the 22.17 F ratio in the first place ? And how was the F critical determined so we could reject or retain our null- hypothesis?

Here are the steps we follow to determine to reject or retain the null-hypothesis: (Note – there will be new terminology. Don’t be too concerned about it. At a certain point each concept below will be explained to you )

Here are the steps we follow to determine to reject or retain the null-hypothesis: (Note – there will be new terminology. Don’t be too concerned about it. At a certain point each concept below will be explained to you ) Step 1 – calculate the sums of squares between groups

Here are the steps we follow to determine to reject or retain the null-hypothesis: (Note – there will be new terminology. Don’t be too concerned about it. At a certain point each concept below will be explained to you ) Step 1 – calculate the sums of squares between groups Step 2 – calculate the sums of squares within groups

Here are the steps we follow to determine to reject or retain the null-hypothesis: (Note – there will be new terminology. Don’t be too concerned about it. At a certain point each concept below will be explained to you ) Step 1 – calculate the sums of squares between groups Step 2 – calculate the sums of squares within groups Step 3 – place the sums of squares values in an ANOVA table

Here are the steps we follow to determine to reject or retain the null-hypothesis: (Note – there will be new terminology. Don’t be too concerned about it. At a certain point each concept below will be explained to you ) Step 1 – calculate the sums of squares between groups Step 2 – calculate the sums of squares within groups Step 3 – place the sums of squares values in an ANOVA table Step 4 - determine the degrees of freedom

Here are the steps we follow to determine to reject or retain the null-hypothesis: (Note – there will be new terminology. Don’t be too concerned about it. At a certain point each concept below will be explained to you ) Step 1 – calculate the sums of squares between groups Step 2 – calculate the sums of squares within groups Step 3 – place the sums of squares values in an ANOVA table Step 4 - determine the degrees of freedom Step 5 – divide the between and the within sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups.

Step 6 – Divide the between groups means square by the within groups mean square to get the F-ratio

Step 6 – Divide the between groups means square by the within groups mean square to get the F-ratio Step 7 – locate the F critical on the F Distribution Table

Step 6 – Divide the between groups means square by the within groups mean square to get the F-ratio Step 7 – locate the F critical on the F Distribution Table Step 8 – determine which is bigger

Step 6 – Divide the between groups means square by the within groups mean square to get the F-ratio Step 7 – locate the F critical on the F Distribution Table Step 8 – determine which is bigger Step 9 – retain or reject the null hypothesis

Step 6 – Divide the between groups means square by the within groups mean square to get the F-ratio Step 7 – locate the F critical on the F Distribution Table Step 8 – determine which is bigger Step 9 – retain or reject the null hypothesis Step 10 – if the null is rejected conduct a posthoc test to see where the differences lie (this will be shown in another presentation)

Step 1 - calculate the sums of squares between groups

Step 1 - calculate the sums of squares between groups Let’s illustrate this statement with a data set :

Step 1 - calculate the sums of squares between groups Let’s illustrate this statement with a data set : football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average 18 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average 9 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average 5

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group . Then we create a new column of group means

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group . Then we create a new column of group means football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average 18 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average 9 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average 5

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group . Then we create a new column of group means football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average 18 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average 9 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average 5 player group means football 18 b-ball 9 soccer 5 average

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group . Then we create a new column of group means football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average 18 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average 9 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average 5 player group means football 18 b-ball 9 soccer 5 average c c c

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group . Then we create a new column of group means Another way to state Average of Averages is Mean of Means. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average 18 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average 9 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average 5 player group means football 18 b-ball 9 soccer 5 average

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group . Then we create a new column of group means Another way to state Average of Averages is Mean of Means. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average 18 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average 9 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average 5 player group means football 18 b-ball 9 soccer 5 mean of means 10.7

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group . Then we create a new column of group means Another way to state Average of Averages is Mean of Means . Average of Averages or Mean of Means is also called the Grand Mean . football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average 18 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average 9 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average 5 player group means football 18 b-ball 9 soccer 5 mean of means 10.7

Step 1 - calculate the sums of squares between groups First we calculate the mean for each group . Then we create a new column of group means Another way to state Average of Averages is Mean of Means . Average of Averages or Mean of Means is also called the Grand Mean . football player pizza eaten 1 17 slices 2 18 slices 3 19 slices average 18 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices average 9 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices average 5 player group means football 18 b-ball 9 soccer 5 Grand Mean 10.7

Step 1 - calculate the sums of squares between groups Now we will compute the sum of squared deviations like we’ve done before with variance but this time between group means and the grand mean .

Step 1 - calculate the sums of squares between groups Now we will compute the sum of squared deviations like we’ve done before with variance but this time between group means and the grand mean . player group means football 18 b-ball 9 soccer 5 Grand Mean 10.7

Step 1 - calculate the sums of squares between groups Now we will compute the sum of squared deviations like we’ve done before with variance but this time between group means and the grand mean . player group means football 18 b-ball 9 soccer 5 Grand Mean 10.7

Step 1 - calculate the sums of squares between groups Now we will compute the sum of squared deviations like we’ve done before with variance but this time between group means and the grand mean . player group means football 18 b-ball 9 soccer 5 Grand Mean 10.7

Step 1 - calculate the sums of squares between groups Now we will compute the sum of squared deviations like we’ve done before with variance but this time between group means and the grand mean .

Step 1 - calculate the sums of squares between groups Now we will compute the sum of squared deviations like we’ve done before with variance but this time between group means and the grand mean . group means 18 9 5 football b-ball soccer

Step 1 - calculate the sums of squares between groups Now we will compute the sum of squared deviations like we’ve done before with variance but this time between group means and the grand mean . group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer

Step 1 - calculate the sums of squares between groups So, here is the deviation between each of the group means and grand mean. group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer

Step 1 - calculate the sums of squares between groups So, here is the deviation between each of the group means and grand mean. group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = =

Step 1 - calculate the sums of squares between groups Now we square the deviations between groups and grand mean. If we didn’t, when we try to sum the deviations they will come to zero. group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = =

Step 1 - calculate the sums of squares between groups Now we square the deviations between groups and grand mean. If we didn’t, when we try to sum the deviations they will come to zero. group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 =

Step 1 - calculate the sums of squares between groups Finally, we multiply these squared deviations by the number of students in each group. group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 =

Step 1 - calculate the sums of squares between groups Finally, we multiply these squared deviations by the number of students in each group. group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = # in each group 3 3 3 x x x

Step 1 - calculate the sums of squares between groups Why do we do this?

Step 1 - calculate the sums of squares between groups Why do we do this ? We do this so as to provide greater weight to those groups with more students.

Step 1 - calculate the sums of squares between groups Why do we do this ? We do this so as to provide greater weight to those groups with more students. As an example, if there were 6 soccer players, then we would do the following :

Step 1 - calculate the sums of squares between groups Why do we do this ? We do this so as to provide greater weight to those groups with more students. As an example, if there were 6 soccer players, then we would do the following : group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = # in each group 3 3 6 x x x

Step 1 - calculate the sums of squares between groups Why do we do this ? We do this so as to provide greater weight to those groups with more students. As an example, if there were 6 soccer players, then we would do the following : group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = x x x # in each group 3 3 6

Step 1 - calculate the sums of squares between groups Let’s go back to our original data set:

Step 1 - calculate the sums of squares between groups Let’s go back to our original data set: group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = x x x # in each group 3 3 3

Step 1 - calculate the sums of squares between groups Let’s go back to our original data set: group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = x x x # in each group 3 3 3 w eighted sq. dev . 161.3 8.3 96.3 = = =

Step 1 - calculate the sums of squares between groups So, the Sum of the Weighted Squared Deviations between groups and grand mean is: group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = x x x # in each group 3 3 3 w eighted sq. dev . 161.3 8.3 96.3 = = =

Step 1 - calculate the sums of squares between groups So, the Sum of the Weighted Squared Deviations between groups and grand mean is: group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = x x x # in each group 3 3 3 w eighted sq. dev . 161.3 8.3 96.3 266.0 = = = sum

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer.

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices group means 4 5 7 football b-ball soccer

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices group means 4 5 7 5.3 football b-ball soccer grand mean

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices group means 4 5 7 grand mean 5.3 5.3 5.3 – – – football b-ball soccer

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices group means 4 5 7 grand mean 5.3 5.3 5.3 – – – football b-ball soccer deviation - 1.3 - 0.3 1.7 = = =

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices group means 4 5 7 grand mean 5.3 5.3 5.3 – – – football b-ball soccer deviation - 1.3 - 0.3 1.7 = = = squared deviation 1.8 0.1 2.8 2 = 2 = 2 =

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices group means 4 5 7 grand mean 5.3 5.3 5.3 – – – football b-ball soccer deviation - 1.3 - 0.3 1.7 = = = squared deviation 1.8 0.1 2.8 2 = 2 = 2 = x x x # in each group 3 3 3

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices group means 4 5 7 grand mean 5.3 5.3 5.3 – – – football b-ball soccer deviation - 1.3 - 0.3 1.7 = = = squared deviation 1.8 0.1 2.8 2 = 2 = 2 = x x x # in each group 3 3 3 w eighted sq. dev . 5.3 0.3 8.3 = = =

Watch what happens to the sum of squared deviations between group and grand mean when the group means are closer. So let’s say: Football players ate 4 slices; B-ball players ate 5 slices; Soccer players ate 7 slices group means 4 5 7 grand mean 5.3 5.3 5.3 – – – football b-ball soccer deviation - 1.3 - 0.3 1.7 = = = squared deviation 1.8 0.1 2.8 2 = 2 = 2 = x x x # in each group 3 3 3 w eighted sq. dev . 5.3 0.3 8.3 14.0 = = = sum

Notice how when the group means are closer, their sum of squares between groups is much smaller than when they are further apart like our original analysis. (See below)

Notice how when the group means are closer, their sum of squares between groups is much smaller than when they are further apart like our original analysis. (See below) group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = x x x # in each group 3 3 3 w eighted sq. dev . 161.3 8.3 96.3 266.0 = = = sum

The weighted sums of squares between the groups represents how spread apart the levels

The weighted sums of squares between the groups represents how spread apart the levels ( football, basketball, and soccer players )

The weighted sums of squares between the groups represents how spread apart the levels ( football, basketball, and soccer players ) of the independent variable

The weighted sums of squares between the groups represents how spread apart the levels ( football, basketball, and soccer players ) of the independent variable ( athletes )

The weighted sums of squares between the groups represents how spread apart the levels ( football, basketball, and soccer players ) of the independent variable ( athletes ) are from one another.

The weighted sums of squares between the groups represents how spread apart the levels ( football, basketball, and soccer players ) of the independent variable ( athletes ) are from one another . We believe we know the source of the differences among the groups.

The weighted sums of squares between the groups represents how spread apart the levels ( football, basketball, and soccer players ) of the independent variable ( athletes ) are from one another . We believe we know the source of the differences among the groups. In this case we believe that the variance in the amount of pizza eaten is a function of whether an athlete plays football, basketball or soccer. This is our alternative hypothesis .

The weighted sums of squares between the groups represents how spread apart the levels ( football, basketball, and soccer players ) of the independent variable ( athletes ) are from one another . We believe we know the source of the differences among the groups. In this case we believe that the variance in the amount of pizza eaten is a function of whether an athlete plays football, basketball or soccer. This is our alternative hypothesis .

Important note: To simplify the phrase “sum of weighted squares deviations between the group means and grand mean” we simply state: Between Groups Sum of Squares.

Important note: To simplify the phrase “sum of weighted squares deviations between the group means and grand mean” we simply state: Between Groups Sum of Squares .

Important note: To simplify the phrase “sum of weighted squares deviations between the group means and grand mean” we simply state: Between Groups Sum of Squares . Now on to Step 2 – calculate the sums of squares within groups

Step 2 – calculate the sums of squares within groups

Step 2 – calculate the sums of squares within groups On the other hand, the sum of squares within the groups represents variance for which we have not accounted. We don’t know the source of the variance within the groups.

Step 2 – calculate the sums of squares within groups On the other hand, the sum of squares within the groups represents variance for which we have not accounted. We don’t know the source of the variance within the groups. So, we compute the within groups sum of squares.

Step 2 – calculate the sums of squares within groups On the other hand, the sum of squares within the groups represents variance for which we have not accounted. We don’t know the source of the variance within the groups. So, we compute the within groups sum of squares. Let’s think about why do we do this .

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group.

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group. For all we know, the scores are clustered around their means like this :

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group. For all we know, the scores are clustered around their means like this : mean = 5 mean = 9 mean = 18

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group. For all we know, the scores are clustered around their means like this : mean = 5 mean = 9 mean = 18 grand mean = 10.67

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group. For all we know, the scores are clustered around their means like this : (Hint : if this were the case then there most likely would be a significant difference between the means .) mean = 5 mean = 9 mean = 18 grand mean = 10.67

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group. For all we know, the scores are clustered around their means like this : Or, they could be clustered around their means like this (very spread out and overlapping) :

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group. For all we know, the scores are clustered around their means like this : Or, they could be clustered around their means like this (very spread out and overlapping) : mean = 5 mean = 9 mean = 18

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group. For all we know, the scores are clustered around their means like this : Or, they could be clustered around their means like this (very spread out and overlapping) : mean = 5 mean = 9 mean = 18 grand mean = 10.67

Step 2 – calculate the sums of squares within groups Based on the group means alone (2, 5, 9) we still do not know how much variability there is within each group. For all we know, the scores are clustered around their means like this : Or, they could be clustered around their means like this (very spread out and overlapping) : (Hint: if this were the case then there would most likely NOT be a significant difference between the means, because there is too much overlap between the distributions .) mean = 5 mean = 9 mean = 18 grand mean = 10.67

Step 2 – calculate the sums of squares within groups By computing the within groups sums of squares we will be able to consider how narrow or wide these sample distributions are.

Step 2 – calculate the sums of squares within groups By computing the within groups sums of squares we will be able to consider how narrow or wide these sample distributions are.

Step 2 – calculate the sums of squares within groups By computing the within groups sums of squares we will be able to consider how narrow or wide these sample distributions are.

Step 2 – calculate the sums of squares within groups As you recall, our between groups sums of squares value is 266 :

Step 2 – calculate the sums of squares within groups As you recall, our between groups sums of squares value is 266 : group means 18 9 5 grand mean 10.7 10.7 10.7 – – – football b-ball soccer deviation 7.3 - 1.7 - 5.7 = = = squared deviation 53.8 2.8 32.1 2 = 2 = 2 = x x x # in each group 3 3 3 w eighted sq. dev . 161.3 8.3 96.3 266.0 = = = sum

Step 2 – calculate the sums of squares within groups As you recall, our between groups sums of squares value is 266 : There are two ways to compute the within groups sums of squares :

Step 2 – calculate the sums of squares within groups As you recall, our between groups sums of squares value is 266 : There are two ways to compute the within groups sums of squares : The short way

Step 2 – calculate the sums of squares within groups As you recall, our between groups sums of squares value is 266 : There are two ways to compute the within groups sums of squares : The short way The long way

We will begin with the long way.

We will begin with the long way. The long way is calculated by computing the within sums of squares for football players plus the sums of squares for basketball players plus the sums of squares with soccer players.

We will begin with the long way. The long way is calculated by computing the within sums of squares for football players plus the sums of squares for basketball players plus the sums of squares with soccer players.

Now, we compute the sums of squares within football players:

Now, we compute the sums of squares within football players: football player pizza eaten 1 17 slices 2 18 slices 3 19 slices

Now, we compute the sums of squares within football players : then within basketball players: football player pizza eaten 1 17 slices 2 18 slices 3 19 slices

Now, we compute the sums of squares within football players : then within basketball players: football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices

Now, we compute the sums of squares within football players : then within basketball players: finally, within soccer players: football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices

Now, we compute the sums of squares within football players : then within basketball players: finally, within soccer players: football player pizza eaten 1 17 slices 2 18 slices 3 19 slices basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

We begin with our football players.

We begin with our football players. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices

We begin with our football players. Compute the mean. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices

We begin with our football players. Compute the mean. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices

We begin with our football players. Compute the mean. football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices

We begin with our football players. Compute the mean. sample mean 18 18 18 football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices

We begin with our football players. Compute the mean. Subtract the mean from each player’s slices. sample mean 18 18 18 – – – = = = football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices

We begin with our football players. Compute the mean. Subtract the mean from each player’s slices. sample mean 18 18 18 – – – deviation - 1 1 = = = football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices

We begin with our football players. Compute the mean. Subtract the mean from each player’s slices. Square each deviation score. sample mean 18 18 18 – – – deviation - 1 1 = = = 2 = 2 = 2 = football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices

We begin with our football players. Compute the mean. Subtract the mean from each player’s slices. Square each deviation score. sample mean 18 18 18 – – – deviation - 1 1 = = = squared deviation 1 1 2 = 2 = 2 = football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices

We begin with our football players. Compute the mean. Subtract the mean from each player’s slices. Square each deviation score . The sum of the squared deviations for the football player group … sample mean 18 18 18 – – – deviation - 1 1 = = = squared deviation 1 1 2 = 2 = 2 = football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices

We begin with our football players. Compute the mean. Subtract the mean from each player’s slices. Square each deviation score . The sum of the squared deviations for the football player group … sample mean 18 18 18 – – – deviation - 1 1 = = = squared deviation 1 1 2 = 2 = 2 = = football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices sum

We begin with our football players. Compute the mean. Subtract the mean from each player’s slices. Square each deviation score . The sum of the squared deviations for the football player group is 2 . sample mean 18 18 18 – – – deviation - 1 1 = = = squared deviation 1 1 2 2 = 2 = 2 = = football player pizza eaten 1 17 slices 2 18 slices 3 19 slices mean 18 slices sum

Now we will do the same for the basketball player group.

Now we will do the same for the basketball player group. basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices

Now we will do the same for the basketball player group. basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices

Now we will do the same for the basketball player group. sample mean 9 9 9 basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices

Now we will do the same for the basketball player group. sample mean 9 9 9 – – – = = = basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices

Now we will do the same for the basketball player group. sample mean 9 9 9 – – – deviation - 1 1 = = = basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices

Now we will do the same for the basketball player group. sample mean 9 9 9 – – – deviation - 1 1 = = = 2 = 2 = 2 = basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices

Now we will do the same for the basketball player group. sample mean 9 9 9 – – – deviation - 1 1 = = = squared deviation 1 1 2 = 2 = 2 = basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices

Now we will do the same for the basketball player group. sample mean 9 9 9 – – – deviation - 1 1 = = = squared deviation 1 1 2 = 2 = 2 = = basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices sum

Now we will do the same for the basketball player group. sample mean 9 9 9 – – – deviation - 1 1 = = = squared deviation 1 1 2 2 = 2 = 2 = = basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices sum

Now we will do the same for the basketball player group. So the sum of squares within the basketball player group is 2 . sample mean 9 9 9 – – – deviation - 1 1 = = = squared deviation 1 1 2 2 = 2 = 2 = = basketball player pizza eaten 1 8 slices 2 9 slices 3 10 slices mean 9 slices sum

Now we will do the same for the soccer player group.

Now we will do the same for the soccer player group. soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices

Now we will do the same for the soccer player group. soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices

Now we will do the same for the soccer player group. sample mean 5 5 5 soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices

Now we will do the same for the soccer player group. sample mean 5 5 5 – – – = = = soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices

Now we will do the same for the soccer player group. sample mean 5 5 5 – – – deviation - 4 4 = = = soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices

Now we will do the same for the soccer player group. sample mean 5 5 5 – – – deviation - 4 4 = = = 2 = 2 = 2 = soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices

Now we will do the same for the soccer player group. sample mean 5 5 5 – – – deviation - 4 4 = = = squared deviation 16 16 2 = 2 = 2 = soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices

Now we will do the same for the soccer player group. sample mean 5 5 5 – – – deviation - 4 4 = = = squared deviation 16 16 2 = 2 = 2 = = soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices sum

Now we will do the same for the soccer player group. sample mean 5 5 5 – – – deviation - 4 4 = = = squared deviation 16 16 32 2 = 2 = 2 = = soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices sum

Now we will do the same for the soccer player group. So the sum of squares within the soccer player group is 32 . sample mean 5 5 5 – – – deviation - 4 4 = = = squared deviation 16 16 32 2 = 2 = 2 = = soccer player pizza eaten 1 1 slices 2 5 slices 3 9 slices mean 5 slices sum

Let’s summarize the sum of squares within groups:

Let’s summarize the sum of squares within groups: Sum of squares within the football player group = 2 .

Let’s summarize the sum of squares within groups: Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 .

Let’s summarize the sum of squares within groups: Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 . Sum of squares within the soccer player group = 32 .

Let’s summarize the sum of squares within groups: Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 . Sum of squares within the soccer player group = 32 . Now lets’ sum up these within groups sum of squares

Let’s summarize the sum of squares within groups: Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 . Sum of squares within the soccer player group = 32 . Now lets’ sum up these within groups sum of squares Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 . Sum of squares within the soccer player group = 32 .

Let’s summarize the sum of squares within groups: Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 . Sum of squares within the soccer player group = 32 . Now lets’ sum up these within groups sum of squares Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 . Sum of squares within the soccer player group = 32 .

Let’s summarize the sum of squares within groups: Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 . Sum of squares within the soccer player group = 32 . Now lets’ sum up these within groups sum of squares Sum of squares within the football player group = 2 . Sum of squares within the basketball player group = 2 . Sum of squares within the soccer player group = 32 . 36 .

Another way to calculate the total sums of squares is to put all of the scores in a column:

Another way to calculate the total sums of squares is to put all of the scores in a column: players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9

Calculate the grand mean: players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9

Calculate the grand mean: players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7

And subtract players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7

And subtract the grand mean from them: players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7 grand mean 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 – – – – – – – – –

Which equals players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7 grand mean 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 – – – – – – – – –

Which equals the deviation players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7 grand mean 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 – – – – – – – – – deviation 6.3 7.3 8.3 - 2.7 - 1.7 - 0.7 - 9.7 - 5.7 - 1.7 = = = = = = = = =

Then square those deviations players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7 grand mean 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 – – – – – – – – – deviation 6.3 7.3 8.3 - 2.7 - 1.7 - 0.7 - 9.7 - 5.7 - 1.7 = = = = = = = = =

Then square those deviations players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7 grand mean 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 – – – – – – – – – deviation 6.3 7.3 8.3 - 2.7 - 1.7 - 0.7 - 9.7 - 5.7 - 1.7 = = = = = = = = = deviation 39.7 53.3 68.9 7.3 2.9 0.5 94.1 32.5 2.9 2 = 2 = 2 = 2 = 2 = 2 = 2 = 2 = 2 =

Then sum up the squared deviations from the mean and you get the total sums of squares players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7 grand mean 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 – – – – – – – – – deviation 6.3 7.3 8.3 - 2.7 - 1.7 - 0.7 - 9.7 - 5.7 - 1.7 = = = = = = = = = deviation 39.7 53.3 68.9 7.3 2.9 0.5 94.1 32.5 2.9 2 = 2 = 2 = 2 = 2 = 2 = 2 = 2 = 2 =

Then sum up the squared deviations from the mean and you get the total sums of squares players slices F1 17 F2 18 F3 19 B1 8 B2 9 B3 10 S1 1 S2 5 S3 9 grand mean 10.7 grand mean 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 10.7 – – – – – – – – – deviation 6.3 7.3 8.3 - 2.7 - 1.7 - 0.7 - 9.7 - 5.7 - 1.7 = = = = = = = = = deviation 39.7 53.3 68.9 7.3 2.9 0.5 94.1 32.5 2.9 302.0 2 = 2 = 2 = 2 = 2 = 2 = 2 = 2 = 2 = 2 = sum

You have probably figured out the short way to compute the within groups sums of squares:

You have probably figured out the short way to compute the within groups sums of squares:   Compute the between group sum of squares (266.0)

You have probably figured out the short way to compute the within groups sums of squares:   Compute the between group sum of squares (266.0) Subtract it from the total sums of squares (302.0 – 266.0 ), which equals the within group sums of squares (36.0)

Step 3 – place the sums of squares values in an ANOVA table As you recall

Step 3 – place the sums of squares values in an ANOVA table To calculate total sums of squares we simply add the between groups and within groups sums of squares.

Step 3 – place the sums of squares values in an ANOVA table To calculate total sums of squares we simply add the between groups and within groups sums of squares. Group Sum of Squares Between groups 266.0 Within groups 36.0 total 302.0

Step 3 – place the sums of squares values in an ANOVA table To calculate total sums of squares we simply add the between groups and within groups sums of squares. Group Sum of Squares Between groups 266.0 Within groups 36.0 total 302.0

Step 4 – determine the degrees of freedom To calcul

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups.

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom :

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom # of groups minus one equals

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals # of persons minus # of groups equals

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 # of persons minus # of groups equals 3

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 – # of persons minus # of groups equals 3

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 – 1 # of persons minus # of groups equals 3

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 – 1 = # of persons minus # of groups equals 3

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 – 1 = 2 # of persons minus # of groups equals 3

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 – 1 = 2 # of persons minus # of groups equals 9 3

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 – 1 = 2 # of persons minus # of groups equals 9 – 3

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 – 1 = 2 # of persons minus # of groups equals 9 – 3 =

Step 4 – determine the degrees of freedom We then divide the sum of squares by their degrees of freedom to get the mean square value for both groups . Here is how we calculate the number of degrees of freedom : Sums of Squares BETWEEN Groups Degrees of Freedom Sums of Squares WITHIN Groups Degrees of Freedom # of groups minus one equals 3 – 1 = 2 # of persons minus # of groups equals 9 – 3 = 6

Step 4 – determine the degrees of freedom Back to the table:

Step 4 – determine the degrees of freedom Back to the table: Group Sum of Squares Deg. of F. Between groups 266.0 Within groups 36.0

Step 4 – determine the degrees of freedom Back to the table: Group Sum of Squares Deg. of F. Between groups 266.0 2 Within groups 36.0 6

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. Back to the table:

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. The mean square is like an average. So dividing the sum of squares by the degrees freedom is like dividing a data set total (1, 2, 3, 4, 5 = 15) by the number of data points (5). (15/5 = 3). But in this case, the data points are the squared deviations. Back to the table:

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. For example, you may remember that the weighted squared deviations between groups were: 161.3 8.3 96.3 Back to the table:

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. For example, you may remember that the weighted squared deviations between groups were: 161.3 8.3 96.3 To take the average of this you would divide it by three; however, in this case we divide it by the degrees of freedom (why we do this is explained in another presentation) .

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. So 161.3 + 8.3 + 96.3 divided by 2 degrees of freedom =

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. So 161.3 + 8.3 + 96.3 divided by 2 degrees of freedom = 132.95

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. So 161.3 + 8.3 + 96.3 divided by 2 degrees of freedom = 132.95 As you may also remember, the variance is the average of the sums of squared deviations. Well, guess what? The mean square is essentially the variance of the between group.

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. Group Sum of Squares Deg. of F. Mean Square Between groups 266.0 2 Within groups 36.0 6

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. Group Sum of Squares Deg. of F. Mean Square Between groups 266.0 2 133.0 Within groups 36.0 6

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. Group Sum of Squares Deg. of F. Mean Square Between groups 266.0 2 133.0 Within groups 36.0 6 6.0

Step 5 – divide the between and the within groups sums of squares by their corresponding degrees of freedom to get the means square values for both the between and within groups. Group Sum of Squares Deg. of F. Mean Square Between groups 266.0 2 133.0 Within groups 36.0 6 6.0 Also known as the variance

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio Finally, we divide the Between Groups Mean Square (133.0) by the Within Groups Mean Square (6.0) to get the F-ratio.

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio Finally, we divide the Between Groups Mean Square (133.0) by the Within Groups Mean Square (6.0) to get the F-ratio. Group Sum of Squares Deg. of F. Mean Square F-ratio Between groups 266.0 2 133.0 Within groups 36.0 6 6.0

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio Finally, we divide the Between Groups Mean Square (133.0) by the Within Groups Mean Square (6.0) to get the F-ratio. Group Sum of Squares Deg. of F. Mean Square F-ratio Between groups 266.0 2 133.0 Within groups 36.0 6 6.0

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio Finally, we divide the Between Groups Mean Square (133.0) by the Within Groups Mean Square (6.0) to get the F-ratio. Group Sum of Squares Deg. of F. Mean Square F-ratio Between groups 266.0 2 133.0 22.17 Within groups 36.0 6 6.0

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio What is the F-ratio?

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio What is the F-ratio? As just explained, the ratio of the mean sum of squares between groups to the mean sum of squares within groups generates an F-statistic.

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio What is the F-ratio? As just explained, the ratio of the mean sum of squares between groups to the mean sum of squares within groups generates an F-statistic. It is this F-statistic that we will use to test our null hypothesis :

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio What is the F-ratio? As just explained, the ratio of the mean sum of squares between groups to the mean sum of squares within groups generates an F-statistic. It is this F-statistic that we will use to test our null hypothesis : There is no statistically significant difference in the quantity of pizza consumed in one sitting among football, basketball, and soccer players . If

Step 6 – divide the between groups means square by the within groups mean square to get the F-ratio What is the F-ratio? As just explained, the ratio of the mean sum of squares between groups to the mean sum of squares within groups generates an F-statistic. It is this F-statistic that we will use to test our null hypothesis : There is no statistically significant difference in the quantity of pizza consumed in one sitting among football, basketball, and soccer players . If the F critical value is greater than our F-statistic of 22.17

Step 7 – locate the F critical on the F Distribution Table value

Step 7 – locate the F critical on the F Distribution Table This is done by using the number of degrees of freedom in the denominator (within groups = 6) and the degrees of freedom in the numerator (between groups =2) and determining where these two intersect .

Step 7 – locate the F critical on the F Distribution Table This

Step 7 – locate the F critical on the F Distribution Table This

Step 7 – locate the F critical on the F Distribution Table This

Step 7 – locate the F critical on the F Distribution Table This

Step 7 – locate the F critical on the F Distribution Table This And we find the F critical value, which is 5.14

Step 8 – determine which is bigger This

Step 8 – determine which is bigger It just so happens that our F-ratio is 22.17, which means it is bigger than the F critical value.

Step 9 – retain or reject the null hypothesis It

Step 9 – retain or reject the null hypothesis We will therefore reject the null hypothesis at the .05 alpha level, or in other words, with a probability of Type-I error less than .05.

Step 9 – retain or reject the null hypothesis We will therefore reject the null hypothesis at the .05 alpha level, or in other words, with a probability of Type-I error less than .05 . If the F-ratio had been 4.38, then we would fail to reject or in other words, retain the null hypothesis .

Do you see what makes a small or large F-ratio value?

Do you see what makes a small or large F-ratio value ? Here is what makes the difference:

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together,

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, mean = 5 mean = 9 mean = 18

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller mean = 5 mean = 9 mean = 18

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller Group Sum of Squares Deg. of F. Mean Square F-ratio Between groups 12.0 2 6.0 Within groups ?

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller . And to compound the situation, if the within groups are large (meaning there is a lot of difference or variability within groups ),

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller . And to compound the situation, if the within groups are large (meaning there is a lot of difference or variability within groups ), mean = 5 mean = 9 mean = 18

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller . And to compound the situation, if the within groups are large (meaning there is a lot of difference or variability within groups), then the within groups mean square will be smaller mean = 5 mean = 9 mean = 18

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller . And to compound the situation, if the within groups are large (meaning there is a lot of difference or variability within groups), then the within groups mean square will be smaller, Group Sum of Squares Deg. of F. Mean Square F-ratio Between groups 12.0 2 6.0 Within groups 120.0 6 20.0

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller . And to compound the situation, if the within groups are large (meaning there is a lot of difference or variability within groups), then the within groups mean square will be smaller, then the F ratio will be extremely smaller than what we found with the pizza eating athlete groups : Group Sum of Squares Deg. of F. Mean Square F-ratio Between groups 12.0 2 6.0 Within groups 120.0 6 20.0

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller . And to compound the situation, if the within groups are large (meaning there is a lot of difference or variability within groups), then the within groups mean square will be smaller, then the F ratio will be extremely smaller than what we found with the pizza eating athlete groups : Group Sum of Squares Deg. of F. Mean Square F-ratio Between groups 12.0 2 6.0 0.3 Within groups 120.0 6 20.0

Do you see what makes a small or large F-ratio value ? Here is what makes the difference : When the groups or their means are closer together, then the between groups mean square will be smaller . And to compound the situation, if the within groups are large (meaning there is a lot of difference or variability within groups), then the within groups mean square will be smaller, then the F ratio will be extremely smaller than what we found with the pizza eating athlete groups : With an F critical still at 5.14 and an F ratio of 0.3, we would retain the null hypothesis .

Note: if there are really no differences among three groups in terms of some dependent variable, the mean sum of squares between groups will be very similar to the mean sum of squares within groups.

Note: if there are really no differences among three groups in terms of some dependent variable, the mean sum of squares between groups will be very similar to the mean sum of squares within groups. The ratio of such mean squares will be close to 1. As the differences among the groups increases, the ratio of mean sums of squares will increase above 1.

Note: if there are really no differences among three groups in terms of some dependent variable, the mean sum of squares between groups will be very similar to the mean sum of squares within groups. The ratio of such mean squares will be close to 1. As the differences among the groups increases, the ratio of mean sums of squares will increase above 1. At some point, the ratio of mean sums of squares (F) will be large enough that we will conclude that there are probably systematic differences among the groups.

Note: if there are really no differences among three groups in terms of some dependent variable, the mean sum of squares between groups will be very similar to the mean sum of squares within groups. The ratio of such mean squares will be close to 1. As the differences among the groups increases, the ratio of mean sums of squares will increase above 1. At some point, the ratio of mean sums of squares (F) will be large enough that we will conclude that there are probably systematic differences among the groups. We will reject the null hypothesis of no differences and investigate where the differences occur.

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). We will

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). Is the difference between football and basketball players or is it between football and soccer players or basketball and soccer players or all three? The post hoc will provide this information. This will be shown in another presentation.

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). In order to discover what pattern of differences generated the significant F- statistic, we conduct post hoc tests, which use different logics and calculations, some of which are more conservative than others in how they protect against cumulative Type-I error across multiple tests. In essence, each post hoc test is comparing each group to every other group in a series of two-group tests. The results of the series of two-group tests identifies which of the many possible patterns generated the significant F-statistic and where the differences lie.

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). In order to discover what pattern of differences generated the significant F- statistic, we conduct post hoc tests, which use different logics and calculations, some of which are more conservative than others in how they protect against cumulative Type-I error across multiple tests. In essence, each post hoc test is comparing each group to every other group in a series of two-group tests. The results of the series of two-group tests identifies which of the many possible patterns generated the significant F-statistic and where the differences lie.

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). In order to discover what pattern of differences generated the significant F- statistic, we conduct post hoc tests, which use different logics and calculations, some of which are more conservative than others in how they protect against cumulative Type-I error across multiple tests. In essence, each post hoc test is comparing each group to every other group in a series of two-group tests. The results of the series of two-group tests identifies which of the many possible patterns generated the significant F-statistic and where the differences lie.

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). In order to discover what pattern of differences generated the significant F- statistic, we conduct post hoc tests, which use different logics and calculations, some of which are more conservative than others in how they protect against cumulative Type-I error across multiple tests. In essence, each post hoc test is comparing each group to every other group in a series of two-group tests. The results of the series of two-group tests identifies which of the many possible patterns generated the significant F-statistic and where the differences lie.

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). In order to discover what pattern of differences generated the significant F- statistic, we conduct post hoc tests, which use different logics and calculations, some of which are more conservative than others in how they protect against cumulative Type-I error across multiple tests. In essence, each post hoc test is comparing each group to every other group in a series of two-group tests. The results of the series of two-group tests identifies which of the many possible patterns generated the significant F-statistic and where the differences lie.

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). In order to discover what pattern of differences generated the significant F- statistic, we conduct post hoc tests, which use different logics and calculations, some of which are more conservative than others in how they protect against cumulative Type-I error across multiple tests. In essence, each post hoc test is comparing each group to every other group in a series of two-group tests. The results of the series of two-group tests identifies which of the many possible patterns generated the significant F-statistic and where the differences lie.

Step 10 – if the null is rejected, conduct a post hoc test to see where the differences lie (this will be shown in another presentation ). In order to discover what pattern of differences generated the significant F- statistic, we conduct post hoc tests, which use different logics and calculations, some of which are more conservative than others in how they protect against cumulative Type-I error across multiple tests. In essence, each post hoc test is comparing each group to every other group in a series of two-group tests. The results of the series of two-group tests identifies which of the many possible patterns generated the significant F-statistic and where the differences lie.

In summary, One way Analysis of Variance is a method that will help you determine if there is a statistically significant difference between more than two group means. It does not tell you which groups differ, just that they do. Further testing will determine which groups differ.

End of Presentation
Tags