Variability

8,080 views 30 slides Jul 06, 2017
Slide 1
Slide 1 of 30
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30

About This Presentation

Behavioral Statistics


Slide Content

VARIABILITY
Behavioral Statistics
Summer 2017
Dr. Germano

Variability
•Provides a quantitative measure of the degree to which
scores in a distribution are spread out or clustered
together
•Allows a determination of how well an individual score (or
group of scores) represent the entire distribution
•Describes our distribution
•Usually accompanies a measure of central tendency
•Typically in terms of distance from the mean or from other scores

Which dataset
has more
variability?0
2
4
6
8
10
12
0 1 2 3 4
Sample
Score

Variability Size Matters
•When variability is small
(Experiment A), it is easy to
see a difference between
distributions
•When variability is large
(Experiment B), differences
between distributions may
be obscured
Difference! ?

Which distribution has more variability?
The red one?
The green one?
The blue one?

Measures of Variability
•Variability can be measured with
•Range
•Standard deviation
•Variance
•Variability is determined by measuring distance
•The distancebetween score X
1and the mean of the distribution
•Remember central tendency
•Mean, Median, Mode
•There is no “right” or “wrong” measure of central tendency, but the
shape of the distribution will affect how you interpret these statistics
Related concepts

The Range
•The difference between the largest Xvalue and the
smallest Xvalue
range= X
max-X
min
IfX= 2, 4, 6, 1, 8, 10
Thenrange= ?
(remember the real limits!)
range= 10.5 –0.5 = 10
If we don’t account for real limits: range= 10 –1 + 1
•When scores are whole numbers or discrete variables with numerical
scores, the range tells us the number of measurement categories
1, 2, 3, 4, 5, 6, 7, 8, 9, and 10

Is the range a
reliable measure
of variability?
Scores
40
23
19
23
22
21
18
19.5
20
23
21.5
18.5
17.5
160
5
10
15
20
25
30
35
40
45
0 1 2 3
Sample 1
(score x = 40 included)
range= 40.5 –15.5 = 25
or
range= 40 –16 + 1 = 25
Sample 2
(score x = 40 removed)
range= 23.5 -15.5 = 8
or
range= 23 –16 + 1 = 8
Scores (highlow)
40
23
23
23
22
21.5
21
20
19.5
19
18.5
18
17.5
16

The Standard Deviation (SD)
Standard = average
Deviation* = distance from the mean
•Approximates the average distance of scores from the
mean
•Provides the most information
•Takes into account all scores in the distribution
•Most commonly used measure of variability
*Deviation is calculated as: X-μ

SD: A Conceptual Walkthrough
X X –μ
(calc)
X -μ
2(2 –6) -4
4(4 –6) -2
6(6 –6) 0
8(8 –6) 2
10(10 –6) 4
ΣX–μ= 0
We are interested in the relationship
of each score to the mean. Thus,
sign (+/-) is important.
oA score of 4 is below the mean by 2
points (-2), and a score of 10 is above
the mean by 4 points (+4).
X= 2, 4, 6, 8, 10
N= 5 μ= 6
ΣX–μ= 0. Will this always be the case?
Yes. If the mean is the balance
point, the sum of scores above the
mean will be exactly equal to the
sum of scores below the mean.

SD: A Conceptual Walkthrough (cont’d)
XX –μ(X –μ)
2
2 -416
4 -2 4
6 0 0
8 2 4
10 416
Σ(X–μ)
2
= 40
If we square each individual deviation
score, we get rid of the sign (+/-)
•We now have a value we can take the
average of
X= 2, 4, 6, 8, 10
N= 5 μ= 6
40 ÷5 = 8 variance
Population variance: the mean of the
squareddeviation scores
Remember that we are not interested
in squareddistance.
•To convert back into our original measure
of distance:
Standard deviation: the square root
of variance
√8 = 2.83 SD

The Calculation
of Variance and
Standard
Deviation
Step 1 Step 2
Step 3
Step 4
Voila! Standard
Deviation!
Variance

SD: A Conceptual Walkthrough (cont’d)
XX –μ(X –μ)
2
2 -416
4 -2 4
6 0 0
8 2 4
10 416
Σ(X–μ)
2
= 40
SD provides the measure of average
distance of scores from the mean
•Given this set of scores, does this seem
reasonable?
•What if I obtained a standard deviation
value of 20?
X= 2, 4, 6, 8, 10
N= 5 μ= 6
SD= 2.83

Formulas for Calculating Variance and SD
•First a note on notation:
•You will use your calculator to check
your work
•DO NOT freak out
•All formulas will be given to you for exams
•This is a learning tool –so you “get” the
concepts of variance and SD
Sample
SD: s
Variance:s²
Population
SD: σ
Variance: σ²

Sum of Squares (SS)
•SS= the sum of squared deviation scores
•The definitional formula expresses SSin the conceptual
way explained previously:

•The computational formula provides the same answer, but
is easier to use when the mean is not a whole number
2
)(XSS N
X
XSS
2
2
)(


A Demonstration of Both SSMethods
X X
2
X–μ (X -μ)
2
2 4 (2 –6) = -4 16
4 16 (4 –6) = -2 4
6 36 (6 –6) = 0 0
8 64 (8–6) = 2 4
10 100 (10 –6) =4 16
ΣX= 30 ΣX
2
= 220 Σ(X –μ)= 0 Σ(X –μ)
2
= 40
Definitional
Formula40180220
5
)30(
220
)(
22
2



N
X
XSS
Computational
Formula

Variance and Standard Deviation
Sample
•Note that these explicitly show the definitional SS formula.
You can also solve using the computational SS formula.
Population2
2
2
2
11
)(
.
11
)(
s
n
SS
n
MX
s
DeviationStd
n
SS
n
MX
s
Variance












 Variance
s
2
=
(X-m)
2
å
N
=
SS
N
Std.Deviation
s=
(X-m)
2
å
N
=
SS
N
=s
2

Why are the formulas different?
Sample
•We want our sample statistic to be as representative of
the population parameter as possible.
•However, our samples will often be less variable than the
population
•We adjust for the biased estimate of sample variability by dividing
by n–1 (degrees of freedom) instead of N
PopulationStd.Deviation
s=
SS
n-1 Std.Deviation
s=
SS
N Variance
s
2
=
SS
n-1 Variance
s
2
=
SS
N

Degrees of Freedom (df)
Determine the number of scores in the sample
that are independent and free to vary
•If n= 3 and M = 5
•Then ΣXmust equal what?
•M = ΣX ÷n 5 = ΣX ÷3
•The first two scores have no restrictions
•They are independent values and could be any value
•The third score is restricted
•Can only be one value, based on the sum of the first
two scores (2 + 9 = 11)
•X
3MUST be 4
X
2
9
?
n= 3
M= 5
ΣX= 15
3 ×5 = 15
This score has no “freedom”
ΣX= ?

Understanding
SDGraphically
68.26%
94.46%
99.73%

Why is Knowing Variability Important?
Begin to think about how you might compare two
distributions, and how these factors might play a role in
your conclusions

Comprehension Check
If you have two samples:
•Sample 1: n= 25, M= 80
•Sample 2: n= 25, M= 80
Does a score of 85 mean the same thing?
•Consider if for sample 1: s = 1 and for sample 2: s= 10
•Which situation would a score of 85 be a “better” score?
•A score of 85 in sample one means that this score is 5 standard
deviations above the mean
•A score in sample two means that this score is ½ a standard
deviation above the mean

Comprehension
Check
Sample 1
n= 25, M= 80
s= 1
Sample 2
n= 25, M= 80
s= 10
80
In which situation would a score of 85 be a “better-
than-average” score?
Sample 178 79 81 82 8377
60 70 90 100 11050 Sample 2

Properties of the Standard Deviation
X
1
2
3
4
5
6
3.5
1.87
ForX
n= 6
M= 3.5
SD= 1.87

Properties of the Standard Deviation
X
1
2
3
4
5
6
X+ 1
2
3
4
5
6
7
3.5
1.87
4.5
1.87
ForX
n= 6
M= 3.5
SD= 1.87
ForX + 1
n= 6
M= 4.5
SD= 1.87

Properties of the Standard Deviation
X
1
2
3
4
5
6
X+ 1
2
3
4
5
6
7
•Adding a constant to each score results in
the addition of the same constant to the M,
but no change in the SD (sor σ)
4.5
1.87
ForX
n= 6
M= 3.5
SD= 1.87
ForX + 1
n= 6
M= 4.5
SD= 1.87

Properties of the Standard Deviation
3.5
1.87
ForX
n= 6
M= 3.5
SD= 1.87
X
1
2
3
4
5
6

Properties of the Standard Deviation
3.5
1.87
7
3.74
ForX
n= 6
M= 3.5
SD= 1.87
ForX ×2
n= 6
M= 7
SD= 3.74
X ×2
2
4
6
8
10
12
X
1
2
3
4
5
6

Properties of the Standard Deviation
•Multiplying every score by a constant results
in the multiplication by the same constant of
the M, andmultiplication by the same
constant of the SD (sor σ)
7
3.74
ForX
n= 6
M= 3.5
SD= 1.87
ForX ×2
n= 6
M= 7
SD= 3.74
X ×2
2
4
6
8
10
12
X
1
2
3
4
5
6

Properties of
the Standard
Deviation
Why does the
standard deviation
change when we
multiply, but not when
we add?0
1
2
3
4
5
6
0 2 4 6 8
Adding +1 to every score
Multiplying every score by 20
1
2
3
4
5
6
0 2 4 6 8 10 12 14
XX+ 1X ×2
1 2 2
2 3 4
3 4 6
4 5 8
5 6 10
6 7 12
Tags