Chapter 10: Statistical Inferences About Two Populations 1
Chapter 10
Statistical Inferences about Two Populations
LEARNING OBJECTIVES
The general focus of Chapter 10 is on testing hypotheses and constructing confidence
intervals about parameters from two populations, thereby enabling you to
1. Test hypotheses and construct confidence intervals about the difference in two
population means using the z statistic.
2. Test hypotheses and establish confidence intervals about the difference in two
population means using the t statistic.
3. Test hypotheses and construct confidence intervals about the difference in two
related populations when the differences are normally distributed.
4. Test hypotheses and construct confidence intervals about the difference in two
population proportions.
5. Test hypotheses and construct confidence intervals about two population
variances when the two populations are normally distributed.
CHAPTER TEACHING STRATEGY
The major emphasis of chapter 10 is on analyzing data from two samples. The
student should be ready to deal with this topic given that he/she has tested hypotheses and
computed confidence intervals in previous chapters on single sample data.
The z test for analyzing the differences in two sample means is presented here.
Conceptually, this is not radically different than the z test for a single sample mean shown
initially in Chapter 7. In analyzing the differences in two sample means where the
population variances are unknown, if it can be assumed that the populations are normally
distributed, a t test for independent samples can be used. There are two different
Chapter 10: Statistical Inferences About Two Populations 2
formulas given in the chapter to conduct this t test. One version uses a "pooled" estimate
of the population variance and assumes that the population variances are equal. The
other version does not assume equal population variances and is simpler to compute.
However, the degrees of freedom formula for this version is quite complex.
A t test is also included for related (non independent) samples. It is important that
the student be able to recognize when two samples are related and when they are
independent. The first portion of section 10.3 addresses this issue. To underscore the
potential difference in the outcome of the two techniques, it is sometimes valuable to
analyze some related measures data with both techniques and demonstrate that the results
and conclusions are usually quite different. You can have your students work problems
like this using both techniques to help them understand the differences between the two
tests (independent and dependent t tests) and the different outcomes they will obtain.
A z test of proportions for two samples is presented here along with an F test for
two population variances. This is a good place to introduce the student to the F
distribution in preparation for analysis of variance in Chapter 11. The student will begin
to understand that the F values have two different degrees of freedom. The F distribution
tables are upper tailed only. For this reason, formula 10.14 is given in the chapter to be
used to compute lower tailed F values for two-tailed tests.
CHAPTER OUTLINE
10.1 Hypothesis Testing and Confidence Intervals about the Difference in Two Means
using the z Statistic (population variances known)
Hypothesis Testing
Confidence Intervals
Using the Computer to Test Hypotheses about the Difference in Two
Population Means Using the z Test
10.2 Hypothesis Testing and Confidence Intervals about the Difference in Two Means:
Independent Samples and Population Variances Unknown
Hypothesis Testing
Using the Computer to Test Hypotheses and Constr uct Confidence
Intervals about the Difference in Two Popu lation Means Using the t
Test
Confidence Intervals
10.3 Statistical Inferences For Two Related Populations
Hypothesis Testing
Using the Computer to Make Statistical Inference s about Two Related
Populations
Confidence Intervals
Chapter 10: Statistical Inferences About Two Populations 3
10.4 Statistical Inferences About Two Population Proportions, p
1 p2
Hypothesis Testing
Confidence Intervals
Using the Computer to Analyze the Difference in Two Proportions
10.5 Testing Hypotheses About Two Population Variances
Using the Computer to Test Hypotheses about Two Population Variances
KEY TERMS
Dependent Samples Independent Samples
F Distribution Matched-Pairs Test
F Value Related Measures
SOLUTIONS TO PROBLEMS IN CHAPTER 10
10.1 Sample 1
Sample 2
x 1 = 51.3 x2 = 53.2
s1
2 = 52 s2
2 = 60
n1 = 32 n2 = 32
a) H
o: µ1 - µ2 = 0
H
a: µ1 - µ2 < 0
For one-tail test, a = .10
z.10 = -1.28
z =
32
60
32
52
)0()2.533.51()()(
2
2
2
1
2
1
21
21
+
--
=
+
---
nn
xx
ss
mm
= -1.02
Since the observed z = -1.02 > z
c = -1.645, the decision is to fail to reject the null
hypothesis
.
Chapter 10: Statistical Inferences About Two Populations 4
b) Critical value method:
z
c =
2
2
2
1
2
1
21
21
)()(
nn
xx
c
ss
mm
+
---
-1.645 =
32
60
32
52
)0()(
21
+
--
cxx
(
x
1 - x
2)c = -3.08
c) The area for z = -1.02 using Table A.5 is .3461.
The p-value is .5000 - .3461 =
.1539
10.2 Sample 1
Sample 2
n
1 = 32 n 2 = 31
x1 = 70.4 x2 = 68.7
s1 = 5.76 s2 = 6.1
For a 90% C.I., z
.05 = 1.645
2
2
2
1
2
1
21
)(
nn
zxx
ss
+±-
(70.4) 68.7) +
1.645
31
1.6
32
76.5
22
+
1.7 ± 2.465
-.76 <
µ
1 - µ
2 < 4.16
Chapter 10: Statistical Inferences About Two Populations 5
10.3 a) Sample 1 Sample 2
x1 = 88.23 x2 = 81.2
s1
2 = 22.74 s2
2 = 26.65
n
1 = 30 n 2 = 30
H
o: µ 1 - µ2 = 0
H
a: µ 1 - µ2 ¹ 0
For two-tail test, use
a/2 = .01 z .01 = +
2.33
z =
30
65.26
30
74.22
)0()2.8123.88()()(
2
2
2
1
2
1
21
21
+
--
=
+
---
nn
xx
ss
mm
= 5.48
Since the observed z = 5.48 > z
.01 = 2.33, the decision is to reject the null
hypothesis.
b)
2
2
2
1
2
1
21
)(
nn
zxx
ss
+±-
(88.23 81.2) +
2.33
30
65.26
30
74.22
+
7.03 + 2.99
4.04 <
mmmm < 10.02
This supports the decision made in a) to reject the null hypothesis because
zero is not in the interval.
10.4 Computers/electronics Food/Beverage
x1 = 1.96 x2 = 3.02
s1
2 = 1.0188 s2
2 = 0.9180
n1 = 50 n2 = 50
H
o: µ1 - µ2 = 0
H
a: µ1 - µ2 ¹ 0
For two-tail test, a/2 = .005
z.005 = ±2.575
Chapter 10: Statistical Inferences About Two Populations 6
z =
50
9180.0
50
0188.1
)0()02.396.1()()(
2
2
2
1
2
1
21
21
+
--
=
+
---
nn
xx
ss
mm
= -5.39
Since the observed z = -5.39 < z
c = -2.575, the decision is to reject the null
hypothesis.
10.5 A
B
n
1 = 40 n 2 = 37
x1 = 5.3 x2 = 6.5
s1
2 = 1.99 s2
2 = 2.36
For a 95% C.I., z
.025 = 1.96
2
2
2
1
2
1
21
)(
nn
zxx
ss
+±-
(5.3 6.5) +
1.96
37
36.2
40
99.1
+
-1.2 ± .66 -1.86 <
mmmm < -.54
The results indicate that we are 95% confident that, on average, Plumber B does
between 0.54 and 1.86 more jobs per day than Plumber A. Since zero does not lie
in this interval, we are confident that there is a difference between Plumber A and
Plumber B.
10.6 Managers Specialty
n1 = 35 n2 = 41
x1 = 1.84 x2 = 1.99
s1 = .38 s2 = .51
for a 98% C.I.,
z.01 = 2.33
2
2
2
1
2
1
21
)(
nn
zxx
ss
+±-
Chapter 10: Statistical Inferences About Two Populations 7
(1.84 - 1.99) ± 2.33
41
51.
35
38.
22
+
-.15 ± .2384
-.3884 <
µ
1 - µ
2 < .0884
Point Estimate = -.15
Hypothesis Test:
1) H
o: µ1 - µ2 = 0
H
a: µ1 - µ2 ¹ 0
2) z =
2
2
2
1
2
1
21
21
)()(
nn
xx
ss
mm
+
---
3) a = .02
4) For a two-tailed test, z
.01 = + 2.33. If the observed z value is greater than 2.33
or less than -2.33, then the decision will be to reject the null hypothesis.
Since the observed t = 4.91 > t
.025,45 = 2.021, the decision is to reject the null
hypothesis.
10.17 Let Boston be group 1
1) H
o: µ1 - µ2 = 0
H
a: µ1 - µ2 > 0
2) t =
2121
2
2
21
2
1
21
2
1
11
2
)1()1(
)()(
nnnn
nsns
xx
+
-+
-+-
---
mm
3) a = .01
4) For a one-tailed test and df = 8 + 9 - 2 = 15, t
.01,15 = 2.602. If the observed value
of t is greater than 2.602, the decision is to reject the null hypothesis.
5) Boston
Dallas
n
1 = 8 n 2 = 9
x1 = 47 x2 = 44
s
1 = 3 s 2 = 3
Chapter 10: Statistical Inferences About Two Populations 14
H
0: s1
2 = s2
2 a = .10 a/2 = .05
H
a: s1
2 ¹ s2
2
Upper tail critical F value = F
.05,9,9 = 3.18
Lower tail critical F value = F
.95,9,9 = 0.314
F =
0023378.
0018989.
2
2
2
1
=
s
s
= 0.81
Since the observed F = 0.81 is greater than the lower tail critical value of 0.314
and less than the upper tail critical value of 3.18, the decision is to
fail
to reject the null hypothesis
.
10.42 Let Houston = group 1 and Chicago = group 2
1) H
0: s1
2 = s2
2
H
a: s1
2 ¹ s2
2
2) F =
2
2
2
1
s
s
3) a = .01
4) df
1 = 12 df2 = 10 This is a two-tailed test
The critical table
F values are: F
.005,12,10 = 5.66 F
.995,10,12 = .177
Chapter 10: Statistical Inferences About Two Populations 28
If the observed value is greater than 5.66 or less than .177, the decision will be
to reject the null hypothesis.
5) s
1
2 = 393.4 s
2
2 = 702.7
6) F =
7.702
4.393
=
0.56
7) Since
F = 0.56 is greater than .177 and less than 5.66,
the decision is to
fail to reject the null hypothesis.
8) There is no significant difference in the variances of
number of days between Houston and Chicag o.
10.43 H
0: s1
2 = s2
2 a = .05 n1 = 12 s1 = 7.52
H
a: s1
2 > s2
2 n2 = 15 s2 = 6.08
df
num = 12 - 1 = 11 dfdenom = 15 - 1 = 14
The critical table
F value is F.05,10,14 = 5.26
F =
2
2
2
2
2
1
)08.6(
)52.7(
=
s
s
=
1.53
Since the observed F = 1.53 < F
.05,10,14 = 2.60, the decision is to fail to reject the
null hypothesis
.
10.44 H
0: s1
2 = s2
2 a = .01 n 1 = 15 s 1
2 = 91.5
H
a: s1
2 ¹ s2
2 n 2 = 15 s 2
2 = 67.3
df
num = 15 - 1 = 14 dfdenom = 15 - 1 = 14
The critical table F values are: F
.005,12,14 = 4.43 F .995,14,12 = .226
F =
3.67
5.91
2
2
2
1
=
s
s
= 1.36
Since the observed F = 1.36 < F
.005,12,14 = 4.43 and > F
.995,14,12 = .226, the decision
is to
fail to reject the null hypothesis.
Chapter 10: Statistical Inferences About Two Populations 29
10.45 H
o: µ 1 - µ2 = 0
H
a: µ 1 - µ2 ¹ 0
For a = .10 and a two-tailed test, a/2 = .05 and z
.05 = +
1.645
Sample 1 Sample 2
1x = 138.4
2
x = 142.5
s1 = 6.71 s2 = 8.92
n
1 = 48 n 2 = 39
z = 39
)92.8(
48
)71.6(
)0()5.1424.138()()(
2
2
2
2
1
2
1
21
21
+
--
=
+
---
nn
xx
ss
mm
= -2.38
Since the observed value of z = -2.38 is less than the critical value of z = -1.645,
the decision is to reject the null hypothesis. There is a significant difference in
the means of the two populations.
Chapter 10: Statistical Inferences About Two Populations 30
10.47 Ho: µ 1 - µ2 = 0
H
a: µ 1 - µ2 > 0
Sample 1
Sample 2
1x= 2.06
2x = 1.93
s
1
2
= .176 s 2
2
= .143
n
1 = 12 n 2 = 15
This is a one-tailed test with df = 12 + 15 - 2 = 25. The critical value is
t
.05,25 = 1.708. If the observed value is greater than 1.708, the decision will be to
reject the null hypothesis.
t =
2121
2
2
21
2
1
21
2
1
11
2
)1()1(
)()(
nnnn
nsns
xx
+
-+
-+-
---
mm
Since the observed value of t = 0.85 is less than the critical value of t = 1.708, the
decision is to fail to reject the null hypothesis. The mean for population one is
not significantly greater than the mean for population two.
10.48 Sample 1 Sample 2
x1 = 74.6 x2 = 70.9
s1
2
= 10.5 s2
2
= 11.4
n1 = 18 n2 = 19
For 95% confidence, a/2 = .025.
Using df = 18 + 19 - 2 = 35,
t
35,.025 = 2.042
Chapter 10: Statistical Inferences About Two Populations 31
10.49 H
o: D = 0 a = .01
H
a: D < 0
n = 21 df = 20
d = -1.16 s
d = 1.01
The critical
t
.01,20 = -2.528. If the observed t is less than -2.528, then the decision
will be to reject the null hypothesis.
t =
21
01.1
016.1--
=
-
n
s
Dd
d
= -5.26
Since the observed value of t = -5.26 is less than the critical t value of -2.528, the
decision is to
reject the null hypothesis. The population difference is less
than zero.
Since the observed value of z = -1.20 is greater than -1.96, the decision is to
fail
to reject the null hypothesis
. There is no significant difference in the
population
proportions.
10.52 Sample 1
Sample 2
n
1 = 409 n 2 = 378
p
1 = .71 p2 = .67
For a 99% Confidence Level,
a/2 = .005 and z .005 = 2.575
10.53 H
0: s1
2 = s2
2 a = .05 n 1 = 8 s 1
2
= 46
H
a: s1
2 ¹ s2
2 n 2 = 10 S 2
2 = 37
df
num = 8 - 1 = 7 dfdenom = 10 - 1 = 9
The critical F values are: F
.025,7,9 = 4.20 F
.975,9,7 = .238
Chapter 10: Statistical Inferences About Two Populations 33
If the observed value of F is greater than 4.20 or less than .238, then the decision
will be to reject the null hypothesis.
F =
37
46
2
2
2
1
=
s
s
= 1.24
Since the observed F = 1.24 is less than F
.025,7,9 =4.20 and greater than
F
.975,9,7 = .238, the decision is to fail to reject the null hypothesis. There is no
significant difference in the variances of the two populations.
10.54 Term
Whole Life
xt = $75,000 xw = $45,000
s
t = $22,000 s w = $15,500
n
t = 27 n w = 29
df = 27 + 29 - 2 = 54
For a 95% Confidence Level,
a/2 = .025 and t .025,40 = 2.021 (used df=40)
Since the observed z = -3.73 is less than z
.05 = -1.645, the decision is to reject the
null hypothesis
. 1997 has a significantly higher proportion.
10.57 Accounting
Data Entry
n
1 = 16 n 2 = 14
x1 = 26,400 x2 = 25,800
s
1 = 1,200 s 2 = 1,050
H
0: s1
2 = s2
2
H
a: s1
2 ¹ s2
2
df
num = 16 1 = 15 df denom = 14 1 = 13
The critical F values are: F
.025,15,13 = 3.05 F .975,15,13 = 0.33
Chapter 10: Statistical Inferences About Two Populations 35
F =
500,102,1
000,440,1
2
2
2
1
=
s
s
= 1.31
Since the observed F = 1.31 is less than F
.025,15,13 = 3.05 and greater than
F
.975,15,13 = 0.33, the decision is to fail to reject the null hypothesis.
10.58 H
0: s1
2 = s2
2 a = .01 n 1 = 8 n 2 = 7
H
a: s1
2 ¹ s2
2 S 1
2 = 72,909 S 2
2 = 129,569
df
num = 6 dfdenom = 7
The critical F values are: F
.005,6,7 = 9.16 F
.995,7,6 = .11
F =
909,72
569,129
2
2
2
1
=
s
s = 1.78
Since F = 1.95 < F
.005,6,7 = 9.16 but also > F
.995,7,6 = .11, the decision is to fail to
reject the null hypothesis
. There is no difference in the variances of the shifts.
10.59 Men
Women
n
1 = 60 n 2 = 41
x1 = 631 x2 = 848
s1 = 100 s2 = 100
For a 95% Confidence Level,
a/2 = .025 and z.025 = 1.96
10.63 H
0: s1
2 = s2
2 a = .05 n 1 = 27 s 1 = 22,000
H
a: s1
2 ¹ s2
2 n 2 = 29 s 2 = 15,500
df
num = 27 - 1 = 26 dfdenom = 29 - 1 = 28
The critical F values are: F
.025,24,28 = 2.17 F .975,28,24 = .46
F =
2
2
2
2
2
1
500,15
000,22
=s
s
= 2.01
Since the observed F = 2.01 < F
.025,24,28 = 2.17 and > than F .975,28,24 = .46, the
decision is to
fail to reject the null hypothesis.
Chapter 10: Statistical Inferences About Two Populations 38
10.64 Name Brand Store Brand d
54 49 5
55 50 5
59 52 7
53 51 2
54 50 4
61 56 5
51 47 4
53 49 4
n = 8 d = 4.5 sd=1.414 df = 8 - 1 = 7
For a 90% Confidence Level, a/2 = .05 and
t.05,7 = 1.895
n
s
td
d
±
4.5 +
1.895
8
414.1
= 4.5 ± .947
3.553 <
D < 5.447
10.65 H
o: µ1 - µ2 = 0 a = .01
H
a: µ1 - µ2 < 0 df = 23 + 19 - 2 = 40
Wisconsin
Tennessee
n
1 = 23 n2 = 19
x1 = 69.652 x2 = 71.7368
s1
2 = 9.9644 s2
2 = 4.6491
For one-tail test,
a = .01 and the critical t.01,40 = -2.423
t =
2121
2
2
21
2
1
21
2
1
11
2
)1()1(
)()(
nnnn
nsns
xx
+
-+
-+-
---
mm
Chapter 10: Statistical Inferences About Two Populations 39
Since the observed t = -2.44 < t .01,40 = -2.423, the decision is to reject the null
hypothesis .
Since the observed
t = 4.52 > t.005,14 = 2.977, the decision is to reject the null
hypothesis .
10.73 A
t test was used to test to determine if Hong Kong has significantly different
rates than Bombay. Let group 1 be Hong Kong.
H
o: µ1 - µ2 = 0
H
a: µ1 - µ2 > 0
n1 = 19 n2 = 23
x1 = 130.4 x2 = 128.4
S
1 = 12.9 S2 = 13.9 a = .01
t = 0.48 with a p-value of .634 which is not significant at of .05. There is not
enough evidence in these data to declare that there is a difference in the average
rental rates of the two cities.
10.74 H
0: D = 0
H
a: D ¹ 0
This is a related measures before and after study. Fourteen people were involved
in the study. Before the treatment, the sample mean was 4.357 and after the
Chapter 10: Statistical Inferences About Two Populations 43
treatment, the mean was 5.214. The higher number after the treatment indicates
that subjects were more likely to blow the whistle after having been through the
treatment. The observed t value was 3.12 which was more extreme than two-
tailed table t value of + 2.16 causing the researcher to reject the null hypothesis.
This is underscored by a p-value of .0081 which is less than a = .05. The study
concludes that there is a significantly higher likelihood of blowing the whistle
after the treatment.
10.75 The point estimates from the sample data indicate that in the northern city the
market share is .3108 and in the southern city the market share is .2701. The
point estimate for the difference in the two proportions of market share are .0407.
Since the 99% confidence interval ranges from -.0394 to +.1207 and zero is in the
interval, any hypothesis testing decision based on this interval would result in
failure to reject the null hypothesis. Alpha is .01 with a two-tailed test. This is
underscored by a calculated z value of 1.31 which has an associated p-value of
.191 which, of course, is not significant for any of the usual values of a.
10.76 A test of differences of the variances of the populations of the two machines is
being computed. The hypotheses are:
H
0: s1
2 = s2
2
H
a: s1
2 ¹ s2
2
Twenty-six pipes were measured for sample one and twenty-six pipes were
measured for sample two. The observed F = 1.79 is not significant at a = .05 for
a two-tailed test since the associated p-value is .0758. There is no significant
difference in the variance of pipe lengths for pipes produced by machine A versus
machine B.