chapter three Sampling_distributions_1.ppt

aschalewshiferaw 59 views 24 slides Jul 01, 2024
Slide 1
Slide 1 of 24
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24

About This Presentation

Sampling and Sampling Distribution teaching materials.


Slide Content

Sampling Distributions

Sampling Distribution
Introduction
•In real life calculating parameters of
populations is prohibitive because
populations are very large.
•Rather than investigating the whole
population, we take a sample, calculate a
statisticrelated to theparameterof interest,
and make an inference.
•The sampling distributionof the statisticis
the tool that tells us how close is the statistic
to the parameter.

•An estimatorof a population parameter is a sample
statistic used to estimate or predict the population
parameter.
•An estimateof a parameter is a particularnumerical
value of a sample statistic obtained through
sampling.
•A point estimateis a single value used as an
estimate of a population parameter.
A population parameter
is a numerical measure of
a summary characteristic
of a population.
Sample Statistics as Estimators
of Population Parameters
•A sample statisticis a
numerical measure of a
summary characteristic
of a sample.

•The sample mean, , is the most common
estimator of the population mean, 
•The sample variance, s
2
, is the most common
estimator of the population variance, 
2
.
•The sample standard deviation, s, is the most
common estimator of the population standard
deviation, .
•The sample proportion, , is the most common
estimator of the population proportion, p.
EstimatorsX pˆ

•The sampling distribution of Xis the
probability distribution of all possible values
the random variable may assume when a
sample of size nis taken from a specified
population.X
Sampling Distribution of X

Sampling Distribution of the Mean
•An example
–A die is thrown infinitely many times. Let X
represent the number of spots showing on
any throw.
–The probability distribution of X is
x 1 2 3 4 5 6
p(x) 1/6 1/6 1/6 1/6 1/6 1/6
E(X) = 1(1/6) +
2(1/6) + 3(1/6)+
………………….= 3.5
V(X) = (1-3.5)
2
(1/6) +
(2-3.5)
2
(1/6) +
…………. …= 2.92

•Suppose we want to estimate 
from the mean of a sample of
size n = 2.
•What is the distribution of ?x
Throwing a dice twice –sampling
distribution of sample meanx

Sample MeanSample MeanSample Mean
1 1,11 13 3,1 2 25 5,1 3
2 1,21.5 14 3,22.5 26 5,2 3.5
3 1,32 15 3,3 3 27 5,3 4
4 1,42.5 16 3,43.5 28 5,4 4.5
5 1,53 17 3,5 4 29 5,5 5
6 1,63.5 18 3,64.5 30 5,6 5.5
7 2,11.5 19 4,12.5 31 6,1 3.5
8 2,22 20 4,2 3 32 6,2 4
9 2,32.5 21 4,33.5 33 6,3 4.5
10 2,43 22 4,4 4 34 6,4 5
11 2,53.5 23 4,54.5 35 6,5 5.5
12 2,64 24 4,6 5 36 6,6 6 Throwing a die twice –sample
mean

x The distribution of when n = 2 Sample MeanSample MeanSample Mean
1 1,11 13 3,1 2 25 5,1 3
2 1,21.5 14 3,22.5 26 5,2 3.5
3 1,32 15 3,3 3 27 5,3 4
4 1,42.5 16 3,43.5 28 5,4 4.5
5 1,53 17 3,5 4 29 5,5 5
6 1,63.5 18 3,64.5 30 5,6 5.5
7 2,11.5 19 4,12.5 31 6,1 3.5
8 2,22 20 4,2 3 32 6,2 4
9 2,32.5 21 4,33.5 33 6,3 4.5
10 2,43 22 4,4 4 34 6,4 5
11 2,53.5 23 4,54.5 35 6,5 5.5
12 2,64 24 4,6 5 36 6,6 6
1 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0 5.5 6.0
6/36
5/36
4/36
3/36
2/36
1/36x
E( ) =1.0(1/36)+
1.5(2/36)+….=3.5
V(X) = (1.0-3.5)
2
(1/36)+
(1.5-3.5)
2
(2/36)... = 1.46x 2
:
2
2 x
xxx
andNote

 

6)
5
(5833.
5.3
5n
2
x2
x
x



 )
10
(2917.
5.3
10n
2
x2
x
x



 )
25
(1167.
5.3
25n
2
x2
x
x




Sampling Distribution of the
Mean

Sampling Distribution of the
Mean)
5
(5833.
5.3
5n
2
x2
x
x



 )
10
(2917.
5.3
10n
2
x2
x
x



 )
25
(1167.
5.3
25n
2
x2
x
x




Notice that is smaller than .
The larger the sample size the
smaller . Therefore, tends
to fall closer to , as the sample
size increases.2
x x 2
x
Notice that is smaller than 
x
.
The larger the sample size the
smaller . Therefore, tends
to fall closer to , as the sample
size increases.2
x x 2
x 2

The expected value of the sample meanis equal to the population mean: EX
X X()
The variance of the sample meanis equal to the population variance divided by
the sample size: VX
n
X
X
()

2
2
The standard deviation of the sample mean, known as the standard error of
the mean, is equal to the population standard deviation divided by the square
root of the sample size:n
XSD
X
X

 )(s.e.
Relationships between Population Parameters and
the Sampling Distribution of the Sample Mean

When sampling from a population
with mean and finite standard
deviation , the sampling
distribution of the sample mean will
tend to be a normal distribution with
mean and standard deviationas
the sample size becomes large
(n>30).
For “large enough” n:
n )/,(~
2
nNX 
P
(X
)
X
0.25
0.20
0.15
0.10
0.05
0.00
n = 5
P
(X
)
0.2
0.1
0.0
X
n = 20
f(X
)
X
-
0.4
0.3
0.2
0.1
0.0

Large n
The Central Limit Theorem

Normal Uniform Skewed
Population
n = 2
n = 30
XXXX
General
The Central Limit Theorem Applies to
Sampling Distributions from AnyPopulation

Mercury makes a 2.4 liter V-6 engine, used in speedboats. The company’s
engineers believe the engine delivers an average horsepower of 220 HP and
that the standard deviation of power delivered is 15 HP. A potential buyer
intends to sample 100 engines. What is the probability that the sample mean
will be less than 217 HP?PX P
X
n n
PZ PZ
PZ
( )
( ).
 













 











 











 
217
217
217220
15
100
217220
15
10
200228




The Central Limit Theorem
(Example)

If the population standard deviation, , isunknown, replace with
the sample standard deviation, s. If the population is normal, the
resulting statistic:
has a t distribution with (n -1) degrees of freedom.
•The tis a family of bell-shaped and
symmetric distributions, one for each
number of degree of freedom.
•The expected value of t is 0.
•The variance of t is greater than 1, but
approaches 1 as the number of degrees of
freedom increases.
•The tdistribution approaches a standard
normal as the number of degrees of
freedom increases.
•When the sample size is small (<30) we use
t distribution.ns
X
t
/


Standard normal
t, df=20
t, df=10


Student’s tDistribution

Sampling Distributions
Finite Population Correction Factor
If the sample sizeis more than 5%of the
population size and the sampling is done
without replacement, then a correction needs
to be made to the standard error of the
means.1


N
nN
n
x

Sampling Distribution of x
Finite PopulationInfinite Population

x
n
Nn
N



()
1 

x
n

• is referred to as the standard error of the
mean.
x
•A finite population is treated as being
infinite if n/N<.05.
• is the finite correction factor.( )/( )NnN 1 x
Standard Deviation of

= 32.2
x = 32
•The amount of soda pop in each bottle is normally
distributed with a mean of 32.2 ounces and a
standard deviation of 0.3 ounces.
•Find the probability that a carton of four bottles will
have a mean of more than 32 ounces of soda per
bottle.
•Solution
–Define the random variable as the mean amount of soda per
bottle.9082.0)33.1z(P
)
43.
2.3232x
(P)32x(P
x





 32x
0.90822.32
x
Sampling Distribution of the Sample Mean

•Example
–Dean’s claim: The average weekly income of
M.B.A graduates one year after graduation is
$600.
–Suppose the distribution of weekly income has a
standard deviation of $100. What is the
probability that 25 randomly selected graduates
have an average weekly income of less than
$550?
–Solution0062.0)5.2(
)
25100
600550
()550(





zP
x
PxP
x


Sampling Distribution of the
Sample Mean

The sample proportionis the percentage of
successes in nbinomial trials. It is the
number of successes,X, divided by the
number of trials, n.p
X
n

As the sample size, n, increases, the sampling
distribution of approaches a normal
distributionwith mean pand standard
deviationp pp
n
()1
Sample proportion:
1514131211109876543210
0.2
0.1
0.0
P
(X
)
n=15,p=0.3
X
14
15
13
15
12
15
11
15
10
15
9
15
8
15
7
15
6
15
5
15
4
15
3
15
2
15
1
15
0
15
15
15
^p
210
0.5
0.4
0.3
0.2
0.1
0.0
X
P
(X
)
n=2,p=0.3
109876543210
0.3
0.2
0.1
0.0
P
(X
)
n=10,p=0.3
X
The Sampling Distribution of the Sample
Proportion,p

Normal approximation to the
Binomial
–Normal approximation to the binomial works
best when
•the number of experiments (sample size) is
large, and
•the probability of success, p, is close to 0.5.
–For the approximation to provide good results
two conditions should be met:
np 5; n(1 -p) 5 

•Example
–A state representative received 52% of the
votes in the last election.
–One year later the representative wanted
to study his popularity.
–If his popularity has not changed, what is
the probability that more than half of a
sample of 300 voters would vote for him?

•Example
–Solution
•The number of respondents who prefer the
representative is binomial with n = 300 and p =
.52. Thus, np = 300(.52) = 156 and
n(1-p) = 300(1-.52) = 144 (both greater than 5)7549.
300)52.1)(52(.
52.50.
)1(
ˆ
)50.ˆ( 














npp
pp
PpP
Tags