PROBABILITY AND PROBABILITY DISTRIBUTIONS.ppt

3,736 views 59 slides Sep 10, 2022
Slide 1
Slide 1 of 59
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59

About This Presentation

PROBABILITYAND PROBABILITY DISTRIBUTIONS
Session 2.2
TEACHING BASIC STATISTICS
Motivation for Studying Chance
Sample StatisticEstimatesPopulation Parameter
e.g. Sample Mean X= 50estimatesPopulation Mean m
Questions:
1.
How do we assess the reliability of our estimate?
2.
What is an adequate sample s...


Slide Content

PROBABILITY
AND PROBABILITY
DISTRIBUTIONS

Session 2.2
TEACHING BASIC STATISTICS
Motivation for Studying Chance
Sample StatisticEstimatesPopulation Parameter
e.g. Sample Mean X= 50estimatesPopulation Mean m
Questions:
1.How do we assess the reliability of our estimate?
2.What is an adequate sample size? [ We would expect a
large sample to give better estimates. Large samples
more costly.]

Session 2.3
TEACHING BASIC STATISTICS
An Approach to Solve the Questions
If sample was chosen through
chance processes, we have to
understand the notion of
probability and sampling
distribution.

Session 2.4
TEACHING BASIC STATISTICS
To introduce probability….
Random experiment
Sample space
Event as subset of sample
space
Likelihood of an event to occur
-probability of an event

Session 2.5
TEACHING BASIC STATISTICS
Features of a Random Experiment
All outcomes are known in
advance.
The outcome of any one
trial is unpredictable.
Trials are repeatable under
identical conditions.

Session 2.6
TEACHING BASIC STATISTICS
EXAMPLES
Rolling a die and
observing the
number of dots on
the upturned face
Tossing a one-peso
coin and observing
the upturned face
Measuring the
height of a student
enrolled this term

Session 2.7
TEACHING BASIC STATISTICS
SAMPLE SPACE
It is a set such that each element
denotes an outcome of a random
experiment.
Any performance of the
experiment results in an outcome
that corresponds to exactly one
and only one element.
It is usually denoted by S.

Session 2.8
TEACHING BASIC STATISTICS
ILLUSTRATION
Rolling a die and observing
the number of dots on the
upturned face
S={ , , , , , }
S={1, 2, 3, 4, 5, 6}

Session 2.9
TEACHING BASIC STATISTICS
EVENT
A subset of the sample space
Usually denoted by capital letters like
E, Aor B
Observance of the elements of the
subset implies the occurrence of the
event
Can either be classified as simple or
compound event

Session 2.10
TEACHING BASIC STATISTICS
ILLUSTRATION
S = {1, 2, 3, 4, 5, 6}
An event of
observing odd-
number of dots
in a roll of a die
E
1= { 1, 3, 5}
An event of
observing even-
number of dots
in a roll of a die
E
2= { 2, 4, 6}

Session 2.11
TEACHING BASIC STATISTICS
Visualizing Events
Contingency Tables
Tree Diagrams
Red 2 24 26
Black 2 24 26
Total 4 48 52
Ace Not Ace Total
Full
Deck
of Cards
Red Cards
Black
Cards
Not an Ace
Ace
Ace
Not an Ace

Session 2.12
TEACHING BASIC STATISTICS
Mutually Exclusive Events
Two events are mutuallyexclusiveif
one and only one of them can occur at a
time.
Example:
Coin toss: either a head or a tail, but not
both. The events head and tail are
mutually exclusive.

Session 2.13
TEACHING BASIC STATISTICS
The numerical measure of
the likelihood that an event
will occur
Between 0 and 1
Note: Sum of the probabilities
of all mutually exclusive and
collective exhaustive events
is 1
Certain
Impossible
0.5
1
0
PROBABILITY

Session 2.14
TEACHING BASIC STATISTICS
Assigning Probabilities
Subjective
confident student views chances of passing
a course to be near 100 %
Logical
symmetry/equally likely: coin, dice, cards etc.
(A PRIORI assignment)
Empirical
chances of rain 75 % since it rained 15 out of
past 20 days (A POSTERIORI)

Session 2.15
TEACHING BASIC STATISTICS
If all possible outcomes can be listed and
are equally likely to occur, we can compute
the Probability of an Event E:OutcomesTotal
OutcomesEventofNumber
EP)(
Example:
P(ace in a deck of cards) = 4/52
since there are 4 aces in a deck of (52) cards.
Computing Probability

Session 2.16
TEACHING BASIC STATISTICS
Computing Joint Probability
The probability of a joint event, A and B:( and ) = ( )
number of outcomes from both A and B
total number of possible outcomes in sample space
P A B P A B
 E.g. (Red Card and Ace)
2 Red Aces 1

52 Total Number of Cards 26
P


Session 2.17
TEACHING BASIC STATISTICS
Rules on Probability
Property 1. The probability of an
event E is any number between 0
and 1 inclusive.
Property 2. The sum of the
probabilities of a set of mutually
exclusive events is 1.

Session 2.18
TEACHING BASIC STATISTICS
Rules on Probability
Property 3. Addition Rule
P(A or B) = P(A) + P(B) -P(A and B)
A
B

Session 2.19
TEACHING BASIC STATISTICS
Computing Probability
P(King or Spade) = P(King) + P(Spade)
-P(King and Spade) =
P(King or Queen) = P(King)+P(Queen) =13
4
52
16
52
1
52
13
52
4
 13
2
52
8
52
4
52
4

since King and Queen are mutually exclusive, i.e. P(King and Queen)=0

Session 2.20
TEACHING BASIC STATISTICS
Marginal Probability
Black
Color
Type Red Total
Ace 2 2 4
Non-Ace 24 24 48
Total 26 26 52
P(Ace) =
4
52
A Deck of 52 Cards

Session 2.21
TEACHING BASIC STATISTICS
Conditional Probability
Black
Color
Type Red Total
Ace 2 2 4
Non-Ace 24 24 48
Total 26 26 52(Ace and Red) 2/52 2
(Ace | Red)
(Red) 26/52 26
P
P
P
  
A Deck of 52 Cards

Session 2.22
TEACHING BASIC STATISTICS
Joint Probability
Multiplication Rule:
The chance that two events will
occur is the chance that the first
event will occur multiplied by the
chance of the second event (given
that the first has happened)

Session 2.23
TEACHING BASIC STATISTICS
Joint Probability
A Deck of 52 Cards
Chance of Red Ace = 2/52 = (26/52) x (2/26)
Black
Color
Type Red Total
Ace 2 2 4
Non-Ace 24 24 48
Total 26 26 52

Session 2.24
TEACHING BASIC STATISTICS
UNEQUALLY LIKELY OUTCOME
ASSUMPTION
The outcomes have different
likelihood to occur.
The probability of an event Eis
then computed as the sum of the
probabilities of the outcomes
found in the event E, that is,
P[E] = sum of p{e}
where eis an element of event E.

Session 2.25
TEACHING BASIC STATISTICS
ILLUSTRATION
S = {1, 2, 3, 4, 5, 6}
Assuming that the probability of each of the
outcomes 1,2, and 3 is 1/12 while each of the
outcomes 4, 5 and 6 has likelihood to occur
equal to 1/4.
The probability of an event of observing odd-
number of dots in a roll of a die is P[E
1] = sum
ofp{1}, p{3} and p{5} = 1/12 + 1/12 + 1/4 =
5/12.

Session 2.26
TEACHING BASIC STATISTICS
A POSTERIORI APPROACH
The random experiment has to
be performed and the event of
interest is observed.
The probability of the event is
the relative frequency of the
occurrence of such event.

Session 2.27
TEACHING BASIC STATISTICS
ILLUSTRATION
Suppose the experiment was done
for 100 times and it was observed
that an odd-number of dots occurred
60 times and even-number of dots
occurred 40 times.
The probability of an event of
observing odd-number of dots in a
roll of a die is the relative frequency
of the event or P[E
1] = 60/100 = 0.6

Session 2.28
TEACHING BASIC STATISTICS
Random Variable
Defined on a random experiment
A rule or a function that maps
each element of the sample to
one and only one real number
The mapping produces mutually
exclusive partitioning on the set
of real numbers

Session 2.29
TEACHING BASIC STATISTICS
ILLUSTRATION
Rolling two dice and observing the
number of dots on the upturned faces.
S={ (1,1), (1,2), (1,3), (1,4), (1,5), (1,6)
(2,1), (2,2), (2,3), (2,4), (2,5), (2,6)
(3,1), (3,2), (3,3), (3,4), (3,5), (3,6)
(4,1), (4,2), (4,3), (4,4), (4,5), (4,6)
(5,1), (5,2), (5,3), (5,4), (5,5), (5,6)
(6,1), (6,2), (6,3), (6,4), (6,5), (6,6)}

Session 2.30
TEACHING BASIC STATISTICS
ILLUSTRATION
We define a random variable as the total number of
dots on the upturned faces.
2
3
4
5
6
7
8
9
10
11
12
(1,1),
(1,2), (2,1),
(1,3), (2,2), (3,1),
(1,4), (2,3), (3,2), (4,1),
(1,5), (2,4), (3,3), (4,2), (5,1),
(1,6), (2,5), (3,4), (4,3), (5,2), (6,1),
(2,6), (3,5), (4,4), (5,3), (6,2),
(3,6), (4,5), (5,4), (6,3),
(4,6), (5,5), (6,4),
(5,6), (6,5),
(6,6)

Session 2.31
TEACHING BASIC STATISTICS
ILLUSTRATION
The random variable takes on the values 2,
3, 4, 5, 6, 7, 8, 9, 10, 11 and 12.
Some of the values had more corresponding
elements in the sample space. For example,
2 corresponds to only one outcome while 3
corresponds to 2 outcomes.
The probability that the random variable will
take a value is equal to the sum of the
probabilities of the corresponding outcomes
in the sample space.

Session 2.32
TEACHING BASIC STATISTICS
ILLUSTRATION
The probability that the random variable will
take the value 4 is equal to the sum of the
probabilities of the corresponding outcomes.
The probability that the total number of dots
on the upturned faces of the dice is 4 is then
equal to the sum of the probabilities of the
outcomes (1,3), (2,2), and (3,1).
Each outcome in the sample space has
probability of 1/36. Thus, the probability that
the total number of dots is 4 is equal to 3/36
or 1/12.

Session 2.33
TEACHING BASIC STATISTICS
PROBABILITY DISTRIBUTION
A table or a curve or a function
that presents the possible values
of the random variable and its
corresponding probabilities.
Some random variables are
better presented as a table while
others as a function or as a
curve or graph.

Session 2.34
TEACHING BASIC STATISTICS
ILLUSTRATION
The probability distribution of the random variable, Xdefined
as the total number of dots on the upturned faces in a roll of
two dice, is presented as a table below:X 23456789101112
P[X=x]1/362/363/364/365/366/365/364/363/362/361/36 0.00
0.05
0.10
0.15
0.20
23456789101112
X = Total Num ber of Dots on the Upturned faces

Session 2.35
TEACHING BASIC STATISTICS
Types of Probability Distributions
Discrete Probability Distributions:
Bernoulli, Binomial, Geometric,
Hypergeometric, Negative Binomial,
Continuous Probability Distributions:
Normal, Exponential, Gamma, Beta,
Uniform,

Session 2.36
TEACHING BASIC STATISTICS
Bernoulli Probability Distribution
Named after Bernoulli
Discrete random variable with
only two possible values; 0 and 1
The value 1 represents success
while the value 0 represents
failure
The parameter pis the probability
of success.

Session 2.37
TEACHING BASIC STATISTICS
Bernoulli Probability Distribution
Its probability
distribution function
is given by:
Graphically, the
distribution is illustrated
as follows: 






01
,1
xp
xp
xXP
0 1
p
1-p

Session 2.38
TEACHING BASIC STATISTICS
Binomial Probability Distribution
Composed of nindependent
Bernoulli trials
The parameter pis the probability of
success remains constant from one
trial to another
Discrete random variable defined as
the number of success out of ntrials
Possible values; 0, 1, 2, .., n

Session 2.39
TEACHING BASIC STATISTICS
Binomial Probability Distribution
Its probability
distribution function is
given by:
Graphically, the
distribution is illustrated
as follows:   nxpp
x
n
xXP
xnx
2, ,1 ,0 ,1 










0 1 2 …. n
and the function is
undefined elsewhere.

Session 2.40
TEACHING BASIC STATISTICS
•‘Bell-Shaped’
•Symmetric
•Range of possible values
is infinite on both
directions. Mean
Median
Mode
X
f(X)
m
Normal Probability Distribution

Session 2.41
TEACHING BASIC STATISTICS
The Mathematical Model
 

 
21
2
2
1
2
: density of random variable
3.14159; 2.71828
: population mean
: population standard deviation
: value of random variable
X
f X e
f X X
e
XX
m



m





   

Session 2.42
TEACHING BASIC STATISTICS
THE NORMAL CURVE0.00
0.05
0.10
0.15
0.20
0.25
-15 -10 -5 0 5 10 15 20
Two normal distributions with the same mean but
different variances.
N(5,4)
N(5,9)

Session 2.43
TEACHING BASIC STATISTICS
Two normal distributions with the different means
but equal variances0.00
0.05
0.10
0.15
0.20
0.25
-5 0 5 10 15 20
N(5,4)
N(10,4)
THE NORMAL CURVE

Session 2.44
TEACHING BASIC STATISTICS
By varying the parameters and m, we obtain
different normal distributions
There are an infinite number of normal curves
Many Normal Distributions

Session 2.45
TEACHING BASIC STATISTICS
Normal Distribution Properties
For a normal curve, the area within:
a)one standard deviation from the
mean is about 68%,
b)two standard deviations from the
mean is about 95%; and
c)three standard deviations from
the mean is about 99.7%.

Session 2.46
TEACHING BASIC STATISTICS
Probability is the area
under the curve!
cd
X
f(X)
PcXd( )?
Areas Normal Distributions

Session 2.47
TEACHING BASIC STATISTICS
Infinitely Many Normal Distributions imply
Infinitely Many Tables to Look Up!
Each distribution
has its own table?
Which Table???

Session 2.48
TEACHING BASIC STATISTICS
Standard Normal Distribution
Since there are many normal curves,
often it is important to standardize,
and refer to a STANDARD NORMAL
DISTRIBUTION (or curve) where the
mean m= 0 and the =1

Session 2.49
TEACHING BASIC STATISTICS
THE Z-TABLE
P[Zz]
Examples:
1.P[Z0] = 0.5
2.P[Z1.25] = 0.8944
3.P[Z1.96] = 0.9750
0 z
This table summarizes the cumulative probability
distribution for Z (i.e. P[Zz])

Session 2.50
TEACHING BASIC STATISTICS
Standardizing Example6.2 5
0.12
10
X
Z
m


  
Shaded Area Exaggerated
Normal Distribution10 5m 6.2 X Standard Normal DistributionZ 0
Zm 0.12 1
Z

Session 2.51
TEACHING BASIC STATISTICS
Solution: The Cumulative
Standardized Normal Curve
Z.00.01
0.0.5000.5040.5080
.5398.5438
0.2.5793.5832.5871
0.3.6179.6217.6255
.5478
.02
0.1 .5478
Cumulative Standard Normal Distribution Table (Portion)
Probabilities
Shaded Area
Exaggerated
Only One Table is Needed0 1
ZZm
Z = 0.120

Session 2.52
TEACHING BASIC STATISTICS
Normal Distribution
Standardized Normal Curve10 1
Z 5m 7.1 X Z 0
Zm 0.21 2.9 5 7.1 5
.21 .21
10 10
XX
ZZ
mm

   
       2.9 0.21 .0832  2.9 7.1 .1664PX   .0832
Shaded Area Exaggerated
Example:

Session 2.53
TEACHING BASIC STATISTICS
Z.00.01
0.0.5000.5040.5080
.5398.5438
0.2.5793.5832.5871
0.3.6179.6217.6255
.5832
.02
0.1 .5478
Cumulative Standard Normal
Distribution Table (Portion)
Shaded Area
Exaggerated0 1
ZZm
Z = 0.21
(continued)0  2.9 7.1 .1664PX  
Example:

Session 2.54
TEACHING BASIC STATISTICS
Z.00.01
-03.3821.3783.3745
.4207.4168
-0.1.4602.4562.4522
0.0.5000.4960.4920
.4168
.02
-02 .4129
Cumulative Standard Normal
Distribution Table (Portion)
Shaded Area
Exaggerated0 1
ZZm
Z = -0.21 2.9 7.1 .1664PX  
(continued)0
Example:

Session 2.55
TEACHING BASIC STATISTICS 8 .3821PX
Normal Distribution
Standard Normal
Distribution
Shaded Area Exaggerated10 1
Z 5m 8 X Z 0
Zm 0.30 85
.30
10
X
Z
m


   .3821
Example:

Session 2.56
TEACHING BASIC STATISTICS
(continued)
Z.00.01
0.0.5000.5040.5080
.5398.5438
0.2.5793.5832.5871
0.3.6179.6217.6255
.6179
.02
0.1 .5478
Cumulative Standard Normal
Distribution Table (Portion)
Shaded Area
Exaggerated0 1
ZZm
Z = 0.300  8 .3821PX
Example:

Session 2.57
TEACHING BASIC STATISTICS
.1217
Finding ZValues for Known Probabilities
Z.00 0.2
0.0.5000.5040.5080
0.1.5398.5438.5478
0.2.5793.5832.5871
.6179 .6255
.01
0.3
Cumulative Standard Normal
Distribution Table (Portion)
What is ZGiven area between
0 and Z is 0.1217 ?
Shaded Area
Exaggerated
.62170 1
ZZm .31Z 0

Session 2.58
TEACHING BASIC STATISTICS
Example
Suppose that women’s heights can be modeled by a
normal curve with a mean of 1620 mm and a
standard deviation of 50 mm
Solution:The 10th percentile of the height distribution
may be obtained by firstly getting the 10th percentile
of the standard normal curve, which can be read off
as -1.282. This means that the 10th percentile of the
height distribution is 1.282 standard deviations below
the mean. This height is
–1.282(50)+1620 =1555.9

Session 2.59
TEACHING BASIC STATISTICS
RULES IN COMPUTING PROBABILITIES
P[Z= a] = 0
P[Za] can be obtained directly
from the Z-table
P[Za] = 1 –P[Za]
P[Z-a] = P[Z+a]
P[Z-a] = P[Z+a]
P[a
1Za
2] = P[Za
2] –P[Za
1]