chapter2-statistics-course-on-datacamp.pdf

MuhammadAteeq47 10 views 61 slides Oct 13, 2024
Slide 1
Slide 1 of 61
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59
Slide 60
60
Slide 61
61

About This Presentation

stas


Slide Content

What are the
chances?
INTRODUCTION TO STATISTICSGeorge Boorman
Curriculum Manager, DataCamp

INTRODUCTION TO STATISTICSMeasuring chance
What's the probability of an event?
P(event)=
Example: a coin .ip
P(heads)= ==50%
total # of possible outcomes
# ways event can happen
2 possible outcomes
1 way to get heads
2
1

INTRODUCTION TO STATISTICSAssigning salespeople
Sampling

INTRODUCTION TO STATISTICS
Assigning salespeople
P(Brian)==25%
4
1

INTRODUCTION TO STATISTICSMorning meeting

INTRODUCTION TO STATISTICS
Afternoon meeting
P(Brian)==25%
Sampling with replacement
4
1

INTRODUCTION TO STATISTICSIndependent probability
Two events are independent if the probability of the second event does not change based on
the outcome of the ,rst event.

INTRODUCTION TO STATISTICSOnline retail sales
Order NumberProduct TypeNet QuantityGross SalesDiscountsReturnsNet Sales
200 Basket 13 3744.0 -316.80 0.00 3427.20
201 Basket 12 3825.0 -201.60 -288.03335.40
202 Basket 17 3035.0 -63.25 0.00 2971.75
203 Art & Sculpture47 2696.8 -44.16 0.00 2652.64
204 Basket 17 2695.0 -52.50 -110.002532.50
Image credit: h2ps://unsplash.com/@rodriguezedm
1

INTRODUCTION TO STATISTICSProbability of an order for a jewelry product
Product TypeOrder Count
Basket 551
Art & Sculpture337
Jewelry 210
Kitchen 161
Home Decor131
... ...
Total 1767

INTRODUCTION TO STATISTICSProbability of an order for a jewelry product
P(Jewelry)=
P(Jewelry)=
P(Jewelry)=11.88%
Sum(Total Order Count)
Order Count(Jewelry)
1767
210

INTRODUCTION TO STATISTICSProbabilities for all product types
Product TypeOrder CountProbability
Basket 551 31.18%
Art & Sculpture337 19.07%
Jewelry 210 11.88%
Kitchen 161 9.11%
Home Decor131 7.41%
... ... ...
Total 1767 100%

Let's practice!
INTRODUCTION TO STATISTICS

Conditional
probability
INTRODUCTION TO STATISTICSGeorge Boorman
Curriculum Manager, DataCamp

INTRODUCTION TO STATISTICSMultiple meetings
Sampling without replacement

INTRODUCTION TO STATISTICS
Multiple meetings
Sampling without replacement
P(Claire)==33%
3
1

INTRODUCTION TO STATISTICSDependent events
Two events are dependent if the probability of
the second event is aFected by the outcome
of the ,rst event.

INTRODUCTION TO STATISTICS
Dependent events
Two events are dependent if the probability of
the second event is aFected by the outcome
of the ,rst event.

INTRODUCTION TO STATISTICS
Dependent events
Two events are dependent if the probability of
the second event is aFected by the outcome
of the ,rst event.

Sampling without replacement = each pick is
dependent

INTRODUCTION TO STATISTICSConditional probability
Conditional probability is used to calculate
the probability of dependent events
The probability of one event is
conditional on the outcome of another

Context, or subject-ma2er expertise, is
critical!
Image credit: h2ps://unsplash.com/@pixeldan
1

INTRODUCTION TO STATISTICSVenn diagrams

INTRODUCTION TO STATISTICSKitchen sales over $150

INTRODUCTION TO STATISTICSKitchen sales over $150
P(Order>150∣Kitchen)=
P(Order>150∣Kitchen)=
1767
161
1767
20
161
20

INTRODUCTION TO STATISTICSThe order of events matters
P(Kitchen∣Order>150)=
P(Kitchen∣Order>150)=
1767
581
1767
20
581
20

INTRODUCTION TO STATISTICSConditional probability formula
P(A∣B)=
Probability of event A, given event B
Probability of event A and event B
divided by the probability of event B
P(B)
P(A and B)

Let's practice!
INTRODUCTION TO STATISTICS

Discrete
distributions
INTRODUCTION TO STATISTICSGeorge Boorman
Curriculum Manager, DataCamp

INTRODUCTION TO STATISTICS

INTRODUCTION TO STATISTICS
Rolling the dice

INTRODUCTION TO STATISTICSChoosing salespeople

INTRODUCTION TO STATISTICSProbability distribution
Describes the probability of each possible outcome in a scenario

Expected value: The mean of a probability distribution
Expected value of a fair die roll =
(1×)+(2×)+(3×)+(4×)+(5×)+(6×)=3.5
6
1
6
1
6
1
6
1
6
1
6
1

INTRODUCTION TO STATISTICSWhy are probability distributions important?
Help us to quantify risk and inform decision
making


Used extensively in hypothesis testing
Probability that the results occurred by
chance
Image credit: h2ps://unsplash.com/@timmossholder
1

INTRODUCTION TO STATISTICSVisualizing a probability distribution

INTRODUCTION TO STATISTICSProbability = area
P(die roll)≤2= ?

INTRODUCTION TO STATISTICS
Probability = area
P(die roll)≤2=1/3

INTRODUCTION TO STATISTICSUneven die
Expected value of uneven die roll =
(1×)+(2×0)+(3×)+(4×)+(5×)+(6×)=3.67
6
1
3
1
6
1
6
1
6
1

INTRODUCTION TO STATISTICSVisualizing uneven probabilities

INTRODUCTION TO STATISTICS
Adding areas
P(uneven die roll)≤2= ?

INTRODUCTION TO STATISTICS
Adding areas
P(uneven die roll)≤2=1/6

INTRODUCTION TO STATISTICSDiscrete probability distributions
Describe probabilities for discrete outcomes
Fair die
Discrete uniform distribution

Uneven die

INTRODUCTION TO STATISTICSSampling from a discrete distribution
RollResult
1 1
2 2
3 3
4 4
5 5
6 6

Mean=3.5
RollResult
1 3
2 1
3 2
4 4
5 6
6 3
7 2
8 2
9 2
105

INTRODUCTION TO STATISTICSVisualizing a sample

INTRODUCTION TO STATISTICSSample distribution vs theoretical distribution
Mean=3.0 Mean=3.5

INTRODUCTION TO STATISTICSA bigger sample
Sample of 100 rolls
Mean=3.33

INTRODUCTION TO STATISTICSAn even bigger sample
Sample of 1000 rolls
Mean=3.52

INTRODUCTION TO STATISTICSLaw of large numbers
As the size of your sample increases, the sample mean will approach the expected value.
Sample sizeMean
10 3.00
100 3.33
1000 3.52

Let's practice!
INTRODUCTION TO STATISTICS

Continuous
distributions
INTRODUCTION TO STATISTICSGeorge Boorman
Curriculum Manager, DataCamp

INTRODUCTION TO STATISTICSWaiting for the bus

INTRODUCTION TO STATISTICSContinuous uniform distribution

INTRODUCTION TO STATISTICSContinuous uniform distribution

INTRODUCTION TO STATISTICSProbability still = area
P(4≤wait time≤7)=  ?

INTRODUCTION TO STATISTICS
Probability still = area
P(4≤wait time≤7)=  ?

INTRODUCTION TO STATISTICS
Probability still = area
P(4≤wait time≤7)=3×1/12=3/12

INTRODUCTION TO STATISTICSWaiting seven minutes or less
P(wait time≤7)=  ?
P(wait time≤7)=
P(wait time≤7)= =58.33%
12
7−0
12
7

INTRODUCTION TO STATISTICSTotal area = 1
P(0≤wait time≤12)=  ?

INTRODUCTION TO STATISTICS
Total area = 1
P(0≤outcome≤12)=12×1/12=1

INTRODUCTION TO STATISTICSProbability of waiting more than seven minutes
P(wait time≥7)=1−
P(wait time≥7)= =41.67%
12
7
12
5

INTRODUCTION TO STATISTICSBimodal distribution

INTRODUCTION TO STATISTICSThe normal distribution

INTRODUCTION TO STATISTICSTotal area still = 1

Let's practice!
INTRODUCTION TO STATISTICS
Tags