Histogram & normal distribution

RachitVerma25 134 views 31 slides Aug 05, 2020
Slide 1
Slide 1 of 31
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31

About This Presentation

A histogram is a plot that lets you discover, and show, the underlying frequency distribution (shape) of a set of continuous data. This allows the inspection of the data for its underlying distribution (e.g., normal distribution), outliers, skewness, etc.


Slide Content

Histogram & Normal
distributionPractice
Visit:
Learnbay.co

Histogram
Visit:
Learnbay.co
A histogram is a plot that lets you discover, and show, the underlyingfrequency
distribution (shape) of a set of continuous data. This allows the inspection of
the data for its underlying distribution (e.g., normal distribution), outliers,
skewness,etc.
36, 25, 38, 46, 55, 68, 72, 55, 36, 38, 67, 45, 22, 48, 91, 46, 52, 61, 58,55

Let’s buildhistogram
Visit:
Learnbay.co
36,25,38,46,55,68,72,55,36,38,67,45,22,48,91,46,52,61,58,55

Approach tobins
Visit:
Learnbay.co
To construct a histogram from a continuous variable you first need to split the
data into intervals, called bins. In the example above, age has been split into
bins, with each bin representing a 10-year period starting at 20 years. Each bin
containsthenumberofoccurrencesofscoresinthedatasetthatarecontained
within that bin. For the above data set, the frequencies in each bin have been
tabulated along with the scores that contributed to the frequency in each bin
(seenext):

Visit:
Learnbay.co

Choosing the rightbins
Visit:
Learnbay.co
There is no right or wrong answer as to how wide a bin should be, but there are
rules of thumb. You need to make sure that the bins are not too small or too
large. Consider the histogram we produced earlier (see above): the following
histograms use the same data, but have either much smaller or largerbins

Visit:
Learnbay.co

BAR
Graph
ABarGraph (alsocalledBarChart)isagraphicaldisplayofdatausingbarsof
differentheights.

Bar graph
example
Visit:
Learnbay.co

Example
1
Visit:
Learnbay.co

Histogram vs Bargraph
Visit:
Learnbay.co

Normaldistribution
Visit:
Learnbay.co

Visit:
Learnbay.co

Normal distribution -continued
Visit:
Learnbay.co
Many things closely follow a NormalDistribution:
●heights ofpeople
●size of things produced by machines
●errors inmeasurements
●blood pressure
●marks on atest

Visit:
Learnbay.co

The Standard Deviation is a measure of how spreadout
numbersare
Visit:
Learnbay.co

Empiricalrule
Visit:
Learnbay.co

Empirical rule -
1
Visit:
Learnbay.co

Empirical value -
2
Visit:
Learnbay.co

Practice problem1
Visit:
Learnbay.co

Visit:
Learnbay.co

It is good to know the standard deviation, because we can say that anyvalue
is:
●likely to be within 1 standard deviation (68 out of 100 shouldbe)
●very likely to be within 2 standard deviations (95 out of 100 shouldbe)
●almost certainly within 3 standard deviations (997 out of 1000 shouldbe)
Visit:
Learnbay.co

Standardscores
Visit:
Learnbay.co
The number of standard deviations from the mean is also calledthe
"Standard Score", "sigma" or"z-score".
What is his Z-score?

Visit:
Learnbay.co

Visit:
Learnbay.co

Practiceproblem
A survey of daily travel time had these results (in minutes): Convert firstthree
values into Z-scores (Assume Normaldistribuion)
26, 33, 65, 28, 34, 55, 25, 44, 50, 36, 26, 37, 43, 62, 35, 38, 45, 32,
28,34
Visit:
Learnbay.co

Visit:
Learnbay.co

Graphically same data inZ-scores
Visit:
Learnbay.co

Formula forZ-score
Visit:
Learnbay.co

Visit:
Learnbay.co

Reference
Visit:
Learnbay.co
https://www.mathsisfun.com/data/standard-normal-distribution.html