Measures of dispersion

4,008 views 26 slides Jul 19, 2020
Slide 1
Slide 1 of 26
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26

About This Presentation

This slide is about the "Measures of dispersion".


Slide Content

PhD Course Work
Paper 1: Research Methodology & Computer Fundamentals
Group – A (Research Methodology)
Unit-IV: Data Processing and Analysis

Dispersion and Measures of Dispersion
Ambarish Kumar Rai
Assistant Professor
PG Dept. of Geography,
VKS University, Ara, Bihar
Email: [email protected]

Dispersion and Measures of Dispersion
What is Dispersion in Statistics?
•Dispersion is the state of getting dispersed or
spread. Statistical dispersion means the extent
to which a numerical data is likely to vary
about an average value.
•In other words, dispersion helps to understand
the distribution of the data.

Dispersion and Measures of Dispersion in Statistics

Measures of Dispersion
In statistics, the measures of dispersion help to
interpret the variability of data i.e. to know
how much homogenous or heterogeneous the
data is.
In simple terms, it shows how squeezed or
scattered the variable is.

–A measure of dispersion should be rigidly defined
–It must be easy to calculate and understand
–Not affected much by the fluctuations of
observations or by the extreme values
–It should be based on all the observations of the
series.
–It should not be unduly affected by sampling
fluctuations.
–It should be capable of further mathematical
treatment and statistical analysis.


Characteristics of Measures of Dispersion

(1) Comparative Study
•Measures of dispersion give a single value indicating the
degree of consistency or uniformity of distribution. This single
value helps us in making comparisons of various distributions.
•The smaller the magnitude (value) of dispersion, higher is the
consistency or uniformity and vice-versa.
(2) Reliability of an Average
•A small value of dispersion means low variation between
observations and average. It means the average is a good
representative of observation and very reliable.
•A higher value of dispersion means greater deviation among
the observations. In this case, the average is not a good
representative and it cannot be considered reliable.
Objectives of Computing Dispersion

Cont…
(3) Control the Variability
•Different measures of dispersion provide us data of
variability from different angles and this knowledge
can prove helpful in controlling the variation.
•Especially, in the financial analysis of business and
Medical, these measures of dispersion can prove
very useful.
(4) Basis for Further Statistical Analysis
•Measures of dispersion provide the basis further
statistical analysis like, computing Correlation,
Regression, Test of hypothesis, etc.

Types of Measures of Dispersion
There are two main types of dispersion methods in statistics which are:
Absolute Measure of Dispersion
•The measures which express the scattering of observation in terms
of distances i.e., range, quartile deviation.
•The measure which expresses the variations in terms of the average
of deviations of observations like mean deviation and standard
deviation.
Relative Measure of Dispersion
•We use a relative measure of dispersion for comparing distributions
of two or more data set and for unit free comparison.
•They are the coefficient of range, the coefficient of mean deviation,
the coefficient of quartile deviation, the coefficient of variation, and
the coefficient of standard deviation.

Absolute Measure of Dispersion
•An absolute measure of dispersion contains the
same unit as the original data set.
•Absolute dispersion method expresses the
variations in terms of the average of deviations
of observations like standard or means
deviations.
•It includes range, standard deviation, quartile
deviation, etc.

Cont…
The types of absolute measures of dispersion are:
1.Range
2.Variance
3.Standard Deviation
4.Quartiles and Quartile Deviation
5.Mean and Mean Deviation

Range
•It is simply the difference between the maximum
value and the minimum value given in a data set.
•A range is the most common and easily
understandable measure of dispersion.
•It is the difference between two extreme
observations of the data set.
•If X
max and X
min are the two extreme observations
then
Range = X
max – X
min

Cont…
Example: 1, 3,5, 6, 7 => Range = 7 -1= 6
Merits of Range
•It is the simplest of the measure of dispersion
•Easy to calculate
•Easy to understand
•Independent of change of origin
Demerits of Range
•It is based on two extreme observations. Hence,
get affected by fluctuations
•A range is not a reliable measure of dispersion
•Dependent on change of scale

Quartiles and Quartile Deviation
•The quartiles are values that divide a list of numbers into
quarters.
•The quartile deviation is half of the distance between the
third and the first quartile.
•The quartiles divide a data set into quarters.
•The first quartile, (Q
1) is the middle number between the
smallest number and the median of the data.
•The second quartile, (Q
2) is the median of the data set.
•The third quartile, (Q
3) is the middle number between the
median and the largest number.
•Quartile deviation or semi-inter-quartile deviation is
Q = ½ × (Q
3 – Q1)

Cont…
Merits of Quartile Deviation
•All the drawbacks of Range are overcome by quartile
deviation
•It uses half of the data
•Independent of change of origin
•The best measure of dispersion for open-end
classification
Demerits of Quartile Deviation
•It ignores 50% of the data
•Dependent on change of scale
•Not a reliable measure of dispersion

Mean and Mean Deviation
•The average of numbers is known as the mean and the arithmetic
mean of the absolute deviations of the observations from a
measure of central tendency is known as the mean deviation (also
called mean absolute deviation).
•Mean deviation is the arithmetic mean of the absolute deviations
of the observations from a measure of central tendency.
•If x
1, x
2, … , x
n are the set of observation, then the mean
deviation of x about the average A (mean, median, or mode) is
Mean deviation from average A = 1⁄n [∑
i|x
i – A|]
For a grouped frequency, it is calculated as:
Mean deviation from average A = 1⁄N [∑
i f
i |x
i – A|], N = ∑f
i
•Here, x
i and f
i are respectively the mid value and the frequency of
the i
th
class interval.

Cont…
Merits of Mean Deviation
•Based on all observations
•It provides a minimum value when the deviations are
taken from the median
•Independent of change of origin
Demerits of Mean Deviation
•Not easily understandable
•Its calculation is not easy and time-consuming
•Dependent on the change of scale
•Ignorance of negative sign creates artificiality and
becomes useless for further mathematical treatment

Standard Deviation
•A standard deviation is the positive square root of
the arithmetic mean of the squares of the
deviations of the given values from their
arithmetic mean.
•It is denoted by a Greek letter sigma, σ.
•It is also referred to as root mean square
deviation. The standard deviation is given as
σ = [(Σ
i (y
i – ȳ) ⁄ n]
½
= [(Σ
i y
i
2
⁄ n) – ȳ
2
]
½

•For a grouped frequency distribution, it is
σ = [(Σ
i f
i (y
i – ȳ) ⁄ N]
½
= [(Σ
i f
i y
i
2
⁄ n) – ȳ
2
]
½

Cont…
•The square root of the variance is known as the
standard deviation i.e. the square of the standard
deviation is the variance.
S.D. = √σ.
•It is also a measure of dispersion.
σ
2
= [(Σ
i (y
i – ȳ

) / n]
½
= [(Σ
i y
i
2
⁄ n) – ȳ
2
]
•For a grouped frequency distribution, it is
σ
2
= [(Σ
i f
i (y
i – ȳ

) ⁄ N]
½
= [(Σ
i f
i x
i
2
⁄ n) – ȳ
2
]
•If instead of a mean, we choose any other arbitrary
number, say A, the standard deviation becomes the
root mean deviation.

Cont…
Merits of Standard Deviation
•Squaring the deviations overcomes the drawback of
ignoring signs in mean deviations
•Suitable for further mathematical treatment
•Least affected by the fluctuation of the observations
•The standard deviation is zero if all the observations
are constant
•Independent of change of origin
Demerits of Standard Deviation
•Not easy to calculate
•Difficult to understand for a layman
•Dependent on the change of scale

Variance
•Deduct the mean from each data in the set
then squaring each of them and adding each
square and finally dividing them by the total
no of values in the data set is the variance.
Variance (σ
2
)=∑(X−μ)
2
/N

Variance of the Combined Series
•If σ
1, σ
2 are two standard deviations of two
series of sizes n
1 and n
2 with means ȳ
1 and ȳ
2.
•The variance of the two series of sizes n
1 +
n
2 is:
σ
2
= (1/ n
1 + n
2) ÷ [n
1 (σ
1
2
+ d
1
2
) + n
2 (σ
2
2
+ d
2
2
)]
where,
d
1 = ȳ

1 − ȳ

, d
2 = ȳ

2 − ȳ


and
ȳ

= (n
1 ȳ

1

+ n
2 ȳ

2) ÷ ( n
1 + n
2)

Relative Measure of Dispersion
•The relative measures of depression are used to
compare the distribution of two or more data sets.
•This measure compares values without units.
Common relative dispersion methods include:
1.Coefficient of Range
2.Coefficient of Variation
3.Coefficient of Standard Deviation
4.Coefficient of Quartile Deviation
5.Coefficient of Mean Deviation

Coefficient of Dispersion
•Whenever we want to compare the variability of the
two series which differ widely in their averages.
•Also, when the unit of measurement is different.
•We need to calculate the coefficients of dispersion
along with the measure of dispersion.
•The coefficients of dispersion are calculated along with
the measure of dispersion when two series are
compared which differ widely in their averages.
•The dispersion coefficient is also used when two series
with different measurement unit are compared.
•It is denoted as C.D.

The common coefficients of dispersion are:
C.D. In Terms of Coefficient of dispersion
Range C.D. = (X
max – X
min) ⁄ (X
max + X
min)
Quartile Deviation C.D. = (Q3 – Q1) ⁄ (Q3 + Q1)
Standard Deviation (S.D.) C.D. = S.D. ⁄ Mean
Mean Deviation C.D. = Mean deviation/Average

Coefficient of Variation
100 times the coefficient of dispersion based on
standard deviation is the coefficient of
variation (C.V.).
C.V. = 100 × (S.D. / Mean) = (σ/ȳ ) × 100

Thanks..!