Quartiles, Deciles and Percentiles
Let us now extend the concept of
partitioning of the frequency
distribution by taking up the concept
of quantiles (i.e. quartiles, deciles and
percentiles)
We have already seen that the
median divides the area under the
frequency polygon into two equal
halves:
50% 50%
X
f
Median
Afurthersplittoproducequarters,tenthsor
hundredthsofthetotalareaunderthefrequencypolygon
isequallypossible,andmaybeextremelyusefulfor
analysis.(Weareofteninterestedinthehighest10%of
somegroupofvaluesorthemiddle50%another.)
QUARTILES
Thequartiles,togetherwiththemedian,achievethe
divisionofthetotalareaintofourequalparts.
Thefirst,secondandthirdquartilesaregivenby
theformulae:
c
n
f
h
lQ
4
1
c
n
f
h
lQ
4
3
3
First quartile
Second quartile (i.e. median)
Third quartile mediancn
f
h
lc
n
f
h
lQ
2
4
2
2
25%
X
f
Q1 Q2 =X
~ Q3
25% 25% 25% Itisclearfromtheformulaofthesecond
quartilethatthesecondquartileisthesameasthe
median.
DECILES & PERCENTILES:
Thedecilesandthepercentilesgivethedivisionofthe
totalareainto10and100equalpartsrespectively.
c
n
f
h
lD
10
1
The formula for the first decile is
Theformulaeforthesubsequentdecilesare
c
n
f
h
lD
10
3
3
and so on. It is easily seen that the 5th decile is the same quantity as
the median.
c
n
f
h
lD
10
2
2
c
n
f
h
lP
100
1 The formula for the first percentile is
Theformulaeforthesubsequentpercentilesare
c
n
f
h
lP
100
2
2
c
n
f
h
lP
100
3
3
and so on.
Again,itiseasilyseenthatthe50thpercentileisthesameas
themedian,the25thpercentileisthesameasthe1stquartile,the75th
percentileisthesameasthe3rdquartile,the40thpercentileisthe
sameasthe4thdecile,andsoon.
Allthesemeasuresi.e.themedian,quartiles,decilesandpercentiles
arecollectivelycalledquantilesorfractiles.
Thequestionis,“Whatisthesignificanceofthisconceptof
partitioning?Whyisitthatwewishtodivideourfrequency
distributionintotwo,four,tenorhundredparts?”
Theanswertotheabovequestionsis:
In certain situations, we may be interested in describing the relative
quantitative locationof a particular measurement within a data set.
Quantiles provide us with an easy way of achieving this. Out
of these various quantiles, one of the mostfrequently used is
percentileranking.
FREQUENCY DISTRIBUTION OF
CHILD-CARE MANAGERS AGE
Class Interval Frequency
20 –29 6
30 –39 18
40 –49 11
50 –59 11
60 –69 3
70 –79 1
Total 50
Suppose we wish to determine:
The 1st quartile
The 6th decile
The 17th percentile
Solution
We begin with the 1st quartile (also known as lower
quartile).
The 1st quartile is given by:
Where, l, hand fpertain to the class that contains
the first quartile.
c
n
f
h
lQ
4
1
In this example,
n = 50, and hence
n/4 = 50/4 = 12.5
Class Boundaries Frequency
f
Cumulative
Frequency
cf
19.5 –29.5 6 6
29.5 –39.5 18 24
39.5 –49.5 11 35
49.5 –59.5 11 46
59.5 –69.5 3 49
69.5 –79.5 1 50
Total 50
Class
containing
Q
1
Hence,
l= 29.5
h= 10
f= 18
and
C = 6
Hence, the 1st quartile is given by:
1
=
4
10
= 29.5 12.5 6
18
= 29.5 3.6
= 33.1
hn
Q l c
f
The 6th Decile is given by6
6
10
hn
D l c
f
In this example,
n = 50, and hence
6n/10 = 6(50)/10 = 30
Class Boundaries Frequency
f
Cumulative
Frequency
cf
19.5 –29.5 6 6
29.5 –39.5 18 24
39.5 –49.5 11 35
49.5 –59.5 11 46
59.5 –69.5 3 49
69.5 –79.5 1 50
Total 50
Class
containing
D
6
Hence,
l= 39.5
h = 10
f = 11
and
C = 24
Hence, 6th decile is given by
6
6
=
10
10
= 39.5 30 24
11
= 29.5 5.45
= 44.95
hn
D l c
f
Interpretation
Six-tenth i.e. 60% of the managers are younger than
age 44.95 years, and four-tenth are older than this age.
The 17th Percentile is given by17
17
100
hn
P l c
f
In this example,
n = 50, and hence
17n/100 = 17(50)/100 = 8.5
Class Boundaries Frequency
f
Cumulative
Frequency
cf
19.5 –29.5 6 6
29.5 –39.5 18 24
39.5 –49.5 11 35
49.5 –59.5 11 46
59.5 –69.5 3 49
69.5 –79.5 1 50
Total 50
Class
containing
P
17
Hence,
l= 29.5
h = 10
f = 18
and
C = 6
Hence, 6th decile is given by
17
17
=
100
10
= 29.5 8.5 6
18
= 29.5 1.4
= 30.9
hn
P l c
f
Interpretation
17% of the managers are younger than age 30.9 years,
and 83% are older than this age.
EXAMPLE:
If oil company ‘A’ reports that its yearly sales are at the
90th percentile of all companies in the industry, the
implication is that 90% of all oil companies have yearly
sales lessthan company A’s, and only 10% have yearly
sales exceeding company A’s:
0.1
0
Relative Frequency
Company A’s sales
(90th percentile)
Yearly Sales
0.9
0
0
5
10
15
20
25
30
35
29.95 32.95 35.95 38.95 41.95 44.95 Cumulative Frequency Polygon
or OGIVE
ThisOgiveenablesustofindthemedianandanyother
quantilethatwemaybeinterestedinveryconveniently.
Andthisprocessisknownasthegraphiclocationof
quantiles.Letusbeginwiththegraphicallocationof
themedian:
Because of the fact that the median is that value
before which half of the data lies, the firststep is to divide
the total number of observations n by 2.
In this example:15
2
30
2
n
Thenextstepistolocatethisnumber15onthey-axis
ofthecumulativefrequencypolygon.
0
5
10
15
20
25
30
35
29.95 32.95 35.95 38.95 41.95 44.95 Cumulative Frequency Polygon
or OGIVE2
n
0
5
10
15
20
25
30
35
29.95 32.95 35.95 38.95 41.95 44.95 Cumulative Frequency Polygon or OGIVE2
n
Next,wedrawahorizontallineperpendiculartothey-
axisstartingfromthepoint15,andextendthislineuptothe
cumulativefrequencypolygon.
0
5
10
15
20
25
30
35
29.95 32.95 35.95 38.95 41.95 44.95 Cumulative Frequency Polygon
or OGIVE2
n
0
5
10
15
20
25
30
35
29.95 32.95 35.95 38.95 41.95 44.95 Cumulative Frequency Polygon
or OGIVE2
n 9.37X
~