QUANTITATIVE METHODS NOTES.pdf

18,913 views 136 slides May 28, 2022
Slide 1
Slide 1 of 136
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58
Slide 59
59
Slide 60
60
Slide 61
61
Slide 62
62
Slide 63
63
Slide 64
64
Slide 65
65
Slide 66
66
Slide 67
67
Slide 68
68
Slide 69
69
Slide 70
70
Slide 71
71
Slide 72
72
Slide 73
73
Slide 74
74
Slide 75
75
Slide 76
76
Slide 77
77
Slide 78
78
Slide 79
79
Slide 80
80
Slide 81
81
Slide 82
82
Slide 83
83
Slide 84
84
Slide 85
85
Slide 86
86
Slide 87
87
Slide 88
88
Slide 89
89
Slide 90
90
Slide 91
91
Slide 92
92
Slide 93
93
Slide 94
94
Slide 95
95
Slide 96
96
Slide 97
97
Slide 98
98
Slide 99
99
Slide 100
100
Slide 101
101
Slide 102
102
Slide 103
103
Slide 104
104
Slide 105
105
Slide 106
106
Slide 107
107
Slide 108
108
Slide 109
109
Slide 110
110
Slide 111
111
Slide 112
112
Slide 113
113
Slide 114
114
Slide 115
115
Slide 116
116
Slide 117
117
Slide 118
118
Slide 119
119
Slide 120
120
Slide 121
121
Slide 122
122
Slide 123
123
Slide 124
124
Slide 125
125
Slide 126
126
Slide 127
127
Slide 128
128
Slide 129
129
Slide 130
130
Slide 131
131
Slide 132
132
Slide 133
133
Slide 134
134
Slide 135
135
Slide 136
136

About This Presentation

BMCU002 QUANTITATIVE METHODS


Slide Content

BMCU002: QUANTITATIVE METHODS NOTES
Page 1 of 3

INTRODUCTION TO STATISTICS
Definition:
It is the science of collecting, organizing, presenting, analyzing and interpreting data to assist in
making more effective decisions.
Types of Statistics
(a) Descriptive statistics: it’s a tabular, graphical and numerical method for organizing and
summarizing information clearly and effectively relating to either a population or sample.
(b) Inferential statistics: are the methods of drawing and measuring the reliability of
conclusions about a statistical population based on information from a sample data set.
 A population is a collection of all possible individuals, objects or measurements of
interest.
 A sample part or sub set of the population of interest.
Variables:
A variable is a measurable characteristic that assumes different values among the subjects.
Types of variables
(a) Independent variables: It is a variable that a researcher manipulates in order to determine its
effect or influence on another variable. They predict the amount of variation that occurs in
other variables.
(b) Dependent variables: It is the variable that is measured, predicted or monitored and is
expected to be affected by manipulation of an independent variable. They attempt to indicate
the total influence arising from the effects of the independent variable. It varies as a function
of the independent variable e.g., influence of hours studied on performance in a statistical test,
influence of distance from the supply center on cost of building materials.
The above variables can either be qualitative or quantitative variables: -
i. Qualitative variables: Are variables that are non-numeric i.e., attributes e.g., Gender,
Religion, Colour, State of birth etc.
ii. Quantitative variables: are numeric variables. They can either be discrete or
continuous.
 Discrete variables: Are variables, which can only assume certain values
i.e., whole numbers. Are always counted.
 Continuous variables: Are variables, which can assume any value within
a specific range. Are always measured e.g., height, temperature, weight,
radius etc.


Levels of measurement
There are four levels of measurement; nominal, ordinal, interval and ratio.
(a) Nominal level. The observations are classified under a common characteristic e.g., sex, race,
marital status, employment status, language, religion etc. helps in sampling.

BMCU002: QUANTITATIVE METHODS NOTES
Page 2 of 3

(b) Ordinal level: items or subjects are not only grouped into categories, but they are ranked into
some order e.g., greater than, less than, superior, happier than, poorer, above etc. helps in
developing a likert scale.
(c) Interval level: numerals are assigned to each measure and ranked. The intervals between
numerals are equal. The numerals used represent meaningful quantities but the zero point is
not meaningful e.g., test scores, temperature.
(d) Ratio level: has all the characteristics of the other levels and in addition the zero point is
meaningful. Mathematical operations can be applied to yield meaningful values e.g., height,
weight, distance, age, area etc.

Characteristics of statistical data
 They are aggregate of facts e.g., total sales of a firm for one year.
 They are affected to a marked extent by a multiplicity of causes e.g., volume of wheat
production depends on rainfall, soil fertility, seeds etc
 They are numerically expressed e.g., population of Kenya increased by 4 million during the
year 2004.
 They are estimated according to a reasonable standard of accuracy e.g., 90% accuracy
 They are collected in a systematic manner.
 They are collected for a predetermined purpose
 They should be placed in relation to each other.

Uses and users of statistics
1. Government:
 Monitoring economic and social trends
 Forecasting
 Policy making
2. Individuals
 Leisure activities
 Community work
 Personal finances
 Gambling
3. Academia
 Testing hypothesis
 Developing new theories
 Consultancy services
4. Businesses
 Planning and control
 Quality control especially for the manufacturers
 Forecasting i.e., planning production schedules, advertising expenditures etc.
 Auditing

BMCU002: QUANTITATIVE METHODS NOTES
Page 3 of 3

 Determining production costs e.g., by using regression and correlation, one can determine
the relationship between two variables like costs and methods of production, advertising and
sales etc.
 It gives relevant information for decision-making.
Limitations of statistics
 Deals with aggregate facts and not individual items.
 Deals mainly with quantitative characteristics and not qualitative characteristics like
honesty, efficiency etc.
 The results are only true on an average and under certain conditions.
 Statistics can be misused i.e., wrong interpretation. It requires experience and skill to draw
sensible conclusions from the data.
 Statistics may not provide the best solution under all circumstances.

BMCU002: QUANTITATIVE METHODS NOTES
Page 1 of 11

DESCRIPTIVE STATISTICS
Descriptive statistics is used to summarize data and make sense out of the raw data
collected during the research.
Data collection
Data can be collected from primary and / or secondary sources.
Secondary data consists of information that already exists somewhere having been
collected for another purpose e.g., in government publications, periodicals, journals, books
etc.
Advantages: Low in cost and Readily available
Disadvantages: The data needed might not exist and The existing data might be
outdated, inaccurate, incomplete and unreliable.

Primary data consists of original information gathered for the specific purpose through
observation, interviews and questionnaires.
Advantages
- It is relevant
- Its accurate
Disadvantages
- It is costly
- It is time consuming
Presentation of data
Presentation of data refers to the classification and tabulation of data. Classification of data
refers to the act of arranging the data in groups or classes according to some resemblance
of the data in each group or class. Tabulation of data is the arrangement of statistical data
in columns and rows.
Frequency distribution
A frequency distribution is a grouping of data into mutually exclusive categories showing
the number of observations in each category.
Steps
 Decide on the number of classes
 Determine the class interval or width
 Set the individual class limits
 Tally the values into the classes

BMCU002: QUANTITATIVE METHODS NOTES
Page 2 of 11

 Count the number of items in each class
A class interval is the difference between the lower limit of the class and the lower limit of
the next class.
A class midpoint / class mark is the middle point between the lower and the upper class
limit.
Graphical representation of a frequency distribution
1. Histogram: It is a graph in which classes are marked on the horizontal axis and the
class frequencies on the horizontal axis and the class frequencies on the vertical axis.
The class frequencies are represented by the heights of the bars and the bars are drawn
adjacent to each other.
2. Frequency polygons: The class midpoints are connected with a line segment.
3. Cumulative frequency polygons
 Less than cumulative frequency polygons
 More than cumulative frequency polygons
4. Line charts: Show the change in a variable over time
5. Bar chart: Make use of rectangles to present the given data. Can be vertical,
horizontal or component.
6. Pie charts: different segments of a circle represent percentage contribution of various
components to the total.
7. Graphs
8. Pictograms: pictures are used to represent data.
Example
(a) The data below indicates the marks attained by students in a statistical test. Construct
a frequency distribution table with 10 classes
12
8
18
5
15
24
25
25
32
40
40
42
44
46
48
50
50
52
53
55
56
59
60
66
68
72
76
83
95
98

(b) From the above: construct a histogram, frequency polygons and curves, cumulative
frequency curves.


MEASURES OF CENTRAL TENDENCY
Central tendency is the tendency of observations to cluster near the central part of the
distribution. Measures of central tendency are the measures of location e.g. mean, mode
and median. They are the most representative value of the distribution.

BMCU002: QUANTITATIVE METHODS NOTES
Page 3 of 11

Qualities of a good average
Should be-
 Rigidly defined
 Based on all values
 Easily understood and calculated
 Least affected by the fluctuations of sampling
 Capable of further algebraic or statistical treatment
 Least affected by extreme values

Types of averages
The following are the most important types of averages
(a) Arithmetic mean or simple average
(b) Median
(c) Mode
(d) Geometric mean
(e) Harmonic mean

THE ARITHMETIC MEAN
It is obtained by summing up the values of all the items of a series and dividing this sum
by the number of items.
Computation of the arithmetic mean for
Individual series:-
Direct method n
X
X


where X = arithmetic mean , n = number of items

Grouped series
Direct method n
xf
X


Where f = frequencies, n = number of items

Properties of the arithmetic mean
 The product of the arithmetic mean and the number of items is equal to the sum of all
given values
 The algebraic sum of the deviations of the various values from the mean is equal to
zero
 The sum of the squares of deviations from arithmetic mean is least.
Advantages of the arithmetic mean
 Can be easily understood
 Takes into account all the items of the series

BMCU002: QUANTITATIVE METHODS NOTES
Page 4 of 11

 It is not necessary to arrange the data before calculating the average
 It is capable of algebraic treatment
 It is a good method of comparison
 It is not indefinite
 It is used frequently.
Disadvantages of the arithmetic mean
 It is affected by extreme values to a great extent
 It may be a figure that does not exist in a series
 It cannot be calculated if all the items of a series are not known
 It cannot be used incase of qualitative data

THE MEDIAN
The median is the middle value of a series arranged in ascending or descending order. If
there are n observations, the median is the value of the th
n






2
1 item.
Computation of the median in discrete series
 Arrange the items in descending or ascending order with their corresponding
frequencies against them.
 Compute the cumulated frequencies and then locate the middle item.
Computation of the median in Continuous series
The median has to be interpolated in the class interval containing the median using the
formula:-
Median = ??????+
(
�
�
)−??????
??????
(??????)

Where:
L= lower class boundary
n= total number of values
B= cumulative frequency of the group before the median group
G= frequency of the median group
W= class width

Properties of the Median
 It is a positional average and is influenced by the position of the items in the series
and not by the size of items
 The sum of the absolute values of deviations is least.
Advantages of the Median
 It is easy to calculate
 It is simple and is understood easily
 It is less affected by the value of extreme items

BMCU002: QUANTITATIVE METHODS NOTES
Page 5 of 11

 It can be calculated by inspection in some cases
 It is useful in the study of phenomenon which are of qualitative nature
Disadvantages of the Median
 It is not a suitable representative of a series in most cases
 It is not suitable for further algebraic treatment
 It is not used frequently like arithmetic mean
 It cannot be determined exactly in the case of continuous series
Quartiles, deciles and percentiles
 Quartiles are the values of the items that divide the series into four equal parts.
 Deciles divide the series into 10 equal parts.
 Percentiles divide the series into 100 equal parts.
The 2
nd
quartile, 5
th
decile and 50
th
percentile are equal to the median.

THE MODE
The mode is the value, which occurs most often in the data. A distribution with one mode
is called unimodal, with two modes bimodal and with many modes, multimodal
distribution. The class mid-point of a modal class is called a crude mode.
Calculation of the mode in a continuous series
Mode = ??????+
??????�−??????�−�
(??????�−??????�−�)+(??????�−??????�+�)
(??????)
Where:
 L is the lower-class boundary of the modal group
 fm-1 is the frequency of the group before the modal group
 fm is the frequency of the modal group
 fm+1 is the frequency of the group after the modal group
 w is the group width
Properties of the mode
 It represents the most typical value of the distribution and it should coincide with
existing items
 It is not affected by the presence of extremely large or small items
Advantages of the Mode
 It is easy to understand
 Extreme items do not affect its value
 It possesses the merit of simplicity
Disadvantages of the Mode
 It is often not clearly defined
 Exact location is often uncertain
 It is unsuitable for further algebraic treatment

BMCU002: QUANTITATIVE METHODS NOTES
Page 6 of 11

 It does not take into account extreme values.

GEOMETRIC MEAN
Geometric Mean is the n
th
root of the product of n values i.e. n
nxxxMG .....*.
21
For ungrouped data
G.M = Antilog of n
Logx
Grouped data
G.M = Antilog of n
fLogx

Merits of the Geometric mean
 It takes into account all the items in the data and condenses them into one
representative value.
 It gives more weight to smaller values than to large values.
 It is amenable to algebraic manipulations
Demerits
 It is difficult to use and compute
 It is determinate for positive values and cannot be used for negative values or zero.

HARMONIC MEAN
It is the reciprocal of the arithmetic mean of the reciprocal of a series of observations.
Ungrouped data
H.M = 
x
n
1

Grouped data
H.M = 
x
f
n

Merits of the Harmonic mean
 It takes into account all the observations in the data
 It gives more weight to smaller items
 It is amenable to algebraic manipulations
 It measures the rates of change
Demerits
 It is difficult to compute when the number of items is large
 It assigns too much weight to smaller items.
Factors to consider in the choice of an average
 The purpose for which the average is being used
 The nature, characteristics and properties of the average
 The nature and characteristics of the data.

MEASURES OF DISPERSION
Definition of dispersion
 It is the degree to which numerical data tends to spread about an average value

BMCU002: QUANTITATIVE METHODS NOTES
Page 7 of 11

 It is the extent of the scattered ness of items around a measure of central tendency
Significance of measuring dispersion
 To determine the reliability of an average
 To serve as a basis for the control of the variability
 To compare two or more series with regard to their variability
 To facilitate the use of other statistical measures
Properties of a good measure of dispersion
It should be: -
 Simple to understand
 Easy to compute
 Rigidly defined
 Based on each and every item in the
distribution
 Amenable to further algebraic
calculations
 Have sampling stability
 Not be unduly affected by extreme
values
Measures of dispersion
 Range
 Quartile deviation
 Mean deviation
 Standard deviation

BMCU002: QUANTITATIVE METHODS NOTES
Page 8 of 11


The Range: it is the difference between the smallest value and the largest value of a series
Advantages of the Range
 It is the simplest to understand and compute
 It takes the minimum time to calculate the value of the range
Limitations
 It is not based on each and every value of the distribution
 It is subject to fluctuations of considerable magnitude from sample to sample
 It cannot be computed in case of open-ended distributions
 It does not explain or indicate anything about the character of the distribution within the two
extreme observations.
Uses of the range
 Quality control
 Fluctuations of prices
 Weather forecast
 Finding the difference between two values e.g. wages earned by different employees.
The standard deviation
It is the square root of the arithmetic average of the squares of the deviations measured from the
mean. It measures how much “spread” or “ Variability” is present in the sample. A small standard
deviation means a high degree of uniformity of the observations as well as the homogeneity of a
series and vice versa.
Ways of computing the standard deviation
Direct method
Ungrouped data n
dx

2

where 
2
dx = sum of squares of the deviations from arithmetic mean
Grouped data n
fdx

2



Advantages of the standard deviation
 It is rigidly defined and is based on all the observations of the series
 It is applied or used in other statistical techniques like correlation and regression analysis and
sampling theory.
 It is possible to calculate the combined standard deviation of two or more groups.
Disadvantages of the standard deviation
 It cannot be used for comparing the dispersion of two or more series of observations given in
different units.
 It gives more weight to extreme values.

BMCU002: QUANTITATIVE METHODS NOTES
Page 9 of 11



SKEWNESS AND KURTOSIS IN STATISTICS
The average and measure of dispersion can describe the distribution but they are not sufficient to
describe the nature of the distribution. For this purpose we use other concepts known as Skewness
and Kurtosis. The symmetrical and skewed distributions are shown by curves as

Skewness
Skewness means lack of symmetry. A distribution is said to be symmetrical when the values are
uniformly distributed around the mean. For example, the following distribution is symmetrical
about its mean 3.
X : 1 2 3 4 5
Frequency (f): 5 9 12 9 5

In a symmetrical distribution the mean, median and mode coincide, that is, mean = median = mode.
Several measures are used to express the direction and extent of skewness of a dispersion. The
important measures are that given by Pearson. The first one is the Coefficient of Skewness:


For a symmetric distribution Sk = 0. If the distribution is negatively skewed then Sk is negative and
if it is positively skewed then Sk is positive. The range for Sk is from -3 to 3.

BMCU002: QUANTITATIVE METHODS NOTES
Page 10 of 11

The other measure uses the b (read ‘beta’) coefficient which is given by, where, m2
and m3 are the second and third central moments. The second central moment m2 is nothing but
the variance. The sample estimate of this coefficient is where m2 and m3 are
the sample central moments given by


For a symmetrical distribution b1 = 0. Skewness is positive or negative depending upon whether
m3 is positive or negative.

Kurtosis
A measure of the peakness or convexity of a curve is known as Kurtosis.

BMCU002: QUANTITATIVE METHODS NOTES
Page 11 of 11


It is clear from the above figure that all the three curves, (1), (2) and (3) are symmetrical about
the mean. Still they are not of the same type. One has different peak as compared to that of
others. Curve (1) is known as mesokurtic (normal curve); Curve (2) is known as leptocurtic
(leading curve) and Curve (3) is known as platykurtic (flat curve). Kurtosis is measured by
Pearson’s coefficient, b2 (read ‘beta - two’).It is given by .
The sample estimate of this coefficient is where, m4 is the fourth central moment
given by m4 =

The distribution is called normal if b2 = 3. When b2 is more than 3 the distribution is said to be
leptokurtic. If b2 is less than 3 the distribution is said to be platykurtic.

BMCU002: QUANTITATIVE METHODS NOTES
Page 1 of 13

MEASURES OF CENTRAL TENDENCY
MODE
Meaning
The mode refers to that value in a distribution, which occur most frequently. It is an actual value,
which has the highest concentration of items in and around it.
Computation of the Mode
1. Ungrouped or Raw Data
For ungrouped data or a series of individual observations, mode is often found by mere inspection.
Example 1:
2 , 7, 10, 15, 10, 17, 8, 10, 2
 Mode = M0 = 10
In some cases the mode may be absent while in some cases there may be more than one mode.
Example 2:
1) 12, 10, 15, 24, 30 (no mode)
2) 7, 10, 15, 12, 7, 14, 24, 10, 7, 20, 10
∴ The modes are 7 and 10
2. Grouped Data
a) Discrete Distribution
For Discrete distribution, see the highest frequency and corresponding value of X is mode. A
discrete variable is the one whose outcomes are measured in fixed numbers.
b) Continuous Distribution
See the highest frequency then the corresponding value of class interval is called the modal
class. Then apply the following formula:

BMCU002: QUANTITATIVE METHODS NOTES
Page 2 of 13

Mode = M0 = l1+
??????1−??????0
(??????
1−??????0 )+(??????
1−??????2 )
x??????
Where: �
1=the lower value of the class in which the lies
�
1=the frequency of the class in which the mode lies
�
0=the frequency of the class preceding the modal class
�
2=the frequency of the class succeeding the modal class
??????=the class interval of the modal classs
NOTE: While applying the above formula, we should ensure that the class-intervals are uniform
throughout. If the class-intervals are not uniform, then they should be made uniform on the
assumption that the frequencies are evenly distributed throughout the class.
Example 3:
Let us take the following frequency distribution:
Class Intervals Frequency
30−40 4
40−50 6
50−60 8
60−70 12
70−80 9
80−90 7
90−100 4
Required:
Calculate the mode in respect of this series.
Solution
Mode = M0 = 60+
12−8
(12−8)+(12−9)
x10
=60+
4
4+3
??????10=65.7 approx.

BMCU002: QUANTITATIVE METHODS NOTES
Page 3 of 13

3. Determination of Modal Class
For a frequency distribution modal class corresponds to the maximum frequency. But it is not
possible to identify by inspection the class where the mode lies in any one (or more) of the
following cases:
i. If the maximum frequency is repeated.
ii. If the maximum frequency occurs in the beginning or at the end of the distribution.
iii. If there are irregularities in the distribution, the modal class is determined by the method
of grouping.
Steps for Calculation
1. Prepare a grouping table with 6 columns.
2. In column I, write down the given frequencies.
3. Column II is obtained by combining the frequencies two by two.
4. Leave the 1
st
frequency and combine the remaining frequencies two by two and write in column
III.
5. Column IV is obtained by combining the frequencies three by three.
6. Leave the 1st frequency and combine the remaining frequencies three by three and write in
column V.
7. Leave the 1
st
and 2
nd
frequencies and combine the remaining frequencies three by three and
write in column VI.
8. Mark the highest frequency in each column.
9. Form an analysis table to find the modal class.
10. After finding the modal class use the formula to calculate the modal value.
Example 4
Calculate the mode for the following frequency distribution.
Class Interval 0−5 5−10 10−15 15−20 20−25 25−30 30−35 35−40
Frequency 9 12 15 16 17 15 10 13

BMCU002: QUANTITATIVE METHODS NOTES
Page 4 of 13

Solution
Grouping Table
Class Interval Frequency 2 3 4 5 6
0−5 9
5−10 12 21 36
10−15 15 27 43
15−20 16 31 48
20−25 17 33 48
25−30 15 32 42
30−35 10 25 38
35−40 13 23

Analysis Table
Columns 0−5 5−10 10−15 15−20 20−25 25−30 30−35 35−40
1 1
2 1 1
3 1 1
4 1 1 1
5 1 1 1
6 1 1 1
Total 1 2 4 5 2
The maximum occurred corresponding to 20−25, and hence it is the modal class.
Mode = M0 = 20+
17−16
(17−16)+(17−15)
??????5
M
0=20+
1
1+2
??????5=21.6 approx.
Example 5
The following table gives some frequency data:

BMCU002: QUANTITATIVE METHODS NOTES
Page 5 of 13

Size of Item Frequency Cummulative Currency
10−20 10 10
20−30 18 28
30−40 25 53
40−50 26 79
50−60 17 96
60−70 4 100
Total 100
Required:
Calculate the mode
Solution
Grouping Table
Class Interval Frequency 2 3 4 5 6
10−20 10
20−30 18 28 53
30−40 25 43 69
40−50 26 51 68
50−60 17 43 47
60−70 4 21
Analysis Table
Columns 10−20 20−30 30−40 40−50 50−60 60−70
1 1
2 1 1
3 1 1 1 1
4 1 1 1
5 1 1 1
6 1 1 1
Total 1 3 5 5 2

BMCU002: QUANTITATIVE METHODS NOTES
Page 6 of 13

Mode = 3 median - 2 mean
Median=
n+1
2
=
100+1
2
=50.5th item
This lies in the class 30−40.
??????��????????????�=�
1+
�
2−�
1
�
(�−�)=30+
40−30
25
(50.50−28)=30+9=39
Calculation of Arithmetic Mean
Class- Interval Frequency Mid- Points d d'=d/10 fd’
10−20 10 15 −20 −2 −20
20−30 18 25 −10 −1 −18
30−40 25 35 0 0 0
40−50 26 45 10 1 26
50−60 17 55 20 2 34
60−70 4 65 30 3 12
Total 100 34
Assumed mean= 35
Median=A+
∑fd′
n
xi

Median=A35+
34
100
x10=38.4
Mode=3 median−2 mean=3(39)−2(38.4)=117−76.8=40.2
Merits of Mode
1. It is easy to calculate and in some cases it can be located mere inspection.
2. Mode is not at all affected by extreme values.
3. It can be calculated for open-end classes.
4. It is usually an actual value of an important part of the series.
5. In some circumstances it is the best representative of data.

BMCU002: QUANTITATIVE METHODS NOTES
Page 7 of 13

Demerits of Mode
1. It is not based on all observations.
2. It is not capable of further mathematical treatment.
3. Mode is ill-defined generally, it is not possible to find mode in some cases.
4. As compared with mean, mode is affected to a great extent,by sampling fluctuations.
5. It is unsuitable in cases where relative importance of items has to be considered.
QUARTILES
Meaning
The quartiles divide the distribution in four parts. There are three quartiles. The second quartile
(Q2) divides the distribution into two halves and therefore is the same as the median. The first
(lower) quartile (Q1) marks off the first one-fourth, the third (upper) quartile (Q3) marks off the
three-fourth. In other words, the three quartiles Q1, Q2 and Q3 are such that 25 percent of the data
fall below Q1, 25 percent fall between Q1 and Q2, 25 percent fall between Q2 and Q3 and 25 percent
fall above Q3.
Computation of the Mode
1. Raw or Ungrouped Data
First arrange the given data in the increasing order and use the formula for Q1 and Q3.
Q
1=(
n+1
4
)th item
Q
3=3(
n+1
4
)th item
Example 1
Compute quartiles for the data given below:
25,18,30, 8, 15, 5, 10, 35, 40, 45
Solution
5, 8, 10, 15, 18,25, 30,35,40, 45
Q
1=(
n+1
4
)th item

BMCU002: QUANTITATIVE METHODS NOTES
Page 8 of 13

Q
1=(
10+1
4
)th item
Q
1=(2.75)th item
Q
1=2
nd
item+(
3
4
)(3
rd
item−2
nd
item)
Q
1=8+(
3
4
)(10−8)=9.5
Q
3=3(
n+1
4
)th item
Q
3=3(2.75)th item
Q
3=(8.25)th item
Q
3=8
th
item+(
1
4
)(9
th
item−8
th
item)
??????
3=35+(
1
4
)(40−35)=36.25
2. Discrete Series
Step1: Find cumulative frequencies.
Step2: Find (
??????+1
4
)
Step3: See in the cumulative frequencies, the value just greater than (
??????+1
4
), then the corresponding
value of x is Q1.
Step 4: Find 3 (
??????+1
4
)
Step 5: See in the cumulative frequencies, the value just greater than 3 (
??????+1
4
), then the
corresponding value of x is Q3.

Example 2
Compute quartiles for the data given bellow:
X 5 8 12 15 19 24 30
F 4 3 2 4 5 2 4

BMCU002: QUANTITATIVE METHODS NOTES
Page 9 of 13

Solution :
X F CF
5 4 4
8 3 7
12 2 9
15 4 13
19 5 18
24 2 20
30 4 24
Total 24
Q
1=(
N+1
4
)th item=(
24+1
4
)=(
25
4
)=6.25
th
item
Q
3=3(
N+1
4
)th item=3(
24+1
4
)=3(
25
4
)=18.25
th
item
Q
1=8; Q
3=24
3. Continuous Series
Step1: Find cumulative frequencies
Step2: Find (
N
4
)
Step 3: See in the cumulative frequencies, the value just greater than (
??????
4
), then the corresponding
class interval is called first quartile class.
Step 4: Find 3(
3
4
)
Step 5: See in the cumulative frequencies the value just greater than 3(
3
4
), then the corresponding
class interval is called 3rd quartile class.
Step 6: Apply the respective formulae.
Q
1=l
1+(
N
4
−m
1
f
1
)x c
1

BMCU002: QUANTITATIVE METHODS NOTES
Page 10 of 13

Q
3=l
3+(
3(
N
4
)−m
3
f
3
)xc
3
Where: l
1=lower limit of the first quartile class
f
1=frequency of the first quartile class
c
1=width of the first quartile class
m
1=cf preceding the first quartile class
l
3=lower limit of the third quartile class
f
3=frequency of the third quartile class
c
3=width of the third quartile class
m
3=cf preceding the third quartile class
Example 3
The following series relates to the marks secured by students in an examination.
Marks Number of Students
0−10 11
10−20 18
20−30 25
30−40 28
40−50 30
50−60 33
60−70 22
70−80 15
80−90 12
90−100 10

BMCU002: QUANTITATIVE METHODS NOTES
Page 11 of 13

Required:
Find the quartiles.
Solution:
Marks Number of Students Cummulative Frequency
0−10 11 11
10−20 18 29
20−30 25 54
30−40 28 82
40−50 30 112
50−60 33 145
60−70 22 167
70−80 15 182
80−90 12 194
90−100 10 204
Total 204
(
N
4
)=(
204
4
)=51;3(
N
4
)=153
Q
1=20+(
51−29
25
)x 10=28.8
Q
1=60+(
153−145
22
)x 10=63.64
PERCENTILES
The percentile values divide the distribution into 100 parts each containing 1 percent of the cases.
The percentile (Pk) is that value of the variable up to which lie exactly k% of the total number of
observations.
1. Percentile for Raw Data or Ungrouped Data
Relationship :
P25 = Q1 ; P50 = Q2 = Median and P75 = Q3

BMCU002: QUANTITATIVE METHODS NOTES
Page 12 of 13

Example 4
Calculate P15 for the data given below:
5, 24 , 36 , 12 , 20 , 8
Solution:
Arranging the given values in the increasing order.
5, 8, 12, 20, 24, 36
P
15=(
15(n+1)
100
)th item
P
15=(
15(6+1)
100
)th item
P
15=(
(15x7)
100
)th item
P
15=(1.05)th item
P
15=1
st
item+0.05(2
nd
item−1
st
item)
P
15=5+0.05(8−5)=5.15
2. Percentile for Grouped Data
Example 5
Find P53 for the following frequency distribution:
Class Interval 0−5 5−10 10−15 15−20 20−25 25−30 30−35 35−40
Frequency 5 8 12 16 20 10 4 3
Solution :
Class Interval Frequency Cummulative Frequency
0−5 5 5
5−10 8 13
10−15 12 25
15−20 16 41
20−25 20 61

BMCU002: QUANTITATIVE METHODS NOTES
Page 13 of 13

25−30 10 71
30−35 4 75
35−40 3 78
Total 78

P
53=l
1+
53N
100
−m
f
xc
P
53=20+
53(78)
100
−41
f
x5=20.085

BPCU004: ADVANCED BUSINESS STATISTICS
Page 1 of 23

MEASURES OF DISPERSION
MEANING
Dispersion (also known as scatter, spread or variation) measures the extent to which the items
vary from some central value.

SIGNIFICANCE OF MEASURING VARIATION
1. Measures of variation point out as to how far an average is representative of the mass.
2. Measures of dispersion determine nature and cause of variation in order to control the
variation itself.
3. Measures of dispersion enable a comparison to be made of two or more series with regard
to their variability.
4. Measures of dispersion are the basis of Many powerful analytical tools in statistics such as
correlation analysis, testing of hypothesis, analysis of variance, the statistical quality control
and regression analysis.
Characteristics/Properties of a Good Measure of Dispersion
1. It should be simple to understand.
2. It should be easy to compute.
3. It should be rigidly defined.
4. It should be based on each and every item of the distribution.
5. It should be amenable to further algebraic treatment.
6. It should have sampling stability.
7. Extreme items should not unduly affect it.

ABSOLUTE AND RELATIVE MEASURES OF DISPERSION
There are two kinds of measures of dispersion, namely:
1. Absolute measure of dispersion.
2. Relative measure of dispersion.
Absolute measure of dispersion indicates the amount of variation in a set of values in terms of
units of observations. For example, when rainfalls on different days are available in mm, any
absolute measure of dispersion gives the variation in rainfall in mm. On the other hand relative
measures of dispersion are free from the units of measurements of the observations. They are

BPCU004: ADVANCED BUSINESS STATISTICS
Page 2 of 23

pure numbers. They are used to compare the variation in two or more sets, which are having
different units of measurements of observations.
Absolute measure Relative measure
1. Range 1. Co-efficient of Range
2. Quartile deviation 2. Co-efficient of Quartile deviation
3. Mean deviation 3. Co-efficient of Mean deviation
4. Standard deviation 4. Co-efficient of variation
RANGE AND COEFFICIENT OF RANGE
1. Range
This is the simplest possible measure of dispersion and is defined as the difference between the
largest and smallest values of the variable.
Range=L−S
??????ℎ�??????�: L=Largest Value
S=Smallest Value
In individual observations and discrete series, L and S are easily identified. In continuous series,
the following two methods are followed.
Method 1:
L=Upper boundary of the highest class
S=Lower boundary of the highest class
Method 2:
L=Mid value of the highest class
S=Mid value of the lowest class
2. Co-efficient of Range
Coefficient of Range=
L−S
L+S

Example 1
Find the value of range and its co-efficient for the following data.
7, 9, 6, 8, 11, 10

BPCU004: ADVANCED BUSINESS STATISTICS
Page 3 of 23

Solution:
Range=L−S=11−4=7
Coefficient of Range=
L−S
L+S
=
11−4
11+4
=0.4667
Example 2:
Calculate range and its co efficient from the following distribution.
Size : 60−63 63−66 66−69 69−72 72−75
Number : 5 18 42 27 8
Solution:
Range=L−S=75−60=15
Coefficient of Range=
L−S
L+S
=
75−60
75+60
=0.1111
Merits
1. It is simple to understand.
2. It is easy to calculate.
3. In certain types of problems like quality control, weather forecasts, share price analysis, et
c., range is most widely used.
Demerits:
1. It is very much affected by the extreme items.
2. It is based on only two extreme observations.
3. It cannot be calculated from open-end class intervals.
4. It is not suitable for mathematical treatment.
5. It is a very rarely used measure.
QUARTILE DEVIATION AND CO -EFFICIENT OF QUARTILE DEVIATION
1. Quartile Deviation (Q.D)
Definition: Quartile Deviation is half of the difference between the first and third quartiles.
Hence, it is called Semi-Inter Quartile Range.
??????.�=
??????
3−??????
1
2

BPCU004: ADVANCED BUSINESS STATISTICS
Page 4 of 23

Among the quartiles Q1, Q2 and Q3, the range Q3 – Q1 is called inter quartile range and
??????3−??????1
2
,
semi inter quartile range.
2. Co-efficient of Quartile Deviation
Co−efficient of Q.D=
Q
3−Q
1
Q
3+Q
1

Example 3
Find the Quartile Deviation for the following data:
391, 384, 591, 407, 672, 522, 777, 733, 1490, 2488
Solution:
Arrange the given values in ascending order.
384, 391, 407, 522, 591, 672, 733, 777, 1490, 2488.
Position of Q
1 is
N+1
4
=
10+1
4
=12.75
th
item
Q
1=2
nd
item+0.75(3
rd
Item−2
nd
Item)
??????
1=391+0.75 (4.7−391)=403
Position of Q
3 is 3(
N+1
4
)=3(12.75)=8.25
th
item
Q
3=8
th
Item+0.25(9
th
Item−8
th
Item)
Q
3=777+0.25(1490−777)=955.25
??????.�=
955.25−403
2
=276.125
Example 4
Weekly wages of labours are given below. Calculated Q.D and Coefficient of Q.D.
Weekly Wage (Kshs.) 100 200 400 500 600
No. of Weeks 5 8 21 12 6

BPCU004: ADVANCED BUSINESS STATISTICS
Page 5 of 23

Solution :
Weekly Wage (Kshs.) No. of Weeks Cum. No. of Weeks
100 5 5
200 8 13
400 21 34
500 12 46
600 6 52
Total 52
Position of Q
1 is
N+1
4
=
52+1
4
=13.25
th
item
Q
1=13
th
Item+0.25(14
th
Item−13
th
Item)
??????
1=200+0.25 (400−200)=250
Position of Q
3 is 3(
N+1
4
)=3(13.25)=39.75
th
item
Q
3=39
th
Item+0.75(40
th
Item−39
th
Item)
Q
3=500+0.75(600−500)=575
??????.�=
575−250
2
=162.5
Co−efficient of Q.D=
Q
3−Q
1
Q
3+Q
1

Co−efficient of Q.D=
575−250
575+250
=
325
825
=0.394
Example 5
For the data given below, give the quartile deviation and coefficient of quartile deviation.
X 351−500 501−650 651−800 801−950 951−1100
F 48 189 88 47 28

BPCU004: ADVANCED BUSINESS STATISTICS
Page 6 of 23

Solution:
X True Class Intervals F Cumulative
Frequency
351−500 350.5−500.5 48 48
501−650 500.5−650.5 189 237
651−800 650.5−800.5 88 325
801−950 800.5−950.5 47 372
951−1100 950.5−1100.5 28 400
Total 400
Q
1=
N
4
=
400
4
=100; Q
2=3 (
N
4
)=3 (100)=300
Q
1=l
1+(
N
4
−m
1
f
1
)x c
1
Q
1=500.5+(
100−48
189
)x 150=541.77
Q
3=l
3+(
3(
N
4
)−m
3
f
3
)xc
3
Q
3=650.5+(
300−237
88
)x150=757.89
Q.D=
Q
3−Q
1
2
=
757.89−541.77
2
=108.06
Co−efficient Q.D=
Q
3−Q
1
Q
3+Q
1
=
757.89−541.77
757.89+541.77
=0.1663
Merits of Quartile Deviation
1. It is simple to understand and easy to calculate.
2. It is not affected by extreme values.
3. It can be calculated for data with open end classes also.
Demerits of Quartile Deviation
1. It is not based on all the items. It is based on two positional values Q1 and Q3 and ignores
the extreme 50% of the items.

BPCU004: ADVANCED BUSINESS STATISTICS
Page 7 of 23

2. It is not amenable to further mathematical treatment.
3. It is affected by sampling fluctuations.
MEAN DEVIATION AND COEFFICIENT OF MEAN DEVIATION
1. Mean Deviation
The mean deviation is measure of dispersion based on all items in a distribution. Mean deviation
is the arithmetic mean of the deviations of a series computed from any measure of central
tendency; i.e., the mean, median or mode, all the deviations are taken as positive i.e., signs are
ignored. But in general practice and due to wide applications of mean, the mean deviation is
generally computed from mean. M.D can be used to denote mean deviation.
2. Coefficient of mean deviation:
Mean deviation calculated by any measure of central tendency is an absolute measure. For the
purpose of comparing variation among different series, a relative mean deviation is required.
The relative mean deviation is obtained by dividing the mean deviation by the average used for
calculating mean deviation.
Co−efficient of Mean Deviation=
Mean Deviation
Mean or Median or Mode

If the result is desired in percentage, the coefficient of mean deviation.
Co−efficient of Mean Deviation=
Mean Deviation
Mean or Median or Mode
x100
COMPUTATION OF MEAN DEVIATION
1. Individual Series
a. Calculate the average mean, median or mode of the series.
b. Take the deviations of items from average ignoring signs and denote these deviations
by |D|.
c. Compute the total of these deviations, i.e., Σ |D|
d. Divide this total obtained by the number of items.
M.D.=
D
n

BPCU004: ADVANCED BUSINESS STATISTICS
Page 8 of 23

Example 6
Calculate mean deviation from mean and median for the following data: 100, 150, 200, 250,
360, 490, 500, 600, 671 also calculate coefficients of M.D.
Solution:
Mean=
 X
N
=
3321
9
=369
Now arrange the data in ascending order
100, 150, 200, 250, 360, 490, 500, 600, 671
Mean=Value of (
n+1
2
)th item=Value of (
9+1
2
)th item=Value of 5
th
item=360
X D=X−Mean D=X−Median
100 269 260
150 219 210
200 169 160
250 119 110
360 9 0
490 121 130
500 131 140
600 231 240
671 302 311
3321 1570 1561

M.D.from mean=
 D
n
=
1570
9
=174.44
Co−efficient of M.D.=
MD
Mean
=
174.44
369
=0.47
M.D.from median=
 D
n
=
1561
9
=173.44
Co−efficient of M.D.=
MD
Median
=
173.44
360
=0.48

BPCU004: ADVANCED BUSINESS STATISTICS
Page 9 of 23

2. Mean Deviation −Discrete Series
Step 1: Find out an average (mean, median or mode).
Step 2: Find out the deviation of the variable values from the average, ignoring signs and denote
them by |D|
Step 3: Multiply the deviation of each value by its respective frequency and find out the total
Σf | D|
Step 4: Divide Σf | D| by the total frequencies N
Example 7
Compute Mean deviation from mean and median from the following data:
Height in cms 158 159 160 161 162 163 164 165 166
No. of
persons
15 20 32 35 33 22 20 10 8
Also compute coefficient of mean deviation.
Solution:
Height (X) No. of
persons (f)
d = x−A
A = 162
fd D=X−mean fD
158 15 −4 −60 3.51 52.65
159 20 −3 −60 2.51 50.20
160 32 −2 −64 1.51 48.32
161 35 −1 −35 0.51 17.85
162 33 0 0 0.49 16.17
163 22 1 22 1.49 32.78
164 20 2 40 2.49 49.80
165 10 3 30 3.49 34.90
166 8 4 32 4.49 35.92
Total 195 −95 338.59
Mean=A+
fd
N
=162+
−95
195
=161.51
M.D.=
fD
N
=
338.59
195
=1.74

BPCU004: ADVANCED BUSINESS STATISTICS
Page 10 of 23

Co−efficient M.D.=
M.D.
Mean
=
1.74
161.51
=0.0108
Height (x) No. of persons (f) c.f. D=X−median fD
158 15 15 3 45
159 20 35 2 40
160 32 67 1 32
161 35 102 0 0
162 33 135 1 33
163 22 157 2 44
164 20 177 3 60
165 10 187 4 40
166 8 195 5 40
195 334
Median=Size of (
N
2
)th item=Size of (
195
2
)th item=Size of 98
th
item=161
M.D.=
fD
N
=
334
195
=1.71
Co−efficient M.D.=
M.D.
Median
=
1.71
161
=0.0106
3. Mean Deviation-Continuous Series
The method of calculating mean deviation in a continuous series same as the discrete series. In
continuous series we have to find out the mid points of the various classes and take deviation
of these points from the average selected. Thus
M.D.=
fD
N

Where:D=m−Average ;m=mid point
Example 8:
Find out the mean deviation from mean and median from the following series.
Age in years No. of persons
0−10 20

BPCU004: ADVANCED BUSINESS STATISTICS
Page 11 of 23

10−20 25
20−30 32
30−40 40
40−50 42
50−60 35
60−70 10
70−80 80
Also compute co-efficient of mean deviation.
Solution:
x m f
�=
�−??????
�
??????=35;�=10

fd D=X−mean fD
0−10 5 20 −3 −60 31.5 630.0
10−20 15 25 −2 −50 21.5 537.5
20−30 25 32 −1 −32 11.5 368.0
30−40 35 40 0 0 1.5 60.0
40−50 45 42 1 42 8.5 357.0
50−60 55 35 2 70 18.5 647.5
60−70 65 10 3 30 28.5 285.0
70−80 75 8 4 32 38.5 308.0
Total 212 3192.5
Mean=A+
∑fd
N
∗c=35+
320
212
x10=36.5
M.D.=
∑fD
N
=
3192.5
212
=15.06

BPCU004: ADVANCED BUSINESS STATISTICS
Page 12 of 23

Calculation of Median and M.D. from Median
x m f c.f D=m−Md fD
0−10 5 20 20 32.25 645.00
10−20 15 25 45 22.25 556.25
20−30 25 32 77 12.25 392.00
30−40 35 40 117 2.25 90.00
40−50 45 42 159 7.75 325.50
50−60 55 35 194 17.75 621.25
60−70 65 10 204 27.75 277.50
70−80 75 8 212 37.75 302.00
Total 212 3209.50
Median=(
N
2
)th item=
212
2
=106
Median=�+
N
2
−m
f
∗c=30+
106−77
40
∗10=37.25
M.D.=
∑fD
N
=
3209.5
212
=15.14
Co−efficient of M.D.=
M.D.
Median
=
15.14
37.25
=0.41
Merits of M.D.
1. It is simple to understand and easy to compute.
2. It is rigidly defined.
3. It is based on all items of the series.
4. It is not much affected by the fluctuations of sampling.
5. It is less affected by the extreme items.
6. It is flexible, because it can be calculated from any average.
7. It is better measure of comparison.
Demerits of M.D.
1. It is not a very accurate measure of dispersion.
2. It is not suitable for further mathematical calculation.
3. It is rarely used. It is not as popular as standard deviation.

BPCU004: ADVANCED BUSINESS STATISTICS
Page 13 of 23

4. Algebraic positive and negative signs are ignored. It is mathematically unsound and
illogical.
STANDARD DEVIATION AND COEFFICIENT OF VARIATION
1. Definition
It is defined as the positive square-root of the arithmetic mean of the Square of the deviations
of the given observation from their arithmetic mean. It is the square–root of the mean of the
squared deviation from the arithmetic mean. Square of standard deviation is called Variance.
2. Calculation of Standard Deviation-Individual Series
There are two methods of calculating Standard deviation in an individual series.
a) Deviations taken from Actual mean
b) Deviation taken from Assumed mean
(a) Deviation taken from Actual mean
This method is adopted when the mean is a whole number.
Steps:
1. Find out the actual mean of the series ( )
2. Find out the deviation of each value from the mean (X = X – )
3. Square the deviations and take the total of squared deviations ∑X
2

4. Divide the total (∑X
2
) by the number of observation (
∑X
2
n
)
Formulae:
Standard Deviation ()=√(
∑X
2
n
)????????????

(X−X)
2
n

(b) Deviations Taken from Assumed Mean
This method is adopted when the arithmetic mean is fractional value. Taking deviations from
fractional value would be a very difficult and tedious task. To save time and labour, the short–
cut method is applied. In this method, the deviations are taken from an assumed mean.
The formula is:

BPCU004: ADVANCED BUSINESS STATISTICS
Page 14 of 23

=√(
∑d
2
N
)−(
∑d
N
)
2

Where:d stands for the deviations from the assumed mean=(X−A)
Steps:
1. Assume any one of the item in the series as an average (A)
2. Find out the deviations from the assumed mean; i.e., X-A denoted by d and also the total of
the deviations Σd
3. Square the deviations; i.e., d
2
and add up the squares of deviations, i.e, Σd
2

4. Then substitute the values in the following formula:
=√(
∑d
2
N
)−(
∑d
N
)
2

Note: We can also use the simplified formula for standard deviation.
=
1
n
√(n∑d
2
)−(∑d)
2

For the frequency distribution
=
c
n
√(N∑fd
2
)−(∑fd)
2

Example 9
Calculate the standard deviation from the following data.
14, 22, 9, 15, 20, 17, 12, 11

BPCU004: ADVANCED BUSINESS STATISTICS
Page 15 of 23

Solution:
Deviations from actual mean.
Values (X) (X−X) (X−X)
2

14 –1 1
22 7 49
9 –6 36
15 0 0
20 4 16
17 2 4
12 –3 9
11 –4 16
120 140
X=
120
8
=15
=

(X−X)
2
n
=√
140
8
=4.18
Example 10
The table below gives the marks obtained by 10 students in statistics. Calculate standard
deviation.
Student Nos : 1 2 3 4 5 6 7 8 9 10
Marks 43 48 65 57 31 60 37 48 78 59
Solution
Deviations from assumed mean
Student Nos : Marks (X) d=X−A (A=57) d
2

1 43 –14 196
2 48 –9 81
3 65 8 64
4 57 0 0
5 31 –26 676

BPCU004: ADVANCED BUSINESS STATISTICS
Page 16 of 23

6 60 3 9
7 37 –20 400
8 48 –9 81
9 78 21 441
10 59 2 4
N=10 d=–44 d
2
=1952

=√(
∑d
2
N
)−(
∑d
N
)
2

=√(
1952
10
)−(
−44
10
)
2
=13.26
3. Calculation of Standard Deviation for Discrete Series
There are three methods for calculating standard deviation in discrete series:
(a) Actual mean methods
(b) Assumed mean method
(c) Step-deviation method.
(a) Actual mean method
Steps:
1. Calculate the mean of the series.
2. Find deviations for various items from the means i.e., d=X−X
3. Square the deviations (d
2
) and multiply by the respective frequencies (f) to get fd
2
.
4. Total to product (Σfd
2
) Then apply the formula:
=√
∑fd
2
∑f

If the actual mean in fractions, the calculation takes lot of time and labour; and as such this
method is rarely used in practice.

BPCU004: ADVANCED BUSINESS STATISTICS
Page 17 of 23

(b) Assumed Mean Method
Here deviation are taken not from an actual mean but from an assumed mean. Also this method
is used, if the given variable values are not in equal intervals.
Steps:
1. Assume any one of the items in the series as an assumed mean and denoted by A.
2. Find out the deviations from assumed mean, i.e, X-A and denote it by d.
3. Multiply these deviations by the respective frequencies and get the Σfd.
4. Square the deviations (d
2
).
5. Multiply the squared deviations (d
2
) by the respective frequencies (f) and get Σfd
2
.
6. Substitute the values in the following formula:
=√
∑fd
2
∑f
−(
∑fd
∑f
)
2

Where:d=A−A,N=f
Example 11:
Calculate Standard deviation from the following data.
X 20 22 25 31 35 40 42 45
f 5 12 15 20 25 14 10 6
Solution :
Deviations from assumed mean
X f d=X−A (A=31) d
2
fd fd
2

20 5 −11 121 −55 605
22 12 −9 81 −108 972
25 15 −6 36 −90 540
31 20 0 0 0 0
35 25 4 16 100 400
40 14 9 81 126 1134
42 10 11 121 110 1210
45 6 14 196 84 1176
Total N=107 fd=167 fd
2
=6037

BPCU004: ADVANCED BUSINESS STATISTICS
Page 18 of 23

=√
∑fd
2
∑f
−(
∑fd
∑f
)
2

=√
6037
107
−(
167
107
)
2
=7.35
(c) Step-deviation method:
If the variable values are in equal intervals, then we adopt this method.
Steps:
1. Assume the center value of the series as assumed mean A.
2. Find out d

=
X−A
C
, where C is the interval between each value.
3. Multiply these deviations d

by the respective frequencies and get ∑fd

.
4. Square the deviations and get d
′2
.
5. Multiply the squared deviation (d
′2
) by the respective frequencies (f) and obtain the total
∑fd
′2
.
6. Substitute the values in the following formula to get the standard deviation.
=√
∑fd
′2
∑f
−(
fd
′2
∑f
)
2
*C
Example 12
Compute Standard deviation from the following data.
Marks 10 20 30 40 50 60
No. of students 8 12 20 10 7 3
Solution:
Marks (X) No. of students (f)
d

=
X−30
10

d
2
fd fd
2

10 8 −2 4 −16 32
20 12 −1 1 −12 12
30 20 0 0 0 0
40 10 1 1 10 10
50 7 2 4 14 28
60 3 3 9 9 27
N=60 fd=5 fd
2
=109

BPCU004: ADVANCED BUSINESS STATISTICS
Page 19 of 23

=√
∑fd
′2
∑f
−(
fd
′2
∑f
)
2
*C
=√
∑109
2
60
−(
5
60
)
2
∗10=13.45
4. Calculation of Standard Deviation for Continuous series
In the continuous series the method of calculating standard deviation is almost the same as in a
discrete series. But in a continuous series, mid-values of the class intervals are to be found out.
The step- deviation method is widely used.
The formula is,
=√
∑fd
′2
N
−(
fd
′2
N
)
2
*C
Where d

=
m−A
C
;C=Class interval
Steps:
1. Find out the mid-value of each class.
2. Assume the center value as an assumed mean and denote it by A.
3. Find out d

=
m−A
C

4. Multiply the deviations d

by the respective frequencies and get fd


5. Square the deviations and get �
′2
.
6. Multiply the squared deviations �
′2
) by the respective frequencies and get fd
′2

7. Substituting the values in the following formula to get the standard deviation.
=√
∑fd
′2
N
−(
fd
′2
N
)
2
*C
Example 13:
The daily temperature recorded in a city in Russia in a year is given below.
Temperature C
0
No. of days
−40 to −30 10
−30 to −20 18
−20 to −10 30
−10 to 0 42

BPCU004: ADVANCED BUSINESS STATISTICS
Page 20 of 23

0 to −10 65
10 to −20 180
20 to 30 20
Required:
Calculate Standard Deviation.
Solution :
Temperature
(X)
Mid-Point
(m)
No. of days
(f)
d

=
m−(−5)
10

d
′2
fd

fd
′2

−40 to −30 −35 10 −3 9 −30 90
−30 to −20 −25 18 −2 4 −36 72
−20 to −10 −15 30 −1 1 −30 30
−10 to 0 −5 42 0 0 0 0
0 to −10 5 65 1 1 65 65
10 to −20 15 180 2 4 360 720
20 to 30 25 20 3 9 60 180
N=365 fd=389 fd
2
=1157
=√
∑fd
′2
N
−(
fd

N
)
2
*C
=√
1157
365
−(
389
365
)
2
*10 =14.26
0
�
Merits of Standard Deviation
1. It is rigidly defined and its value is always definite and based on all the observations and
the actual signs of deviations are used.
2. As it is based on arithmetic mean, it has all the merits of arithmetic mean.
3. It is the most important and widely used measure of dispersion.
4. It is possible for further algebraic treatment.
5. It is less affected by the fluctuations of sampling and hence stable.
6. It is the basis for measuring the coefficient of correlation and sampling.
Demerits of Standard Deviation
1. It is not easy to understand and it is difficult to calculate.
2. It gives more weight to extreme values because the values are squared up.

BPCU004: ADVANCED BUSINESS STATISTICS
Page 21 of 23

3. As it is an absolute measure of variability, it cannot be used for the purpose of comparison.
Coefficient of Variation
The standard deviation is an absolute measure of dispersion. It is expressed in terms of units in
which the original figures are collected and stated. The standard deviation of heights of students
cannot be compared with the standard deviation of weights of students, as both are expressed
in different units, i.e heights in centimeter and weights in kilograms. Therefore the standard
deviation must be converted into a relative measure of dispersion for the purpose of
comparison. The relative measure is known as the coefficient of variation.
The coefficient of variation is obtained by dividing the standard deviation by the mean and
multiply it by 100. symbolically,
Coefficient of Variation (C.V.)=

X
x100
If we want to compare the variability of two or more series, we can use C.V. The series or
groups of data for which the C.V. is greater indicate that the group is more variable, less stable,
less uniform, less consistent or less homogeneous. If the C.V. is less, it indicates that the group
is less variable, more stable, more uniform, more consistent or more homogeneous.
Example 15
In two factories A and B located in the same industrial area, the average weekly wages (in
rupees) and the standard deviations are as follows:
Factory Average Standard Deviation No. of workers
A 34.5 5 476
B 28.5 4.5 524
Required:
(a) Which factory A or B pays out a larger amount as weekly wages?
(b) Which factory A or B has greater variability in individual wages?
Solution:
Total wages paid by factory A=34.5x476=Kshs.16,422

(a) Total wages paid by factory B=28.5x524=Kshs.14,934

BPCU004: ADVANCED BUSINESS STATISTICS
Page 22 of 23


Therefore factory A pays out larger amount as weekly wages.
(b) C.V. of distribution of weekly wages of factory A and B are
CV (A)=

X
x100=
5
34.5
x100=14.49%
CV (B)=

X
x100=
4.5
28.5
x100=15.79%
Factory B has greater variability in individual wages, since C.V. of factory B is greater than
C.V of factory A.
Example 16
Prices of a particular commodity in five years in two cities are given below:
Price in City A Price in City B
20 10
22 20
19 18
23 12
16 15
Which city has more stable prices?
Solution:
Actual mean method
City A City B
Prices (X) dx=X− 20 dx
2
Prices (Y) dy=Y− 15 dy
2

20 0 0 10 −5 25
22 2 4 20 5 25
19 −1 1 18 3 9
23 3 9 12 −3 9
16 −4 16 15 0 0
X=100 dx dx
2
Y=75 dy=0 dy
2
=68

City A: X=
∑X
n
=
100
5
=20

BPCU004: ADVANCED BUSINESS STATISTICS
Page 23 of 23

=√
∑dx
2
n
=√
30
5
=2.45
CV (A)=

X
x100=
2.45
20
x100=12.25%
City B: X=
∑X
n
=
75
5
=15
=√
∑dx
2
n
=√
68
5
=3.69
CV (A)=

X
x100=
3.69
15
x100=24.6%
City A had more stable prices than City B, because the coefficient of variation is less in City A.

BMCU002: QUANTITATIVE METHODS NOTES
Page 1 of 19

LESSON THREE: OVERVIEW OF HYPOTHESIS TESTING
3.0 Introduction




3.1 Lesson Objectives






3.2 Definition of Hypothesis Testing
Hypothesis: It’s a statement about a population parameter developed for the purpose of testing.
Hypothesis testing: It’s a procedure based on sample evidence and probability theory to determine
whether the hypothesis is a reasonable statement.
3.2 Procedure for Testing a Hypothesis
The following are the steps that are followed when testing hypothesis
1. State the null and alternate hypothesis
2. Select a level of significance.
3. Identify the test statistic
4. Formulate a decision rule and identify the rejection region
5. Compute the value of the test statistic
6. Make a conclusion.

This lesson gives an overview of the concepts in hypothesis testing. It describes the procedure
of testing a hypothesis, differentiates between one-tailed and two-tailed tests and type I and
Type II errors. Examples of testing hypothesis about a single population mean when the
population variance and not given are discussed.
By the end of the lesson, the students should be able to;
 Define the term hypothesis
 Differentiate between one-tailed and two-tailed tests
 Describe the procedure for testing hypothesis
 Test hypothesis about the mean when the population variance is known
 Test hypothesis about the mean when the population variance is unknown

BMCU002: QUANTITATIVE METHODS NOTES
Page 2 of 19

State the null hypothesis (HO) and alternate hypothesis (HA)
 The null hypothesis is a statement about the value of a population parameter. It should be
stated as “There is no significant difference between ……………”. It should always contain
an equal sign.
 The alternate hypothesis is a statement that is accepted if sample data provide enough
evidence that the null hypothesis is false.

Select a Level of Significance
A level of significance is the probability of rejecting the null hypothesis when it is true. It is
designated by  and should be between 0 –1.
Types of errors that can be committed
i. Type I error: it is rejecting the null hypothesis, when it is true.
ii. Type II error: It is not rejecting the null hypothesis, when it is false.
Null hypothesis Do not reject HO Reject HO
HO is True Correct decision Type I error
HO is false Type II error Correct decision

Identify the Test Statistic
A test statistic is the statistic that will be used to test the hypothesis e.g. )(,,
2
squarechiFand  

Formulate a decision rule
A decision rule is a statement of the conditions under which the null hypothesis is rejected and
the conditions under which it is not rejected. The region or area of rejection defines the location of
all those values that are so large or so small that the probability of their occurrence under a true
null hypothesis is rather remote.
Compute the value of the test statistic and make a conclusion
The value of the test statistic is determined from the sample information, and is used to determine
whether to reject the null hypothesis or not.

BMCU002: QUANTITATIVE METHODS NOTES
Page 3 of 19

3.4 One-Tailed and Two-Tailed Tests
 A test is one tailed when the alternate hypothesis states a direction e.g.
Ho: The mean income of women is equal to the mean income of men
HA: The mean income of women is greater than the mean income of men
 A test is two tailed if no direction is specified in the alternate hypothesis
Ho: There is no difference between the mean income of women and the mean income
of men
HA: There is a difference between the mean income of women and the mean income of
men
3.5 Testing The Population Mean When the Population Variance is Known
When the population variance is known and the population is normally distributed, the test
statistic for testing hypothesis about  is n
x
Z


 . The confidence interval estimator of 
when 2
 is known is n
Zx


2

Example One
A study by the Coca-Cola Company showed that the typical adult Kenyan consumes 18 gallons of
Coca-Cola each year. According to the same survey, the standard deviation of the number of
gallons consumed is 3.0. A random sample of 64 college students showed they consumed an
average (mean) of 17 gallons of cola last year. At the 0.05 significance level, can we conclude that
there is a significance difference between the mean consumption rate of college students and other
adults?
Solution
1. Stating the null and alternate hypothesis 18:
18:
0




AH
H

2. Level of significance: 05.0

BMCU002: QUANTITATIVE METHODS NOTES
Page 4 of 19

3. Test statistic n
X
Z



4. Rejection region occ025.02/
HReject ,96.1or Z 96.1 ZIf 96.1 ZZ


5. Value of the test statistic 96.167.2
64
3
1817





n
X
Z
c



6. Conclusion
Reject H0. Yes, there is a significance difference between the mean consumption rate of college
students and other adults.
Example Two
Past experience indicates that the monthly long distance telephone bill per household in a particular
community is normally distributed, with a mean of Sh. 1012 and a standard deviation of Sh. 327.
After an advertising campaign that encouraged people to make long distance telephone calls more
frequently, a random sample of 57 households revealed that the mean monthly long distance bill
was Sh. 1098. Can we conclude at the 10% significance level that the advertising campaign was
successful?
Solution
1. Stating the null and alternate hypothesis 1012:
1012:
0




AH
H

2. Level of significance: 1.0
3. Test statistic n
X
Z



BMCU002: QUANTITATIVE METHODS NOTES
Page 5 of 19

4. Rejection region oc1.0
HReject ,28.1 ZIf 28.1 ZZ


5. Value of the test statistic 28.199.1
57
327
10121098





n
X
Z
c



6. Conclusion
Reject H0. Yes, there is sufficient evidence to conclude that the advertising campaign was
successful
3.6 Testing the Population Mean when the Population Variance is Unknown
When the population variance is unknown and the population is normally distributed, the test
statistic for testing hypothesis about  is n
s
x
t

 which has a student t distribution with 1n
degrees of freedom.
We now have two different test statistic for testing the population mean. The choice of which one
to use depends on whether or not the population variance is known.
 If the population variance is known, the test statistic is n
x
Z



 If the population variance is unknown, the test statistic is n
s
x
t

 1. nfd

The confidence interval estimator of  when 2
 is unknown is n
s
tx
2

 1.. nfd
Example One
A manufacturer of automobile seats has a production line that produces an average of 100 seats
per day. Because of new government regulations, a new safety device has been installed, which
the manufacturer believes will reduce average daily output. A random sample of 15 days’ output
after the installation of the safety device is shown below:

BMCU002: QUANTITATIVE METHODS NOTES
Page 6 of 19

93, 103, 95, 101, 91, 105, 96, 94, 101, 88, 98, 94, 101, 92, 95
Assuming that the daily output is normally distributed, is there sufficient evidence at the 5%
significance level, to conclude that average daily output has decreased following the installation
of the safety device?
Solution
1. Stating the null and alternate hypothesis 100:
100:
0




AH
H

2. Level of significance: 05.0
3. Test statistic n
s
X
t


4. Rejection region oc14,05.01
HReject ,761.1 tIf 761.1, 

tt
n

5. Value of the test statistic 
761.182.2
15
85.4
10047.96
85.4
14
15
1447
139917
1
47.96
15
1447
139917X 1447
2
2
2
2
















 
n
s
X
t
n
n
X
X
S
n
X
X
X
c


6. Conclusion

BMCU002: QUANTITATIVE METHODS NOTES
Page 7 of 19

Reject H0. Yes, there is sufficient evidence to conclude that average daily output has decreased
following the installation of the safety device
Example Two
A courier service advertises that its average delivery time is less than six hours for local deliveries.
A random sample of the amount of time this courier takes to deliver packages to an address across
town produced the following times (rounded to the nearest hour).
7, 3, 4, 6, 10, 5, 6, 4, 3, 8
Is there sufficient evidence to support the courier’s advertisement at the 5% level of significance?
Solution
1. Stating the null and alternate hypothesis 6:
6:
0




AH
H

2. Level of significance: 05.0
3. Test statistic n
s
X
t


4. Rejection region oc9,05.01
HReject ,833.1 tIf 833.1, 

tt
n

5. Value of the test statistic

BMCU002: QUANTITATIVE METHODS NOTES
Page 8 of 19

833.156.0
10
27.2
66.5
27.2
9
10
56
360
1
6.5
10
56
360X 56
2
2
2
2
















 
n
s
X
t
n
n
X
X
S
n
X
X
X
c


6. Conclusion
Do not Reject H0. No, there is no sufficient evidence to conclude that the advertising campaign
was successful
3.7 Chi-Square Test
A chi-squared test is any statistical hypothesis test in which the sampling distribution of the test
statistic is a chi-squared distribution when the null hypothesis is true. Also considered a chi-
squared test is a test in which this is asymptotically true, meaning that the sampling distribution (if
the null hypothesis is true) can be made to approximate a chi-squared distribution as closely as
desired by making the sample size large enough. The chi-square test is used to determine whether
there is a significant difference between the expected frequencies and the observed frequencies in
one or more categories.
3.7.1 Chi-Square Test of a Multinomial Experiment (Goodness-Of-Fit Test)
A multinomial experiment is a generalized version of a binomial experiment that allows for more
than two possible outcomes on each trial of the experiment. The following are the properties of a
multinomial experiment
 The experiment consists of a fixed number n of trials.
 The outcome of each trial can be classified into exactly one of k categories called cells
 The probability 1
P that the outcome of a trial will fall into a cell i remains constant for each
trial, for ..........k 3, 2, 1,i moreover, 1........
21

k
PPP .

BMCU002: QUANTITATIVE METHODS NOTES
Page 9 of 19

 Each trial of the experiment is independent of the other trials.
Test Statistic  




k
i i
ii
e
eo
1
2
2


Rejection Region
1-k ,
22


Example One
Two companies A and B have recently conducted aggressive advertising campaigns in order to
maintain and possibly increase their respective shares of the market for a particular product. These
two companies enjoy a dominant position in the market. Before advertising campaigns began, the
market share for Company A was 45% while Company B had a market share of 40%. Other
competitors accounted for the remaining market share of 15%. To determine whether these market
shares changed after the advertising campaigns, a marketing analyst solicited the preferences of a
random sample of 200 consumers of this product. Of the 200 consumers, 100 indicated a
preference for Company’s A’s product, 85 preferred Company’s B product and the remainder
preferred one or another of the products distributed by other competitors. Conduct a test to
determine at the 5% level of significance, whether the market shares have changed from the levels
they were at before the advertising campaigns occurred.
Solution
1. Stating the null and alternate hypothesis
Ho: P1= 0.45, P2 = 0.4, P3 = 0.15
HA: At least one of the i
P is not equal to its specified value.
2. Level of significance: 05.0
3. Test statistic: 



k
i i
ii
e
eo
1
2
2 )(

4. Rejection region :99147.5
2
2,05.
1,
22
  k

BMCU002: QUANTITATIVE METHODS NOTES
Page 10 of 19

5. Value of the test statistic: assuming that the null hypothesis is correct, we can calculate the
expected number of consumers who prefer A, B and others using the formula npe
i
 .
Company Observed
frequency
Expected
frequency  
2
iieo

 
i
ii
e
eo
2


A
B
Others
100
85
15
90
80
30
100
25
225
1.11
.31
7.50
Total 200 200 8.92

Therefore 92.8
)(
1
2
2




k
i i
ii
e
eo

6. Conclusion: Reject Ho
There is sufficient evidence at the 5% level of significance to allow us to conclude that the
market shares have changed from the levels they were at before the advertising campaigns
occurred.
Example Two
To determine if a single die, is balanced, or fair, the die was rolled 600 times. The observed
frequencies with which each of the six sides of the die turned up are recorded in the following
table: -
Face 1 2 3 4 5 6
Observed frequency 114 92 84 101 107 102
Is there sufficient evidence to conclude at the 5% level of significance, that the die is not fair?
Solution
1. Stating the null and alternate hypothesis valuespecified itsot equalnot is sP theof oneleast At :
6
1:
i
654321
A
o
H
ppppppH 

2. Level of significance: 05.0

BMCU002: QUANTITATIVE METHODS NOTES
Page 11 of 19

3. Test statistic: 



k
i i
ii
e
eo
1
2
2 )(

4. Decision Rule :HoReject ,0705.11 If ,0705.11
22
5,05.
1,
22
   k
5. Value of the test statistic:
Assuming that the null hypothesis is correct, we can calculate the expected number of
consumers who prefer A, B and others using the formula npe
i
 .
Face Observed
frequency
Expected
frequency  
i
ii
e
eo
2


1
2
3
4
5
6
114
92
84
101
107
102
100
100
100
100
100
100
1.96
0.64
2.56
0.01
0.49
0.04
Total 600 600 5.7

Therefore 0705.117.5
)(
1
2
2




k
i i
ii
e
eo

6. Conclusion:
Do not Reject Ho. There is no sufficient evidence at the 5% level of significance to allow us to
conclude that that the die is not fair.
Rule of Five
For the discrete distribution of the test statistic 2
 to be adequately approximated by the
continuous chi-square distribution, the conventional rule is to require that the expected frequency
for each cell be at least 5. Where necessary, cells should be combined in order to satisfy this
condition. The choice of cells to be combined should be made in such a way that meaningful
categories result from the combination.

BMCU002: QUANTITATIVE METHODS NOTES
Page 12 of 19

3.7.2 Chi-Square Test of a Contingency Table
A contingency table is a rectangular table which items from a population are classified according
to two characteristics. The objective is to analyze the relationship between two qualitative
variables i.e. to investigate whether a dependence relationship exists between two variables or
whether the variables are statistically independent. The number of degrees of freedom for a
contingency table with r rows and c columns is 1 1 -r ..  cfd .
Example One
A sample of employees at a large chemical plant was asked to indicate a preference for one of
three pension plans. The results are given in the following table: -

Job Class
Pension Plan
Plan A Plan B Plan B
Supervisor
Clerical
Laborer
10
19
81
13
80
57
29
19
22
At the 1% significance level, determine whether there is a relationship between the pension
plan selected and the job classification of employees?
Solution

Job Class
Pension Plan Total
Plan A Plan B Plan B
Supervisor
Clerical
Laborer
10
19
81
13
80
57
29
19
22
52
118
160
Total 110 150 70 330
We need to conduct a chi-square of the contingency table to determine whether the classifications
are statistically independent.
Ho: The two classifications are independent
HA: the two classifications are dependent

BMCU002: QUANTITATIVE METHODS NOTES
Page 13 of 19

Test statistic: 



k
i i
ii
e
eo
1
2
2 )(

Rejection region :2767.13
2
4,01.0
)1)(1(,
22
    cr
The value of the test statistic
To compute the expected values for each cell, multiply the row total by the column total and divide
by the total number of shirts sampled.
Cell i Observed frequency o

Expected frequency e

e
eo
2


1
2
3
4
5
6
7
8
9
10
13
29
19
80
19
81
57
22
17.33
23.64
11.03
39.33
53.64
25.03
53.33
72.73
33.94
3.1003
4.7889
29.2766
10.5087
12.9539
1.4527
14.3564
3.4021
4.2005
Total 84.0401

Value of the test statistic : 0401.84
)(
1
2
2




k
i i
ii
e
eo

Conclusion: Reject Ho.
There is enough evidence at the 1% significance level to conclude that the two classifications are
dependent.
Example Two
The Coca Cola Company sells four brands of sodas in East Africa. To help determine if the same
marketing approach used in Kenya can be used in Uganda and Tanzania, one of the firm’s
marketing analysts wants to ascertain if there is an association between the brand of Soda preferred
and the nationality of the consumer. She first classifies the population according to the brand of

BMCU002: QUANTITATIVE METHODS NOTES
Page 14 of 19

soda preferred i.e. Fanta, Sprite, Coke and Krest. Her second classification consists of the three
nationalities; Kenyan, Tanzanian and Ugandan. The marketing analyst then interviews a random
sample of 250 Soda drinkers from the three countries, classifies each according to the two criteria
and records the observed frequency of drinkers falling into each of the cells as shown in the table
below.

Nationality
Soda preference
Total Coke Krest Sprite Fanta
Kenyan
Ugandan
Tanzanian
72
26
7
8
10
10
12
16
14
23
33
19
115
85
50
Total 105 28 42 75 250
Based on the above sample data, can we conclude at the 1% level of significance that there is a
relationship between the preference of the soda drinkers and their nationality?
Solution
We need to conduct a chi-square of the contingency table to determine whether the classifications
are statistically independent.
Ho: The two classifications are independent
HA: the two classifications are dependent
Test statistic: 



k
i i
ii
e
eo
1
2
2 )(

Rejection region :8119.16
2
6,01.0
)1)(1(,
22
    cr
The value of the test statistic
To compute the expected values for each cell, multiply the row total by the column total
and divide by the total number of respondents sampled.

BMCU002: QUANTITATIVE METHODS NOTES
Page 15 of 19

Cell i Observed frequency o

Expected frequency e

e
eo
2


1
2
3
4
5
6
7
8
9
10
11
12
72
26
7
8
10
10
12
16
14
23
33
19
48.30
35.70
21.00
12.88
9.52
5.60
19.32
14.28
8.40
34.50
25.50
15.00
11.63
2.64
9.33
1.85
0.02
3.46
2.77
0.21
3.73
3.83
2.21
1.07

Value of the test statistic: 75.42
)(
1
2
2




k
i i
ii
e
eo

Conclusion: Reject Ho.
Based on the sample data, we can conclude at the 1% significance level that there is a relationship
between preferences of soda drinkers and their nationality.
3.7.3 Chi-Square Test for Normality
The chi-square goodness of fit test for a normal distribution proceeds in essentially the same way
as the chi-square test for a multinomial population. The multinomial test dealt with a single
population of qualitative data, where as a normal distribution involves quantitative data. Therefore,
we must begin by subdividing the range of the normal distribution into a set of intervals or
categories in order to obtain qualitative data.
Example One
A battery manufacturer who wants to determine if the lifetimes of his batteries are normally
distributed. Such information would be helpful in establishing the guarantee that should be offered.
The lifetimes of a sample of 200 batteries are measured and the resulting data are grouped into a

BMCU002: QUANTITATIVE METHODS NOTES
Page 16 of 19

frequency distribution as shown in the table below. The mean and the standard deviation of the
sample life times are 164 and 10 respectively.





Is there evidence at the 5% level of significance that the lifetimes of his batteries are normally
distributed?
Solution
1. Stating the null and alternate hypothesis
H0: The data are normally distributed
HA: The data are not normally distributed
2. Level of significance: 05.0
3. Test statistic: 3-kd.f.
)(
1
2
2




k
i i
ii
e
eo

4. Decision Rule :HoReject ,9915.5 If ,9915.5
22
2,05.
1,
22
   k
5. Value of the test statistic: 10 ,164X
6.2
10
164-190
Z,6.1
10
164-180
Z,6.0
10
164170
4.0
10
164160
Z,4.1
10
164150
,4.2
10
164140










Z
ZZ

Life Time in Hours Number of Batteries
140 up to 150
150 up to 160
160 up to 170
170 up to 180
180 up to 190
15
54
78
42
11
Total 200

BMCU002: QUANTITATIVE METHODS NOTES
Page 17 of 19

Lifetime Probability Observed
frequency
Expected
frequency  
i
ii
e
eo
2


Less than 150
150 up to 160
160 up to 170
170 up to 180
180 or more
0.0808
0.2638
0.3811
0.2195
0.0548
15
54
78
42
11
16.16
52.76
76.22
43.9
10.96
0.0833
0.0291
0.0416
0.0822
0.0001
200 200 0.2363

Therefore 9915.502363.0
)(
1
2
2




k
i i
ii
e
eo

6. Conclusion:
Do not Reject Ho. There is no sufficient evidence at the 5% level of significance to allow us
to conclude that the lifetimes of his batteries are normally distributed?
Example Two
The instructors for an introductory accounting course attempt to construct the final examination
so that the grades are normally distributed with a mean of 65.
Grade Frequency
30 up to 40
40 up to 50
50 up to 60
60 up to 70
70 up to 80
80 up to 90
4
17
29
49
33
18
From the sample of grades appearing in the accompanying frequency distribution table, can
you conclude that they have achieved their objective? (Use 05.0 )

BMCU002: QUANTITATIVE METHODS NOTES
Page 18 of 19

Solution
1. Stating the null and alternate hypothesis
H0: The data are normally distributed
HA: The data are not normally distributed
2. Level of significance: 05.0
3. Test statistic: 3-kd.f.
)(
1
2
2




k
i i
ii
e
eo

4. Decision Rule :HoReject ,81373.7 If ,81473.7
22
3,05.
1,
22
   k
5. Value of the test statistic:
x f xf Dx 10
'
Dx
Dx
2
'Dx
'fDx 2
'fDx
30 up to 40
40 up to 50
50 up to 60
60 up to 70
70 up to 80
80 up to 90
35
45
55
65
75
85
4
17
29
49
33
18
140
765
1595
3185
2475
1530
-30
-20
-10
0
10
20
-3
-2
-1
0
1
2
9
4
1
0
1
4
-12
-34
-29
0
33
36
36
68
29
0
33
72
150 9690 -6 238 6.1210*
150
6
150
238
6.64
150
9690
2










n
xf
x
12.6 ,6.64  X

BMCU002: QUANTITATIVE METHODS NOTES
Page 19 of 19
22.1
12.6
64.6-80
Z,43.0
6.12
6.6470
37.0
6.12
6.6460
Z,16.1
6.12
6.6450
,95.1
6.12
6.6440










Z
ZZ

Lifetime Probability Observed
frequency
Expected
frequency  
i
ii
e
eo
2


Less than 40
40 up to 50
50 up to 60
60 up to 70
70 up to 80
80 or more
0.0256
0.0974
0.229
0.3144
0.2224
0.1112
4
17
29
49
33
18
3.84
14.61
34.35
47.16
33.36
16.68
0.0067
0.3910
0.8333
0.0718
0.0039
0.1045
150 150 1.4112

Therefore 81473.74112.1
)(
1
2
2




k
i i
ii
e
eo

6. Conclusion:
Do not Reject Ho. The data is normally distributed therefore we can conclude that they have
achieved their objective

Page 1 of 19

LESSON THREE: REGRESSION ANALYSIS
3.0 Introduction
Regression involves developing a mathematical equation that analyses the relationship between
the variable to be forecast (dependent variable) and the variables that the statistician believes
are related to the forecast variable (independent variable). Regression is the estimation of
unknown values or the prediction of one variable from known values of other variables. Simple
linear regression involves a relationship between two variables only. Multiple regression
analyses or considers the relationship between three or more variables.
3.1 Lesson Objectives
By the end of the lesson, the students should be able to:
i. Formulate a simple regression model
ii. Calculate the coefficient of correlation and determination and interpret them
iii. Test hypothesis about the regression coefficients
3.2 Simple Regression
The first step in establishing the relationship between X and Y is to obtain observations on the
two variables and analyze the data using a scatter diagram to indicate whether a positive or
negative relationship exists between X and Y. the relationship can be approximated by a
straight line. Algebraically, the relationship is tt
XbbY
10

The above function is deterministic since it gives exact relationship between X and Y. when
the line is plotted, not all the points will fall on the line because of the following reasons:
 Omission of other explanatory variables from the function
 Random behavior of human beings
 Imperfect specification of the functional form of the model
 Errors of aggregation
 Errors of measurement
To account for the deviations of some points from the straight line, the error term is introduced.
The introduction of the error term makes the function stochastic ttt
eXbbY 
10 . To
estimate the values of the coefficients 0
b and 1
b , we need observations on Y, X and the error
term. However, the error term is not observable and therefore we make assumptions about the
error term.

Page 2 of 19

3.3 Assumptions of the Error Term
The following are the assumptions of the error term
 The error term is a real random variable which has a mean of zero and constant variance
(Assumption of homoscedasticity)
 The error term is normally distributed
 The error term corresponding to different values of X for different periods are not correlated
(assumption of no autocorrelation)
 There is no relationship between the explanatory variables and the error term
 The explanatory variables are measured without error. The error absorbs the influence of
omitted variables and errors of measurement in the dependent variable.
All the above assumptions are called stochastic assumptions
Other Assumptions
 The explanatory variables are not perfectly linearly related or correlated (No
multicollinearity)
 The variables are correctly aggregated
 The relation being estimated is identified
 The relationship is correctly specified
The regression equation of Y on X
 It used to predict the values of Y from the given values of X.
 It is expressed as follows XbbY
10

 To determine the values of 0
b and 1
b the following two normal equations are to be solved
simultaneously
  



2
10
10
XbXbXY
XbnbY

 Alternatively the values of 0
b and 1
b can be got using the following formula’s XbYb
10





2
2
1
XnX
YXnXY
b

Page 3 of 19

3.4 Correlation
Definition: It is the existence of some definite relationship between two or more variables.
Correlation analysis is a statistical tool used to describe the degree to which one variable is
linearly related to another variable.
Types of Correlation
Correlation may be classified in the following ways:-
(a) Positive and negative correlation.
Correlation is said to be positive if two series move in the same direction, otherwise it is
negative (opposite Direction).
(b) Linear and Non-Linear correlation
Correlation is linear if the amount of change in one variable tends to bear a constant ratio to
the amount of change in the other variable otherwise it is non-linear.
(c) Simple, partial and multiple correlation
Simple correlation is where two variables are studied while partial or multiple involves three
or more variables.
3.5 Methods of Calculating Simple Correlation
 Scatter diagram
 Karl Pearson’s coefficient of correlation
 Spearman’s rank correlation coefficient
 Method of least squares
Karl Pearson’s coefficient of correlation (Product moment coefficient of correlation)
The coefficient of correlation (r) is a measure of strength of the linear relationship between two
variables. 




2
2
2
2
YnYXnX
YXnXY
r

Interpretation of the coefficient of correlation
1. When r = +1, there is a perfect positive correlation between the variables
2. When r = -1, there is a perfect negative correlation between the variables

Page 4 of 19

3. When r = 0, there is no correlation between the variables
4. The closer r is to +1 or to –1, the stronger the relationship between the variables and the
closer r is to 0, the weaker the relationship.
5. The following table lists the interpretations for various correlation coefficients:
Value Comment
0.8 to 1.0
0.6 to 0.8
0.4 to 0.6
0.2 to 0.4
0.0 to 0.2
Very strong
Strong
Moderate
Weak
Very weak
Method of least squares yyxx
xy
SSSS
SS
r
*


Coefficient of determination (r
2
)
It is the square of the correlation coefficient. It shows the proportion of the total variation in
the dependent variable Y that is explained or accounted for by the variation in the independent
variable X. e.g. If the value of r = 0.9, r
2
= 0.81, this means 81% of the variation in the
dependent variable has been explained by the independent variable.
Example One
A random sample of eight auto drivers insured with a company and having similar auto
insurance policies was selected. The following table lists their driving experience (in years)
and the monthly auto insurance premium (in Sh.000) paid by them.
Driving experience (Years) 5 2 12 9 15 6 25 16
Monthly auto insurance premium
(In Sh.000)
64 87 50 71 44 56 42 69
i. Find the least squares regression line by identifying the appropriate dependent and
independent variable
ii. Interpret the meaning of the constants calculated in part (i).
iii. Compute the coefficient of correlation and coefficient of determination and interpret
them.

Page 5 of 19

Solution:
i. xy
10
ˆ  xx
xy
SS
SS

1
ˆ
 xy
10
ˆˆ
  90x
1396
2
x 4739xy 474y 29642
2
y 
5.383
8
90
1396
2
2
2



n
x
xSS
xx

5.593
8
474*90
4739 


n
yx
xySS
xy

5.1557
8
474
29642
2
2
2



n
y
ySS
yy
55.1
5.383
5.593
ˆ
1 


xx
xy
SS
SS

69.76)25.11*55.1(25.59
ˆˆ
10  xy
xxy 55.169.76ˆ
10


ii. 55.1
ˆ
1 it indicates the rate at which the insurance premium reduces with an
additional year of driving experience 69.76
ˆ
0
It indicates the amount of premium that would be paid by a driver without
any years of experience.
iii. 77.0
5.1557*5.383
5.593
*



yyxx
xy
SSSS
SS
r

There is a strong negative relationship between the years of experience and the monthly auto
insurance premiums %29.5977.0
22
r

59.29% of the premium paid is determined by the driving experience
Example Two
A company is using a system of payment by results. The union claims that this seriously
discriminates against the workers. there is a fairly steep learning curve which workers follow
with the apparent outcome that more experienced workers can perform the task in about half
of the time taken by the new employee. You have been asked to find out if there is any basis

Page 6 of 19

for this claim. To do this, you have observed ten workers on the shop floor, timing how long it
takes them to produce an item. It was then possible for you to match these times with the length
of worker’s experience. The results obtained are shown below:
Month’s experience 2 5 3 8 5 9 12 16 1 6
Time taken 27 26 30 20 22 20 16 15 30 19
Required:
(a) Find the regression line of time taken on month’s experience
(b) Compute the coefficient of correlation and coefficient of determination and interpret them.

Solution: xbbY
10

xx
xy
SS
SS
b
1 XbYb
10 76X
645
2
X 1300XY 225Y 5331
2
Y 
1.196
10
67
645
2
2
2



n
X
XSS
xx

5.207
10
225*67
1300 


n
YX
XYSS
xy

5.268
10
225
5331
2
2
2



n
Y
YSS
yy
0581.1
1.196
5.207
1 


xx
xy
SS
SS
b
41073.15)7.6*0581.1(5.22
10  XbYb
XY 0581.141073.15 


iv. 0581.1
1
b : It indicates the rate at which the time taken would reduce by for every
additional month of experience 41073.15
0
b
It indicates the time taken by an employee without any experience 9043.0
5.268*1.196
5.207
*



yyxx
xy
SSSS
SS
r

There is a very strong negative correlation between the month’s experience and the time
taken

Page 7 of 19
%78.81100*8178.09043.0
22
r

81.78% of the variation in the time taken is explained by the month’s experience
Example Three
Students in the BMS 302 class were polled by a researcher attempting to establish a relationship
between hours of study in the week immediately preceding the end of semester exam and the
marks received on the exam. The surveyor gathered the data listed in the accompanying table
Hours of study Exam score
25
12
18
26
19
20
23
15
22
8
93
57
55
90
82
95
95
80
85
61
i. Find the least squares regression line by identifying the appropriate dependent and
independent variable.
ii. Interpret the meaning of the values of 0 and 1 calculated in part (i).
iii. Compute the correlation of coefficient and coefficient of determination and interpret them.

Solution xy
10
ˆ 
xx
xy
SS
SS

1
ˆ
 xy
10
ˆˆ
  188x
3832
2
x 15540xy 793y 65143
2
y 
6.297
10
188
3832
2
2
2



n
x
xSS
xx

6.631
10
793*188
15540 


n
yx
xySS
xy

Page 8 of 19

1.2258
10
793
65143
2
2
2



n
y
ySS
yy
122.2
6.297
6.631
ˆ
1 
xx
xy
SS
SS

4064.39)8.18*122.2(3.79
ˆˆ
10  xy
xxy 122.24064.39ˆ
10


i. 122.2
ˆ
1
 it indicates the rate at which the exam score would increase with an
additional hour of study 04.39
ˆ
0
It indicates the exam score that would be attained by a student who does
not study a week to exams.

ii. 77.0
1.2258*6.297
6.631
*

yyxx
xy
SSSS
SS
r

There is a strong positive relationship between the exam score and the number of hours studied %29.5977.0
22
r

59.29% of the exam score is determined by the number of hours studied
3.6 Spearman’s Rank Correlation
 It is the correlation between the ranks assigned to individuals by two different people.
 It is a non-parametric technique for measuring strength of relationship between paired
observations of two variables when the data are in ranked form.

It is denoted by R or p NN
d
NN
d
R
i





3
2
2
2
6
1
)1(
6
1

In rank correlation, there are two types of problems:-
i. Where actual ranks are given
ii. Where actual ranks are not given

Page 9 of 19

Where actual ranks are given
Steps:
 Take the differences of the two ranks i.e. (R1-R2) and denote these differences by d.
 Square these differences and obtain the total 
2
d
 Use the formula NN
d
R



3
2
6
1
Example
The ranks given by two judges to 10 individuals are given below.
Individual 1 2 3 4 5 6 7 8 9 10
Judge 1(X) 1 2 7 9 8 6 4 3 10 5
Judge 2 (Y) 7 5 8 10 9 4 1 6 3 2
Calculate
(a) The spearman’s rank correlation.
(b) The Coefficient of correlation
Where ranks are not given
Ranks can be assigned by taking either the highest value as 1 or the lowest value as 1. the same
method should be followed in case of all the variables.
Example
Calculate the Rank correlation coefficient for the following data of marks given to 1
st
year B
Com students:
CMS 100 45 47 60 38 50
CAC 100 60 61 58 48 46

Equal Ranks or Tie in Ranks
 Where equal ranks are assigned to some entries, an adjustment in the formula for
calculating the Rank coefficient of correlation is made.
 The adjustment consists of adding  mm
3
12
1 to the value of 
2
d where m stands
for the number of items whose ranks are common.

Page 10 of 19

Example
An examination of eight applicants for a clerical post was taken by a firm. From the marks
obtained by the applicants in the accounting and statistics papers, compute the Rank coefficient
of correlation.
Applicant A B C D E F G H
Marks in accounting 15 20 28 12 40 60 20 80
Marks in statistics 40 30 50 30 20 10 30 60
3.7 Assessing the Regression Model
3.7.1 Estimating the variance of the error variable
The sample statistic 2
2


n
SSE
S
e is an unbiased estimator of 2
e . The square root of 2
eS is called
the standard error of estimate i.e. 2

n
SSE
S
e xx
yy
SS
SS
SSSSE
xy
2


Interpretation of the Standard Error of Estimate
 The smallest value that the standard error of estimate can assume is zero, which occurs
when SSE = 0 i.e. when all the points fall on the regression line.
 If 
S is close to zero, the fit is excellent and the linear model is likely to be a useful and
effective analytical and forecasting tool
 If 
S is large, the model is a poor one and the statistician should either improve it or
discard it.
 In general, the standard error of estimate cannot be used as an absolute measure of the
model’s utility. Nonetheless, it is useful in comparing models.
3.7.2 Drawing inferences about 1

This involves determining whether a linear relationship actually exists between x and y . The
null hypothesis will always state that there is no linear relationship between the variables i.e. 0:
10
H
. Any of the following three alternate hypothesis can be tested:-
i. 0:
1

A
H Tests whether some linear relationship exists between x and y
ii. 0:
1

A
H Tests for a positive linear relationship exists between x and y

Page 11 of 19

iii. 0:
1

A
H Tests for a negative linear relationship exists between x and y
The test statistic is 1
11
bs
b
t

 where xx
e
b
SS
S
S
1
Assuming that the error variable is normally distributed, the test statistic follows a student
distribution with 2n degrees of freedom
The confidence interval estimator of 12,2/11 bn
Stb




3.7.3 Measuring the strength of the linear relationship 1

is useful in measuring the strength of the linear relationship particularly when we want to
compare different models to see which one fits the data better.
(a) Coefficient of Correlation
The coefficient of correlation denoted by )(Rho measures the similarity of the changes in the
values of x and y . Its range is 11 . Since  is a population parameter, its value is
estimated from the data. The sample coefficient of correlation r is defined as follows:- yyxx
xy
SSSS
SS
r
*


(b) Testing the Coefficient of Correlation
If 0 the values of x and y are uncorrelated and the linear model is not appropriate. We
can determine if x and y are correlated by testing the following hypothesis 0:
0:
0




AH
H

Test statistics for rs
r
t where 2
1
2



n
r
s
r
The test statistics is student t distributed with n-2 degrees of freedom if the error variable is
normally distributed
(c) Coefficient of Determination )(
2
r
This measures the proportion of variability in the dependent variable that is explained by
variability of the independent variable.

Page 12 of 19
yyxxSSSS
SS
r
xy
2
2


3.7.4 Predicting the particular value of y for a given x (The prediction Interval)
The prediction interval is given by: -  
SSxx
xx
n
StyY
g
en
2
2,2/
1





Where gx is the given value of x and g
xbby
10
ˆ 
3.7.5 Estimating the expected value of y for a given x

(The confidence Interval)
The confidence interval is given by: -  
SSxx
xx
n
StyY
g
en
2
2,2/
1
ˆ




Where gx is the given value of x and g
xbby
10
ˆ 
Example One
A real estate agent would like to predict the selling price of single family homes. After careful
consideration, she concludes that the variable likely to be mostly closely related to the selling
price is the size of the house. As an experiment, she takes a random sample of 15 recently sold
houses and records the selling price in Sh.000’s and size in 100 ft
2
of each. The data is shown
in the table below: -
House size
(100 ft
2
)
20.0 14.8 20.5 12.5 18.0 14.3 27.5 16.5 24.3 20.2
Selling price
(Sh’000)
89.5 79.9 83.1 56.9 66.6 82.5 126.3 79.3 119.9 87.6

22.0 19.0 12.3 14.0 16.7
112.6 120.8 78.5 74.3 74.8
Required: -
(a) Find the sample regression line for the data
(b) Estimate the variance of the error variable and the standard error of estimate.

Page 13 of 19

(c) Can we conclude at the 1% significance level that the size of a house is linearly related
to its selling price?
(d) Estimate the 99% confidence interval estimate of 1

(e) Compute the coefficient of correlation and interpret its value
(f) Can we conclude at the 1% significance level that the two variables are correlated?
(g) Compute the coefficient of determination and interpret its value
(h) Predict with 95% confidence the selling price of a house that occupies 2,000ft
2
.
(i) In a certain part of the city, a developer built several thousand houses whose floor plans
and exteriors differ but whose sizes are all 2,000 ft
2
. To date, they have been rented but
the builder now wants to sell them and wants to know approximately how much money
in total he can expect from the sale of the houses. Help him by estimating a 95%
confidence interval estimate of the mean selling price of the houses.
Solution
(a) Find the least squares regression line xbby
10
ˆ
xx
xy
SS
SS
b
1 xyb
10
ˆ
 6.272X
 6.1332Y  97.25257XY  24.5222
2
X
42.124618
2
Y 
189.268
15
6.272
24.5222
2
2
2



n
x
xSS
xx

186.1040
15
6.1332*6.272
97.25257 


n
yx
xySS
xy

24.6230
15
2.1332
42.124618
2
2
2



n
y
ySS
yy
88.3
189.268
186.1040
1 
xx
xy
SS
SS
b
34.18)17.18*88.3(84.88
10  xbyb
xxbby 88.334.18ˆ
10


Page 14 of 19

(b) Estimate the variance of the error variable and the standard error of estimate. 16913
13
215
88.2195
2
88.2195
19.268
18.1040
24.6230
22
2
2







e
e
xx
yy
S
n
SSE
S
SS
SS
SSSSE
xy

(c) Can we conclude at the 1% significance level that the size of a house is linearly related to
its selling price? 0:
b
t:StatisticTest 0.05 0:
1
1
1
1




A
b
o
H
S
H

Decision rule 013,025.02,2/
HReject ,012.3or 012.3 If 3.012 
 ccn
tttt


Value of the test statistic 012.389.4
794.0
88.3
794.0
19.268
13
1
1
1


b
xx
e
b
S
b
t
SS
S
S

Conclusion: Reject Ho. Yes, the data provides sufficient evidence to conclude that the
house size is linearly related to its selling price
(d) Estimate the 99% confidence interval estimate of 1
 27.649.1
39.288.3)794.0*012.3(88.3
1
1
12,2/11







 bnStb

(e) Compute the coefficient of correlation and interpret its value 805.0
24.6230*19.268
18.1040
*

yyxx
xy
SSSS
SS
r

There is a very strong positive correlation between the size of the house and its selling
price

Page 15 of 19

(f) Can we conclude at the 1% significance level that the two variables are correlated? 0:
r t:StatisticTest 0.05 0:




A
r
o
H
S
H

Decision rule 013,025.02,2/
HReject ,012.3or 012.3 If 3.012 
 ccn
tttt


Value of the test statistic 012.389.4
165.0
805.0
165.0
215
805.01
2
1
22








r
r
S
r
t
n
r
S

Conclusion: Reject Ho. Yes, the data provides sufficient evidence to conclude that the
house size is correlated to its selling price
(g) Compute the coefficient of determination and interpret its value %8.64100*648.0805.0
22
r

64.8% of the variation in the selling price is explained by the variation in the size of the
house and the remaining 35.2% is explained by other factors
(h) Predict with 95% confidence the selling price of a house that occupies 2,000ft
2
.  
 
11.12577.66
17.2994.95
19.268
17.1820
15
1
1*13*16.294.95
94.95)20(88.334..18ˆ
1

2
2
2,2/








Y
Y
y
SSxx
xx
n
StyY
g
en

(i) In a certain part of the city, a developer built several thousand houses whose floor plans
and exteriors differ but whose sizes are all 2,000 ft
2
. To date, they have been rented but
the builder now wants to sell them and wants to know approximately how much money in
total he can expect from the sale of the houses. Help him by estimating a 95% confidence
interval estimate of the mean selling price of the houses.

Page 16 of 19
 
 
84.10304.88
9.794.95
19.268
17.1820
15
1
*13*16.294.95
94.95)20(88.334..18ˆ
1
ˆ
2
2
2,2/








Y
Y
y
SSxx
xx
n
StyY
g
en

Example Two
The different interest rates charged by some financial institutions may reflect how stringent
their standards are for their loan appraisals i.e. the lower the rate, the higher the standards and
hence the lower the default rate. The following data were collected from a sample of nine
financial companies selected at random.
Interest rate (%) X 7.0 6.6 6.0 8.5 8.0 7.5 6.5 7.0 8.0
Default rate (per 1000 loans) Y 38 40 35 46 48 39 36 37 44
Required: -
(a) Find the least squares regression line of the default rate on interest rate
(b) Calculate the coefficient of correlation and the coefficient of determination and
interpret their values
(c) Find the standard error of estimate.
(d) Do these data provide sufficient evidence to indicate that there is a positive linear
relationship between the interest rate and the default rate?
(e) Find the 95% prediction interval for the default rate when the interest rate is 8%.
Solution:
(a) Find the least squares regression line of the default rate on interest rate xbby
10
ˆ
xx
xy
SS
SS
b
1 xyb
10
ˆ
 1.65x
31.476
2
x  5.2652xy 363y
14811
2
y 
42.5
9
1.65
31.476
2
2
2



n
x
xSS
xx

8.26
9
363*1.65
5.2652 


n
yx
xySS
xy

Page 17 of 19

170
9
363
14811
2
2
2



n
y
ySS
yy
9446.4
42.5
8.26
1 
xx
xy
SS
SS
b
 5674.4)
9
1.65
*9446.4(
9
363
10  xbyb
xxbby 9446.45674.4ˆ
10


(b) Calculate the coefficient of correlation and the coefficient of determination and interpret
their values 8829.0
170*42.5
8.26
*

yyxx
xy
SSSS
SS
r

There is a very strong positive correlation between the interest rate and default rate %95.77100*7795.08829.0
22
r

77.95% of the variation in the default rate is explained by the variation in the interest rate
while the remaining 21.46% is explained by other factors
(c) Find the standard error of estimate 314.2
29
4834.37
2
4834.37
42.5
8.26
170
2
2






n
SSE
S
SS
SS
SSSSE
e
xx
yy
xy

(d) Do these data provide sufficient evidence to indicate that there is a positive linear
relationship between the interest rate and the default rate? 0:
b
t:StatisticTest 0.05 0:
1
1
1
1




A
b
o
H
S
H

Decision rule 07,05.02,
HReject ,895.1 If 895.1 
 cn
ttt


Value of the test statistic 895.19749.4
9939.0
9446.4
9939.0
42.5
314.2
1
1
1


b
xx
e
b
S
b
t
SS
S
S

Page 18 of 19

Conclusion: Reject Ho. Yes, the data provides sufficient evidence to conclude that there
is a positive linear relationship between the interest rate and default rate
(e) Find the 95% prediction interval for the default rate when the interest rate is 8%.  
 
1701.500783.38
0459.61242.44
42.5
23.78
9
1
1*314.2*365.21242.44
1242.44)8(9446.45674.4ˆ
1

2
2
2,2/








Y
Y
y
SSxx
xx
n
StyY
g
en

3.8 Activities
1. A Farmer wanted to find out the relationship between the amount of fertilizer used and the
yield of corn. He selected seven acres of his land on which he used different amounts of
fertilizer to grow corn. The following table gives the amount (in kg) of fertilizer used and
the yield (in Tonnes) of corn for each of the seven acres.
Fertilizer used 120 80 100 70 88 75 110
Yield of corn 138 112 129 96 119 104 134
i. Find the least squares regression line by identifying the appropriate dependent and
independent variable.
ii. Interpret the meaning of the values of 0 and 1 calculated in part (i).
iii. Compute the coefficient of correlation and coefficient of determination and interpret
them.
iv. Predict the yield of corn per acre for 105 kg of fertilizer used.

2. In an attempt to get a better idea of some of the determinants of medical expenditures by
families, a social worker collected data on family size and average weekly medical bills,
with the results shown in the following table;
Family size 2 2 4 5 7 3 8 10 5 2 3 5 2
Weekly medical
expenses ( in Sh. ’00’)

20

28

52

50

78

35

102

88

51

22

29

49

25

i. Find the least squares regression line by identifying the appropriate dependent and
independent variable.

Page 19 of 19

ii. Interpret the meaning of the values of 0 and 1 calculated in part (i).
iii. Compute the coefficient of correlation and coefficient of determination and interpret them.

3. The sales manager of a large manufacturer of computers wants to determine the extent of
the relationship between the sales performance of salesmen based upon their sales
experience. A sample of 12 salesmen was randomly selected and their annual sales figures
in thousands of dollars and their years of sales experience are recorded in the following
table: -
Experience in
Years (X)
2 2 1 1 5 5 3 4 4 3 8 10
Annual sales (Y) 20 12 8 10 70 60 32 90 30 30 100 60

a) Find the least squares simple regression line that can be used to estimate the annual sales
given that the experience in years is known.
b) Interpret the meaning of the constants calculated in (a) above
c) Compute the coefficient of correlation and coefficient of determination and interpret their
values
d) Do the data provide enough evidence at the 5% significance level to conclude that the
experience in years and the annual sales are linearly related?
e) Construct a 99% confidence interval for 1.
f) Predict with 95% confidence the annual sales for a salesperson with nine years of
experience

Page 1 of 7

DIFFERENTIATION
2.0 Introduction






2.1 Lesson Objectives











2.2 Rules of Differentiation
(a) Power Rule
If 1
,


nn
nx
dx
dy
xy . This holds at all points except at 0x
Example
If 4155
55, xx
dx
dy
xy 

(b) Constant Function
If cy a constant, then 0
dx
dy
By the end of the lesson, the students should be able to:
1. Differentiate any given function, using the rules of differentiation.
2. Find the second derivative of any given function
3. Find the partial derivatives of a given function
4. Find the gradient of a given curve at a given point
5. Identify maximum and minimum turning points
Differentiation is the process of finding the derivative of a function. The derivative
represents the instantaneous rate of change in the dependent variable given a change in the
independent variable.

Page 2 of 7

Example
If 0,6
dx
dy
y
(c) Constant Times a Function
If cuy where )(xfu , dx
du
c
dx
dy

Example
If 223
123*4,4 xx
dx
dy
xy 
(d) Sum or Difference of Functions
If vuy where )(xfu and )(xgv , dx
dv
dx
du
dx
dy
 .
Example
If 1024,108
23
 x
dx
dy
xxy
If xx
dx
dy
xxy 109,53
223

(e) Product Rule
If y is a product of two functions i.e. uvy where u and v are functions of x , then dx
du
v
dx
dv
u
dx
dy


Example 1
Find the derivative of the function  352
2
 xxy
Solution
Let 2
2xu x
dx
du
4 and
35xv 5
dx
dv
 
xxxxx
xx
dx
du
v
dx
dv
u
dx
dy
1230122010
5355*2
222
2



Page 3 of 7

Example 2
Find the derivative of the function  35
2
 xxy
Solution
Let 5xu 1
dx
du and
3
2
xv x
dx
dv
2

  31031325
22
 xxxxx
dx
du
v
dx
dv
u
dx
dy
(f) Quotient Rule
If y is a quotient of two functions i.e. v
u
y where u and v are functions of x , then 2
v
dx
dv
u
dx
du
v
dx
dy



Example 1
Find the derivative of the function x
x
y
32
2


Solution
Let 32
2
xu x
dx
du
4 and
xv 1
dx
dv
 
2
2
2
2
2
321324
x
x
x
xxx
v
dx
dv
u
dx
du
v
dx
dy 





Example 2
Find the derivative of the function 2
2
3
x
x
y

Page 4 of 7

Solution
Let 3
2
xu x
dx
du
2 and
2
xv x
dx
dv
2
 

344
33
2
2
22
2
66622232
xx
x
x
xxx
x
xxxx
v
dx
dv
u
dx
du
v
dx
dy







(g) Chain rule
Chain rule is used to differentiate compound functions. If 
n
uy where u is a function of x ,
then dx
du
nu
dx
dy
n1
 or dx
du
du
dy
dx
dy
*
Example 1
Find the derivative of the function 35
3
 xxy
Solution
The above function can be re-written as  
2
1
3
35 xxy
Let  35
3
 xxu 53
2
x
dx
du
Therefore 2
1
uy 2
1
2
1
u
du
dy 352
53
)35)(53(
2
1
)53(*
2
1
*
3
2
2
1
3222
1





xx
x
xxxxu
dx
du
du
dy
dx
dy

Example 2
Find the derivative of the function  
2
3
2
563  xxy
Solution
Let 563
2
 xxu 66x
dx
du

Page 5 of 7

Therefore 2
3
uy 2
1
2
3
u
du
dy
   
2
1
22
1
2
2
1
56399)66()563(
2
3
)66(*
2
3
*


xxxxxx
xu
dx
du
du
dy
dx
dy

2.3 Second derivative of a function
The second derivative of )(xfy is denoted by )(''xf or 2
2
dx
yd .
Example
Find the second derivative of the function 251045
23
 xxxy
Solution 830
10815
2
2
2


x
dx
yd
xx
dx
dy

2.4 Partial Derivatives
If a function has more than one variable, partial derivatives can be calculated for each variable, by
assuming all the other variables to be constants.
Example
Find the partial derivatives the following function with respect to x and y. 2081510
22
 yxyxZ

Solution 2
830
1610
xy
y
Z
xy
x
Z





Page 6 of 7

2.5 Finding the Gradient
When finding the gradient of a curve at a given point, the function is differentiated and the value
of x is inserted at the required points in the derivative function.
Example
Find the gradient of the curve 535
2
 xxY when x = 2.
Solution 310x
dx
dy

Therefore when x = 2, the gradient of the curve 233)2(10 

2.6 Finding a Stationary Point
A stationary point is that point where the gradient is zero. It may be a maximum or a minimum
point. The following steps are taken to find out a stationary point
 Find the derivative of the function
 Equate the derivative to Zero
 Solve the resulting equation
Example
Find the point at which the curve 1032
2
 xxY will have a stationary point.
Solution 75.0
4
3
034
34



x
x
x
dx
dy

There will be a stationery point when 75.0x
2.7 Turning Points (Maximum and Minimum)
The turning points show the maximum or minimum values of a given function. The gradient at
both maximum and minimum points is zero. In order to find out the maximum or minimum points,
the second derivative is calculated. The following rules apply:

Page 7 of 7

First derivative Second derivative
Maximum
Minimum
Zero
Zero
Negative
Positive

Example
Find the point at which the curve 23
3
 xxY will have a maximum or minimum point.
Solution 1or 1 ,1 033 33
222
 xxxxx
dx
dy

There will be stationary points when x = +1 or x = -1
Finding the second derivative 61*6
d
then 1- x if
61*6
dx
d
then 1 xif
6
2
2
2
2
2
2



dx
y
y
x
dx
yd

Therefore, there is a minimum point when x = +1 and a maximum point when x = -1.

Page 1 of 8

INTERGRAL CALCULUS
2.0 Introduction












2.1 Lesson Objectives








2.2 Rules of Integration
Integration is the reverse of differentiation. Given the derivative of a function, the process of
finding the original function is the reverse of differentiation, f is said to be an antiderivative of 'f
. Given 4)('xf then Cxxf 4)(
Example 1:
Find the antiderivative of any constant of 0)('xf .

By the end of the lesson, the students should be able to: -
1. Integrate various functions by using the rules of integration
2. Differentiate between integration and differentiation
3. Apply integration in solving business problems and
4. Find the area of a curve between two points.
Integration is the branch of calculus concerned with the theory and applications of
integrals. While differential calculus focuses on rates of change, such as slopes of tangent
lines and velocities, integral calculus deals with total size or value, such as lengths, areas, and
volumes. The two branches are connected by the fundamental theorem of calculus, which
shows how a definite integral is calculated by using its antiderivative (a function whose rate
of change, or derivative, equals the function being integrated). For example, integrating a
velocity function yields a distance function, which enables the distance traveled by an object
over an interval of time to be calculated. As a result, much of integral calculus deals with the
derivation of formulas for finding antiderivatives.

.

Page 2 of 8

Solution
The derivative of any constant function is 0. Therefore, the antiderivative is Cxf)(
Example 2:
Find the derivative of 52)(' xxf
Solution Cxxxf  5)(
2

Given that f is a continuous function

 CxFdxxf )()(
If F´(x) = f(x), C is the constant of integration.

RULE 1: Constant Functions:

 Ckxkdx , Where k is any constant.

Examples:
a) 
 Cxdx2)2( .
b) 
 Cxdx
2
3
2
3 .
c) Cxdx 
22 .
d) 
 CCxdx)0(0 .

RULE 2: Power Rule
C
n
x
dxx
n
n




1
1 provided that 1n

Examples
a) C
x
xdx 
2
2

Page 3 of 8

b) C
x
dxx 
3
3
2
c) CxC
x
dxxdxx 
2
3
2
3
2
1
3
2
2
3
d) C
x
C
x
dxxdx
x





 2
2
3
3
2
1
2
1


RULE 3: Constant Times a Function
 
 .)()( dxxfkdxxKf Where k is any constant.


Examples a) C
x
xdxxdx 
2
5
55
2
b) CxC
x
dxxdx
x

3
3
2
2
6
1
3
*
2
1
2
1
2
c) CxCx
x
dxxdx
x
dx
x
 

66
2
1
*33
1
3
3
2
1
2
1
2
1
RULE 4: Addition or Difference Rule
If 
dx )(xf and 
dxxg)( exist, then   
 dxxgdxxfdxxgxf )()()()(

Example
a)   Cx
x
CxC
x
dxxdxdxx  
6
2
3
)6(
2
3
6363
2
21
2

N.B Even though the two integrals technically results in separate constants of integration, these
constants may be considered together as one.

RULE 5: Power Rule Exception
Cxdxx 

ln
1

Page 4 of 8

RULE 6: Exponential Function
Cdx
xx



RULE 7: 

1,
1
)(
)('.)(
1





nC
n
xf
dxxfxf
n
n


Example  
 
C
x
dxx 


4
35
535
4
3


RULE 8:

 
Cdxxf
xfxf

'
Examples:
(a) Evaluate  dxx
x
2
2
Solution Cdxx
xx

22
2 


(b) Evaluate  dxx
x
3
32

Solution
3
3)( xxf and 2
9)(' xxf

 dxx
x
3
32
 =dxx
x
3
32
9
9
 =dxx
x
3
32
9
9
1
 =Cdxx
xx

33
333
9
1


2.3 Application of Integral Calculus
Integration can be used to determine the area between curves and also to solve business and
problems.

Page 5 of 8

2.3.1 Finding the Area Between a Curve and the X-Axis
For a function f that is continuous over [a, b], the area between )(xfy and the x axis from x =
a to x = b can be found using definite integrals as follows:
 For 0)(xf over [a, b]: Area = 
b
a
dxxf)(
 For 0)(xf over [a, b]: Area = 

b
a
dxxf)]([
Example
Find the area bounded by 2
6)( xxxf  and 0y for 41x .
Solution
Area = 242448
3
)1(
)1(3
3
)4(
)4(3
3
3)6(
3
2
3
2
4
1
4
1
3
22






















x
xdxxx

2.3.2 Finding the Area Between Two Curves
If f and g are continuous and )()( xgxf over the interval [a, b] , then the area bounded by )(xfy
and )(xgy for bxa is given by
Area = 

b
a
dxxgxf )]()([
Example
Find the area bounded by the graphs of ,3
2
1)( xxf 1 and 2 ,1)(
2
 x-xxxg
Solution
Area =  
25.8
4
33
4
4
4
3
8
2
4
1
3
1
2
43
2
2
13
2
)]()([
1
2
23
1
2
2
1
2
2
1
2













































xx
dx
x
xdxx
x
dxxgxf

Page 6 of 8

2.3.3 Application of Integration to Cost, Revenue and Profit Functions
Examples:
1. The function describing the marginal cost of producing a product is 100xMC
Where x equals the number of units produced .It is also known that total cost equals Sh. 40,000
when 100x . Determine the total cost function.
Solution:
To determine the total cost, we must first find the antiderivative of the marginal cost function, or Cx
x
xC  100
2
)(
2

Given that C (100) = 40,000, we can solve for the value of C, which happens to represent the fixed
cost.
000,25
000,10000,5000,40
)100(100
2
100
40000
2



c
C
C
The specific function representing the total cost of producing the product is
000,25100
2
)(
2
 x
x
xC
2. The marginal revenue function for a company’s product is
xMR 000,50
Where x equals the number of units produced and sold. If total revenue equals 0 when no units
are sold, determine the total revenue function for the producer.
Solution:
Because the marginal revenue function is the derivative of the total function is the antiderivative
of MR.
C
x
xxR 
2
000,50)(
2
Since we are told that 0)0(R , substitution of 0x and 0R into the above equation yields
0
2
0
)0(000,500
2


C
C

Page 7 of 8

Thus the total revenue function for the company’s product is 2
)(000,50)(
2
x
xxR 

3. An automobile manufacturer estimates that the annual rate of expenditure )(tr for maintenance
on one of its model is represented by the function 2
10100)( ttr  .
(a) What is the expected maintenance expenditures during the automobile first 5 years?
(b) Of the above expenditures, what is expected to be incurred during the fifth year?
Solution
(a) 
5
0
3
5
0
2
3
10
100[)10100(
t
tdtt  =67.416500
3
)5(10
)5(100
3

=$916.67

(b) Of these expenditures, those expected to be incurred during the firth year are
estimated as 
5
4
35
4
2
3
10
100)10100(
t
tdtt 
=













3
)4(10
)4(100
3
)5(10
)5(100
33
=916.67- )33.213400 =Sh. 303.34
4. A state civic organization is conducting its annual fund-raising campaign for its summer camp
program for the disadvantaged. Campaign expenditures will be incurred at a rate of $10,000
per day. From past experience it is known that contribution will be high during the early stages
of the campaign and will tend to fall off as the campaign continues. The function describing
the rate at which contribution are received is 20000100)(
2
 ttC
Where t represents the day of the campaign, and )(tC equals the rate at which contributions
are received, measured in shillings per day. The organization wishes to maximize the net
proceeds from the campaign.
a) Determine how long the campaign should be conducted in order to maximize net proceeds.
b) What are the total campaign expenditures expected to equal?
c) What are the total contributions expected to equal?
d) What are the net proceeds (total contribution less total expenditures) expected to equal?

Page 8 of 8

Solution
a) The function which describes the rate at which expenditures )(te are incurred is
e (t)=10,000
)(tC = )(te
-100t
2
+20,000=10,000
t
2
= 100
t = 10 days’ negative root is meaningless
b) Total campaign expenditures are represented by
E = e (t)
= (Sh. 10,000 per day) (10 days)
= Sh.100,000
c) Total contribution during the 10 days are represented by
C= dtt )000,20100(
10
0
2

= -100 10
0
3
000,20
3
t
t

= )10(000,20
3
)10(100
3



= -33,333.33+200,000 = $166,666.67

d) Net Proceeds = Costs – Expenditures = Sh.166, 666.67 - Sh.100, 000
=Sh. 66,666.67

Page 1 of 19

PROBABILITY THEORY
SOME BASIC CONCEPTS
(1) Probability
Probability refers to the chance of occurrence of an event or happening. probability is a
quantitative measure of uncertainty − a number that conveys the strength of our belief in the
occurrence of an uncertain event.
Probability theory provides us with the ways and means to quantify the uncertainties involved
in such situations.
(2) Experiment
An experiment is a process that leads to one of several possible outcomes. An outcome of an
experiment is some observation or measurement. The experiments in probability theory have
three things in common:
 there are two or more outcomes of each experiment
 it is possible to specify the outcomes in advance
 there is uncertainty about the outcomes
A single outcome of an experiment is called a basic outcome or an elementary event.
(3) Sample Space
The sample space is the universal sets pertinent to a given experiment. It is the set of all
possible outcomes of an experiment and is denoted by Ω.
The sample spaces for the following experiments are:
Experiment Sample Space
Drawing a Card {all 52 cards in the deck}
Reading the Temperature {all numbers in the range of temperatures}
Measurement of a Product's Dimension {undersize, outsize, right size}
Launching of a New Product {success, failure}
(4) Event
An event, in probability theory, constitutes one or more possible outcomes of an experiment.
An event is a subset of a sample space. It is a set of basic outcomes.

Page 2 of 19

APPROACHES TO PROBABILITY THEORY
(1) The Classical Approach
In the Classical Approach, probability of an event is defined as the relative size of the event
with respect to the size of the sample space. For example, consider an experiment of drawing
a card out of a deck of 52 cards and event A that on any draw, a king is there. Since there are
4 kings and there are 52 cards, the size of A is 4 and the size of the sample space is 52.
Therefore, the probability of A is equal to 4/52.
The rule used in computing probabilities, assuming equal likelihood of all
basic outcomes, is as follows:
Probability of the event A:




Where:

(2) The Relative Frequency Approach
As per this approach, the probability of occurrence of an event is the ratio of the number of
times the event occurs to the total number of trials. For example, consider an experiment of
administering a taste test for a new soup and event B that a consumer likes the taste. In this
situation, the past data on people who were administered the soup and the number that liked
the taste is taken. In the absence of past data, an experiment is undertaken, where the taste
test is administered on a group of people to check its effect. The probability of the event B:




Where:

(3) The Subjective Approach
The Subjective Approach involves personal judgment, information, intuition, and other
subjective evaluation criteria. For example consider an experiment of commissioning a solar
power plant and event C that the plant turns out to be a successful venture. Since a solar
power plant being a relatively new development involving the latest technology, past

Page 3 of 19

experiences are not available. Experimentation is also ruled out because of high cost and time
involved, unlike the taste testing situation.
NOTE: The different approaches have evolved to cater to different kinds of situations. These
complement each other in the sense that where one fails, the other becomes applicable.
Example 1
A fair coin is tossed twice. Find the probabilities of the following events:
(a) A, getting two heads
(b) B, getting one head and one tail
(c) C, getting at least one head or one tail
(d) D, getting four heads
Solution:
Being a Two-Trial Coin Tossing Experiment, it gives rise to the following O
n
= 2
n
= 4,
possible equally likely outcomes:
HH HT TH TT
Thus, for the sample space: N(S) = 4
We can use the Classical Approach to find out the required probabilities.
(a) Probability for event A of getting two heads








(b) Probability for even B of getting one head and one tail











(c) Probability for event C of getting at least one head or one tail

Page 4 of 19

(d) Probability for event D of getting four heads








Note: The occurrence of C is certainty, whereas D is an impossible event.
Example 2
A newspaper boy wants to find out the chances that on any day he will be able to sell more
than 90 copies of The Standard. From his dairy where he recorded the daily sales of the last
year, he finds out that out of 365 days, on 75 days he had sold 80 copies, on 144 days he had
sold 85 copies, on 62 days he had sold 95 copies and on 84 days he had sold 100 copies of
The Standard. Find out the required probability for the newspaper boy.
Solution
Taking the Relative Frequency Approach, we find:
Sales(Event) No. of Days (Frequency) Relative Frequency
80 75 75/365
85 144 144/365
95 62 62/365
100 84 84/365
 the number of days when his sales were more than 90 = (62 + 84) days = 146 days







PROBABILITY AXIOMS
(a) The probability of an event A, written as P(A), must be a number
between zero and one, both values inclusive.

(b) The probability of occurrence of one or the other of all possible events is equal to one.
As S denotes the sample space or the set of all possible events, we write

Page 5 of 19

Thus in tossing a coin once; P(a head or a tail) = 1.
(c) If two events are such that occurrence of one implies that the other cannot occur, then
the probability that either one or the other will occur is equal to the sum of their
individual probabilities. Thus, in a coin-tossing situation, the occurrence of a head
rules out the possibility of occurrence of tail. These events are called mutually
exclusive events. In such cases then, if A and B are the two events respectively, then

If two or more events together define the total sample space, the events are said to be
collectively exhaustive.
INTERPRETATION OF A PROBABILITY
The values 0 and 1 sets the range of values that the probability measure may take. In other
words 0 ≤ P(A) ≤ 1
When an event cannot occur (impossible event), its probability is zero. The probability of the
empty set is zero: P(Φ) = 0. In a deck where half the cards are red and half are black, the
probability of drawing a green card is zero because the set corresponding to that event is the
empty set: there are no green cards.
Events that are certain to occur have probability 1.00. The probability of the entire sample
space S is equal to 1.00: P(S) = 1.00. If we draw a card out of a deck, 1 of the 52 cards in the
deck will certainly be drawn, and so the probability of the sample space, the set of all 52
cards, is equal to 1.00.
Within the range of values 0 to 1, the greater the probability, the more confidence we have in
the occurrence of the event in question. A probability of 0.95 implies a very high confidence
in the occurrence of the event. A probability of 0.80 implies a high confidence. When the
probability is 0.5, the event is as likely to occur as it is not to occur. When the probability is
0.2, the event is not very likely to occur. When we assign a probability of 0.05, we believe
the event is unlikely to occur, and so on.

Page 6 of 19

PROBABILITY RULES
(1) The Union Rule
The Rule of Unions allows us to write the probability of the union of two events in terms of
the probabilities of the two events and the probability of their intersection.
(a) Two Overlapping Events
Consider two events A and B defined over the sample space S. The probability of the
intersection or union of two events is:

Example 3
A card is drawn from a well-shuffled pack of playing cards. Find the probability that the card
drawn is either a club or a king.
Solution
Let A be the event that a club is drawn and B the event that a king is drawn. Then,

Example 4
Suppose your chance of being offered a certain job is 0.45, your probability of getting
another job is 0.55, and your probability of being offered both jobs is 0.30. What is the
probability that you will be offered at least one of the two jobs?
Solution:
Let A be the event that the first job is offered and B the event that the second job is offered.
Then,


(b) Mutually Exclusive Events
When the sets corresponding to two events are disjoint (i.e., have no intersection), the two
events are called mutually exclusive. For mutually exclusive events, the probability of the
intersection of the events is zero. For mutually exclusive events the Rule of Unions is:

Page 7 of 19


Example 5
A card is drawn from a well-shuffled pack of playing cards. Find the probability that the card
drawn is either a king or a queen.
Solution:
Let A be the event that a king is drawn and B the event that a queen is drawn. Since A and B
are two mutually exclusive events, we have,

(2) The Complement Rule
The Rule of Complements defines the probability of the complement of an event in terms of
the probability of the original event. Consider event A defined over the sample space S. The
complement of set A, denoted by , is a subset, which contains all outcomes, which do not
belong to A.

As a simple example, if the probability of rain tomorrow is 0.3, then the
probability of no rain tomorrow must be 1 - 0.3 = 0.7. If the probability
of drawing a king is 4/52, then the probability of the drawn card's not
being a king is 1 - 4/52 = 48/52.
Example 7
Find the probability of the event of getting a total of less than 12 in the experiment of
throwing a die twice.
Solution
Let A be the event of getting a total 12.


(3) The Conditional Probability Rule

Page 8 of 19

As a measure of uncertainty, probability depends on information. There are situations where
the probability of an event A is influenced by the information that another event B has
occurred.Thus, the probability of event A given the occurrence of event B is defined as the
probability of the intersection of A and B, divided by the probability of event B.




Example 8
For an experiment of throwing a die twice, find the probability:
(a) of the event of getting a total of 9, given that the die has shown up points between 4
and 6 (both inclusive)
(b) of the event of getting points between 4 and 6 (both inclusive), given that a total of 9
has already been obtained
Solution:
Let getting a total 9 be the event A and the die showing points between 4 and 6 (both
inclusive) be the event B



(a)
(b)

(4) The Product Rule
The Product Rule (also called Multiplication Theorem) allows us to write the probability of
the simultaneous occurrence of two (or more) events. The Product Rule states that the
probability that both A and B will occur simultaneously is equal to the probability that B (or
A) will occur multiplied by the conditional probability that A (or B) will occur, when it is
known that B (or A) is certain to occur or has already occurred.

Example 9

Page 9 of 19

A box contains 10 balls out of which 2 are green, 5 are red and 3 are black. If two balls are
drawn at random, one after the other without replacement, from the box. Find the
probabilities that:
(a) both the balls are of green color
(b) both the balls are of black color
(c) both the balls are of red color
(d) the first ball is red and the second one is black
(e) the first ball is green and the second one is red
Solution:
(a) Probability that both the balls are of green color


















(b) Probability that both the balls are of black color


















(c) Probability that both the balls are of red color


















(d) Probability that the first ball is red and the second one is black


















(e) Probability that the first ball is green and the second one is red


















Example 10
A consulting firm is bidding for two jobs, one with each of two large multinational
corporations. The company executives estimate that the probability of obtaining the
consulting job with firm A, event A, is 0.45. The executives also feel that if the company
should get the job with firm A, then there is a 0.90 probability that firm B will also give the
company the consulting job. What are the company's chances of getting both jobs?
Solution:

Page 10 of 19


INDEPENDENT EVENTS
Two events are said to be independent of each other if the occurrence or non-occurrence of
one event in any trial does not affect the occurrence of the other event in any trial. Events A
and B are independent of each other if and only if the following three conditions hold:
(1)
(2) and
(3)
The rules for union and intersection of two independent events can be extended to sequences
of more than two events.
Intersection Rule
The probability of the intersection of several independent events A1, A2, ……is just the
product of separate probabilities i.e.







Union Rule
The probability of the union of several independent events A1, A2, ……is given by the
following equation.







Example 11
A problem in mathematics is given to five students A, B,C, D and E. Their chances of solving
it are 1/2, 1/3, 1/3, 1/4 and 1/5 respectively. Find the probability that the problem will
(a) not be solved
(b) be solved

Solution:
The problem will not be solved when none of the students solve it.

Page 11 of 19



The problem will be solved when at least one of the students solve it.

Page 12 of 19

BINOMIAL DISTRIBUTION
A Binomial Distribution is a probability distribution that summarizes the likelihood that a
value will take one of the two independent values under a given set of parameters or
assumptions.
A Binomial Distribution is the probability distribution of Binomial Random Variable. A
Binomial Random Variable is the random variable that counts the number of successes in
many independent, identical Bernoulli trials. Bernoulli trials are trials that have only two
possible outcomes, which are mutually exclusive. In other words, these are trials, whose
outcome can only be either a success or a failure.
CONDITIONS FOR A BINOMIAL RANDOM VARIABLE
The condition to be satisfied for a binomial random variable is that the experiment should be
a Bernoulli Process. A Bernoulli Process is a sequence of identically and independently
distributed Bernoulli variables. Any uncertain situation or experiment that is
marked by the following three properties is known as a Bernoulli Process:
(1) There are only two mutually exclusive and collectively exhaustive outcomes in the
experiment i.e. S = {success, failure}
(2) In repeated trials of the experiment, the probabilities of occurrence of these events
remain constant
(3) The outcomes of the trials are independent of one another
BINOMIAL PROBABILITY FUNCTION














CHARACTERISTICS OF A BINOMIAL DISTRIBUTION
(1) A fixed number of observations.
(2) Each observation is categorized as to whether or not the “event of interest” occured.
(3) Constant probability for the event of interest occuring for each observation.

Page 13 of 19

(4) Observations are independent.
(5) The expected value or the mean, denoted by μ, of a Binomial distribution is computed as

(6)The variance, denoted by σ
2
, of a Binomial distribution is computed as


(7) The r
th
moment about the origin denoted by


, of a Binomialdistribution is computed as:






(8) The r
th
moment about the mean denoted by


, of a binomial distribution is computed
as:





(9) To bring out the skewness of a Binomial distribution we can calculate, moment
coefficient of skewness,

(10) A measure of kurtosis of the Binomial distribution is given by the moment coefficient
of kurtosis

(11) If n is large and if neither of p or q is too close to zero, the Binomial distribution can
be closely approximated by a Normal distribution with standardized variable

(12) Binomial distribution can reasonably be approximated by the Poisson distribution
when n is infinitely large and p is infinitely small i. e. When
Example 1
Assuming the probability of male birth as ½, find the probability distribution of number of
boys out of 5 births.
(a) Find the probability that a family of 5 children have
(i) at least one boy
(ii) at most 3 boys
(b) Out of 960 families with 5 children each find the expected number of families with (i) and
(ii) above
Solution:








The probability distribution of X is given below
X = x: 0 1 2 3 4 5
P (X = x): 1/32 5/32 10/32 10/32 5/32 1/32

Page 14 of 19

(a)(i)
(ii)

(b)(i)
(ii)

Page 15 of 19

THE NORMAL DISTRIBUTION
The Normal Distribution is useful in characterizing uncertainties in many real-life processes,
in statistical inferences, and in approximating other probability distributions.
PROPERTIES OF NORMAL DISTRIBUTION
(1) The normal curve is not a single curve representing only one continuous distribution. It
represents a family of normal curves; since for each different value of μ and σ, there is a
specific normal curve different in its positioning on the X-axis and the extent of spread
around the mean.

(2) The normal curve is bell-shaped and perfectly symmetric about its mean. As a result 50%
of the area lies to the right of mean and balance 50% to the left of mean.
(3) The normal curve has a (relative) kurtosis of 0, which means it has average peakedness
and is mesokurtic.
(4) Theoretically, the normal curve never touches the horizontal axis and extends to infinity
on both sides. That is the curve is asymptotic to X-axis.
(5) If several independent random variables are normally distributed, then their sum will also
be normally distributed.
(6) If a normal variable X under goes a linear change in scale such as Y = aX + b, where a
and b are constants and a ≠ 0; the resultant Y variable will also be normally distributed
with mean = a E(X) + b and Variance = a
2
V(X).

Page 16 of 19

Example
A cost accountant needs to forecast the unit cost of a product for the next year. He notes that
each unit of the product requires 10 labor hours and 5 kg of raw material. In addition, each
unit of the product is assigned an overhead cost of Kshs 200. He estimates that the cost of a
labor hour next year will be normally distributed with an expected value of Rs 45 and a
standard deviation of Kshs 2; the cost of raw material will be normally distributed with an
expected value of Kshs 60 and a standard deviation of Kshs 3. Find the distribution of the
unit cost of the product. Find its expected value and variance.
Solution:
Since the cost of labor L may not influence the cost of raw material M, we can assume that
the two are independent. This makes the unit cost of the product Q a random variable. So if























Page 17 of 19

THE CENTRAL LIMIT THEOREM
The Central Limit Theorem is remarkable because it states that the distribution of the
sample mean tends to a normal distribution regardless of the distribution of the
population from which the random sample is drawn.
The Central Limit Theorem states that when sampling is done from a population with mean
and standard deviation , the sampling distribution of the sample mean tends to a
normal distribution with mean μ and standard deviation as the sample size n increases.




)
APPLICATION OF THE CENTRAL LIMIT THEOREM
(1) The theorem allows us to make probability statements about the possible
range of values the sample mean may take.
(2) It allows us to compute probabilities of how far away may be from
the population mean it estimates.

Figure 1 Sampling Distributions of for different Sample Sizes
The central limit theorem says that, in the limit, as n goes to infinity (n → ), the
distribution of becomes a normal distribution (regardless of the distribution of the
population). The rate at which the distribution approaches a normal distribution does depend,
however, on the shape of the distribution of the parent population:
(1) if the population itself is normally distributed, the distribution of is normal for
any sample size n.

Page 18 of 19

(2) if the population distributions are very different from a normal distribution, a
relatively large sample size is required to achieve a good normal approximation for
the distribution of .
In general, a sample of 30 or more elements is considered “Large Enough” for the central
limit theorem to be applicable.

Figure 12-5 Population Distribution and the Sampling Distribution of
The figure emphasizes the three aspects of the central limit theorem:
(1) When the sample size is large enough, the sampling distribution of is normal
(2) The expected value of is
(3) The standard deviation of is
Example
ABC Tool Company makes Laser XR; a special engine used in speedboats. The company’s
engineers believe that the engine delivers an average power of 220 horsepower, and that the
standard deviation of power delivered is 15 horsepower. A potential buyer intends to sample
100 engines (each engine to be run a single time). What is the probability that the sample
mean will be less than 217 horsepower?
Solution:
Given that:

Page 19 of 19




























So there is a small probability that the potential buyer’s tests will result in a sample mean less
than 217 horsepower.

Page 1 of 14

INDEX NUMBERS
DEFINITION
Index numbers are statistical devices designed to measure the relative changes in the level of a
certain phenomenon in two or more situations. According to Wheldon an index number is a
statistical device for indicating the relative movements of the data where measurement of
actual movements is difficult or incapable of being made. The phenomenon under
consideration may be any field of quantitative measurements. It may refer to a single variable
or a group of distinct but related variables. In Business and Economics, the phenomenon
under consideration may be:
 the prices of a particular commodity like steel, gold, leather, etc. or a group of
commodities like consumer goods, cereals, milk and milk products, cosmetics, etc.
 volume of trade, factory production, industrial or agricultural production, imports or
exports, stocks and shares, sales and profits of a business house and so on.
 the national income of a country, wage structure of workers in various sectors, bank
deposits, foreign exchange reserves, cost of living of persons of a particular community,
class or profession and so on.
The various situations requiring comparison may refer to either
 the changes occurring over a time, or
 the difference(s) between two or more places, or
 the variations between similar categories of objects/subjects, such as persons, groups of
persons, organisations etc. or other characteristics such as income, profession, etc.
CHARACTERISTICS OF INDEX NUMBERS
1. Index Numbers are specialized averages.
2. Index Numbers are expressed in percentages.
3. Index Numbers measure changes not capable of direct measurement, e.g. price level, cost
of living, business or economic, activity etc.
4. Index Numbers are for comparison.
USES OF INDEX NUMBER
1. Index numbers as economic barometers
2. Index numbers help in studying trends and tendencies.
3. Index numbers help in formulating decisions and policies.

Page 2 of 14

4. Price indices measure the purchasing power of money
5. Index numbers are used for deflation.
6. They measure the relative change.
7. They are of better comparison.
8. They are good guides.
9. They are the pulse of the economy.
10. They compare the wage adjuster.
11. They compare the standard of living.
12. They are a special type of averages.
13. They provide guidelines to policy.
14. To measure the purchasing power of money.
TYPES OF INDEX NUMBERS
1. Price Index Numbers: The price index numbers measure the general changes in the
prices. They are further sub-divided into the following classes:
(i) Wholesale Price Index Numbers: The wholesale price index numbers reflect the changes
in the general price level of a country.
(ii) Retail Price Index Numbers: These indices reflect the general changes in the retail
prices of various commodities such as consumption goods, stocks and shares, bank
deposits, government bonds, etc.
(iii)Consumer Price Index: Commonly known as the Cost of living Index, CPI is a
specialized kind of retail price index and enables us to study the effect of changes in the
price of a basket of goods or commodities on the purchasing power or cost of living of a
particular class or section of the people like labour class, industrial or agricultural worker,
low income or middle income class etc.
2. Quantity Index Numbers: Quantity index numbers study the changes in the volume of
goods produced (manufactured), consumed or distributed, like: the indices of agricultural
production, industrial production, imports and exports, etc.
3. Value Index Numbers: These are intended to study the change in the total value (price
multiplied by quantity) of output such as indices of retail sales or profits or inventories.
PROBLEMS IN THE CONSTRUCTION OF INDEX NUMBERS
1. Purpose of the index numbers.
2. Selection of weights to be used.

Page 3 of 14

3. Type of average to be used.
4. Choice of Index:
5. Selection of base period.
6. Selection of commodities.
7. Selection of source of data.
8. Collection of data.
NOTATIONS USED FOR INDEX NUMBERS
Since index numbers are computed for prices, quantities, and values, these are denoted by the
lower case letters:
p,q and v represent respectively the price,the quantity,and the value of an
individual commodity
Subscripts 0, 1, 2,… i, ... are attached to these lower case letters to distinguish price, quantity,
or value in any one period from those in the other. Thus,
P
0=denotes the price of a commodity in the base period,
P
1=denotes the price a commodity in period 1,or the current period,and
P
i=denotes the price a commodity in the i
th
period,where i=1,2,3,……..
Similar meanings are assigned to q0, q1, ... qi, ... and v0, v1, … vi, …
Capital letters P, Q and V are used to represent the price, quantity, and value index numbers,
respectively. Subscripts attached to P, Q, and V indicates the years compared. Thus,
P
01=means the price index for period 1 relative to period 0,
P
02=means the price index for period 2 relative to period 0,
P
12=means the price index for period 2 relative to period 1,and so on
Similar meanings are assigned to quantity Q and value V indices. It may be noted that all
indices are expressed in percent with 100 as the index for the base period, the period with
which comparison is to be made.
SIMPLE INDEX NUMBERS
A simple price index number is based on the price or quantity of a single commodity. To
construct a simple index, we first have to decide on the base period and then find ratio of the

Page 4 of 14

value at any subsequent period to the value in that base period - the price/quantity
relative.This ratio is then finally converted to a percentage.
??????���� ��� ��� ���??????�� ??????=
??????????????????�� ??????� ���??????�� ??????
??????????????????�� ??????� �??????�� ??????�??????�
∗100
Simple Price Index for period i = 1,2,3 ... will be
�
01=
�
??????
�
0
∗100
Similarly, Simple Quantity Index for period i = 1,2,3 ... will be
�
01=
�
??????
�
0
∗100
Example 1
Given are the following price-quantity data of fish, with price quoted in Kshs per kg and
production in qtls.
Year 1980 1981 1982 1983 1984 1985
Price 15 17 16 18 22 20
Production 500 550 480 610 650 600
Construct:
(a) the price index for each year taking price of 1980 as base,
(b) the quantity index for each year taking quantity of 1980 as base.
Solution:
Simple Price and Quantity Indices of Fish (Base Year = 1980
Year Price(�
??????) Quantity (�
??????) Price Index
�
0??????=
�
??????
�
0
∗100
Quantity Index
�
0??????=
�
??????
�
0
∗100
1980 15 500 100.00 100.00
1981 17 550 113.33 110.00
1982 16 480 106.67 96.00
1983 18 610 120.00 122.00
1984 22 650 146.67 130.00
1985 20 600 133.00 120.00

Page 5 of 14

COMPOSITE INDEX NUMBERS
Aggregative (or composite) index numbers are indices constructed for a group of
commodities or variables are known as aggregative (or composite) index numbers. Depending
upon the method used for constructing an index, composite indices may be:
1. Simple Aggregative Price/ Quantity Index
2. Index of Average of Price/Quantity Relatives
3. Weighted Aggregative Price/ Quantity Index
4. Index of Weighted Average of Price/Quantity Relatives
SIMPLE AGGREGATIVE PRICE/ QUANTITY INDEX
Irrespective of the units in which prices/quantities are quoted, this index for given
prices/quantities, of a group of commodities is constructed in the following three steps:
(i) Find the aggregate of prices/quantities of all commodities for each period (or place).
(ii) Selecting one period as the base, divide the aggregate prices/quantities corresponding to
each period (or place) by the aggregate of prices/ quantities in the base period.
(iii)Express the result in percent by multiplying by 100.
The computation procedure contained in the above steps can be expressed as:
�
0??????=
∑�
??????
∑�
0
∗100 ??????��
�
0??????=
∑�
??????
∑�
0
∗100
Example 2
Given are the following price-quantity data, with price quoted in Kshs per kg and production
in qtls.
1980 1985
Item Price Production Price Production
Fish 15 500 20 600
Mutton 18 590 23 640
Chicken 22 450 24 500
Find
(a) Simple Aggregative Price Index with 1980 as the base.
(b) Simple Aggregative Quantity Index with 1980 as the base.

Page 6 of 14

Calculations for Simple Aggregative Price and Quantity Indices (Base Year = 1980)
Prices Quantities
Item 1980 (�
0) 1985 (�
??????) 1980 (�
0) 1985 (�
??????)
Fish 15 20 500 600
Mutton 18 23 590 640
Chicken 22 24 450 500
Sum 55 67 1540 1740
(a) Simple Aggregative Price Index with 1980 as the base.
�
0??????=
∑�
??????
∑�
0
∗100=
67
55
∗100=121.82
(b) Simple Aggregative Quantity Index with 1980 as the base
�
0??????=
∑�
??????
∑�
0
∗100=
1740
1540
∗100=112.98
INDEX OF AVERAGE OF PRICE/QUANTITY RELATIVES
Given the prices/quantities of a number of commodities that enter into the construction of this
index, it is computed in the following two steps:
(i) After selecting the base year, find the price relative/quantity relative of each commodity
for each year with respect to the base year price/quantity. As defined earlier, the price
relative/quantity relative of a commodity for a given period is the ratio of the
price/quantity of that commodity in the given period to its price/quantity in the base
period.
(ii) Multiply the result for each commodity by 100, to get simple price/quantity indices for
each commodity.
(iii) Take the average of the simple price/quantity indices by using arithmetic mean,
geometric mean or median.
Thus it is computed as:
�
0??????=����??????�� �� (
�
??????
�
??????
∗100) ??????��
�
0??????=����??????�� �� (
�
??????
�
??????
∗100)

Page 7 of 14



Using arithmetic mean
�
0??????=
∑(
�
??????
�
0
∗100)
??????
??????��
�
0??????=
∑(
�
??????
�
0
∗100)
??????

Using geometric mean
�
0??????=���?????? Log[
1
??????
∑log(
�
??????
�
0
∗100)] ??????��
�
0??????=���?????? Log[
1
??????
∑log(
�
??????
�
0
∗100)]
Example 3
From the data in Example 2 find:
(a) Index of Average of Price Relatives (base year 1980); using mean, median and geometric
mean.
(b) Index of Average of Quantity Relatives (base year 1980); using mean, median and
geometric mean.
Solution:
Index of Average of Price Relatives and Quantity Relatives (Base Year = 1980)
Item Price Relative
=(
p
i
p
0
∗100)
=Log(
p
i
p
0
∗100)
Quantity Relative
=(
p
i
p
0
∗100)
=Log(
p
i
p
0
∗100)
Fish 133.33 2.1249 120.00 2.0792
Mutton 127.78 2.1065 108.47 2.0353
Chicken 109.09 2.0378 111.11 2.0458
Sum 370.19 6.2692 339.58 6.1603
(a) Index of Average of Price Relatives (base year 1980)
i) Using Arithmetic Mean

Page 8 of 14

P
0i=
∑(
p
i
p
0
∗100)
N
=
339.58
3
=113.19

Page 9 of 14

ii) Using Median
P
0i=Size of (
N+1
2
)
th
item
=Size of (
3+1
2
)
th
item=Size of 2
nd
item=111.11
iii) Using Geometric Mean
�
0??????=���?????? Log[
1
??????
∑log(
�
??????
�
0
∗100)]
=���?????? Log[
1
3
(6.2689)]
=���?????? Log[2.08963]=122.92
b) Index of Average of Quantity Relatives (base year 1980)
i) Using Arithmetic Mean
Q
0i=
∑(
q
i
q
0
∗100)
N
=
370.19
3
=123.39
ii) Using Median
Q
0i=Size of (
N+1
2
)
th
item
=Size of (
3+1
2
)
th
item=Size of 2
nd
item=127.78
iii) Using Geometric Mean
�
0??????=���?????? Log[
1
??????
∑log(
�
??????
�
0
∗100)]
=���?????? Log[
1
3
(6.1603)]
=���?????? Log[2.05343]=113.092
WEIGHTED AGGREGATIVE PRICE/QUANTITY INDICES
Weighted aggregative indices assign proper weights to individual items. There are various
methods of assigning weights and consequently a large number of formulae for constructing
index numbers have been devised of which some of the most important ones are:
(i) Laspeyre’ s method
(ii) Paasche’ s method

Page 10 of 14

(iii) Fisher’ s ideal Method
(iv) Bowley’ s Method
(v) Marshall- Edgeworth method
LASPEYRE’S INDEX
Laspeyre’s Price Index, using base period quantities as weights is obtained as
�
0??????
????????????
=
∑�
??????�
0
∑�
0�
0
∗100
Laspeyre’s Quantity Index, using base period prices as weights is obtained as
�
0??????
????????????
=
∑�
??????�
0
∑�
0�
0
∗100
PAASCHE’S INDEX
Paasche’s Price Index, using base period quantities as weights is obtained as
�
0??????
�??????
=
∑�
??????�
??????
∑�
0�
??????
∗100
Paasche’s Quantity Index, using base period prices as weights is obtained as
�
0??????
�??????
=
∑�
??????�
??????
∑�
0�
??????
∗100
MARSHALL-EDGEWORTH INDEX
The Marshall-Edgeworth Index uses the average of the base period and given period
quantities/prices as the weights, and is expressed as
=P
0i
ME
=
∑(�
0�
??????+�
??????�
??????
)
∑(�
0�
0+�
??????�
0
)
∗100 ??????��
=P
0i
ME
=
∑(�
0�
??????+�
??????�
??????
)
∑(�
0�
0+�
??????�
0
)
∗100
DORBISH AND BOWLEY INDEX
The Dorbish and Bowley Index is defined as the arithmetic mean of the Laspeyre’s and
Paasche’s indices.
�
0??????
????????????
=
�
0??????
????????????
+�
0??????
�??????
2

�
0??????
????????????
=
�
0??????
????????????
+�
0??????
�??????
2

Page 11 of 14

FISHER’S IDEAL INDEX
The Fisher’s Ideal Index is defined as the geometric mean of the Laspeyre’s and Paasche’s
indices.
�
0??????
??????
=√�
0??????
????????????
.�
�??????
�??????
??????��
�
0??????
??????
=√�
0??????
????????????
.�
�??????
�??????

Example 4
From the data in Example 6.2 find:
(a) Laspeyre’s Price Index for 1985, using 1980 as the base.
(b) Laspeyre’s Quantity Index for 1985, using 1980 as the base.
(c) Paasche’s Price Index for 1985, using 1980 as the base.
(d) Paasche’s Quantity Index for 1985, using 1980 as the base.
Solution:
Calculations for Laspeyre’s and Paasche’s Indices (Base Year = 1980)
Item �
��
� �
��
� �
��
� �
��
�
Fish 7,500 10,000 9,000 12,000
Mutton 10,620 13,570 11,520 14,720
Chiken 9,900 10,800 11,000 12,000
Sum 28,020 34,370 31,520 38,720
(a) Laspeyre’s Price Index for 1985, using 1980 as the base
�
0??????
????????????
=
∑�
??????�
0
∑�
0�
0
∗100=
34,370
28,020
∗100=122.66
(b) Laspeyre’s Quantity Index for 1985, using 1980 as the base
�
0??????
????????????
=
∑�
??????�
0
∑�
0�
0
∗100=
31,520
28,020
∗100=112.49

Page 12 of 14

(c) Paasche’s Price Index for 1985, using 1980 as the base
�
0??????
�??????
=
∑�
??????�
??????
∑�
0�
??????
∗100=
38,720
31,520
∗100=122.84
(d) Paasche’s Quantity Index for 1985, using 1980 as the base
�
0??????
�??????
=
∑�
??????�
??????
∑�
0�
??????
∗100=
38,720
34,370
∗100=112.66
Example 5
Construct price index number from the following data by applying
(a) Laspeyere’ s Method
(b) Paasche’ s Method
(c) Fisher’ s ideal Method
Commodity 2000 2001
Price Quantity Price Quantity
A 2 8 4 5
B 5 12 6 10
C 4 15 5 12
D 2 18 4 20
Solution:
Commodity �
��
� �
��
� �
��
� �
��
�
A 16 32 10 20
B 60 72 50 60
C 60 75 48 60
D 36 72 40 80
Sum 172 251 148 220

Laspeyre

s Price Index=P
01
La
=
∑p
1q
0
∑p
0q
0
∗100=
251
172
∗100=145.93
Pasche

s Price Index=P
01
Pa
=
∑�
1�
1
∑�
0�
1
∗100=
220
148
∗100=148.65

Fisher

s Ideal Index=P
01
F
=√�
01
????????????
.�
�1
�??????
=√145.93∗148.65=√21692.4945=147.28

Page 13 of 14

Interpretation:
The results can be interpreted as follows: If Kshs 100 were used in the base year to buy the
given commodities, we have to use Kshs 145.90 in the current year to buy the same amount of
the commodities as per the Laspeyre’ s formula. Other values give similar meaning.
Example 6
Calculate the index number from the following data by applying
(a) Bowley’ s price index
(b) Marshall- Edgeworth price index
Commodity Base year Current year
Quantity Price Quantity Price
A 10 3 8 4
B 20 15 15 20
C 2 25 3 30
Solution:
Commodity �
��
� �
��
� �
��
� �
��
�
A 30 40 24 32
B 300 400 225 300
C 50 60 75 90
Sum 380 500 324 422

Dorbish and Bowley Index=P
01
DB
=
P
01
La
+P
01
Pa
2
=
1
2
[
p
1q
0
p
0p
0
+
p
1q
1
p
0q
1
]∗100
=
1
2
[
500
380
+
422
324
]∗100
=
1
2
[1.316+1.302]∗100
=
1
2
[1.316+1.302]∗100
=
1
2
[2.168]∗100=130.9
Marshall−Edgeworth Index=P
01
ME
=
∑(�
0�
1+�
1�
1
)
∑(�
0�
0+�
1�
0
)
∗100

Page 14 of 14

=[
500+422
380+324
]∗100
=[
922
704
]∗100=130.966