Correlation and regression

5,560 views 20 slides Jun 02, 2021
Slide 1
Slide 1 of 20
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20

About This Presentation

Correlation and regression.
It shows different aspects of Correlation and regression.
A small comparison of these two is also listed in this presentation.


Slide Content

By: Aanchal Roll No. :20001534001 CORRELATION & REGRESSION 1

2 Correlation Introduction: Two variables are said to be correlated if the change in one variable results in a corresponding change in the other variable. The correlation is a statistical tool which studies the relationship between two variables. Correlation analysis involves various methods and techniques used for studying and measuring the extent of the relationship between the two variables. Correlation is concerned with the measurement of “ strength of association between variables ”. The degree of association between two or more variables is termed as correlation.

3 Definition The correlation is the measure of the extent and the direction of the relationship between two variables in a bivariate distribution. Example: Height and weight of children. An increase in the price of the commodity by a decrease in the quantity demanded. Types of Correlation: The following are the types of correlation Positive and Negative Correlation Simple, Partial and Multiple Correlation Linear and Non-linear Correlation

Correlation first developed by Sir Francis Galton (1822 – 1911) and then reformulated by Karl Pearson (1857 – 1936 ) 4

5 Types of Correlation i. Positive and Negative correlation : If both the variables are varying in the same direction i.e. if one variable is increasing and the other on an average is also increasing or if as one variable is decreasing, the other on an average, is also decreasing, correlation is said to be positive. If on the other hand, the variable is increasing, the other is decreasing or vice versa, correlation is said to be negative . Example 1: a) heights and weights (b) amount of rainfall and yields of crops (c) price and supply of a commodity (d) income and expenditure on luxury goods (e) blood pressure and age

6 ii. Simple, Partial and Multiple Correlation : When only two variables are studied, it is a case of simple correlation . In partial and multiple correlation , three or more variables are studied . In partial correlation, we have more than two variables, but consider only two variables to be influencing each other, the effect of the other variables being kept constant.

7 iii. Linear and Non-linear Correlation : If the change in one variable tends to bear a constant ratio to the change in the other variable, the correlation is said to be linear . Correlation is said to be non- linear if the amount of change in one variable does nor bear a constant ratio to the amount of change in the other variable.

Methods of Studying Correlation Cor r el a tion G r aph i c al Method Scatter Diag r am A l g eb r aic Method Karl Pearson’s Coefficient of Correlation 8

9 Methods of Studying Correlation The following are the methods of determining correlation Scatter diagram method Karl Pearson’s Coefficient of Correlation Scatter Diagram: This is a graphic method of finding out relationship between the variables. Given data are plotted on a graph paper in the form of dots i.e. for each pair of x and y values we put a dot and thus obtain as many points as the number of observations. The greater the scatter of points over the graph, the lesser the relationship between the variables.

Scatter Diagram Perfect Positive X O Y Correlation Perfect Negative O Y Correlation X O Low Degree of Y Negative Correlation Low Degree of Positive Correlation X X O Y High Degree of X O Positive Correlation Y O Y No Correlation X O High Degree of Negative Correlation Y No Correlation X X O Y 10

11 The Coefficient of Correlation: Correlation between the variables is expressed through the numerical value or degree of Correlation. The numerical value or degree of Correlation between two variables is known as Cofficient of Correlation. It is expressed as “r”. The values range between -1.0 and 1.0.

12 Properties of Karl Pearson’s Correlation Coefficient The coefficient of correlation ‘r’ is always a number between -1 and +1 inclusive . If r = +1 or -1, the sample points lie on a straight line . If ‘r’ is near to +1 or -1, there is a strong linear association between the variables . If ‘r’ is small(close to zero), there is low degree of correlation between the variables .

Utility of Correlation It helps to find whether the two variables are correlated or not. It helps to study the nature of relationship between the variables-expressed as positive or negative correlation. It helps to study the degree of correlation between the given variables. 13

Definition: Regression analysis is a mathematical measure of the average relationship between two or more variables in terms of the original units of the data. Thus term regression is used to denote estimation or prediction of the average value of one variable for a specified value of the other variable. The estimation is done by means of suitable equation , derived on the basis of available bivariate data. Such an equation and its geometrical representation is called regression curve. Regression 14

15 In regression analysis there are two types of variables and they are: i . Independent ii . Dependent . Dependent variable(Y): The variable whose value is influenced or is to be predicted is called dependent variable . Independent variable(X): The variable which influences the values or is used for prediction is called independent variable .

Utility of Regression Analysis It helps in making predictions or estimates of dependent variable for given values of independent variable. It establishes cause and effect relationship between the variables. It explains the nature of relationship between the variables, i.e. positive or negative relationship. It helps to determine the rate of change in one variable in terms of change in the other variable. 16

Limitations of Regression Analysis It is based on the assumption of linear relationship between the variables- it is not always true. It assumes constant relationship between the variables. Relationship between dependent and independent variables is true within the limits of experiment only. 17

Correlation Regression ‘Correlation’ as the name says it determines the interconnection or a co-relationship between the variables. ‘Regression’ explains how an independent variable is numerically associated with the dependent variable. In Correlation, both the independent and dependent values have no difference. However, in Regression, both the dependent and independent variable are different. The primary objective of Correlation is, to find out a quantitative/numerical value expressing the association between the values. When it comes to regression, its primary intent is, to reckon the values of a haphazard variable based on the values of the fixed variable. Correlation stipulates the degree to which both of the variables can move together. However, regression specifies the effect of the change in the unit, in the known variable(p) on the evaluated variable (q). Correlation helps to constitute the connection between the two variables. Regression helps in estimating a variable’s value based on another given value. 18

19

20 “THANK YOU”