Chi Square & Anova

10,986 views 20 slides Mar 24, 2019
Slide 1
Slide 1 of 20
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20

About This Presentation

It includes various problems related to Chi Square and Anova. Briefed Computational procedures are mentioned for the better understanding.


Slide Content

Chi Square & Anova

Difference between Chi Square & Anova Birinder Singh, Assistant Professor, PCTE Ludhiana 2 It enables us to test whether more than two population proportions can be considered equal Analysis of Variance ( Anova ) enables us to test whether more than two population means can be considered equal. Chi Square ( χ 2 Test) Anova (F Test)

Characteristics of Chi Square Every Chi square distribution extends indefinitely to right from zero. It is skewed to right As df increases, Chi square curve become more bell shaped and approaches normal distribution. Its mean is degree of freedom Its variance is twice degree of freedom 3 Birinder Singh, Assistant Professor, PCTE Ludhiana

Chi Square ( χ 2 Test ) Chi Square Test deals with analysis of categorical data in terms of frequencies / proportions / percentages. It is primarily of three types: Test of Homogeneity: To determine whether different population are similar w.r.t some characteristics. Test of Independence: Tests whether the characteristics of the elements of the same population are related or independent. Test of Goodness of Fit: To determine whether there is a significant difference between an observed frequency distribution and theoretical probability distribution. 4 Birinder Singh, Assistant Professor, PCTE Ludhiana

Computational Procedure – CHI Square Test Formulate Null & Alternative Hypothesis State type of test Select LOS Compute expected frequencies assuming H to be true. Compute χ 2 calculated value using cal = Extract crit value from table Compare cal & crit and make decision.   5 Birinder Singh, Assistant Professor, PCTE Ludhiana

Chi Square – Test of Homogeneity Based on a study, it is expected that 50% of the students opt for marketing, 30% for finance and 20% for HR. In a sample for 100, it was observed that 61, 24 and 15 opt for these subjects respectively. Do you agree with study findings at 10% LOS? ( cal = 4.87) ( crit = 4.605) (Rejected)   6 Birinder Singh, Assistant Professor, PCTE Ludhiana

Chi Square – Test of Homogeneity A shoe seller has received the consignment of the order that he had placed for 10000 pair of different sizes. Without physically segregating the sizes and counting no. of pair of shoes of each size, he wants to ascertain that consignment received is as per order . Check at 5% LOS. ( cal = 6.87) ( crit = 12.592) (Accepted)   7 Birinder Singh, Assistant Professor, PCTE Ludhiana Size 4 5 6 7 8 9 10 Order qty 500 1500 2000 2000 2000 1500 500 Rec. qty 700 1800 2200 2000 2000 1300

Chi Square – Test of Independence Formulae to be used Computation of expected frequency Fe = (RT x CT) / GT where RT = Row Total CT = Column Total GT = Grand Total Computation of degree of freedom Df = (r – 1) (c – 1) 8 Birinder Singh, Assistant Professor, PCTE Ludhiana

Chi Square – Test of Independence Following data was collected when a survey was carried out on preference for formal wear in work place: Is there difference in preference due to sex? Check at 20% LOS. ( cal = 0.2522) ( crit = 1.642) (Accepted)   9 Birinder Singh, Assistant Professor, PCTE Ludhiana Gents Ladies Yes 520 60 No 80 11

Chi Square – Test of Independence Sample data in respect of viewership of a TV Program for various age groups was collected and is as follows: Is viewership of program independent of age ? Check at 5% LOS. ( cal = 26.01 ) ( crit = 9.488) (Rejected)   10 Birinder Singh, Assistant Professor, PCTE Ludhiana 15-25 26-40 41-50 Always 75 180 105 Sometimes 50 60 40 Never 25 20 5

Chi Square – Test of Goodness of Fit Gordon Company requires that college seniors who are seeking positions will be interviewed. For staffing purposes, the director of recruitment thinks that the interview process can be approximated by a binomial distribution with p = 0.40 i.e. Can he conclude that BD at p = 0.4 provides a good description of observed frequencies. Check at 20% LOS. ( cal = 5.041 ) ( crit = 4.642) (Rejected)   11 Birinder Singh, Assistant Professor, PCTE Ludhiana

Analysis of Variance ( Anova ) It enables us to test for the significance of the differences among more than two sample means. Using Anova , we will be able to make inferences about whether our samples are drawn from population having the same mean. Examples: Comparing the mileage of five different brands of cars Testing which of the four different training methods produces the fastest learning record Comparing the average salary of three different companies In each of these cases, we would compare the means of more than two sample means. F-Distribution is used to analyze certain situations 12 Birinder Singh, Assistant Professor, PCTE Ludhiana

Assumptions Populations are normally distributed Samples are random and independent Population Variances are equal. 13 Birinder Singh, Assistant Professor, PCTE Ludhiana

Computational Procedure in Anova (One Way) Define Null & Alternative Hypothesis Select estimator & determine its distribution Select Significance Level Calculate Sum of all observations: T = Ʃx i Calculate correction factor: CF = T 2 / n T where n T = sample size Calculate Sum of squares total, SST = Calculate Sum of squares between columns, SSB = Calculate Sum of squares within columns, SSW = SST – SSB Calculate Mean of squares between groups, MSB = SSB / (k – 1) where k = no. of samples Calculate Mean of squares within groups, MSW = SSW / ( n T – k) Calculate F cal = MSB / MSW Calculate F crit = F( df num , df den , α ) where df num = k – 1, df den = n T – k Compare F cal & F crit and make your statistical & managerial decisions   14 Birinder Singh, Assistant Professor, PCTE Ludhiana

Practice Problem – One Way Anova Three group of students are taught a statistical technique by three different methods. When tested on one problem, sample scores of 3 students selected at random from each of the group as under: At 0.05 LOS, do the means of populations taught by three methods differ? ( F cal = 1.5, F crit = 5.14) (Accepted) 15 Birinder Singh, Assistant Professor, PCTE Ludhiana Group 1 Group 2 Group 3 3 5 3 4 7 7 5 6 5

16 Birinder Singh, Assistant Professor, PCTE Ludhiana

Practice Problem – One Way Anova IAA wanted to find out if average sale of small cars namely Swift, Jazz & Figo is same in Tier II cities. It obtained quarterly sales data from 5 such cities A,B,C,D,E as shown. What conclusion can be drawn at 0.05 LOS? ( F cal = 1.5, F crit = 5.14) (Accepted) 17 Birinder Singh, Assistant Professor, PCTE Ludhiana City Swift Jazz Figo A 32 26 30 B 28 34 - C 25 - 28 D 34 33 32 E 31 31 26

Computational Procedure in Anova (Two Way) Define Null & Alternative Hypothesis Select estimator & determine its distribution Select Significance Level Calculate Sum of all observations: T = Ʃx i Calculate correction factor: CF = T 2 / n T where n T = sample size Calculate Sum of squares columns , SSC = Calculate Sum of squares rows, SSR = Calculate Sum of squares total, SST = Calculate Sum of square error, SSE = SST – (SSC + SSR) Calculate Mean of squares column, MSC = SSC / (c – 1) Calculate Mean of squares rows, MSR = SSR / (r – 1) Calculate Mean of squares error, MSE = SSE / (c – 1 ) (r – 1) Calculate F cal = MSC/MSE & MSR/MSE Calculate F crit = F( df num , df den , α ) where df num = c-1 or r-1, df den = (c-1)(r-1) Compare F cal & F crit and make your statistical & managerial decisions   18 Birinder Singh, Assistant Professor, PCTE Ludhiana

Practice Problem – Two Way Anova Three salesmen Kallu , Lallu & Mallu were assigned three cities A,B & C. The data on sales for quarter ending June 2017 achieved by them is: Is there any significant difference in sales made by 3 of them? Is there any significant difference in sales made in 3 cities? Check at 0.05 LOS? ( F1 cal = 9.23, F1 crit = 6.94) (F2 cal = 3.25, F2 crit = 6.94) (Rejected) & (Accepted) 19 Birinder Singh, Assistant Professor, PCTE Ludhiana City Kallu Mallu Lallu A 4 3 4 B 3 2 5 C 5 3 6

Decision Flow Diagram - Estimation 20 Birinder Singh, Assistant Professor, PCTE Ludhiana Start Is n≥30 Is pop. Known to be normally distributed Use ‘Z’ table Stop Use a Statistician Is SD known? Use ‘Z’ table Stop Use ‘t’ table Stop