Discriminate Analysis for health statistics .pptx

BRGautam 15 views 23 slides Aug 06, 2024
Slide 1
Slide 1 of 23
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23

About This Presentation

Discriminate Analysis for health statistics


Slide Content

Discriminate analysis Presented by: Bhoj Raj Gautam Roshan thapa

Table of content Introduction Assumptions of discriminate analysis Discriminate Analysis Model Types of DA Steps of Analysis Hypothesis Strengths and Weakness Bibilogrpahy

Introduction Discriminate analysis is a statistical method used for Classifying a set of observation into predefined groups In simple, DA helps to understand the relationship between a “dependent variable” and one/ more “independent variables” It is used to analyze and interpret the differences between two or more groups or categories based on multiple predictor variables DA is sometimes also called as: Discriminate factor analysis

Contd … DA is used when the data are normally distributed whereas the logistic regression is used when the data are not normally distributed Dependent variable – categorical Independent variable – intervals or ratio scale

5 Example Examples includes; Dependent variable is person diseased status (1 yes, 2 no) and independent variables are plasma lipid profile, that is total cholesterol (X1) , triglycerides (X2), HDL (X3) and LDL (X4). Since dependent variable is having two categories, so it it an example of two-group DA. Here independent variables are continuous in nature. We want to know whether somebody has lung cancer. Hence, we wish to predict a yes or no outcome. Possible predictor variables: number of cigarettes smoked a day, coughing frequency and intensity etc.

Objectives To develop discriminant functions To examine whether significant differences exist among the groups, in terms of the predictor variables To determine which predictor variables contribute to most of the intergroup differences To develop the accuracy of classification

When to use da? Data must be from different groups To analysis of differences in groups For classification of new objects

Discriminate analysis model The discriminate analysis model involves linear combination of the following form: where D = discriminate score b’s = discriminate coefficient or weight X’s = predictor or independent variable The coefficient, or weight (b), are estimated so that the groups differ as much as possible on the values of the discriminate function Discriminate analysis– creates an equation which will minimize the possibility of misclassifying cases into their respective groups or categories D = b + b 1 X 1 + b 2 X 2 + b 3 X 3 + …...+ b k X k

Variable with the higher weightings are going to contribute the final grade Grade = b + 0.1X attendance + 0.2X assignment + 0.2X first + 0.5X second + 0.4X final

Assumptions of discriminate analysis

Types of da Linear discriminate analysis: When the criterion/dependent variable has two categories eg : married and unmarried Given by Ronald Fisher in 1936 This methods group images of the same classes and separate images of the different classes This classification involves 2 target categories and 2 predictor variables. These features divide or characterize two or more than two objects or events. Multiple discriminate analysis: When three or more categories of dependent variable are involved

Steps in analysis STEP 1 In step one the independent variables which have the discriminating power are being chosen STEP 2 A discriminant function model is developed by using the coefficients of independent variables

Steps in analysis Contd … STEP 3 In step three Wilk’s Lambda is computed for testing the significance of discriminant function STEP 4 In the step four the independent variables which possess importance in discriminating the groups are being found STEP 5 In step five classification of subjects to their respective group is being made

Steps of discriminant analysis in SPSS 1. Analyze >> Classify >> Discriminant 2. Select ‘dependent variable’ as your grouping variable and enter it into the Grouping Variable Box 3. Click Define Range button and enter the lowest and highest code for your groups 4. Click Continue. 5. Select your predictors (IV’s) and enter into Independents box and select Enter Independents Together. If you planned a stepwise analysis you would at this point select Use Stepwise Method and not the previous instruction. 6. Click on Statistics button and select Means, Univariate Anovas, Box’s M, Unstandardized and Within-Groups Correlation 7. Continue >> Classify. Select Compute From Group Sizes, Summary Table, Leave One Out Classification, Within Groups, and all Plots 8. Continue >> Save and select Predicted Group Membership and Discriminant Scores 9. OK.

Statistics Associated with Discriminant Analysis Eigen values: It is the ratio of between groups to within group sum of squares. Higher the eigen value, more will be the appropriateness of the discriminant function. For a two-group DA, there will be one function and one eigen value. It accounts for explained variance in the model. In case of multiple DA, the eigen values will be more but the values will gradually decline. The first eigen value will be largest and most important, the second will be less than the firstone having less explanatory power.

Box’s M statistic : It is a test for equality of the covariance matrices of the independent variables across the groups. Here the null hypothesis (Ho) is that the observed covariance matrices are equal across groups. So, a nonsignificant test result (i.e., one with a large p- value) will indicate that the covariance matrices are equal.

Wilks’ Lambda : It provides a statistical test to assess discriminating power of the independent variables . If it is significant (revealed by significance of Chisquare ), we reject the “HN: No group separability”, and conclude that the discriminant function is statistically significant.

hypothesis Discriminant analysis tests the following hypothesis: H : the group means of a set of independent variable for two or more groups are equal H1: the group means for two or more groups are not equal This group means is referred to as a centroid

strengths Classification accuracy: discriminate analysis can be effective when the data is well-separated, and the assumptions of the technique are met. It can produce accurate results when the classes are well-defined and distinct Multiclass classification: Unlike some other classification methods, discriminate analysis can handle multiple classes efficiently. It extends naturally to more than two classes without requiring significant modifications Dimensionally reduction: Discriminant analysis can be used to reduce the number of predictors (independent variables) by creating new variables (discriminant function) that capture most of the information about the group differences Normality assumptions: Discriminant analysis can still perform reasonably well when the normality assumptions is slightly violated, especially with large sample sizes

Weakness/Limitations Sensitivity to assumption Overfitting Outliers Non-linear boundaries

Similarities and differences among anova , regression and discriminate analysis ANOVA Regression Discriminate Analysis Similarities Number of dependent variables One One One Number of independent variables Multiple Multiple Multiple Differences Nature of dependent variable Metric Metric Categorical Binary Nature of independent variable Categorical Metric Metric

Bibilogrpahy https://www.statisticssolutions.com/discriminant-analysis/ https://en.wikipedia.org/wiki/Linear_discriminant_analysis https://www.slideshare.net/slideshow/discriminant-function-analysis-dfa/238649297 https://www.slideshare.net/slideshow/discriminantfunctionanalysisdfa2009261213041pptx/256616478 https://www.sciencedirect.com/topics/agricultural-and-biological-sciences/discriminant-analysis https://www.wallstreetmojo.com/discriminant-analysis/ https://stats.oarc.ucla.edu/stata/dae/discriminant-function-analysis/
Tags