Linear & Multiple discriminant analysis using spss
Size: 3.05 MB
Language: en
Added: Mar 14, 2021
Slides: 60 pages
Slide Content
MULTIVARIATE
ANALYSIS
-Dr Nisha Arora
About Me Concepts
How it Works?
Q/A Session
Agenda
•Dr. Nisha Arora is a proficient educator, passionate trainer,
You Tuber, occasional writer, and a learner forever.
✓ PhD in Mathematics.
✓ Works in the area of Data Science, Statistical
Research, Data Visualization & Storytelling
✓ Creator of various courses
✓ Contributor to various research communities and
Q/A forums
✓ Mentor for women in Tech Global
3
About Me
An educator by heart & a
trainer by profession.
http://stats.stackexchange.com/users/79100/learner
https://stackoverflow.com/users/5114585/dr-nisha-arora
https://www.quora.com/profile/Nisha-Arora-9
https://www.researchgate.net/profile/Nisha_Arora2/contributions
http://learnerworld.tumblr.com/
https://www.slideshare.net/NishaArora1
https://scholar.google.com/citations?user=JgCRWh4AAAAJ&hl=en&authuser=
1
https://www.youtube.com/channel/UCniyhvrD_8AM2jXki3eEErw
https://groups.google.com/g/dataanalysistraining/search?q=nisha%20arora
https://www.linkedin.com/in/drnishaarora/detail/recent-activity/posts/
✓Research Queries
✓Coding Queries
✓Blog Posts
✓Slide Decks
✓My Talks
✓Publications
✓Lectures
✓Layman’s Term
Explanation
✓Mentoring
✓Articles & Much More
My Contribution to the Community
❖ Statistics
❖ Data Analysis
❖ Machine Learning
❖ Analytics & Data Science
❖ Data Visualization & Storytelling
❖ Mathematics & Operations Research
❖ Online Teaching
❖ Excel/SPSS/R/Python/Shiny
❖ Tableau/PowerBI
My Expertise
Connect With Me
HTTPS://WWW.LINKEDIN.COM/IN/DRNISHAARORA / [email protected] .
Discriminant Analysis
USING SPSS
My answer to ‘classification of multiple outcomes
with categorical and continuous predictors’:
https://stats.stackexchange.com/a/513616/79100
When to use LDA?
✓Non-Ordinalresponsevariable
✓Metricpredictors
✓Workswellforlowsamplesize
✓Workswellwhencasesarewellseparable
✓Morerestrictivethanlogisticregression
Assumptions of LDA
✓BothLDAandQDAassumethepredictorvariablesXare
drawnfromamultivariateGaussiandistribution.
✓LDAassumesequalityofcovariancesamongthepredictor
variablesXacrosseachalllevelsofY
✓LDAandQDArequirethenumberofpredictorvariables(p)to
belessthenthesamplesize(n).Asimpleruleofthumbisto
useLDA&QDAondatasetswheren≥5p
Box Test
Box'sMtests
NullHypothesis:Equalityofcovariances
acrossgroups
P-value<alpha(0.05)
NullRejected
Useseparatematricestoseeifitgives
radicallydifferentclassificationresults.
We will see using separate groups covariance
matrices later
Prior Probabilities for Groups
Apriorprobabilityisanestimateofthe
likelihoodthatacasebelongstoa
particulargroupwhennoother
informationaboutitisavailable
Classification Function Coefficients
These areusedtocompute
probabilitiesforgroupmembership.
Discriminant Analysis _Output
The classification table shows the practical
results of using the discriminant model. Of
the cases used to create the model, 94 of
the 124 people who previously defaulted are
classified correctly. 281 of the 375
nondefaultersare classified correctly.
Overall, 75.2% of the cases are classified
correctly.
Classifications based upon the cases used to
create the model tend to be too "optimistic"
in the sense that their classification rate is
inflated. The cross-validated section of the
table attempts to correct this by classifying
each case while leaving it out from the
model calculations; however, this method is
generally still more "optimistic" than subset
validation.
Subset validation is obtained by classifying
Get your hands dirty!
Playaroundwithdifferentmodels&seewhatworksbestforyourproblem
How to report the results?
1.ANOVATable[univariateanovainstatisticssubdialogbox)
relationofindividualpredictor
2.BOXM(Assumptionchecking)
it'snotverystrongmeasure...forlargesample,mostly,itgivespvalue>
0.05
3.Performance(EigenValue,Wilkslambda,Classificationtable)
4.Discriminantequations¢roidscores
5.Relativeimportance