Ch 10. SEM Structural Equation Modelling

MuhammadMishbah1 127 views 58 slides Aug 06, 2024
Slide 1
Slide 1 of 58
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39
Slide 40
40
Slide 41
41
Slide 42
42
Slide 43
43
Slide 44
44
Slide 45
45
Slide 46
46
Slide 47
47
Slide 48
48
Slide 49
49
Slide 50
50
Slide 51
51
Slide 52
52
Slide 53
53
Slide 54
54
Slide 55
55
Slide 56
56
Slide 57
57
Slide 58
58

About This Presentation

structural equation modelling


Slide Content

Chapter 10
Structural Equation Modelling
PowerPoint Lecture Slides
Multivariate Data Analysis
Seventh Edition
by Joseph F. Hair Jr.; William C. Black; Barry J. Babin; RolphE. Anderson

Chapter 10 Learning Outcomes
•The use and characteristics Structural Equation
Modelling1
•Model development in Structural Equation Modelling2
•Confirmatory Factor Analysis with SEM3
•Stages statistical assessment in CFA with SEM4

Complexity of Research
Question
•All too often the researcher is faced with a set of interrelated
questions. For example:
–What variables determine a store’s image?
–How does that image combine with other variables to affect purchase
decisions and satisfaction at the store?
–How does satisfaction with the store result in long-term loyalty to it?
•This series of issues has both managerial (practical) and theoretical
importance.
Store Image
Purchase
Decisions
Satisfaction Loyalty

The Use of Structural
Equation Modelling
•Structural equation modeling can examine a series of
dependence relationships simultaneously,particularly in
testing theories that contain multiple equations involving
dependence relationships.
•Example:
–If we believe that image creates satisfaction, and then
satisfaction creates loyalty, then satisfaction is both a
dependent and an independent variable in the same theory.
–Thus, a hypothesized dependent variable becomes an
independent variable in a subsequent dependence
relationship.
•An extension of several multivariate techniques, most
notably factor analysis and multiple regression analysis.

The Use of Structural
Equation Modelling
•SEM examines the structure of interrelationships
expressed in a series of equations, similar to a
series of multiple regression equations.
•These equations depict all of the relationships
among constructs (the dependent and
independent variables) involved in the analysis.

The Use of Structural
Equation Modelling
•SEM is known by many names: covariance structure
analysis, latent variable analysis, and sometimes it is
even referred to by the name of the specialized
software package used
–Covariance Based SEM e.g., a LISREL or AMOS model
–Variance Based SEM e.g., SmartPLS

The Characteristics of
Structural Equation Modelling
1.Estimation of multiple and interrelated
dependence relationships
2.An ability to represent unobserved concepts
in these relationships and account for
measurement error in the estimation process
3.Defining a model to explain the entire set of
relationships

Estimation of multiple and interrelated
dependence relationships
•SEM estimates a series of separate, but interdependent,
multiple regression equations simultaneously by
specifying the structural model used by the statistical
program.
•The structural model expresses these dependence
relationships among independent and dependent
variables, even when a dependent variable becomes an
independent variable in other relationships.

Estimation of multiple and interrelated
dependence relationships
•Interdependencyamong variables:
•The researcher draws model upon theory, prior experience,
and the research objectives to distinguish which independent
variables predict each dependent variable.
•Dependent variables in one relationship can become
independent variables in subsequent relationships, giving rise
to the interdependent nature of the structural model.
•Moreover, many of the same variables affect each of the
dependent variables, but with differing effects.

Incorporating Latent Variables That is
Not Measured Directly
•SEM also has the ability to incorporate latent variables
into the analysis.
•A latent construct (also termed a latent variable) is a
hypothesized and unobserved conceptthat can be
represented by observable or measurable variables.
•A latent construct is measured indirectly by examining
consistency among multiple measured variables,
sometimes referred to as manifest variables, or
indicators, which are gathered through various data
collection methods (e.g., surveys, tests, observational
methods).

Example Latent and Manifest
Variables

Benefit Using Latent
Construct (1)
1.Better represent theoretical concepts by using
multiple measures of a concept to reduce the
measurement error of that concept.
–From a theoretical perspective, most concepts are
relatively complex (e.g., patriotism, consumer confidence,
or even satisfaction) and have many meanings and/or
dimensions.
–With concepts such as these, the researcher tries to
design the best questions to measure the concept
knowing that individuals may interpret any single question
somewhat differently.
–The intent is for the collective set of questions to
represent the concept better than any single item.

Measurement Error in SEM
•The researcher must also be aware of measurement
error that occurs with any form of measurement.
•Any more theoretical or abstract concept is necessarily
subject to measurement error, due to inaccurate
responses.
–It occurs when respondents may be somewhat unsure
about how to respond or may interpret the questions in a
way that is different from what the researcher intended.
–Finally, it can result from a natural degree of inconsistency
on the part of the respondent when we use multiple
perspectives or items to measure the same concept.
•All of these situations give rise to measurement error

How to quantify
measurement error?
•SEM provides the measurement model, which specifies
the rules of correspondence between measured and
latent variables (constructs).
•The measurement model enables the researcher to use
any number of variables for a single independent or
dependent construct.
•Once the constructs are defined, then the model can be
used to assess the extent of measurement error (known
as reliability).

The measurement assessment
component of SEM
•The researcher identifies the specific indicator
variables associated with each construct, typically
based on a combination of previous similar studies and
the situation at hand.
•When SEM is applied, the researcher can assess the
contribution of each indicator variable in representing
its associated construct and measure how well the
combined set of indicator variables represents the
construct (reliability and validity).
•After the constructs have met the required
measurement standards, the relationships between
the constructs can be estimated.

Example
•PT ABC would like to determine which factors
may influence the job satisfaction of its
employees.
Perception
towards
Supervisor
Working
Environment
Satisfaction
INDICATORS
(1)My supervisor recognizes my potential;
(2)My supervisor helps me resolve
problems at work;
(3)My supervisor understands the
challenges of balancing work and home
demands.

Benefit Using Latent
Construct (2)
2.Improves the statistical estimation of the
relationships between concepts by accounting
for the measurement error in the concepts.
–From both practical and theoretical perspectives, we
cannot perfectly measure a concept without
measurement error.
–The measurement error affectsthe estimate of the
true structural coefficient

Reliability and Validity
Measurement Model
•Construct Reliability is a measure of the degree to which a
set of indicators of a latent construct is internally
consistent based on how highly interrelated the indicators
are with each other.
–It represents the extent to which the indicators all measure the
same thing.
–High reliability is associated with lower measurement error.
–As reliability goes up, the relationships between a construct and
the indicators are greater, meaning that the construct explains
more of the variance in each indicator.
–Just as we learned in multiple regression, as more of an
outcome is explained(in this case a measured indicator of a
construct), the amount of error decreases.

Reliability and Validity
Measurement Model
•Although reliability is important, high reliability does
not guarantee that a construct is measured
accurately.
•Reliability is a necessary but not sufficient condition
for validity.
•Construct validity
–Extenttowhichasetofmeasuredvariablesactually
representthetheoreticallatentconstructtheyare
designedtomeasure–fittoitspurpose.

Exogenousvs Endogenous
Latent Constructs
•Exogenous constructs are the latent, multi-item equivalent
of independent variables.
–They are determined by factors outside of the model (i.e., they are
not explained by any other construct or variable in the model).
–Visually an exogenous construct does not have any paths (one-
headed arrows) from any other construct or variable going into it.
•Endogenous constructs are the latent, multi-item equivalent
to dependent variables.
–These constructs are theoretically determined by factors within the
model.
–Thus, they are dependent on other constructs, and this dependence
is represented visually by a path to an endogenous construct from
an exogenous construct.

Model Development
•Amodelshouldnotbedevelopedwithoutsome
underlyingtheory.
•Amodelisarepresentationofatheory.
–Theorycanbethoughtofasasystematicsetofrelationships
providingaconsistentandcomprehensiveexplanationof
phenomena.
–Theoryisnottheexclusivedomainofacademia,butcanbe
rootedinexperienceandpracticeobtainedbyobservationof
real-worldbehavior.
–Theoryisoftenaprimaryobjectiveofacademicresearch,but
practitionersmaydeveloporproposeasetofrelationshipsthat
areascomplexandinterrelatedasanyacademicallybased
theory.

Model Development
•AmodelinSEMterminologyconsistsoftwomodels:
–themeasurementmodel(representinghow
measuredvariablescometogethertorepresent
constructs)
–thestructuralmodel(showinghowconstructsare
associatedwitheachother).
•Pathdiagramisvisualportrayaloftherelationships
employsspecificconventionsbothfortheconstructs
andmeasuredvariablesaswellastherelationships
betweenthem.

Depicting the Constructs
(Measurement Model)
•Latentconstructsarerelatedtomeasured
(observed)variableswithameasurement
relationship.
•Thesevariablesarereferredtoasindicators
becausenosinglevariablecancompletely
representaconstruct,butitcanbeusedasan
indicationoftheconstruct.
•Theresearchermustjustifythetheoreticalbasisof
theindicatorsbecauseSEMonlyexaminesthe
empiricalcharacteristicsofthevariables.

Depicting Structural
Relationship
•A structural model involves specifying structural
relationships between latent constructs.
•Specifying a relationship generally means that we
either specify that a relationship exists or that it
does not exist.
–If it exists, an arrow is drawn; if no relationship is
expected, then no arrow is drawn.
•Two types of relationships are possible among
constructs: dependencerelationships and
correlational(covariance) relationships.

Structural Equations Modeling
Overview

Structural Equations Modeling
Overview

Combining Measurement and
Structural Relationships

The Role of Theory in SEM
1.specifyingrelationshipsthatdefinethe
model;
2.establishingcausation,particularlywhen
usingcross-sectionaldata;
3.thedevelopmentofamodelingstrategy

Relationship Specification
•Although theory can be important in all
multivariate procedures, it is particularly
important for SEM because it is considered a
confirmatory analysis.
–It is useful for testing and potentially confirming
theory.
–Theory is needed to specify relationships in both
measurement and structural models,
modifications to the proposed relationships, and
many other aspects of estimating a model.

Establishing Causation
•Dependence relationships can sometimes be theoretically
hypothesized as causal.
–Causal research designs traditionally involve an experiment with some
controlled manipulation (e.g., a categorical independent variable as found
in ANOVA).
•SEM models are typically used in nonexperimental situations in
which the exogenous constructs are not experimentally
controlled variables.
–This limits the researcher’s ability to draw causal inferences and SEM
alone cannot establish causality.
•SEM can treat dependence relationships as causal if four types
of evidence are reflected in the SEM model, they are (1)
covariation, (2) sequence, (3) nonspurious covariation, and (4)
theoretical support.

Condition for Causation in SEM
models
1.Covariance
–Causality means that a change in a cause brings about a
corresponding change in an effect.
–Thus, systematic covariance (correlation) between the cause
and effect is necessary, but not sufficient, to establish causality.
2.Sequence
–Sequence in causation means that improvements in the cause
must occur before the effect increases, if the relationship
between the two variables is causal.
–Example: Improvements in supervisor perception must occur
before job satisfaction increases. OR, the changes in supervisor
perception cannot occur after the change in job satisfaction.

Condition for Causation in SEM
models
3.Nonspurious covariation
–A spurious relationship is one that is false or misleading. It happens
when there is high collinearity among predictors.
–Nonspurious relationship describes the size and nature of the
relationship between a cause and the relevant effect should not be
affected by including other constructs (or variables) in a model.
–A causal inference is supported when we can show that some third
construct does not affect the relationship between the cause and
effect (a lack of collinearity among the predictors is desirable)
–When collinearity is not present, the researcher comes closest to
reproducing the conditions that are present in an experimental
design.

Testing for Nonspurious
Relationship

Condition for Causation in SEM
models
4.Theoretical support
–Unless theory can be used to establish a causal ordering and a
rationale for the observed covariance, the relationships remain
simple association and should not be attributed with any
further causal power.
–This condition emphasizes the fact that simply testing a SEM
model and analyzing its results cannot establish causality.
–Theoretical support becomes especially important with cross-
sectional data.

Estimation Techniques in SEM
•Maximum likelihood estimation (MLE), which is
more efficient and unbiased when the assumption
of multivariate normality is met.
–MLE is a flexible approach to parameter estimation in
which the “most likely” parameter values to achieve the
best model fit are found.
–Cons: Sensitive to non-normal data
•MLE continues to be the most widely used approach
and is the default in most SEM programs. In fact, it
has proven fairly robust to violations of the
normality assumption.

Estimation Techniques in SEM
•Another methods: (1) weighted least squares (WLS); (2)
generalized least squares (GLS); and (3) asymptotically
distribution free (ADF) estimation.
–The ADF technique has received particular attention due to its
insensitivity to non-normality of the data, but its requirement
of rather large sample sizes limits its use.

Confirmatory Factor Analysis
With SEM

Highlight Exploratory Factor
Analysis (EFA)
•InEFA,thefactorsarederivedfromstatisticalresults,not
fromtheory.
–Thismeansthattheresearcherrunsthesoftwareandlets
theunderlyingpatternofthedatadeterminethefactor
structure.
•EFAisconductedwithoutknowinghowmanyfactorsreally
exist(ifany)orwhichvariablesbelongwithwhichconstructs.
•WhenEFAisapplied,theresearcherusesestablished
guidelinestodeterminewhichvariablesloadonaparticular
factorandhowmanyfactorsareappropriate.
•Thefactorsthatemergecanonlybenamedafterthefactor
analysisisperformed.

Intro to Confirmatory Factor
Analysis (CFA)
•CFAisappliedtotesttheextenttowhicharesearcher’s
theoreticalpatternoffactorloadingsonprespecified
constructsrepresentstheactualdata.
–InCFA,theresearchermustspecifyboththenumberoffactorsthat
existforasetofvariablesandwhichfactoreachvariablewillload
onbeforeresultscanbecomputed.
•Theresearchermakesthisassignmentbasedonthetheory
beingtestedbeforeanyresultscanbeobtained.
•Avariableisassignedtoonlyasinglefactor(construct),and
cross-loadings(loadingonmorethanasinglefactor)arenot
assigned.
•CFAstatisticstellushowwellourtheoreticalspecificationof
thefactorsmatchesreality(theactualdata).
–Inasense,CFAisatoolthatenablesustoeither“confirm”or
“reject”ourpreconceivedtheory.

Six-Stage Process for CFA with
SEM

Six-Stage Process for Structural
Equation Modeling

Measurement Model
Development
•Unidimensional measures
mean that a set of measured
variables (indicators) can be
explained by only one
underlying construct.
–Unidimensionalitybecomes
critically important when more
than two constructs are
involved.
–In such a situation, each
measured variable is
hypothesized to relate to only
a single construct.

Type of Relationships Impacts on
Unidimensionality
a single measured variable
is associated with more
than one construct
within-construct error
covariance
between-construct error
covariance

Items (indicators) per Latent
Construct –Rules of Thumb
•More items (measured variables or indicators) are not
necessarily better.
–Even though more items do produce higher reliability estimates and
generalizability [4], more items also require larger sample sizes and
can make it difficult to produce truly unidimensional factors.
–Good practice dictates a minimum of three items per factor,
preferably four, not only to provide minimum coverage of the
construct’s theoretical domain, but also to provide adequate
identification for the construct.
–Use four indicators whenever possible.
–Having three indicators per construct is acceptable, particularly
when other constructs have more than three.
–Constructs with fewer than three indicators should be avoided.

Items per construct and Model
Identification
•Number of indicators influences model identification, which
can be seen through degree of freedom.
•Possible SEM model identification:
a.Under-identified has more parameters to be estimated
than unique indicator variable variances and covariances in
the observed variance/covariance matrix. It has no
solution.
b.Just-Identifiedhas just enough degrees of freedom to
estimate all free parameters. It has exactly one solution.
c.Overidentifiedmodels have more unique covariance and
variance terms than parameters to be estimated. It has
more than one solution.

Underidentifiedvs Just Identified

Overidentified

Degree of Freedom
•p=totalmanifestvariables
•k=totalestimatedparametersbyadding,
a.Numberofregressioncoefficient(numberofarrowbetween
latentvariableandindicators)
b.Numberofmeasurementerrorforeachindicators
c.Numberofstructuralrelationship(exogenous-endogenous
structuralterm)
d.Numberofcovariance(correlation)betweenlatentvariables

Reflective vs Formative Construct
•A reflective measurement theory is based on the idea
that latent constructs cause the measured variables
and that the error results in an inability to fully explain
these measured variables.
–Thus, the arrows are drawn from latent constructs to measured variables.
•A formative measurement theory is modeled based on
the assumption that the measured variables cause the
construct.
–The error in formative measurement models, therefore, is an inability of
the measured variables to fully explain the construct.
–A key assumption is that formative constructs are not considered latent,
they are viewed as indices where each indicator is a cause of the
construct.

Example Reflective vs Formative
Construct
•LatentVariable:SATISFACTIONwithhotelaccommodations
•Reflectivemodelmighthavetherepresentativemeasures
–“Ifeelwellinthishotel”
–“Thishotelbelongstomyfavorites”
–“Irecommendthishoteltoothers”
–“Iamalwayshappytostayovernightinthishotel.”
•Formativemodel,incontrast,mighthavetheconstituent
measures
–“Theroomiswellequipped”
–“Icanfindsilencehere”
–“Thefitnessareaisgood”
–“Thepersonnelarefriendly”
–“Theserviceisgood”

COMPUTERSELF-EFFICACY
•Reflective
–Iamcapableatperformingtasksonmycomputer.
–Ifeelconfidentinmyabilitytoperformcomputer-relatedtasks.
•Formative
–IamconfidentatmyabilitytoperformtasksinMSWord.
–IamskillfulatusingExcel.
SYSTEMQUALITY
•Reflective
–Overall,Iwouldratethesystemqualityofthesystemhighly.
–Thequalityofthesystemisappropriateformyneeds.
•Formative
–Reliability,EaseofUse,Complexity,Accessibility,Responsiveness
Example Reflective vs Formative
Construct

Heywood Case
•An error variance estimate of less than zero (a negative
error variance) is termed a Heywood case.
•Such a result is logically impossible because it implies a
less than 0 percent error in an item, and by inference it
implies that more than 100 percent of the variance in an
item or a construct is explained.
•Heywood cases are particularly problematic in CFA
models with small samples or when the three-indicator
rule is not followed.
–Models with sample size greater than 300 that adhere to the
three-indicator rule are unlikely to produce Heywood cases.

Assessing Measurement Model (1)
•Reliabilityrefers to consistencyor stabilityof
measurement.
a.Can our measure or other form of observation be
confirmed by further measurements or observations?
b.If you measure the same thing would you get the same
score?
•Internal consistency: Internal consistency reliability
is a measure of how well the items on a test
measure the same construct or idea.
Measured by Composite Reliability and Cronbach’s
Alpha

•Validity: refers to the suitability or meaningfulness of
the measurement. It indicates how well an instrument
measures the construct it purpose to measure.
–Does this instrument describe accurately the
construct I am attempting to measure?
•Convergent Validity:measures of constructs that
theoretically should be related to each other are, in
fact, observed to be related to each other. Measured
using factor loading and average variance extracted
(AVE).
Assessing Measurement Model (2)

•Discriminant validity: measures of constructs that
theoretically should not be related to each other are, in
fact, observed to not be related to each other.
Measured using cross loadings and the Fornell–Larcker
criterion.
–Cross loadings -an indicator’s loading with its associated
latent construct should be higher than its loadings with all the
remaining constructs (i.e., the cross loadings)
–Fornell–Larckercriterion -the AVE of each latent construct
should be greater than the latent construct’s highest squared
correlation with any other latent construct
Assessing Measurement Model (3)

Rules of Thumb Construct Reliability
and Validity (Reflective Model)
•Standardized loading estimates (factor loading) should
be 0.5 or higher, and ideally 0.7 or higher.
•Construct reliability should be .7 or higher to indicate
adequate convergence or internal consistency.
•AVE should be .5 or greater to suggest adequate
convergent validity.
•To provide evidence of discriminant validity
–AVE estimates for two factors also should be greater than the
square of the correlation between the two factors.
–An indicator’s loadings should be higher than all of its cross
loadings

Rules of Thumb Construct Reliability
and Validity (Formative Model)
•Examine the significance of each indicator’s weight
(relative importance) and indicator’s loading (absolute
importance).
•Use two-tailed test for significance test with significance level
0,01 or 0,05 or 0,1 (T value = 2,58 or 1,96 or 1,69 respectively)
•Use bootstrapping (resampling) to assess the significance.
•Minimum bootstrap sample as much as the number of valid
samples.
•Examine multicollinearity(should be less than 0,9) or
each indicator’s variance inflation factor (VIF) value
should be less than 5.

Assessing Structural Model
•R square-coefficient of determination, measuring the
amount of variation accounted for in the endogenous
constructs by the exogenous constructs.
•Level of significance of the path coefficients–
assessing prior hypotheses (should be less than
significance level).
Tags