An example from economics to illustrate - the demand and supply of a good:
(1)
(2)
(3)
where = quantity of the good demanded
= quantity of the good supplied
S
t
= price of a substitute good
T
t
= some variable embodying the state of technology
Q PSu
dt t t t
Q PTv
st t t t
QQ
dt st
Q
dt
Q
st
•The point is that price and quantity are determined simultaneously (price
affects quantity and quantity affects price).
•P and Q are endogenous variables, while S and T are exogenous.
•We can obtain REDUCED FORM equations corresponding to (4) and (5) by
solving equations (4) and (5) for P and for Q (separately).
Simultaneous Equations Models:
The Structural Form
Q PSu
Q PTv
•(8) and (9) are the reduced form equations for P and Q.
Obtaining the Reduced Form (cont’d)
Q SuQ Tv
QQ T Suv
( )( ) ( ) Q T Suv
Q T S
uv
•But what would happen if we had estimated equations (4) and (5), i.e. the
structural form equations, separately using OLS?
•Both equations depend on P. One of the CLRM assumptions was that
E(Xu) = 0, where X is a matrix containing all the variables on the RHS of
the equation.
•It is clear from (8) that P is related to the errors in (4) and (5) - i.e. it is
stochastic.
•What would be the consequences for the OLS estimator, , if we ignore
the simultaneity?
•If the X’s are non-stochastic, E(Xu) = 0, which would be the case in a single
equation system, so that , which is the condition for unbiasedness.
•But .... if the equation is part of a system, then E(Xu) 0, in general.
Simultaneous Equations Bias (cont’d)
(')'
XXXy
1
yXu
E E EXXXu
XXEXu
(
)()((')')
(')(')
1
1
E(
)
uXXX
uXXXXXXX
uXXXX
')'(
')'(')'(
)(')'(ˆ
1
11
1
•Is the OLS estimator still consistent, even though it is biased?
•No - In fact the estimator is inconsistent as well.
•Hence it would not be possible to estimate equations (4) and (5)
validly using OLS.
Simultaneous Equations Bias (cont’d)
•We CAN estimate equations (10) & (11) using OLS since all the RHS
variables are exogenous.
•But ... we probably don’t care what the values of the coefficients
are; what we wanted were the original parameters in the structural
equations - , , , , , .
Avoiding Simultaneous Equations Bias
P T S
10 11 12 1
Q T S
20 21 22 2
•As well as simultaneity, we sometimes encounter another problem:
identification.
•Consider the following demand and supply equations
Supply equation(12)
Demand equation(13)
We cannot tell which is which!
•Both equations are UNIDENTIFIED or NOT IDENTIFIED, or
UNDERIDENTIFIED.
•The problem is that we do not have enough information from the equations to
estimate 4 parameters. Notice that we would not have had this problem with
equations (4) and (5) since they have different exogenous variables.
Identification of Simultaneous Equations
Q P
Q P
- The order condition - is a necessary but not sufficient condition for an
equation to be identified.
- The rank condition - is a necessary and sufficient condition for
identification. We specify the structural equations in a matrix form and
consider the rank of a coefficient matrix.
What Determines whether an Equation is Identified
or not? (cont’d)
•If more than G-1 are absent, it is over-identified. If less than G-1 are
absent, it is not identified.
Example
•In the following system of equations, the Y’s are endogenous, while the
X’s are exogenous. Determine whether each equation is over-, under-, or
just-identified.
(14)-(16)
Simultaneous Equations Bias (cont’d)
Y Y Y X Xu
Y Y Xu
Y Yu
1 0 12 33 41 52 1
2 0 13 21 2
3 0 12 3
G = 3;
If # excluded variables = 2, the eq
n
is just identified
If # excluded variables > 2, the eq
n
is over-identified
If # excluded variables < 2, the eq
n
is not identified
Equation 14: Not identified
Equation 15: Just identified
Equation 16: Over-identified
Simultaneous Equations Bias (cont’d)
2. Run the regression corresponding to equation (14).
3. Run the regression (14) again, but now also including the fitted values
as additional regressors:
(20)
4. Use an F-test to test the joint restriction that
2
= 0, and
3
= 0. If the
null hypothesis is rejected, Y
2
and Y
3
should be treated as endogenous.
Tests for Exogeneity (cont’d)
,YY
23
1
1
33
1
222514332101
ˆˆ
uYYXXYYY
•Assume that the error terms are not correlated with each other. Can we estimate the equations
individually using OLS?
•Equation 21: Contains no endogenous variables, so X
1
and X
2
are not correlated with u
1
. So
we can use OLS on (21).
•Equation 22: Contains endogenous Y
1
together with exogenous X
1
and X
2
. We can use OLS
on (22) if all the RHS variables in (22) are uncorrelated with that equation’s error term. In
fact, Y
1
is not correlated with u
2
because there is no Y
2
term in equation (21). So we can use
OLS on (22).
Recursive Systems
Y X Xu
Y Y X Xu
Y Y Y X Xu
1 10 111 122 1
2 20 211 211222 2
3 30 311 322 311 322 3
•If the system is just identified, ILS involves estimating the reduced
form equations using OLS, and then using them to substitute back to
obtain the structural parameters.
•However, ILS is not used much because
1. Solving back to get the structural parameters can be tedious.
2. Most simultaneous equations systems are over-identified.
Stage 1:
•Estimate the reduced form equations (17)-(19) individually by OLS and obtain the
fitted values, .
Stage 2:
•Replace the RHS endogenous variables with their stage 1 estimated values:
(24)-(26)
•Now and will not be correlated with u
1
, will not be correlated with u
2
, and
will not be correlated with u
3
.
Estimation of Systems
Using Two-Stage Least Squares (cont’d)
,,YYY
123
Y Y Y X Xu
Y Y Xu
Y Yu
1 0 12 33 41 52 1
2 0 13 21 2
3 0 12 3
Y
2
Y
3
Y
3
Y
2
•The standard error estimates also need to be modified compared with
their OLS counterparts, but once this has been done, we can use the
usual t- and F-tests to test hypotheses about the structural form
coefficients.
Estimation of Systems
Using Two-Stage Least Squares (cont’d)
The Problem With IV
•What are the instruments?
Solution: 2SLS is easier.
Other Estimation Techniques
1. 3SLS - allows for non-zero covariances between the error terms.
2. LIML - estimating reduced form equations by maximum likelihood
3. FIML - estimating all the equations simultaneously using maximum
likelihood
Instrumental Variables (cont’d)
•How Might the Option Price / Trading Volume and the Bid / Ask Spread be
Related?
Consider 3 possibilities:
1. Market makers equalise spreads across options.
2. The spread might be a constant proportion of the option value.
3. Market makers might equalise marginal costs across options irrespective
of trading volume.
An Example of the Use of 2SLS: Modelling
the Bid-Ask Spread and Volume for Options
where PR
i
& CR
i
are the squared deltas of the options
The Models
CBA CDUM C CL T CRe
i i i i i ii
0 1 2 3 4 5
CL CBA T T Mv
i i i i i i
0 1 2 3
2
4
2
PBA PDUM P PL T PRu
i i i i i i i
0 1 2 3 4 5
PL PBA TT Mw
i i i i i i
0 1 2 3
2
4
2
-3 . 8 5 4 2
(-1 0 . 5 0 )
4 6 . 5 9 2
( 3 0 . 4 9 )
-0 . 1 2 4 1 2
(-6 . 0 1 )
0 . 0 0 4 0 6
( 1 4 . 4 3 )
0 . 0 0 8 6 6
( 4 . 7 6 )
0 . 6 1 8
N o t e : t-r a t i o s i n p a r e n t h e s e s . S o u r c e : G e o r g e a n d L o n g s t a f f ( 1 9 9 3 ) . R e p r i n t e d w i t h t h e p e r m i s s i o n o f
t h e S c h o o l o f B u s i n e s s A d m i n i s t r a t i o n , U n i v e r s i t y o f W a s h i n g t o n .
P u t B i d -A s k S p r e a d a n d T r a d i n g V o l u m e R e g r e s s i o n
iiiiiii
uP RTP LPP DUMP B A
543210
( 6 . 5 7 )
iiiiii wMTTP B AP L
2
4
2
3210 ( 6 . 5 8 )
0
1
2
3
4
5 A d j . R
2
-2 . 8 9 3 2
(-8 . 4 2 )
4 6 . 4 6 0
( 3 4 . 0 6 )
-0 . 1 5 1 5 1
(-7 . 7 4 )
0 . 0 0 3 3 9
( 1 2 . 9 0 )
0 . 0 1 3 4 7
( 1 0 . 8 6 )
0 . 5 1 7
N o t e : t-r a t i o s i n p a r e n t h e s e s . S o u r c e : G e o r g e a n d L o n g s t a f f ( 1 9 9 3 ) . R e p r i n t e d w i t h t h e p e r m i s s i o n o f
t h e S c h o o l o f B u s i n e s s A d m i n i s t r a t i o n , U n i v e r s i t y o f W a s h i n g t o n .
•The authors argue that in the second part of the paper, they did indeed find
evidence of substitutability between calls & puts.
Comments
- No diagnostics.
- Why do the CL and PL equations not contain the CR and PR variables?
- The authors could have tested for endogeneity of CBA and CL.
- Why are the squared terms in maturity and moneyness only in the
liquidity regressions?
- Wrong sign on the squared deltas.
Conclusions
•Simplest case is a bivariate VAR
where u
it
is an iid disturbance term with E(u
it)=0, i=1,2; E(u
1t u
2t)=0.
•The analysis could be extended to a VAR(g) model, or so that there are g
variables and g equations.
Vector Autoregressive Models
y y y y y u
y y y y y u
t t ktk t ktk t
t t ktk t ktk t
1 10 1111 11 1121 12 1
2 20 2121 22 2111 21 2
... ...
... ...
y
t
=
0
+
1
y
t-1
+ u
t
g1 g1 gg g1 g1
Vector Autoregressive Models:
Notation and Concepts
y y y u
y y y u
t t t t
t t t t
1 10 1111 1121 1
2 20 2121 2111 2
y
y
y
y
u
u
t
t
t
t
t
t
1
2
10
20
11 11
21 21
11
21
1
2
•But what if the equations had a contemporaneous feedback term?
•We can write this as
•This VAR is in primitive form.
y y y u
y y y u
t t t t
t t t t
1 10 1111 1121 1
2 20 2121 2111 2
y y y yu
y y y yu
t t t t t
t t t t t
1 10 1111 1121 122 1
2 20 2121 2111 221 2
y
y
y
y
y
y
u
u
t
t
t
t
t
t
t
t
1
2
10
20
11 11
21 21
11
21
12
22
2
1
1
20
0
•We can then pre-multiply both sides by B
-1
to give
y
t
= B
-1
0
+ B
-1
1
y
t-1
+ B
-1
u
t
or
y
t
= A
0
+ A
1
y
t-1
+ e
t
•This is known as a standard form VAR, which we can estimate using OLS.
1
1
22
12 1
2
10
20
11 11
21 21
11
21
1
2
y
y
y
y
u
u
t
t
t
t
t
t
cause changes in y
2?” If y
1 causes y
2, lags of y
1 should be significant in the
equation for y
2
. If this is the case, we say that y
1
“Granger-causes” y
2
.
•If y
2
causes y
1
, lags of y
2
should be significant in the equation for y
1
.
•If both sets of lags are significant, there is “bi-directional causality”
Hypothesis Implied Restriction
1. Lags of y1t do not explain current y2t
21 = 0 and
21 = 0 and
21 = 0
2. Lags of y1t do not explain current y1t
11 = 0 and
11 = 0 and
11 = 0
3. Lags of y2t do not explain current y1t
12 = 0 and
12 = 0 and
12 = 0
4. Lags of y2t do not explain current y2t
22 = 0 and
22 = 0 and
22 = 0
•This is done by determining how much of the s-step ahead forecast error
variance for each variable is explained innovations to each explanatory
variable (s = 1,2,…).
•The variance decomposition gives information about the relative
importance of each shock to the variables in the VAR.