Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Chapter 23 Outline
• Review
o Demand and Supply Models
o Ordinary Least Squares (OLS) Estimation Procedure
o Reduced Form (RF) Estimation Procedure – One Way to Cope
with Simultaneous Equation Models
• Two Stage Least Squares (TSLS): An Instrumental Variable (IV) Two
Step Approach – A Second Way to Cope with Simultaneous Equation
Models
o 1st Stage: Use the exogenous explanatory variable(s) to estimate
the endogenous explanatory variable(s).
o 2nd Stage: In the original model, replace the endogenous
explanatory variable with its estimate.
• Comparison of Reduced Form (RF) and Two Stage Least Squares
(TSLS) Estimates
• Statistical Software and Two Stage least Squares (TSLS)
• Identification of Simultaneous Equation Models: Order Condition
o Taking Stock
o Underidentification
o Overidentification
o Overidentification and Two Stage Least Squares (TSLS)
• Summary of Identification Issues
EstP =
Generate a new variable, EstP, that estimates the price of beef based on the 1st
stage.
2. Next, consider the demand model:
Demand Model:QtD = β Const
D
+ β PD Pt + β ID Inct + etD
and the second stage of the two stage least squares (TSLS) estimation
procedure:
• 2nd Stage: In the original model, replace the endogenous explanatory
variable with its estimate.
o Dependent variable: Original dependent variable. In this case,
the original explanatory variable is the quantity of beef, Qt.
o Explanatory variables: 1st stage estimate of the endogenous
explanatory variable and the relevant exogenous explanatory
3
Review
Demand and Supply Models
For the economist, arguably the most important example of a simultaneous
equations model is the demand/supply model:
Demand Model:QtD = β Const
D
+ β PD Pt + β ID Inct + etD
Supply Model: QtS = β Const
S
+ β PS Pt + β FP
S
FeedPt + etS
Equilibrium: QtD = QtS = Qt
Endogenous Variables: Qt and Pt
Exogenous Variables: FeedPt and Inct
Project: Estimate the beef market demand and supply parameters
It is important to emphasize the distinction between endogenous and
exogenous variables in a simultaneous equation model. Endogenous variables are
variables whose values are determined “within” the model. In the demand/supply
example, both quantity and price are determined simultaneously “within” the
model; the model is explaining both the equilibrium quantity and the equilibrium
price as depicted by the intersection of the supply and demand curves. On the
other hand, exogenous are determined “outside” the context of the model; the
values of exogenous variables are taken as given. The model does not attempt to
explain how the values of exogenous variables are determined.
• Endogenous variables – Variables determined “within” the model:
Quantity and Price.
• Exogenous variables – Variables determined “outside” the model.
5
Price
Q = Equlibrium Quantity
P
P = Equilibrium Price
D
Quantity
Q
Figure 23.1: Demand/Supply Model
So, where did we go from here? We explored the possibility that the ordinary
least squares (OLS) estimation procedure might be consistent. After all, is not
“half a loaf” better than none? We took advantage of our Econometrics Lab to
address this issue. Recall the distinction between an unbiased and a consistent
estimation procedure:
• Unbiased: The estimation procedure does not systematically
underestimate or overestimate the actual value; that is, after many, many
repetitions the average of the estimates equals the actual value.
• Consistent but Biased: As consistent estimation procedure can be biased.
But, as the sample size, as the number of observations, grows:
o The magnitude of the bias decreases. That is, the mean of the
coefficient estimate’s probability distribution approaches the actual
value.
o The variance of the estimate’s probability distribution diminishes
and approaches 0.
Unfortunately, the Econometrics Lab illustrates the sad fact that the ordinary least
squares (OLS) estimation procedure is neither unbiased nor consistent.
7
Reduced Form (RF) Estimation Procedure – One Way to Cope with Simultaneous
Equation Models
We begin with the simultaneous equation model and then constructed the reduced
form (RF) equations:
Demand and Supply Models:
Demand Model:QtD = β Const
D
+ β PD Pt + β ID Inct + etD
Supply Model: QtS = β Const
S
+ β PS Pt + β FP
S
FeedPt + etS
Equilibrium: QtD = QtS = Qt
Endogenous Variables: Qt and Pt
Exogenous Variables: FeedPt and Inct
Reduced Form (RF) Estimates
Quantity Reduced Form (RF) Estimates: EstQ = aConst
Q
+ aFP
Q
FeedP + aIQ Inc
Price Reduced Form (RF) Estimates: EstP = aConst
P
+ aFP
P
FeedP + aIP Inc
We use the ordinary least squares (OLS) estimation procedure to estimate the
reduced form (RF) parameters and then use the ratio of the reduced form (RF)
estimates to estimate the “slopes” of the demand and supply curves:
Two Stage Least Squares (TSLS): An Instrumental Variable (IV) Two Step
Approach – A Second Way to Cope with Simultaneous Equation Models
Another way to estimate simultaneous equation model is the two stage least
squares (TSLS) estimation procedure. As the name suggests the procedure
involves two steps. As we shall see, two stage least squares (TSLS) uses a
strategy that is similar to the instrumental variable (IV) approach.
• 1st Stage: Use the exogenous explanatory variable(s) to estimate the
endogenous explanatory variable(s).
o Dependent variable: The endogenous explanatory variable(s), the
“problem” explanatory variable(s).
o Explanatory variable(s): All exogenous variables.
• 2nd Stage: In the original model, replace the endogenous explanatory
variable with its estimate.
o Dependent variable: Original dependent variable.
o Explanatory variables: 1st stage estimate of the endogenous
explanatory variable and the relevant exogenous explanatory
variables.
We shall now illustrate the two stage least squares (TSLS) approach by consider
the beef market.
Beef Market Data: Monthly time series data relating to the market for beef from
1977 to 1986.
Qt Quantity of beef in month t (millions of pounds)
Pt Real price of beef in month t (1982-84 cents per pound)
Inct Real disposable income in month t (thousands of chained 2005 dollars)
ChickPt Real rice of whole chickens in month t (1982-84 cents per pound)
FeedPt Real price of cattle feed in month t (1982-84 cents per pounds of corn cobs)
Consider the model for the beef market that we used in the last chapter:
Demand Model:QtD = β ConstD
+ β PD Pt + β ID Inct + etD
Supply Model: QtS = β Const
S
+ β PS Pt + β FP
S
FeedPt + etS
Equilibrium: QtD = QtS = Qt
Endogenous Variables: Qt and Pt
Exogenous Variables: FeedPt and Inct
10
The strategy for the first stage is similar to the strategy used by
instrumental variable (IV) approach. The endogenous explanatory variable is the
source of the bias; consequently, the endogenous explanatory variable is the
“problem” explanatory variable. The endogenous explanatory variable is the
dependent variable. The explanatory variables are all the exogenous variables. In
our example, price is the endogenous explanatory variable; consequently, price
becomes the dependent variable in the first stage. The exogenous variables,
income and feed price, are the explanatory variables.
1st Stage: Use the exogenous explanatory variable(s) to estimate the endogenous
explanatory variable(s).
o Dependent variable: The endogenous explanatory variable(s), the
“problem” explanatory variable(s). In this case, the price of beef, Pt, is the
endogenous explanatory variable.
o Explanatory variable(s): All exogenous variables. In this case, the
exogenous variables are FeedPt and Inct.
Using these regression results we estimate the price of beef based on the
exogenous variables, income and feed price.
The strategy for the second stage is also similar to the instrumental
variable (IV) approach. We return to the original model and replace the
endogenous explanatory variable with the estimate from Stage 1. The dependent
variable is the original dependent variable, quantity. The explanatory variables are
Stage 1’s estimate of the price and the relevant exogenous variables. In our
example, we have two models, one for demand and one for supply; accordingly,
we first apply the second stage to demand and then to supply.
11
2nd Stage: In the original model, replace the endogenous explanatory variable
with its estimate.
Demand Model
o Dependent variable: Original dependent variable. In this case, the
original explanatory variable is the quantity of beef, Qt.
o Explanatory variables: 1st stage estimate of the endogenous explanatory
variable and the relevant exogenous explanatory variables. In this case, the
estimate of the price of beef and income, EstPt and Inct.
Comparison of Reduced Form (RF) and Two Stage Least Squares (TSLS)
Estimates
Compare the estimates from the reduced form (RF) approach with the estimates
from the two stage least squares (TSLS) approach:
The estimates are identical. In this case, the reduced form (RF) estimation
procedure and the two stage least squares (TSLS) estimation procedure produce
identical results.
13
Note that these are the same estimates that we obtained when we generated the
estimates of the price on our own.
Taking Stock
Let us step back for a moment to review our beef market model:
Demand and Supply Models:
Demand Model:QtD = β ConstD
+ β PD Pt + β ID Inct + etD
Supply Model: QtS = β Const
S
+ β PS Pt + β FP
S
FeedPt + etS
Equilibrium: QtD = QtS = Qt
Endogenous Variables: Qt and Pt
Exogenous Variables: FeedPt and Inct
Reduced Form (RF) Estimates
Quantity Reduced Form (RF) Estimates: EstQ = aConst
Q
+ aFP
Q
FeedP + aIQ Inc
Price Reduced Form (RF) Estimates: EstP = aConst
P
+ aFP
P
FeedP + aIP Inc
We can use the coefficient interpretation approach to estimate the “slopes” of the
demand and supply in terms of the reduced form (RF) estimates:i
15
Intuition: Critical Role of the Exogenous Variable Absent from the Model
In each model there is one exogenous variable absent and one endogenous
explanatory variable. This one to one correspondence allows us to estimate the
coefficient of the endogenous explanatory variable, price.
• Demand Model:QtD = β Const
D
+ β PD Pt + β ID Inct + etD
Changes in the feed price, the exogenous variable absent from the demand
model, allow us to estimate the “slope” of the demand curve. The supply
curve shifts, but the demand curve remains stationary. Consequently, the
equilbria “trace out” the stationary demand curve.
• Supply Model: QtS = β Const
S
+ β PS Pt + β FP
S
FeedPt + etS
16
Key Point: In each case, changes in the exogenous variable absent in the model
allow us to estimate the value of the price coefficient, the model’s endogenous
explanatory variable.
Order Condition
The Order Condition formalizes this relationship:
Number of
Number of
exogenous
endogenous
explanatory
explanatory variables
variables absent
included in the model
from the model
Less Than Equal To Greater Than
ã ↓ é
Model Model Model
Underidentified Identified Overidentified
↓ ↓ ↓
No RF Estimate Unique RF Estimate Multiple RF Estimates
The reduced form estimation procedure for our beef market example is identified.
For both the demand model and the supply model, the number of exogenous
variable absent from the model equaled the number of endogenous explanatory
variables in the model:
Underidentification
We shall now illustrate the underidentification problem. Suppose that no income
information was available. Obviously, if we have no income information, we
cannot include Inc as an explanatory variable in our models:
Demand and Supply Models:
Demand Model:QtD = β Const
D
+ β PD Pt + β ID Inct + etD
Supply Model: QtS = β Const
S
+ β PS Pt + β FP
S
FeedPt + etS
Equilibrium: QtD = QtS = Qt
Endogenous Variables: Qt and Pt
Exogenous Variables: FeedPt and Inct
Reduced Form (RF) Estimates
Quantity Reduced Form (RF) Estimates: EstQ = aConst
Q
+ aFP
Q
FeedP + aIQ Inc
Price Reduced Form (RF) Estimates: EstP = aConst
P
+ aFP
P
FeedP + aIP Inc
Let us now apply the order condition by comparing the number of absent
exogenous variables and endogenous explanatory variables in each model:
There is both good news and bad news when we have feed price information but
no income information:
• Good news: Since we still have feed price information, we still have
information about supply curve shifts. The shifts in the supply curve cause
the equilibrium quantity and price to move along the demand curve. In
other words, shifts in the supply curve “trace out” the demand curve;
hence, we can still estimate the “slope” of the demand curve.
• Bad news: On the other hand, since we have no income information, we
have no information about demand curve shifts. Without knowing how the
demand curve shifts we have no idea how the equilibrium quantity and
price move along the supply curve. In other words, we cannot “trace out”
19
the supply curve; hence, we cannot estimate the “slope” of the supply
curve.
To use the reduced form (RF) approach to estimate the “slope” of the
demand curve, we first use ordinary least squares (OLS) to estimate the
parameters of the reduced form (RF) equations:
Table 23.9: OLS Regression Results – Quantity Reduced Form (RF) Equation
Then, we can estimate the “slope” of the demand curve by calculating the ratio of
the feed price estimates:
Q
aFP −821.85
Estimated "slope" of the demand curve = bP = P =
D
= −1,566.5
aFP .52464
20
Now, let us use the two stage least squares (TSLS) estimation procedure to
estimate the “slope” of the demand curve:
Again, let us now apply the order condition by comparing the number of absent
exogenous variables and endogenous explanatory variables in each model:
Number of
Number of
exogenous
endogenous
explanatory
explanatory variables
variables absent
included in the model
from the model
Less Than Equal To Greater Than
ã ↓ é
Model Model Model
Underidentified Identified Overidentified
↓ ↓ ↓
No RF Estimate Unique RF Estimate Multiple RF Estimates
Again, there is both good news and bad news when we have income
information, but no feed price information:
• Good news: Since we have income information, we still have information
about demand curve shifts. The shifts in the demand curve cause the
equilibrium quantity and price to move along the supply curve. In other
words, shifts in the demand curve “trace out” the supply curve; hence, we
can still estimate the “slope” of the supply curve.
• Bad news: On the other hand, since we have no feed price information,
we have no information about supply curve shifts. Without knowing how
the supply curve shifts we have no idea how the equilibrium quantity and
23
price move along the demand curve. In other words, we cannot “trace out”
the demand curve; hence, we cannot estimate the “slope” of the demand
curve.
To use the reduced form (RF) approach to estimate the “slope” of the
supply curve, we first use ordinary least squares (OLS) to estimate the parameters
of the reduced form (RF) equations:
Then, we can estimate the “slope” of the supply curve by calculating the ratio of
the income estimates:
aIQ 20.225
Estimated "slope" of the supply curve = bP = P =
S
= 2, 091.7
aI .009669
Once again, two stage least squares (TSLS) provide the same estimate:
24
Overidentification
While an underidentification problem arises when too little information is
available, an overidentification problem arises when, in some sense, too much
information is available. To illustrate this suppose that in addition to the feed
price and income information, the price of chicken is also available. Since beef
and chicken are substitutes, the price of chicken would appear as an exogenous
explanatory variable in the demand model. The simultaneous equation model and
the reduced form (RF) estimates would become:
Now we have two exogenous factors that shift the demand curve: income
and the price of chicken. Consequently, there are two ways to “trace out” the
supply curve. There are now two different ways to use the reduced form (RF)
estimates to estimate the “slope” of the supply curve:
Ratio of the reduced form (RF) Ratio of the reduced form (RF)
income coefficients chicken feed coefficients
↓ ↓
Estimated “slope” Estimated “slope”
of supply curve: of supply curve:
↓ ↓
a Q
aQ
bPS = IP bPS = CP
P
aI aCP
27
Then, we use the reduced form (RF) estimates to compute the estimates for the
“slopes” of the demand and supply curves:
Estimated “slope” Estimated “slope” Estimated “slope”
of demand curve of the supply curve of the supply curve
↓ ↓ ↓
Ratio of reduced Ratio of reduced Ratio of reduced
form (RF) feed price form (RF) income form (RF) chicken price
coefficient estimates coefficient estimates coefficient estimates
↓ ↓ ↓
a Q
−349.54 a Q
16.865 a Q
47.600
bPD = FPP
= = −366.0 bPS = IP = = 1051.2 bPS = CPP
= = 173.3
aFP .95501 aI .016043 aCP .27464
The reduced form (RF) estimation procedure produces two different estimates for
the “slope” for the supply curve. This is what we mean by overidentification.
The estimated “slope” of the demand curve is −366.0. This is the same estimate as
computed by the reduced form (RF) estimation procedure.
29
Two stage least squares (TSLS) provides a single estimate for the “slope” of the
supply curve:
bPS = 893.5
Table 23.18 compares the estimates that result when using the two
different estimation procedures:
Note that the demand model is not overidentified and both the reduced form (RF)
estimation procedure and the two stage least squares (TSLS) estimation procedure
provide the same estimate for the “slope” of the demand curve. On the other hand,
the supply model is overidentified. The reduced form (RF) estimation procedure
provides two estimates for the “slope” of the supply curve; the two stage least
squares (TSLS) estimation procedure provides only one.
30
1
Again, recall that the coefficients do not equal the slope of the demand curve,
but rather the reciprocal of the slope. They are the ratio of run over rise instead of
rise over run. This occurs as a consequence of the economist’s convention of
placing quantity on the horizontal axis and price on the vertical axis. To avoid the
awkwardness of using the expression “the reciprocal of the slope” repeatedly, we
shall we shall place the word slope within double quotes to indicate that it is the
reciprocal.