Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Panel Data
Regression Models
Example:
Example: Panel Data Models
The Christenson Associates
airline data constructed by
Christensen Associates of
Madison, Wisconsin are a
frequently cite dataset.
For this example, lets
consider the costs of six
airline firms for the period
1970-1984, for a total of 90
panel observations as in
Table 1 in the next slide.
The variable are defined as:
Where i = 1, 2, ,6;
t = 1, 2, ,15
What options do we have?
1. Pooled OLS
We simply pool all 90 observations and estimate a
grand regression, neglecting the cross-section and
time series nature of the data.
No distinction between airlines one airline is as
good as the other. Therefore,
C it = + 1 ln Q + 2 ln PFit + 3 LFit + u it
where i = 1,2,,6;
t = 1,2,,15
10
Pooled OLS
Step 1: Stack data
11
12
Pooled OLS
OLS consists of five core assumptions:
1. Linearity in parameter
2. Exogeneity: Disturbances are not correlated with any
regressors.
3. Disturbance s have constant variance
(Homoscedasticity) and are not related with each other
(no autocorrelation).
4. Variability in X values and X values are fixed in
repeating sampling.
13
Pooled OLS
5. No multicollinerity problem.
If individual effects are not zero, heterogeneity
(individual
specific characteristics
like
intelligence and personality that are not
captured in the regressors) may influence
Assumptions 2 and 3.
Hence, the OLS estimator is no longer best
unbiased linear estimator. The panel data
models provide a way to deal with these
problems.
14
Step 1: Stack
data
Step 2: Apply
OLS
17
where i = 1,2,,6;
t = 1,2,,15
LSDV
Now we can write the FEM
as
Cit = 1 + 2 D2 i + 3 D3i + 4 D4i + 5 D5i + 6 D6i
+ 1Qit + 2 PFit + 3 LFit + uit
19
LSDV estimates
22
2.
3.
Within estimator
26
27
Example: REM
Instead of treating i as
fixed, we assume that it is a
random variable: i = + i
where i is a random error
term with a mean of zero
and a variance of 2 .
28
Example: REM
Note that the error
component consists of two
part: i, which is the crosssection, or individualspecific, error component,
and uit, which is the
combined time series and
cross-section error
component and is sometimes
called the idiosyncratic term
because it varies over time
cross-section (i.e. subject) as
well as time.
29
30
31
Define: S = and
T
i =1
t =1
it
i = 1
2
it
t = 1
NT
=
1
2 ( T 1) S 2
Construct q = FE RE
and
V (q ) = V ( FE ) V ( RE )
Test statistic: m = q
[V (q )] 1 q
where q is distributed as a 2
statistic with k degree of freedom
(where k is the dimensionality of
).
The null hypothesis is that the
preferred model is a random
effects model and the alternative
is that the fixed effects model is
preferred.
33
34