Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Introduction
Increasing trends for healthy lifestyles make people rethink their health habits.
However, although a lot of people know that fruits and vegetables consumption as well
as avoiding unhealthy habits may improve their health, not so many are aware of the
effects of different health behaviors. The impact of health behaviors is useful not only
for preventive health medicine but also in economics, since, healthy people are more
productive.
Several research studies suggested that fruits consumption, physical exercises, good
social relationships have a significant positive impact on health, whereas smoking and
obesity have a negative impact on health [1-4]. We will estimate the effects of several
covariates by using panel data models. First, we will describe the data, then estimate
the coefficients by using both fixed and random effects estimators. And finally, we will
discuss the possible bias and limitations.
Data description
The data come from National longitudinal surveys of youth 1997 (the respondents were
first interviewed in 1997). The data set in its original form consists of the answers of
8984 individuals in 2002, 2007 and 2008. After dropping all variables which the
participants did not answer, the sample size is reduced to about 6000 individuals.
Furthermore, because of attrition in 2008, only answers in the years of 2002 and 2007
are analyzed.
The dependent variable is general health status (1-5 variable, 1 – “very poor health
status”, 5 – “perfect health status”). As we can see, health status of individuals changed
across two years. In 2007 less than 50% of respondents have the same health status
as in 2002.
Oleksandra Cheipesh 11720920
1 6 12 10 7 2 37
2 16 112 154 81 23 386
3 16 182 649 507 178 1,532
4 10 112 587 1,054 467 2,230
5 5 52 286 647 929 1,919
The sample size consists of about 50% of males, 25% of black-skinned people and
21% of Hispanics (table 3). In 2002 interviewers had more stressful events than in
2007 and the share of underweight people decreased by 4%. In contrast, there
were more overweight respondents in 2007: 47% in comparison to 36% (table 2)
unemp
2002 .0912516 .0036861 .0840262 .098477
2007 .072903 .0033278 .0663799 .0794261
div
2002 .0488204 .0027584 .0434135 .0542274
2007 .0286697 .0021361 .0244826 .0328568
over
2002 .3623853 .0061531 .3503243 .3744463
2007 .4670708 .0063864 .4545525 .4795891
under
2002 .1251638 .0042358 .1168611 .1334666
2007 .0853539 .0035766 .0783432 .0923645
Oleksandra Cheipesh 11720920
The values of the regressors vary more between than within the individuals.
However, there is enough variation within individuals, which means that we can use
a fixed effects estimator.
To estimate the coefficients, we include all available regressors and compare the
results of the fixed and random effects estimators. It can be shown (table 5) that
the coefficients of the fixed and random effects estimators are different, and the
Hausman test rejects the null of consistency of the random effects estimator (p-
value is close to zero). This means that we cannot estimate the values of time-
invariant regressors as gender and race, but we allow for time constant endogeneity
Oleksandra Cheipesh 11720920
across individuals. For our fixed effects model, we specify panel robust standard
errors.
Coefficients
(b) (B) (b-B) sqrt(diag(V_b-V_B))
fixed random Difference S.E.
As shown, only the effects of several covariates can be estimated, namely: being
over- and underweighted, time spent on sleep and doing exercises and the number
of eaten fruits (table 6). Therefore, being under- or overweighed decrease the
general health status by approximately 0.16 and 0.21 points respectively, while
every hour of sleep and every time per week when respondents do exercises
improve their health by about 0.028 points. And finally, every 2-3 fruits per week
have a positive impact of 0.04 points.
The control variables of stressful events are insignificant even at the 10%
confidence interval. Moreover, they are not correlated with sleep (correlation is less
than 3%). Therefore, as can be shown, if we drop out these variables in order to
gain efficiency, other coefficients will change only slightly (by less than 0.0001
points, appendix 2).
status can be improved by about 0.5 points. Considering that health is measured
from zero to five, the effects are not so small. However, since almost all
respondents sleep from 6 to 8 hours a day, the role of sleep in this case is not very
big.
F(11,6103) = 15.10
corr(u_i, Xb) = 0.1263 Prob > F = 0.0000
Robust
health_new Coef. Std. Err. t P>|t| [95% Conf. Interval]
sigma_u .75984703
sigma_e .7056016
rho .53696561 (fraction of variance due to u_i)
Although we use a fixed effects estimator, the results may be biased. First, the general
health status is measured subjectively. Therefore, people who started eating fruits and
doing exercises in 2007 might feel themselves better because of placebo effect.
Second, there may be an omitted variable bias of alcohol consumption, smoking, etc.
Sometimes, people who have bad habits, do not have a healthy diet and do not do
exercises. For this reason, the effects of fruits and exercises might me overestimated.
And finally, there may be a reverse causality between doing exercises and general
health status: people who are helathier, are more likely to do exercises.
Due to the fact that we analyze data only across two years, the lagged regressors
cannot be used as instruments. Analyzing the same dataset across more than two
Oleksandra Cheipesh 11720920
years could help to solve the problem of time-varying endogeneity. The same data set
is available for 2009, 2010 and 2011, however, in these years the respondents were
not asked about doing exercises. Nevertheless, if we drop out the variable of doing
exercises, other coefficients will change (appendix 3) and in such a way we will have
an omited variable bias, because exercises are correlated with diet habits and weight.
Conclusion
In order to estimate the impact of health behaviors, stressful events and weight on
health, panel data estimators are used. The cleared data set include 6000 individuals
in 2002 and 2007. As the Hausman test rejected the null of random effects model at
1% confidence interval, fixed effects estimator was used. The results suggested that
such covariates as sleep, fruits consumption and exercises have a positive effect on
health, while being over- and underweighed have a negative impact on health.
Since the data are analyzed only in 2002 and 2007, lagged regressors connot be used
as instruments, that’s why, the results may suffer from endogeneity, which is caused
by reverse causality between health and doing exercises, omited variables of other
health behaviors and estimation error (general health status is self reported). For this
reason, analyzing data across more than 2 years could be a possible improvement.
Reference List
4. James Tsai, Earl S. Ford, Chaoyang Li, Guixiang Zhao, William S. Pearson,
Lina S. Balluz, 2010.Multiple healthy behaviors and optimal self-rated health:
Findings from the 2007. Behavioral Risk Factor Surveillance System Survey.
Preventive Medicine 51, 268–274
Robust
health_new Coef. Std. Err. t P>|t| [95% Conf. Interval]
sigma_u .75961985
sigma_e .70543704
rho .5369329 (fraction of variance due to u_i)
Robust
health_new Coef. Std. Err. t P>|t| [95% Conf. Interval]
sigma_u .7646857
sigma_e .70772878
rho .53862487 (fraction of variance due to u_i)
under over div unemp dec sleep tel comp exer veg
under 1.0000
over -0.2887 1.0000
div -0.0011 0.0016 1.0000
unemp 0.0025 0.0390 0.0312 1.0000
dec 0.0096 0.0210 -0.0002 0.0495 1.0000
sleep 0.0144 -0.0560 0.0097 -0.0226 -0.0242 1.0000
tel 0.0169 0.0301 -0.0123 0.0135 0.0380 0.0597 1.0000
comp -0.0165 0.0476 0.0046 -0.0341 0.0031 -0.0902 -0.1289 1.0000
exer -0.0092 -0.1019 -0.0001 -0.0178 -0.0093 -0.0345 -0.1033 0.0449 1.0000
veg -0.0365 0.0032 -0.0043 -0.0155 0.0173 -0.0144 -0.0993 0.0940 0.1487 1.0000
fruit -0.0281 -0.0344 -0.0058 -0.0407 -0.0037 0.0131 -0.0876 0.1017 0.2067 0.4920
fruit
fruit 1.0000