Sei sulla pagina 1di 23

17 Calorie shifting diet versus calorie restriction diet:

a comparative clinical trial study.


Davoodi SH, Ajami M, Ayatollahi SA, Dowlatshahi K,
Javedan G, Pazoki-Toroudi HR. Int J Prev Med. 2014
Apr;5(4):447-56. [PubMed]

19 Processed foods - are they really that bad for you?


By Chris & Eric Martinez
Copyright May 1st, 2014 by Alan Aragon
Home: www.alanaragon.com/researchreview
Correspondence: aarrsupport@gmail.com

22 How can you get through to people who *think*


they understand the science behind a certain
topic?
By Alan Aragon

Optimizing activity-based fat loss for aesthetic


athletes: Interval or steady-state training?
By Joel Minden, PhD, CSCS

How to manipulate research.


By James Heathers, PhD(c)

12 Changes in exercises are more effective than in


loading schemes to improve muscle strength
[reviewed by Brad Schoenfeld, PhD, CSCS, CSPS,
FNSCA].
Fonseca RM, Roschel H, Tricoli V, de Souza EO, Wilson
JM, Laurentino GC, Aihara AY, de Souza Leo AR,
Ugrinowitsch C. J Strength Cond Res. 2014 May 14. [Epub
ahead of print] [PubMed]
14 The effects of consuming a high protein diet (4.4
g/kg/d) on body composition in resistance-trained
individuals.
Antonio J, Peacock CA, Ellerbroek A, Fromhoff B, Silver
T. J Int Soc Sports Nutr. 2014 May 12;11:19. [PubMed]

16 An amino acid-electrolyte beverage may increase


cellular rehydration relative to carbohydrateelectrolyte and flavored water beverages.
Tai CY, Joy JM, Falcone PH, Carson LR, Mosman MM,
Straight JL, Oury SL, Mendez C, Loveridge NJ, Kim MP,
Moon JR. Nutr J 2014, 13:47 doi:10.1186/1475-2891-13-47
[PunMed]

Alan Aragons Research Review May 2014

[Back to Contents]

Page 1

Optimizing activity-based fat loss for


athletes: Interval or steady-state training?

aesthetic

By Joel Minden
__________________________________________________
For aesthetic athletes, such as dancers, gymnasts, and
bodybuilders, managing body mass and composition is just as
important as sport-specific training. At a selected body weight,
fat mass should be minimized, and dietary strategies, such as
caloric restriction or macronutrient manipulation, are frequently
used to achieve this. For those who prefer to emphasize activitybased methods to reduce body fat, the optimal strategy is
unclear. Although increasing activity to create a negative energy
balance should be the primary goal, there is considerable debate
concerning the differential effectiveness of interval versus
steady-state training. Perhaps the lack of consensus is due to the
fact that empirical research in this area is compromised by
methodological limitations and an inability to control, either
physically or statistically, for the numerous contextual variables
that cloud interpretability.
For example, research on acute metabolic responses to exercise
is sometimes criticized for the artificiality of the experimental
setting, limited time course, and uncertain relation of measured
variables (i.e., substrate utilization, gas exchange, plasma, and
biopsy data) to long-term changes in body composition.
Similarly, research on chronic responses to exercise has its own
set of limitations: individual differences in protocol compliance,
nonexercise activity, and dietary behavior; unknown accuracy of
subjects record keeping; and questionable reliability of
instruments used to track changes in body composition. Finally,
both acute and chronic outcome data should be interpreted
within the context of participant variables, including
demographic characteristics and fitness levels, and dimensions
of training protocols, such as modality, intensity, duration, and
frequency of exercise. In light of these factors, its no surprise
the efficacy debate continues.
Despite the many challenges to interpretability, consistencies in
the literature can be identified, and tentative conclusions can be
made by directly comparing the effects of multi-week interval
and steady-state training programs on body mass and
composition. Given the enthusiasm for interval training in both
scientific and popular media, its somewhat surprising that these
direct comparisons are limited. In the following section, Ill
present the results of these studies. For ease of interpretation,
data on strength training or diet-only conditions will not be
reported, nor will metabolic or cardiovascular outcome data.
Studies that compared interval training to no-exercise controls or
those that combined interval with steady-state training will also
Alan Aragons Research Review May 2014

be excluded. In all studies, interval training sessions, unless


otherwise noted, included 4 to 15 work intervals performed for
15 to 240 seconds, with each repetition followed by low- to
moderate-intensity periods of active recovery for up to 4
minutes.
The Research
In perhaps the earliest direct comparison, Thomas et al1 assigned
recreationally active male and female college students to steadystate or interval running programs matched for energy
expenditure, 500 kcal per session. Exercise bouts were
performed 3 times per week for 12 weeks. After statistically
controlling for pre-intervention differences in body composition
(assessed through hydrostatic weighing), the data revealed that
subjects in both conditions experienced a reduction in body fat
percentage. There were, however, no differences between the
exercise conditions.
Following the emergence of research by Tremblay et al,2 steadystate endurance training as a fat loss strategy was dismissed by
many as inferior to intense but brief interval training. In this
classic study, adults with no previous exercise history completed
either a 20-week endurance training program or a 5-week
endurance training program followed by 15 weeks of interval
training bouts that varied in duration and intensity.
Heralded as a breakthrough study two decades ago, the results
appeared to demonstrate a paradoxical advantage of brief
interval work for fat loss despite an energy cost well below that
of endurance training. Although frequently noted for its finding
that subcutaneous fat loss was ninefold greater for those in the
interval condition, this estimate was made after statistically
correcting for the energy cost of each type of exercise. When
actual fat loss between the two conditions was compared, the
difference was nonsignificant. Other aspects of this heavily cited
study make firm conclusions about fat loss differences by
protocol difficult: the undetermined reliability of skinfold data,
the inclusion of an endurance training component (25 30-minute
sessions) to the interval training program, and no control for
dietary behavior.
Years after the release of this promising study, additional
evaluations of interval training began to emerge, the bulk of
which failed to demonstrate any reliable advantage of interval
training. For example, in Tjnna and colleagues 16-week study
of metabolic syndrome patients,3 subjects exercised on inclined
treadmills, and work volume for the interval and endurance
conditions was equivalent. Both groups experienced reductions
in weight, BMI, and waist circumference, but no differences
between the groups were observed.
Trapp et al4 compared fat loss outcomes of a 20-minute interval
program and a 40-minute steady-state program, both performed
[Back to Contents]

Page 2

by young adult women on cycle ergometers 3 times per week for


15 weeks. Despite the difference in duration of exercise bouts,
estimated energy expenditure over the study period for the two
groups was equivalent. This was achieved by having subjects in
the interval condition perform 60 8-second intervals, followed
by 12-second recovery periods, in each session. The interval
training group, but not the steady-state group, experienced a
reduction in DEXA-measured fat mass (~2.5 kg) at the
completion of the study. This apparent intervention effect must,
however, be interpreted with caution due to pre-existing group
differences. At the beginning of the study, the mean fat mass for
the interval group was 3.8 kg greater than that of the steady-state
group, and follow-up analyses revealed that approximately of
the variance in fat loss was accounted for by level of body fat at
the beginning of the study.
Schjerve et al5 compared fat loss responses in obese adults to 12
weeks of interval or steady-state treadmill training performed 3
times per week. Conditions were equalized for energy
expenditure. Both groups experienced similarly small but
significant reductions in weight, BMI, and body fat percentage.
There were no differences between the conditions in these
outcomes.
Wallman et al6 examined the effects of 8 weeks of interval or
steady-state training performed by overweight and obese men
and women 4 times per week on a cycle ergometer. Energy
expenditure between the two conditions was equivalent. The
results yielded nonsignificant reductions in weight or fat mass
for both conditions.
Perhaps the greatest support for fat loss benefits of interval
training comes from MacPherson et al.7 In this study,
recreationally athletic college-aged men and women performed 3
weekly sessions of sprint interval training or steady-state
running for 6 weeks. Both groups experienced significant
reductions in body fat percentage and fat mass, as well as small
increases in lean mass. Although the interval group experienced
a larger total decrease in fat mass (1.7 kg vs. 0.8 kg), the
difference between the conditions was nonsignificant. In contrast
to the methods used in the aforementioned studies, MacPherson
et al. did not attempt to equalize work or energy expenditure,
which makes the difference in total exercise time across the
study period (13.5 and 0.75 hours for the steady-state and
interval conditions, respectively) noteworthy. Nevertheless,
subjects in the interval condition were encouraged to engage in
active rest on the treadmill for 4 minutes following each of
their maximal effort sprints, which resulted in a total activity
time commitment of 6.75 hours.

exercise 3 days per week for 12 weeks. There was a significant


decrease in DEXA-measured body fat percentage for the steadystate (-2.6%) but not the interval (-0.3%) group. The absence of
change for the interval group is somewhat unexpected, given
that the aforementioned studies found equivalent effects for the
two types of training. The authors indicated that this result may
be partially explained by the use of interval training bouts that,
to protect this clinical population, were less intense than those
used in previous studies. However, a comparison of protocols
shows the intensity of interval training for the Keating et al.
subjects (~120% of VO2peak and ~90% of maximal heart rate)
was consistent with those used in other studies (e.g., Schjerve,
Tjnna, Wallman and their colleagues) of overweight or obese
subjects.
An alternative explanation is that unmeasured subject variables
contribute to responsivity to exercise. Graphs from the Keating
et al. study show considerable within-group variability for both
exercise groups in body fat percentage change. In fact, some
subjects in both conditions actually gained body fat. This
highlights the importance of going beyond the aggregate data to
search for individual differences that distinguish responders
from non-responders.
Conclusion
Collectively, the data reveal that interval training offers no
reliable advantage over steady-state endurance training for fat
loss. In addition, the effectiveness of interval training is more
likely to be demonstrated when work or energy expenditure is
matched to that of steady-state protocols. This suggests that, in
spite of any acute metabolic or cardiovascular benefits of
interval training, intense but brief exercise is insufficient for
stimulating meaningful fat loss. This was indirectly highlighted
in Boutchers recent review of research in this area.9 Of the 6
interval training studies in which fat loss outcomes were
identified, the two cited (Boudou et al,10 Mourier et al11) for
demonstrating the strongest effects included 2 days per week of
45-minute steady-state training bouts in a program with only one
interval-training day each week.

Recently, Keating et al8 compared fat loss outcomes for


overweight adults randomly assigned to either an interval or
steady-state cycle ergometer program. Both groups performed

Regarding application, assuming energy intake is regulated,


activity-based fat loss programs should prioritize energy cost of
exercise and activity preference. For athletes already involved in
frequent and intense sport-specific training, activities that have a
negative impact on quality of practice and competitive
performance should be avoided. If interval training results in
poor program compliance, fatigue, overeating, and reduced daily
activity, alternative strategies should be explored. For aesthetic
athletes, a realistic fat-loss strategy might involve small dietary
changes combined with low- to moderate-intensity exercise,
such as uphill walking at a comfortable pace, performed for an

Alan Aragons Research Review May 2014

[Back to Contents]

Page 3

extended duration. In sum, although intense interval training has


value to the athlete, it may not be the best option for fat loss. In
the larger context of athletic training, a moderate, comfortable
approach offers the greatest chance for success.
____________________________________________________
Joel Minden, Ph.D., CSCS, is a
lecturer in the psychology and
kinesiology
departments
at
California State University, Chico.
He writes about strength and
conditioning, nutrition, sport
psychology, and dance for his
website www.joelminden.com.

10. Boudou, P., Sobngwi, E., Mauvais-Jarvis, F., Vexiau, P., &
Gautier, J. F. (2003). Absence of exercise-induced variations in
adiponectin levels despite decreased abdominal adiposity and
improved insulin sensitivity in type 2 diabetic men. European
Journal of Endocrinology, 149(5), 421-424. [PubMed]
11. Mourier, A., Gautier, J. F., De Kerviler, E., Bigard, A. X.,
Villette, J. M., Garnier, J. P., Duvallet, A., Guezennec, C. Y., &
Cathelineau, G. (1997). Mobilization of visceral adipose tissue
related to the improvement in insulin sensitivity in response to
physical training in NIDDM: effects of branched-chain amino
acid supplements. Diabetes Care, 20(3), 385-391. [PubMed]

____________________________________________________
References
1.

2.

3.

4.

5.

6.

7.

8.

9.

Thomas, T. R., Adeniran, S. B., & Etheridge, G. L. (1984).


Effects of different running programs on VO2 max, percent fat,
and plasma lipids. Canadian Journal of Applied Sport Sciences,
9(2), 55-62. [PubMed]
Tremblay, A., Simoneau, J. A., & Bouchard, C. (1994). Impact
of exercise intensity on body fatness and skeletal muscle
metabolism. Metabolism, 43(7), 814818. [PubMed]
Tjnna, A. E., Lee, S. J., Rognmo, ., Stlen, T. O., Bye, A.,
Haram, P. M., Loennechen, J. P., Al-Share, Q. Y., Skogvoll, E.,
Slrdahl, S. A., Kemi, O. J., Najjar, S. M., & Wislff, U.
(2008). Aerobic interval training versus continuous moderate
exercise as a treatment for the metabolic syndrome: a pilot
study. Circulation, 118(4), 346354. [PubMed]
Trapp, E. G., Chisholm, D. J., Freund, J., & Boutcher, S. H.
(2008). The effects of high-intensity intermittent exercise
training on fat loss and fasting insulin levels of young women.
International Journal of Obesity, 32(4), 684691. [PubMed]
Schjerve, I. E., Tyldum, G. A., Tjnna, A. E., Stlen, T.,
Loennechen, J. P., Hansen, H. E., Haram, P. M,, Heinrich, G.,
Bye, A., Najjar, S. M,, Smith, G. L., Slrdahl, S. A., & Kemi,
O. J., Wislff, U. (2008). Both aerobic endurance and strength
training programmes improve cardiovascular health in obese
adults. Clinical Science, 115(9), 283293. [PubMed]
Wallman, K., Plant, L. A., Rakimov, B., & Maiorana, A. J.
(2009). The effects of two modes of exercise on aerobic fitness
and fat mass in an overweight population. Research in Sports
Medicine, 17(3), 156170. [PubMed]
Macpherson, R. E., Hazell, T. J., Olver, T. D., Paterson, D. H.,
& Lemon, P. W. (2011). Run sprint interval training improves
aerobic performance but not maximal cardiac output. Medicine
and Science in Sports & Exercise, 43(1), 115-22. [PubMed]
Keating, S. E., Machan, E. A., O'Connor, H. T., Gerofi, J. A.,
Sainsbury, A., Caterson, I. D., & Johnson, N. A. (2014).
Continuous exercise but not high Intensity interval training
improves fat distribution in overweight adults. Journal of
Obesity, 2014. [Journal of Obesity]
Boutcher, S. H. (2010). High-intensity intermittent exercise and
fat loss. Journal of Obesity, 2011. [Journal of Obesity]

Alan Aragons Research Review May 2014

[Back to Contents]

Page 4

How to manipulate research.


By James Heathers
_________________________________________________________________

Most of the audience for this article probably pays attention to


the broader scientific literature in exercise and musculoskeletal
physiology, strength and conditioning, nutrition, dietetics and
sports medicine. From this, you take the available evidence and
you slot it somewhere into an available framework of what's
already known. This, everyone is familiar with.
What people generally dont know is how to cheat.
Yes, cheat. Let me outline why you would: the way academic
funding presently works is that, in general, output is rewarded
over insight two papers are better than one. So, the more you
write, the better off youre going to be. There are many problems
with this, and the environment it creates. One of those problems
is that it becomes very tempting to 'massage' results from
different research projects in order to achieve reportable
outcomes. I should mention here that the majority of the time
this isnt actually dishonesty its the fact that researchers have
convinced themselves that theyve asked a good question, and
that if they just change a few key variables with the analysis and
reporting, suddenly theyll have the result that they know is
there. And when that result turns up, it was because the initial
analysis was wrong.
Unfortunately, science doesnt work like that. Much of my
academic work is in dealing with problems surrounding this
issue; I am a methodologist. This means I concentrate heavily on
how research should be conducted essentially, research into
research. Methodologists develop new techniques in analysis,
and verify that old ones work in the manner we hope they do.
Think of the production of knowledge via academic outcomes as
a game of poker. Research, like poker, is an expensive,
stochastic process full of frustration, late nights, and alcohol
but, also like poker, eventually if youre good enough, the
balance of probabilities favour you winning. Insight, like money,
is hard won.
This process comes to a scrunching halt when someone starts to
obscure the honest truth of what happened in a study, because
there is no skill or reason that can be applied. You literally cant
win, because the odds of something being supportable or
repeatable are being manipulated.
However, just like poker, there are 'tells' certain signs which
allow you to detect another process at work. This is a partial list
of those tells, illustrated with examples drawn liberally from the
medical and social sciences. Ive tried to use exercise science
and nutrition studies where convenient, but the principles are the
same regardless often Ive simply chosen the most convenient
examples that have come to mind. Please bear in mind I dont
think these papers are guilty of any kind of conscious
dishonesty, they are merely convenient examples of the
principles involved.

them are specific to individual papers. These errors are both


common and uncommon, both serious and trivial. They have
various degrees of culpability (likely intent to deceive),
significance (the ability to influence the outcome of the study
overall), and detectability (how easy it is to spot from the article
text). All have the potential to be dishonest.
1.

Altered endpoints, timepoints or measurement criteria

Murphy et al1 investigated the effect of beetroot consumption on


running performance due to their nitrate content. n=11 received
a supplement of either beetroot (standardised to contain 500mg
nitrates) or cranberry puree, in a double-blind cross-over
fashion. Their heart rate, perceived exertion and time to
completion of a 5km run was recorded. In the first mile,
participants rated their perceived exertion significantly higher in
the cranberry condition. In the final 1.8km, participants were
significantly faster in the beetroot condition.
Why was a 5km broken into miles? i.e. 0 1.6km, 1.6 3.2km,
3.2 5km.
There are an infinite number of ways to divide a time interval
into pieces. This analysis could have been performed as single
kilometre intervals, or using a simple statistical model which
predicts the overall effect of time through the race on exertion,
and the overall difference between the groups by time. There is
no reason to use a unit of measurement invented by the ancient
Romans and formally defined in 1593.
Researchers are well aware that trying different assortments of
time intervals can uncover differences between timepoints due
to random variation. Say we split the data into 100m intervals
there are now 50 separate comparisons over the 5km where we
can analyse the difference between our Beetroot and Cranberry
groups. We are essentially making so many comparisons that
one will be true due simply due to the noise present in the
measurement.
(Of course, there are methods for statistically controlling
multiple comparisons2 but researchers don't report all the
comparisons they used... in this case, the reader doesn't know
that these multiple comparisons need to be controlled.)
The other extreme is also a problem. Say we analyse the dataset
only over the whole 5km, but beetroot consumption improved
the finishing speed of the run. This would be a highly significant
finding, as we know that in middle/long distance races there is
already a pattern between laps or race phases (e.g., Tucker et
al3).
Culpability: low to medium
Significance: medium
Detectability: high
2.
3.

Conveniently one-sided significance testing


Methodological fiddles

(These are not always associated, but theyve been so neatly


combined in a paper from a few years ago that Ive put them
together here.)

This list is not comprehensive and is in no particular order


some of them work over time across different papers, some of

Christian et al4 enrolled n=279 in a computer-support program


for weight loss at an American public hospital. All participants

Alan Aragons Research Review May 2014

[Back to Contents]

Page 5

were suffering from metabolic syndrome. Participants were


given a full health and blood screening 12 months apart, and
were assigned to either a computer-based tailored lifestyle
intervention or a standard package of information on weight
management. Participants were more likely to lose weight in the
intervention vs. control group (-3.3lbs vs 0.33lbs, p=0.002).
Participants who lost more than 10% BW had lower total
cholesterol (-14.9 vs -3.9, p=0.05) which appears to be driven by
the loss of LDL cholesterol (-14.0 vs. -4.1, p=0.04).
Why was the outcome of the program determined by a group of
people who lost 5% of bodyweight which included BOTH
members of the intervention and the control group?
This is the methodological fiddle there were n=46 participants
who lost more than 10% BW and n=11 of them were from the
control group (and thus n=35 from the intervention). These were
lumped together to create the impression that the program was
effective. This is hardly the most honest conclusion when about
a quarter of the people with significant weight lost were from the
control group it would be more true to say here that people
with metabolic syndrome who lose weight improve their serum
lipids regardless of how they do it. This is hardly evidence in
favour of the intervention.
The other fiddle is staring us in the face from the p-values
above
Why was the above difference assessed with a one-sided t-test?
As were all probably aware here, the p-value is the calculated
probability of getting the observed result if the null hypothesis is
true. In this case, this is that the standard intervention and the
computer-based intervention were identical. We accept that
when a result is sufficiently unlikely to have occurred on this
basis, that the experimental hypothesis is true in other words,
that our intervention has actually intervened.
One-tailed statistical tests assume that this process has a
direction that the effect will have a direction (i.e. A will be
higher than B). This gives you twice the statistical flexibility
than you otherwise might have in a two-tailed test.
There are a few situations where one-tailed tests are necessary.
Firstly, when we have strong directional hypothesis: good
evidence than our intervention should be better than the control
group. In this case, we do not the researchers mention previous
work with the same intervention being only somewhat effective
in a diabetic sample. Secondly, when we are using very few ttests to compare different values. In this case, we do not the
researchers have around twenty individual tests.
However, these are not hard and fast rules, and researchers often
have another rule of thumb which simply goes like this: one
tailed tests are what you use when youre trying to get something
to achieve a criteria of significance when it hasnt quite made it.
They have traditionally found refuge in questionable results, and
as weve just discussed, theyre being used here to assess the
difference between did lose weight and didnt lose weight
regardless of group. A classic fiddle, and one the reviewers
really should have spotted.
Alan Aragons Research Review May 2014

Conveniently one-sided tests.


Culpability: medium to high
Significance: low to medium
Detectability: high
Methodological fiddles
Culpability: medium
Significance: medium
Detectability: medium to high
4.

Overly complicated or uninterpretable models

Another rather impressive looking technique is to take individual


measures which are quite complicated and roll them into a
model far more complicated than the average reader can
understand. Social scientists try this much more than exercise
physiologists, in my experience. But it does occur.
A recent paper5 studied the split times of 2 world-record
marathon runs, most recently Patrick Makaus Berlin Marathon
(2011) which was a scarcely believable 2 hours, 3 mins and 38
seconds. It describes several different curve fits possible to these
runs, combines headwind and gradient data with individual
kilometre time splits, and tries to find an optimal model or
pacing strategy.
Towards the conclusion it states:
Oscillations at the micro-level overlay low-frequency,
macro-level oscillations or modes indicating that an athletes
resulting pacing trace represents a potentially complex amalgam
of numerous signalling processes emanating from the brain,
each with their own activation frequency.
Of course, concluding that the best ever marathon times are
employing highly sub-optimal pacing strategies seems wildly
implausible because of the extraordinary amount of
competition over such a long period of time, one might assume
that either a) the best times ever were, in fact, fairly well paced
by definition or b) that an optimal strategy doesnt exist due to
individual differences that are impossible to predict (a stubbed
toe, a very slightly tight hamstring, a bad nights sleep, a microchange in gradient, and so on). An optimal pacing strategy
cant be followed, of course, if its highly impractical. That is
essentially stating If X was possible, then it would be better in
an environment where X cant be practically be performed.
Culpability: low
Significance: low to medium
Detectability: high
5.

Over-testing, a.k.a. random sifting

Ive thought long and hard about how to get you an example of
this, and Im not sure I can. Heres how random sifting works:

We decide to measure the effect of a new training


regime of volume squats on short-course track times.

We assign 30 experienced middle-distance runners


equally to three groups no extra training, 1 extra

[Back to Contents]

Page 6

training day of squats per fortnight, and 3 extra training


days per fortnight.

We take demographic variables to start with (age,


gender, race), anthropometry (height, weight, BMI,
body composition), bloods (c-reactive protein, cortisol)
and training readiness (neurological assessment, heart
rate variability).
We take race variables (400m time, 3 times w. 5 mins
rest between races), and 1500m time (with lap split
times). Participants rate perceived effort and
pain/soreness after each race. Then we run the program
for 6 weeks, test all the above again (mid-line) and test
again at 12 weeks. Naturally, we record the poundages
moved in each session for the two training groups.

Not the worst design ever, right? Comprehensive, detailed?


Wrong. Its dreadful.
This is the most unholy octopus of impossible interlocking
variables youll ever see. Any one of the above can be used to
control for, or combine with, any another. Variables you add to a
study are not additive: if I measure seven things at one
timepoint, I dont have seven potential comparisons in the data. I
have instead any combination of the presence or absence of
those variables, using a cut-off that I define (or choose from the
literature), or using the top or bottom standard deviation, or all
the values over the mean (or the median) to define groups.
With the full access to the above information in my hypothetical
studies, there are so many ways you can find to combine the
outcomes that the answers that you will find are bordering on
meaningless unless the results you find are statistically very
strong. Make no mistake: if I had the above dataset, I am 100%
entirely confident that I could produce a set of statistical
analyses which conclusively showed that our squat intervention
was effective. Even if our squat intervention did literally nothing
or even made performance worse.
The only trick is to hide all the analyses that didnt work, then
write up the one analysis which worked by pure chance as being
predicted by specific research questions that we started with.
This is formally called post-hoc reasoning and very hard to
detect. After you test hundreds or thousands of pathways
through the above variables and find that, say, any squat
intervention (1 or 3 sessions per fortnight) is effective on split
times in 1500m but not total times, and reduces perceived effort
but only in men, you then come up with a reason which
specifically addresses why you might find this (and you choose
past literature to reference accordingly).

"I request that the authors add a statement to the paper


confirming whether, for all experiments, they have reported all
measures, conditions, data exclusions, and how they determined
their sample sizes."
In other words, if the researchers have tested hundreds or
thousands of models trying to find a result, they need to report
the fact that they did so. This statement forces researchers to
either a) assent to the statement and upgrade their untrustworthy
analysis to outright fraud or b) admit that over-testing occurred.
The best way of controlling for this rather insidious and hard-todetect method is study pre-registration this is where the
researchers write and publish a formal prediction of their study
outcome before they start the research. Its not a perfect solution,
but its much better than the alternative.
Culpability: medium to high
Significance: medium to high
Detectability: low
6.

The creeping over-extrapolation

This fiddle is a little different to the others, as it involves the


external perception of the study. Its also very common, so
common that it took me about 45 seconds to find this example.
The science journalism site sciencealert.com.au ran this rather
bold headline a month or so ago.
Depression can be detected with a blood test
Interesting, right? Heres the subheadline, now that has your
attention.
Doctors may soon be able to diagnose mental illness
with a simple blood test, new research suggests.
Sounds like a breakthrough, right? Not so fast. The title of the
article its describing is:
"Platelet Serotonin Transporter Function Predicts Default-Mode
Network Activity"7
Heres the glossy and rather tortured logic that connects them:
The serotonin transporter protein removes serotonin from
extracellular space. The main method of this is via the
transporter protein on blood platelets. There is also a good
relationship between this platelet uptake and the synaptosomal
uptake (the uptake by areas of the brain).

The behavioural economist and statistical guru Uri Simonsohn


has a now-classic paper which conclusively proves that listening
to the song When Im 64 actually makes you older.6 Obviously,
this is a crazy conclusion because a song cant modify your age,
but it is borne out of the analysis that he conducted simply by
hiding all the analyses which didnt work.

Separately, there is a relationship between depression and the


activity of the default-mode network in the brain a
coordinated system of activity which is active at rest and seems
to be implicated with receiving and processing information
which is self-referential. It is hypothesised that this network is
disrupted in depressed people, which is the hypothetical source
of intrusive thoughts and poor concentration in depression.

He also has a great statement that he encourages reviewers to


send to every paper they peer-review which goes like this:

Finally, we know that serotonin is implicated in depression as


serotonin reuptake inhibitors are a frontline treatment for

Alan Aragons Research Review May 2014

[Back to Contents]

Page 7

depression. That is to say, like most psychotropic medication,


they work sometimes in some people. We also are well aware
that while they fairly straightforwardly increase free serotonin
levels this is probably NOT their primary method of action
(otherwise, why would these drugs which raise serotonin in 20
minutes take weeks to start improving mood in depressed
patients?)
But anyway: if we can measure the blood platelet serotonin
reuptake velocity (related to the same function in the brain), it
might be related to the metabolic activity of the brain by the
default-mode network (impaired in depression; serotonin
implicated in function).

Again, these are together because they are closely related.


Outlier remembering
Reger et al9 matched a controlled dose of medium-chain
triglyceride (MCT) oil in n=20 Alzheimer's Disease patients, to
see if the presence of blood ketones had an immediate effect on
cognition. I have reproduced the graph of the central result here
on the left it shows that an increase in performance on a
cognitive task was correlated with increase in blood ketones
or was it?

So the researchers took a sample of healthy people and found a


reasonable relationship between their blood platelet serotonin
uptake with the function of the default-mode network as
measured by blood-oxygen level dependent fMRI scan.
And finally, please recognise that the above is itself a
simplification.
This is what gives us depression detected with a blood test.
I understand, of course, that journalism sensationalises
complicated topics like neurobiology. But the obvious caveat to
that it really shouldnt simplify something so much that is isnt
reasonably true anymore. And why would they do such a thing?
Well, partly because its their job, but also partly because the
researchers put out a press release with exactly the same
headline, containing wonderfully compelling but detail-poor
sentences such as serotonin transporter regulates neural
depression networks.
This is creeping over-extrapolation. You start with a result
which, as far as I can tell, is a fairly solid piece of neurobiology
relating brain oxygen level uptake over certain cortical networks
to measured platelet serotonin uptake in the blood. Then you
write a paper discussion and abstract which extrapolates the
results somewhat, talking about what might be possible in future
(if several important caveats are true). About this, you write a
simplified press release which presents the results in a glowing
light and presents those extrapolations as the point of the paper.
Then you let a journalist with no formal science education write
about it.8
Im including this as an error researchers make because its the
21st century, and researchers have an obligation to ensure that
their research is correctly reported. It is common for researchers
trying to justify the external impact of their work in grant
applications to collect these lazy, overwhelmingly positive
stories and list them prominently on their CVs. Be cautious of
any academic who is proud of how many newspaper articles are
written about them.
Culpability: medium
Significance: medium to high
Detectability: high
7.
8.

Outlier forgetting
Outlier remembering.

Alan Aragons Research Review May 2014

That point you can see on the left-hand side of the left-hand side
graph with the big arrow represents a participant who performed
much more poorly on the cognitive task after MCT oil than after
placebo. This is the dead-set opposite of what was predicted, a
decrease in performance a few times bigger than the alleged
increases in performance observed in other people. Theres no
good reason for this to happen, and its both in the opposite of
the predicted direction and dramatically in excess of everyone
elses change scores.
Now, there are several tests which determine whether or not a
value is an outlier some researchers simply do this by feel,
but the more correct way is with a test which compares the value
to the rest of the sample. The most common version of this is
Grubbs test10 and this flags that value as being an outlier.
Why was the outlier left in?
When this value is removed, the level of statistical significance
drops from p=0.02 to p=0.08, and reduces the r value (the
correlation coefficient) from 0.5 to 0.42. In other words, it
waters down the impact of the central finding. While it isnt
actually a big difference, it does cast doubt on the central
result.11
As you can probably tell from this, outliers being included are
very easy to spot. Even when only the means and standard
deviations of numbers are reported, it's usually obvious when
something is off.
Outlier forgetting
Its hard to find an example of outlier forgetting (the removal of
extreme values which disagree with the theory to improve the
[Back to Contents]

Page 8

central result) for the simple reason that they arent there to find!
There are some sophisticated methods you can try to determine
if there is enough variation in a sample, but until Im writing for
Alan Aragons Statistical Review, well have to let these slide.
Suffice to say, this can be a real problem. If you selectively
remove values which ruin your result, it very quickly runs the
risk of becoming straightforwardly dishonest. This is why I don't
have an example of one all I have is an example of where
someone didn't do it.
You can see a good example of this recently. Kogan et al12
examined the relationship between heart rate variability (HRV)
the same kind we use for athletic monitoring and depression /
social functioning. They found some values which were outliers,
and repeated the analysis with outliers both out and in, and then
reported the separate models. This is definitely the honest way
to do business if you're removing values, the fact that youre
doing it, what the values are, and what this changed about the
analysis should ALL be reported in the paper.
Remembering:
Culpability: medium
Significance: medium
Detectability: very high
Forgetting:
Culpability: high
Significance: high
Detectability: low
9.

'Cute' covariates

Arai et al13 looked at the inter-relationship between heart rate


variability (the same kind we use for athletic monitoring) with
QT-interval (another metric of health/autonomic outflow which
we get out of the electrocardiogram) with a sea of possible
covariates in n=150 young participants.
If I criticised everything in this paper which I didnt like, I
would bore you more than is strictly necessary and wear my
fingers down to stumps. So lets leave the criticisms like the
incorrect use of the analysis of covariation to one side, and just
concentrate on what might be useful for you: how to spot a
dodgy covariates.
There are several tells here. Firstly, the presence of a lot of
covariates and models for a simple question. Here, 9 measures of
different heart rate indices are compared with seven possible
covariates. As before with our hypothetical squat study, a lot of
possible comparisons is a red flag.
Secondly, the use of covariates which are not statistically
independent. For instance, there are models in the paper which
use BMI in the same model as body fat percentage as measured
by impedance. These numbers will obviously be related, and
inter-relationships between these variables complicate our ability
to understand the study outcomes dramatically.

relevant elsewhere (because obese and overweight people often


have impaired HRV, for instance). But this sample Arai et al use
is drawn from Japanese students at a school of medicine the
female sample has a mean BMI of 20.1 and a standard deviation
of 2.1. This means that of the 86 women, it is likely that only one
participant or even absolutely no participants at all were even
overweight (let alone obese). Their comparison paper was drawn
from a sample in Mexico which, as you might be aware, holds
the dubious honour of being the worlds most obese country.
Regression is a complicated topic, and its very easy to hide
dodgy techniques behind a wall of metrics and numbers.
Researchers and reviewers fail to understand the implications
of what theyre doing with a concerning regularity.
Culpability: very high
Significance: very high
Detectability: low
10. Conflating statistical and practical effects
DeWall et al14 tested n=93 undergraduates on the Intimate
Partner Violence scale and Trait Physical Aggression scales in
two groups, who received either a placebo or an intranasal dose
of the hormone oxytocin and a priming condition where they
underwent painful / stressful tasks. The paper strongly concluded
that oxytocin increased intimate partner violence inclinations in
participants who were high in trait physical aggression.
Now, this may be strictly true in the statistical sense the results
are probably calculated correctly.
But does X is mathematically different to Y have any meaning
in this context?
The Intimate Partner Violence scale is a series of charming items
where people are asked to score their likelihood of slapping,
shoving, hitting, kicking etc. their current romantic partner. It is
ranked from 1 not at all likely through to 5 extreme likely,
and then averaged. The problem here is the whole group in the
study had an average of 1.13 (SD = 0.39).
I tried to model this, and its impossible to predict well
(remember that no scores can be below 1). Probably two-thirds
of the entire sample in ALL groups put not at all likely for every
single possible answer. The entire sample could be driven by
some combination of a) the very few people who reported some
vague likelihood of violence, and b) the fact that some of the
groups have no mathematical variability AT ALL everyone
put the same answer. In psychology this is called a floor effect,
and it has the potential to make analyses do awfully strange
things.
As this is a social science example, lets cast the same scenario
into a hypothetical exercise science study:

Lastly, the use of broad appeals. The paper justifies adding


covariates of BMI and fat mass into the sample because it was

Say we have a new supplement which is designed to decrease


post-exercise pain. N=80 participants firstly take either our
supplement or a matched placebo, then all perform a highvolume high-intensity deadlift program, doing sets of 85% 1RM
to concentric failure, and then 80%, 75%, etc. until total

Alan Aragons Research Review May 2014

[Back to Contents]

Page 9

concentric failure with 50% 1RM is reached. They then rate their
lower back and hamstring pain 48 hours after exercise. More or
less everyone writes 10 I am in the maximum amount of
imaginable exercise discomfort mainly because this is an
insane protocol which shouldnt be attempted. But a few people
in our supplement group write 8 I am in a very, very large
amount of pain.
Now, can we accept that this is a meaningful difference? Well,
with hundreds and hundreds of participants, maybe. But it is far
more likely to be semantic we hurt our participants very badly
and seem to only be fiddling at the margins of the value of
interest. What we were looking for was the absence of pain, and
not the presence of very slightly less.
Our domestic violence questionnaire is the other way around
statistical significance or not, the change from, say, extremely
unlikely to quite unlikely may not be particularly useful at
telling us about actual aggressive tendencies.
Culpability: high
Significance: low to medium
Detectability: high

of the research methodology they need to understand, can make


basic mistakes in analysis, can deceive themselves, and can
cheat, manipulate or defraud the process of producing scientific
knowledge.
The thing that we have in our favour in trying to ascertain the
presence of the above is that science is the pursuit of knowledge
on the public record. Anything thats fiddled, or dishonest, or
under-handed, or incorrect, can only ever be hidden in plain
sight, and in general the ideas that everyone agrees are the most
important receive the most scrutiny. This might sound laudable,
but it is anything but straightforward. Progress lurches along
quite slowly. There are a few things that you, the interested
reader (or perhaps peer-reviewer) can do to help, and to satisfy
your own curiosity.
1. Contact the researchers. Ask for data.
Researchers, in general, like to talk about their work. Generally
the person who is on the paper as the corresponding author is
the right person to ask about it. However, be aware when the last
author in the list of authors is listed as corresponding this
generally means the most senior person on the project is also the
person youre contacting, who is also often the busiest.

Bonus: Making up data


I have to include this although it isnt really a manipulation in
the way other things are its fraud! Shang and Hasenberg15
investigated the effect of exercise training subsequent to RouxEn-Y gastric bypass (i.e. stomach surgery). N=60 morbidly
obese participants were randomised to receive either once or
twice-weekly exercise training. Significantly more body weight
and fat mass was lost in the multiple-exercise group, who also
showed significant improvement in co-morbidities.
The problem here is that none of this actually happened.
Someone from either the hospital or associated research group
noticed that in the location where the data was reported from
only n=21 patients had actually undergone any procedure at all
in the period the paper was written over the data, as it stood,
couldnt exist! On questioning, Dr. Shang couldnt produce any
of the raw data and had no answer for where it had come from.
Naturally, this paper is retracted.
Culpability: very high
Significance: very high
Detectability: very low
Conclusions:

(In situations like this, I generally Google the first author and
ask them if they can help...)
Researchers can be notoriously precious about sending their data
to other people. This isnt just because theyre afraid of scrutiny
or persecution (they often are). Its also because data files can be
a complete mess after the completion of a study, in three
different files (with different versions) only comprehensible to a
co-author, and squirreled away on a university server with a
password known only to the research assistant who quit 9
months ago. What youre asking could represent a big
investment of time on the part of the researchers. But you can
always ask.
2. Support efforts to put data in the public domain
This is a big component of whats called open science the
trend towards publishing datasets with experiments, as well as
analytical tools etc. that are used. Remember that people who do
this are extending what until now has been a privilege, which is
the ability to look under the hood of how a study works. I feel
strongly that researchers who publish data earn an extra degree
of trust.
3. Post on pubpeer or PubMed Commons

Please keep in mind firstly that researchers arent science-robots


from an alternate dimension, theyre people. Theyre people
with children and mortgages, and research programs which have
to work out so they can continue to be funded, in highly
competitive jobs, often competing against people who are
willing to bend publication requirements to look better.
Research isnt by any means a hotbed of fraud and deceit.

These are both websites where you can leave comments for the
public record on published research. If you want answers for
questions that you have, they are very useful. To get access, I
believe you need either an academic email address (i.e. one from
a tertiary institution) or an invitation from an existing user.

That being said, researchers even from famous and venerable


institutions can also be stunningly ignorant of the sub-structure

A few years ago, I was very amused when Alan was arguing
with Dr. Robert Lustig of sugar is evil fame, and was told

Alan Aragons Research Review May 2014

[Back to Contents]

4. Start a conversation

Page 10

rather huffily that academics do not have head-to-head


confrontations on blogs, social media, forums, etc. I was amused
because they damn well do all the time, and at great volume.
There are plenty of outlets for legitimate questions about
research which arent the old, formal methods if you know
someone with a public blog, ask them to start a conversation for
you. Or start one yourself. Invite the researchers to comment.
Remember with all of the above to be courteous and show
interest, rather than trying to storm the ramparts. Everyone is
looking for answers, but some are looking better than others.
____________________________________________________
James is just about to finish a
PhD in cardiac electrophysiology.
In his spare time, he breaks
things for money. Everything else
you need to know is here:
jamesheathers.com

11. In statistical terminology, this is only an outlier on the x-axis


and it's in the right place so technically it's a point of
leverage not an outlier.
12. Kogan, A., J. Gruber, et al. (2013). "Too much of a good
thing? Cardiac vagal tone's nonlinear relationship with wellbeing." Emotion 13(4): 599-604. [PubMed]
13. Arai, K., Y. Nakagawa, et al. (2013). "Relationships between
QT interval and heart rate variability at rest and the covariates
in healthy young adults." Auton Neurosci 173(1-2): 53-57.
[AN/BC]
14. DeWall, C.N., O. Gillath, et al. (2014). When the Love
Hormone Leads to Violence: Oxytocin Increases Intimate
Partner Violence Inclinations Among High Trait Aggressive
People Soc Psych Pers Sci, Published online Feb 12th.
[SPPS]
15. Shang, E. and T. Hasenberg (2010). "Aerobic endurance
training improves weight loss, body composition, and comorbidities in patients after laparoscopic Roux-en-Y gastric
bypass." Surg Obes Relat Dis 6(3): 260-266. [PubMed]

____________________________________________________
References:
1.

Murphy, M., K. Eliot, et al. (2012). "Whole beetroot


consumption acutely improves running performance." J Acad
Nutr Diet 112(4): 548-552. [PubMed]
2. http://en.wikipedia.org/wiki/Bonferroni_correction
3. Tucker, R., M. I. Lambert, et al. (2006). "An analysis of
pacing strategies during men's world-record performances in
track athletics." Int J Sports Physiol Perform 1(3): 233-245.
[PubMed]
4. Christian, J. G., T. E. Byers, et al. (2011). "A computer
support program that helps clinicians provide patients with
metabolic syndrome tailored counseling to promote weight
loss." J Am Diet Assoc 111(1): 75-83. [PubMed]
5. Angus, S. D. (2014). "Did recent world record marathon
runners employ optimal pacing strategies?" J Sports Sci
32(1): 31-45. [PubMed]
6. Simmons, J. P., L. D. Nelson, et al. (2011). "False-positive
psychology: undisclosed flexibility in data collection and
analysis allows presenting anything as significant." Psychol
Sci 22(11): 1359-1366. [PubMed]
7. Scharinger, C., U. Rabl, et al. (2014). "Platelet serotonin
transporter function predicts default-mode network activity."
PLoS One 9(3): e92543. [PubMed]
8. And then someone who doesnt even understand the
journalism uses it in an argument on the internet!
9. Reger, M. A., S. T. Henderson, et al. (2004). "Effects of betahydroxybutyrate on cognition in memory-impaired adults."
Neurobiol Aging 25(3): 311-314. [PubMed]
10. http://en.wikipedia.org/wiki/Grubbs'_test_for_outliers

Alan Aragons Research Review May 2014

[Back to Contents]

Page 11

Changes in exercises are more effective than in


loading schemes to improve muscle strength [reviewed
by Brad Schoenfeld, PhD, CSCS, CSPS, FNSCA].

weeks. Training volume (i.e. reps x sets) was equated between


groups. There also was a control group that performed no
resistance exercise. Maximal strength was assessed by the Smith
machine squat; hypertrophy was assessed by MRI.

Fonseca RM, Roschel H, Tricoli V, de Souza EO, Wilson JM,


Laurentino GC, Aihara AY, de Souza Leo AR, Ugrinowitsch C.
J Strength Cond Res. 2014 May 14. [Epub ahead of print]
[PubMed]
____________________________________________________

The study produced some interesting findings. All of the groups


showed significant increases in both strength and hypertrophy
from baseline to studys end. That said, there were important
differences between groups in muscular adaptations, some of
them expected while others rather surprising.

BACKGROUND/PURPOSE: This study investigated the


effects of varying strength exercises and/or loading scheme on
muscle cross-sectional area (CSA) and maximum strength after
four strength training loading schemes: constant intensity and
constant exercise (CICE), constant intensity and varied exercise
(CIVE), varied intensity and constant exercise (VICE), varied
intensity and varied exercise (VIVE). METHODS: Forty-nine
individuals were allocated into five groups: CICE, CIVE, VICE,
VIVE, and control group (C). Experimental groups underwent a
twice a week training for 12 weeks. Squat 1RM was assessed at
baseline and after the training period. Whole quadriceps muscle
and its heads CSA were also obtained pre- and post-training.
RESULTS: The whole quadriceps CSA increased significantly
(p<0.05) in all of the experimental groups from pre- to post-test
in both the right and left legs: CICE: 11.6% and 12.0%; CIVE:
11.6% and 12.2%; VICE: 9.5% e 9.3% and VIVE: 9.9% and
11.6%, respectively. The CIVE and VIVE groups presented
hypertrophy in all of the quadriceps muscle heads (p<0.05),
while the CICE and VICE groups did not present hypertrophy in
the vastus medialis and rectus femoris (RF), and in the RF
muscles, respectively (p>0.05). The CIVE group had greater
strength increments than the other training groups (Effect size
confidence limit of the difference -ESCLdiff CICE: 1.41 - 1.56;
VICE: 2.13 - 2.28; VIVE: 0.59 - 0.75). CONCLUSIONS: Our
findings suggest: a) CIVE is more efficient to produce strength
gains for physically active individuals; b) as long as the training
intensity reaches an alleged threshold, muscle hypertrophy is
similar regardless of the training intensity and exercise variation.
SPONSORSHIP: None listed.
____________________________________________________

With respect to hypertrophy, no differences were seen in the


overall increase in muscle cross sectional area of the quads
between groups. However, the groups that varied their exercises
displayed much greater uniformity of hypertrophy between the
various quadriceps muscles. Specifically, the groups that
performed only the squat exercises failed to significantly
increase muscle growth in the vastus medialis and/or rectus
femoris. This finding provides compelling evidence that varying
exercise selection is essential to achieve complete muscular
development. It is well documented that different exercises
provide the ability to target different aspects of a given muscle.
EMG studies show alterations in activation between exercises,
and these differences have been shown to correlate with muscle
growth. Changes in training angle, degrees of freedom, lengthtension, and other factors contribute to the differences in site
specific hypertrophy.
Assessment of maximal strength showed that the groups that
varied exercises demonstrated greater gains in 1RM squat
compared to the group that held exercise selection constant. On
the surface this seems counterintuitive as the constant exercise
group performed only the squat over the course of the training
program. Given the principle of specificity, it could be
speculated that the best way to enhance maximal squat strength
would be to squat as much as possible. However, the results of
this study suggest that as long as squats are included as a regular
component of the exercise program over time as they were
here then substituting some of the sets with alternative
movements actually enhances results.
Although not directly studied, this would also seem to support
the concept that incorporating so-called non-functional
exercises such as machine-based movements into a functional
training routine would not have any detrimental effects on
functional outcomes and may very well enhance the response.
Intriguingly, the greatest increases in 1RM squat were achieved
by the group that varied exercises but kept the intensity constant.
The authors speculated the variation in rep range might have led
to an attenuation of neural drive in this subject population. This
hypothesis warrants further investigation.

This study was designed to assess the effect of varying intensity


(i.e. loading at a percent of 1RM) and/or exercise selection on
muscle strength and hypertrophy. A total of 49 untrained male
subjects completed the study. Subjects were divided into 4
different training groups with each group comprised of 8-12
subjects; a constant intensity/constant exercise group; a constant
intensity varied exercise group; a varied intensity/constant
exercise group, and; a varied intensity/varied exercise group.
The group that varied intensity performed between 6-10 reps per
set over the course of the study whereas the group that held
intensity constant performed 8 reps per set. With respect to
exercise selection, the varied exercise group performed a
combination of the squat, leg press, deadlift and lunge whereas
the constant exercise group performed only the squat. The
resistance training program was carried out twice a week for 12

As is the case with a majority of resistance trainings studies, the


primary limitation of the study was that it was carried out using
untrained subjects. The early response to lifting is different from
that once you have been training consistently for a fair amount
of time. This limits the generalizability of results. We cannot say

Alan Aragons Research Review May 2014

[Back to Contents]

Page 12

for sure that the same findings would be seen in a well-trained


population. Indeed, if the authors hypothesis that changing the
rep range had a negative effect on neural drive is in fact correct,
it could alternatively be hypothesized that this detriment would
not occur in more experienced subjects since neural adaptations
would already be well-ingrained.
One issue that can be raised with the design is that the rep range
employed for the varied intensity groups (6-10 reps per set) was
fairly narrow. It would be difficult to imagine that changes in
muscle growth would have been significantly different using
such a narrow range over the course of a few months. What
would have been more interesting from a hypertrophy
standpoint, IMO, is if the rep range had of encompassed a low
rep condition (i.e. 5 reps), moderate rep condition (10 reps) and
a high rep condition (15 reps). Based on the concept of the
strength-endurance continuum, comparing a constant intensity of
10 reps per set versus a varied intensity of 5-10-15 reps per set
would have made more sense to see if muscle hypertrophy
differs along this continuum.
Ultimately the study provides intriguing findings that have
practical implications for training. Most importantly, it
reinforces the need to vary exercise selection to maximize
muscular symmetry as well as strength. It also suggests that,
from a maximal strength standpoint, limiting variation in
intensity of load is beneficial during the early stages of training.
Ideally this study should be replicated, perhaps with wider
intervals in rep range, in well-trained subjects to provide better
generalizability for those with lifting experience.
____________________________________________________
Brad Schoenfeld, PhD, CSCS, CSPS, FNSCA, is a
lecturer in the exercise science department for
Lehman College and is the head of their
human performance laboratory. His primary
research interests focus on elucidating the
mechanisms of muscle hypertrophy and their
application to resistance training. He has
published over 40 peer-reviewed journal
articles and currently serves on the Board of
Directors for the NSCA. He is author of the
book, "The M.A.X. Muscle Plan" which is
available at all major bookstores and on
Amazon.com. He maintains an active blog on his website:
http://www.lookgreatnaked.com/

Alan Aragons Research Review May 2014

[Back to Contents]

Page 13

The effects of consuming a high protein diet (4.4


g/kg/d) on body composition in resistance-trained
individuals.
Antonio J, Peacock CA, Ellerbroek A, Fromhoff B, Silver T. J
Int Soc Sports Nutr. 2014 May 12;11:19. [PubMed] [Full Text]
BACKGROUND: The consumption of dietary protein is
important for resistance-trained individuals. It has been posited
that intakes of 1.4 to 2.0 g/kg/day are needed for physically
active individuals. Thus, the purpose of this investigation was to
determine the effects of a very high protein diet (4.4 g/kg/d) on
body composition in resistance-trained men and women.
METHODS: Thirty healthy resistance-trained individuals
participated in this study (mean SD; age: 24.1 5.6 yr; height:
171.4 8.8 cm; weight: 73.3 11.5 kg). Subjects were randomly
assigned to one of the following groups: Control (CON) or high
protein (HP). The CON group was instructed to maintain the
same training and dietary habits over the course of the 8 week
study. The HP group was instructed to consume 4.4 grams of
protein per kg body weight daily. They were also instructed to
maintain the same training and dietary habits (e.g. maintain the
same fat and carbohydrate intake). Body composition (Bod
Pod), training volume (i.e. volume load), and food intake were
determined at baseline and over the 8 week treatment period.
RESULTS: The HP group consumed significantly more protein
and calories pre vs post (p < 0.05). Furthermore, the HP group
consumed significantly more protein and calories than the CON
(p < 0.05). The HP group consumed on average 307 69 grams
of protein compared to 138 42 in the CON. When expressed
per unit body weight, the HP group consumed 4.4 0.8 g/kg/d
of protein versus 1.8 0.4 g/kg/d in the CON. There were no
changes in training volume for either group. Moreover, there
were no significant changes over time or between groups for
body weight, fat mass, fat free mass, or percent body fat.
CONCLUSION: Consuming 5.5 times the recommended daily
allowance of protein has no effect on body composition in
resistance-trained individuals who otherwise maintain the same
training regimen. This is the first interventional study to
demonstrate that consuming a hypercaloric high protein diet
does not result in an increase in body fat. SPONSORSHIP: JA
is the CEO of the International Society of Sports Nutrition. The
protein powder was provided by MusclePharm and Adept
Nutrition (Europa Sports Products brand); both are sponsors of
the ISSN conferences.
Study strengths
A big strength of this study is the underlying concept, and the
interesting question investigated. Its one of the fun studies that
pushes the what if we tried this crazy idea factor, examining a
highly experimental and exploitive protocol. And, it happened to
yield some intriguing results. Overfeeding studies have thus far
focused on carbohydrate and/or fat,1-7 with a glaring scarcity of
studies on protein overfeeding.8 Furthermore, the majority of
overfeeding trials are short, ranging from a few days to less than
a month. Subjects were resistance-trained, which minimizes the
respond-strongly-to-anything tendency of novices.
Study limitations
Air displacement plethysmography (ADP, or Bod Pod) was used
to assess body composition. A comprehensive review by Fields
et al states:9 In conclusion, the BOD POD is a reliable and
Alan Aragons Research Review May 2014

valid technique that can quickly and safely evaluate body


composition in a wide range of subject types, including those
who are often difficult to measure, such as the elderly, children,
and obese individuals. However, it should be noted that the
majority of studies on the Bod Pod have compared it to
hydrostatic weighing. Ball and Altena10 compared Bod Pod to
dual X-ray absorptiometry (DXA) in a large sample of men
(n=160) and found that although the results from the two
methods were highly correlated, the difference increased as
bodyfat increased. Quoting their conclusion (which I feel is
hugely important):10 Practitioners should be aware that even
with the use of technologically sophisticated methods (i.e., Bod
Pod, DXA), differences between methods exist and the
determination of body composition is at best, an estimation.
Another limitation is the questionable reliability of self-reported
dietary intake (and activity output). Research that immediately
comes to mind is Lichtman et al, who found that obese subjects
with a reported history of diet resistance under-reported food
intake by an average of 47%, and over-reported physical activity
by 51%.11 In the case of the present study, there was a massive
amount of protein assigned to the experimental group (4.4 g/kg
or 307 g/day). The investigators were aware of the inherent
difficulty in carrying this out, hence their purposely uneven
randomization: 20 subjects were assigned to the high-protein
(HP) group, and 10 subjects to the control group. Its not out of
the question that over-reporting occurred, since its human
nature to avoid admitting failure to fully follow the program.
Aside from the limitations inherent with self-reported intake,
there was no objective measure of energy expenditure An
attempt to control for training volume was made via daily
journaling. There thus was the reliance upon the accuracy of the
subjects records, instead of an objective measure of energy
expenditure such as the doubly labeled water (DLW) technique.
The use of DLW has been called the gold standard of assessing
energy expenditure, particularly in non-confined conditions.12
However, its rare to see DLW used in sports nutrition studies
(or most any type of research, for that matter). This is because
its expensive and requires specifically trained personnel. Thus,
were left with open questions about how the experimental
protein overfeeding affected non-exercise activity thermogenesis
(NEAT). One of the most memorable examples of DLW use
capturing the impressive extent of NEAT was in 1999 when
Levine et al13 found that the metabolic response to a 1000-kcal
surplus ranged from a 98 kcal decrease to a 692 kcal increase in
NEAT. The groups mean increase in NEAT was 336 kcal. The
authors summation is worth quoting directly:13
Thus, activation of NEAT can explain the variability of fat gain
with overeating. As humans overeat, those with effective
activation of NEAT can dissipate excess energy so that it is not
available for storage as fat, [...] The maximum increase in NEAT
that we detected (692 kcal/day, volunteer 5) could be
accounted for by an increased strolling-equivalent activity of 15
min/hour during waking hours.

Comment/application
The most salient finding was the lack of significant change in
body composition in either group over the 8-week period:
[Back to Contents]

Page 14

novel or greater training stimulus that might elicit further gains.


Taking the results on face-value, it almost seems surplus calories
dont count since NEAT will save you as long as the surplus is
from protein. However, its worth reiterating that not all aspects
of this study were tightly controlled, and reporting error could
have played a significant confounding role in the results.
On the other hand, there is still the possibility that relatively
advanced, resistance-trained subjects have a heightened
capability of dissipating surplus energy from dietary protein
through involuntary, non-exercise means. This possibility also
holds potentially important implications for meal planning of
dieting individuals as well as those who are striving to maintain
weight loss, but also need to control appetite.
Surprisingly, the HP groups body composition showed no
significant changes despite the assignment of an additional 800
kcal (in protein) above and beyond that assigned to the control
group. But, unlike Levine et als overfeeding study, the present
study based overfeeding on protein exclusively. The HP groups
consumption of ~307 g protein versus the control groups ~138 g
without a doubt had a higher thermic effect. As reported by
Jquier,14 the thermic effect of protein (expressed as a
percentage of energy content) is 25-30%, carbohydrate is 6-8%,
and fat is 2-3%. However, not all of the literature is in precise
agreement. Halton and Hu reported greater variability, with the
thermic effect of protein being 20-35%, carbohydrate at 5-15%,
and fat being subject to debate since some investigators found a
lower thermic effect than carbohydrate while others found no
difference.15 Despite relative variations in carbohydrate and fat,
protein has consistently shown a markedly higher thermic effect
than either of them. In combination, the thermic effect of protein
combined with a liberal presumption of NEAT, the majority of
the dissipated protein energy is accounted for. The remainder is
plausibly attributable to reporting error.

On a practical note, the following detail should be weighed into


consideration: ...every subject in the high protein group
consumed protein powder in order to meet the requirements for
the study. Otherwise, it would be have virtually impossible or
highly unlikely that one could consume a 4.4 g/kg/d via food
alone. Protein supplements (in this case, whey and casein
powders) only contain trace amounts of fat and carbohydrate.
Those who want to experiment with higher protein intakes
should keep in mind that inadvertent addition of fat and
carbohydrate along with the extra protein (i.e., via mixedmacronutrient dishes and/or fatty meats) would not mimic the
protocol nor the effects seen in the present study.

In a recent study that made waves for being the first of its kind,
Bray et al16 compared the overfeeding effects of a low-protein
(5%), normal-protein (15%), and high-protein (25%) diet.
Carbohydrate was kept the same across the treatments, with fat
filling in the remainder. Among this studys design strengths
was the use of DLW to assess energy expenditure. A 40%
energy surplus (954 kcal) was imposed for 8 weeks, and the lowprotein lost lean mass, all groups increased fat mass equally, but
the normal & high-protein groups gained lean mass, with the
latter gaining the more lean mass by a small margin. The low
protein group gained significantly less total bodyweight than the
higher-protein groups, but this was due to differences in lean
mass gain.
In the present study, no lean mass was gained despite an
increased protein intake in the HP group. This can be attributed
to the advanced resistance-trained status of the subjects (they
trained an average of 8.5 hours/week for the past 8.9 years), and
their baseline protein intake was already high (~1.9-2.3 g/kg). In
contrast, Bray et als subjects were untrained, and their protein
intake at baseline was 1.2 g/kg, and this was raised to 1.8 g/kg in
the high-protein treatment essentially crossing the threshold
from sub-optimal to optimal. Another point made by the authors
of the present study was that the subjects were instructed to
maintain their habitual training program, thus precluding any
Alan Aragons Research Review May 2014

[Back to Contents]

Page 15

An amino acid-electrolyte beverage may increase


cellular
rehydration
relative
to
carbohydrateelectrolyte and flavored water beverages.
Tai CY, Joy JM, Falcone PH, Carson LR, Mosman MM,
Straight JL, Oury SL, Mendez C, Loveridge NJ, Kim MP,
Moon JR. Nutr J 2014, 13:47 doi:10.1186/1475-2891-13-47
[PubMed]
BACKGROUND: In cases of dehydration exceeding a 2% loss of
body weight, athletic performance can be significantly
compromised. Carbohydrate and/or electrolyte containing beverages
have been effective for rehydration and recovery of performance,
yet amino acid containing beverages remain unexamined. Therefore,
the purpose of this study is to compare the rehydration capabilities
of an electrolyte-carbohydrate (EC), electrolyte-branched chain
amino acid (EA), and flavored water (FW) beverages. METHODS:
Twenty men (n = 10; 26.7 +/- 4.8 years; 174.3 +/- 6.4 cm; 74.2 +/10.9 kg) and women (n = 10; 27.1 +/- 4.7 years; 175.3 +/- 7.9 cm;
71.0 +/- 6.5 kg) participated in this crossover study. For each trial,
subjects were dehydrated, provided one of three random beverages,
and monitored for the following three hours. Measurements were
collected prior to and immediately after dehydration and 4 hours
after dehydration (3 hours after rehydration) (AE = -2.5 +/- 0.55%;
CE = -2.2 +/- 0.43%; FW = -2.5 +/- 0.62%). Measurements
collected at each time point were urine volume, urine specific
gravity, drink volume, and fluid retention. RESULTS: No
significant differences (p > 0.05) existed between beverages for
urine volume, drink volume, or fluid retention for any time-point.
Treatment x time interactions existed for urine specific gravity
(USG) (p < 0.05). Post hoc analysis revealed differences occurred
between the FW and EA beverages (p = 0.003) and between the EC
and EA beverages (p = 0.007) at 4 hours after rehydration. Wherein,
EA USG returned to baseline at 4 hours post-dehydration (mean
difference from pre to 4 hours post-dehydration = -0.0002; p > 0.05)
while both EC (-0.0067) and FW (-0.0051) continued to produce
dilute urine and failed to return to baseline at the same time-point (p
< 0.05). CONCLUSION: Because no differences existed for fluid
retention, urine or drink volume at any time point, yet USG returned
to baseline during the EA trial, an EA supplement may enhance
cellular rehydration rate compared to an EC or FW beverage in
healthy men and women after acute dehydration of around 2% body
mass loss. SPONSORSHIP: MusclePharm Corporation.

sparse among trainees seeking to improve endurance-type


performance. This is because the inclusion of carbohydrate
would serve the dual purpose of driving better exercise
performance, as well as faster glycogen resynthesis post-exercise
(both of which can benefit competitive endurance sports
especially those with multiple glycogen-depleting events per
day). Missing from the comparison was a condition containing
amino acids, carbohydrate, and electrolytes. However, the
authors duly cite research by Lambert et al,17 who found that in
the absence of electrolytes, no significant differences in
rehydration were seen between beverages containing versus
omitting carbohydrate. This implicates electrolytes as the critical
factor in rehydration (rather than carbohydrate, whose function
would be limited to glycogen resynthesis). Still, potentially
interactive or synergistic effects of a combination of carbs,
electrolytes, and amino acids on hydration would have been a
worthy condition to investigate in the present studys
comparison. For example, chocolate milk has demonstrated
effectiveness for rehydration, glycogen resynthesis, and muscle
recovery, and is more nutrient-dense than typical commercial
recovery drinks.18 A final limitation was that the treatments were
not equal in terms of potassium content (AE had the most).
Comment/application
The findings only partially agreed with the authors hypothesis
going into the experiment. They originally predicted that AE and
CE beverages would rehydrate similarly, yet to a greater extent
than the flavored water (FW) beverage. Interestingly, quoting
the authors: The AE and CE beverages rehydrated about
equally; however, they were also equal to the FW beverage.
However, they go on to mention a subtle detail that separated the
CE beverage from the other two, rendering it superior. CE and
FW yielded more diluted urine than AE, as indicated by urine
specific gravity (USG), depicted below:

Study strengths
This study is innovative since its the first to compare the
hydrating effects of a BCAA-electrolyte (AE) beverage with that
of a carbohydrate-electrolyte (CE) beverage. Furthermore, the
protocol involved a more realistic fluid dose than the typically
massive fluid doses given in previous research examining
rehydration beverages. Subjects were required to have a
minimum of one year of endurance and resistance training
experience, which minimized the chance of confounding
newbie effects. This investigation is of relevance to trainees
aiming to economize caloric intake which is often hiked by the
carbohydrate content of conventional recovery beverages.

While this study may have relevance to those seeking to


economize carbohydrate intake, such a population would be very

At 4 hours post-dehydration, USG in the CE and FW trials was


significantly lower than pre-testing, while AEs USG was the
same as pre-testing at this time point. This suggests greater
urinary diuresis and less cellular retention in CE & FW
compared to AE. It should be noted that the measured
differences in fluid retention between conditions were not
statistically significant (43.5% in AE, 40.8% in CE, and 42.2%
in FW). This begs the question of how clinically relevant these
small differences really are, and how necessary or beneficial this
special rehydration product is despite its inclusion of BCAA
and absence of carbohydrate.

Alan Aragons Research Review May 2014

[Back to Contents]

Study limitations

Page 16

Comment/application
Calorie shifting diet versus calorie restriction diet: a
comparative clinical trial study.
Davoodi SH, Ajami M, Ayatollahi SA, Dowlatshahi K, Javedan
G, Pazoki-Toroudi HR. Int J Prev Med. 2014 Apr;5(4):447-56.
[PubMed]
BACKGROUND: Finding new tolerable methods in weight loss
has largely been an issue of interest for specialists. Present study
compared a novel method of calorie shifting diet (CSD) with classic
calorie restriction (CR) on weight loss in overweight and obese
subjects. METHODS: Seventy-four subjects (body mass index
25; 37) were randomized to 4 weeks control diet, 6 weeks CSD or
CR diets, and 4 weeks follow-up period. CSD consisted of three
phases each lasts for 2 weeks, 11 days calorie restriction which
included four meals every day, and 4 h fasting between meals
follow with 3 days self-selecting diet. CR subjects receive
determined low calorie diet. Anthropometric and metabolic
measures were assessed at different time points in the study.
RESULTS: Four weeks after treatment, significant weight, and fat
loss started (6.02 and 5.15 kg) and continued for 1 month of followup (5.24 and 4.3 kg), which was correlated to the restricted energy
intake (P < 0.05). During three CSD phases, resting metabolic rate
tended to remain unchanged. The decrease in plasma glucose, total
cholesterol, and triacylglycerol were greater among subjects on the
CSD diet (P < 0.05). Feeling of hunger decreased and satisfaction
increased among those on the CSD diet after 4 weeks (P < 0.05).
CONCLUSIONS: The CSD diet was associated with a greater
improvement in some anthropometric measures, Adherence was
better among CSD subjects. Longer and larger studies are required
to determine the long-term safety and efficacy of CSD diet.
SPONSORSHIP: None listed.

Study strengths
This is the first study to compare the effects of this particular
permutation of a calorie shifting diet (CSD: 11 days restricted, 3
days unrestricted) pattern with a linear calorie-restricted (CR)
diet. The investigation is an important one given the generally
unimpressive weight loss and weight loss maintenance from
conventional caloric restriction.19-21 The sample size (n=74) was
fairly large, especially for diet research, which is notorious for
its small subject numbers. More subjects translates to greater
statistical power and less likelihood of by-chance occurrences.

CSD outperformed CR on several parameters:


CSD yielded greater weight loss at the end of the follow-up
period but not the intervention period. Full body
composition change details here).
CSD yielded greater fat loss at the end of the follow-up
period but not at the end of the intervention period.
CSDs decrease in RMR was less than that of CR (which
ended up being lower than baseline by the end of the
study).
CSD yielded greater decreases in glucose, total cholesterol,
and triacylglycerol by the end of the study.
CSD tended to yield lesser subjective feelings of hunger
toward the end of the trial.
CSD had a much higher subject retention rate; 36.8%
dropped out of CR, and 15.6% dropped out of CSD.
Overall, CSD trumped CR, especially by the end of the 4-week
follow-up period. However, its very important to view these
results in the proper perspective. It cant be overemphasized that
this was a short intervention (6 weeks plus a 4-week follow-up).
Ill also reiterate that the diets imposed upon both groups were
far from optimal in terms of protein intake. Baseline protein
intake of CSD was ~1.1 g/kg, and this dropped to ~0.9 g/kg
during the intervention. Baseline protein intake in CR was ~1.1
g/kg, and this dropped to ~0.8 g/kg during the intervention.
These protein intakes are approximately half of what has been
repeatedly been shown to be a favorable and effective target for
optimizing muscular adaptations to hypocaloric conditions.22-24
Nevertheless, perhaps a case can be made for CSD over CR
under conditions of subpar protein intake.
Still, the biological plausibility of there being inherent
advantages to the CSD pattern is questionable beyond its
potential to bolster compliance, at least in the short-term. CSD
had more rules and structuring, particularly during the 11-day
cycles where 4 meals were strictly spaced 4 hours apart with no
between-eating allowed. This may have raised subjects
awareness and focus on the protocol, keeping them more
compliant. In contrast, the linear calorie-restricted group could
have been lulled into a monotonous grind conducive to the
loosening of adherence over time. In contrast, the CSD group
essentially took a 3-day diet break (consuming maintenancelevel calories) after every 11-day block of dieting.

The design included an intervention period as well as a followup period which is a good thing, its just that both periods were
short (6-week intervention, 4-week follow-up). This essentially
gives us hypothesis-generating pilot data rather than long-term
data that we can lean on with greater confidence. Another
limitation is the diet construction these are your typical, crappy
research diets. Protein intake during the intervention phase was
actually less than the subjects habitual intakes at baseline in
both groups. The deficits were rather severe, but strangely, they
were not equated. CSDs reduction was set at 45% of baseline
reported maintenance, and CRs was set at 55% of maintenance.
The more severe deficit in CSD may have imparted an
advantage. Furthermore, the results of this study might be
limited to the subject profile (obese, untrained). Bioelectrical
impedance analysis (BIA) was used to assess body composition.

The weight and fat loss benefits of CSD were not clearly
apparent until the end of the follow-up period. Its thus easy to
speculate that the 6 weeks of linear, aggressive caloric restriction
may have been met with deprivation backlash during the followup period where the objective was to consume maintenancelevel calories. Remember that in CR, 55% of baseline intake was
subtracted, leaving subjects with 6 weeks of consuming 1186
kcal/day (down from 2432 kcal at baseline). An important
indicator of the CSDs effectiveness was the doubly higher
dropout rate in CR. The more favorable biochemical changes in
CSD can be attributed primarily to the greater weight and fat
loss at by end of the follow-up. The relative success of the 11/3
CSD model gives rise to the potential effectiveness of other
more convenient and realistic non-linear models. For example, a
5/2 model, with 5 calorie-restricted days followed by 2 selfselected days, would mirrors a weekdays/weekend cycle which
could potentially fit better into the common work schedule.

Alan Aragons Research Review May 2014

[Back to Contents]

Study limitations

Page 17

1.

2.

3.

4.

5.

6.

7.

8.

9.

10.

11.

12.

13.

14.

Lecoultre V, Egli L, Carrel G, Theytaz F, Kreis R, Schneiter


P, Boss A, Zwygart K, L KA, Bortolotti M, Boesch C,
Tappy L. Effects of fructose and glucose overfeeding on
hepatic insulin sensitivity and intrahepatic lipids in healthy
humans. Obesity (Silver Spring). 2013 Apr;21(4):782-5.
[PubMed]
Sobrecases H, L KA, Bortolotti M, Schneiter P, Ith M,
Kreis R, Boesch C, Tappy L. Effects of short-term
overfeeding with fructose, fat and fructose plus fat on
plasma and hepatic lipids in healthy men. Diabetes Metab.
2010 Jun;36(3):244-6. [PubMed]
Ngo Sock ET, L KA, Ith M, Kreis R, Boesch C, Tappy L.
Effects of a short-term overfeeding with fructose or glucose
in healthy young males. Br J Nutr. 2010 Apr;103(7):939-43.
[PubMed]
Claesson AL, Holm G, Ernersson A, Lindstrm T, Nystrom
FH. Two weeks of overfeeding with candy, but not peanuts,
increases insulin levels and body weight. Scand J Clin Lab
Invest. 2009;69(5):598-605. [PubMed]
L KA, Faeh D, Stettler R, Ith M, Kreis R, Vermathen P,
Boesch C, Ravussin E, Tappy L. A 4-wk high-fructose diet
alters lipid metabolism without affecting insulin sensitivity
or ectopic lipids in healthy humans. Am J Clin Nutr. 2006
Dec;84(6):1374-9. [PubMed]
McDevitt RM, Bott SJ, Harding M, Coward WA, Bluck LJ,
Prentice AM. De novo lipogenesis during controlled
overfeeding with sucrose or glucose in lean and obese
women. Am J Clin Nutr. 2001 Dec;74(6):737-46. [PubMed]
Lammert O, Grunnet N, Faber P, Bjrnsbo KS, Dich J,
Larsen LO, Neese RA, Hellerstein MK, Quistorff B. Effects
of isoenergetic overfeeding of either carbohydrate or fat in
young men. Br J Nutr. 2000 Aug;84(2):233-45. [PubMed]
Bray GA, Smith SR, de Jonge L, Xie H, Rood J, Martin CK,
Most M, Brock C, Mancuso S, Redman LM. Effect of
dietary protein content on weight gain, energy expenditure,
and body composition during overeating: a randomized
controlled trial. JAMA. 2012 Jan 4;307(1):47-55. [PubMed]
Fields DA, Goran MI, McCrory MA. Body-composition
assessment via air-displacement plethysmography in adults
and children: a review. Am J Clin Nutr. 2002
Mar;75(3):453-67. [PubMed]
Ball SD, Altena TS. Comparison of the Bod Pod and dual
energy x-ray absorptiometry in men. Physiol Meas. 2004
Jun;25(3):671-8. [PubMed]
Lichtman SW1, Pisarska K, Berman ER, Pestone M,
Dowling H, Offenbacher E, Weisel H, Heshka S, Matthews
DE, Heymsfield SB. Discrepancy between self-reported and
actual caloric intake and exercise in obese subjects. N Engl J
Med. 1992 Dec 31;327(27):1893-8. [PubMed]
Pinheiro Volp AC1, Esteves de Oliveira FC, Duarte Moreira
Alves R, Esteves EA, Bressan J. Energy expenditure:
components and evaluation methods.Nutr Hosp. 2011 MayJun;26(3):430-40. [PubMed]
Levine JA, et al. Role of nonexercise activity thermogenesis
in resistance to fat gain in humans. Science. 1999 Jan
8;283(5399):212-4. [PubMed]
Jquier E. Pathways to obesity. Int J Obes Relat Metab
Disord. 2002 Sep;26 Suppl 2:S12-7. [PubMed]

Alan Aragons Research Review May 2014

15. Halton TL, Hu FB. The effects of high protein diets on


thermogenesis, satiety and weight loss: a critical review. J
Am Coll Nutr. 2004 Oct;23(5):373-85. [PubMed]
16. Bray GA, Smith SR, de Jonge L, Xie H, Rood J, Martin CK,
Most M, Brock C, Mancuso S, Redman LM. Effect of
dietary protein content on weight gain, energy expenditure,
and body composition during overeating: a randomized
controlled trial. JAMA. 2012 Jan 4;307(1):47-55. [PubMed]
17. Lambert CP, Costill DL, McConell GK, Benedict MA,
Lambert GP, Robergs RA, Fink WJ. Fluid replacement after
dehydration: influence of beverage carbonation and
carbohydrate content. Int J Sports Med. 1992
May;13(4):285-92. [PubMed]
18. Pritchett K, Pritchett R. Chocolate milk: a post-exercise
recovery beverage for endurance sports. Med Sport Sci.
2012;59:127-34. [PubMed]
19. Rosenbaum M, Hirsch J, Gallagher DA, Leibel RL. Longterm persistence of adaptive thermogenesis in subjects who
have maintained a reduced body weight. Am J Clin Nutr.
2008 Oct;88(4):906-12. [PubMed]
20. Redman LM1, Heilbronn LK, Martin CK, de Jonge L,
Williamson DA, Delany JP, Ravussin E; Pennington
CALERIE Team. Metabolic and behavioral compensations
in response to caloric restriction: implications for the
maintenance of weight loss. PLoS One. 2009;4(2):e4377.
[PubMed]
21. Camps SG, Verhoef SP, Westerterp KR. Weight loss,
weight maintenance, and adaptive thermogenesis. Am J Clin
Nutr. 2013 May;97(5):990-4. [Epub ahead of print]
[PubMed]
22. Layman DK, Evans EM, Erickson D, Seyler J, Weber J,
Bagshaw D, Griel A, Psota T, Kris-Etherton P. A moderateprotein diet produces sustained weight loss and long-term
changes in body composition and blood lipids in obese
adults. J Nutr. 2009 Mar;139(3):514-21. [PubMed]
23. Layman DK, Evans E, Baum JI, Seyler J, Erickson DJ,
Boileau RA. Dietary protein and exercise have additive
effects on body composition during weight loss in adult
women. J Nutr. 2005 Aug;135(8):1903-10. [PubMed]
24. Layman DK, Boileau RA, Erickson DJ, Painter JE, Shiue H,
Sather C, Christou DD. A reduced ratio of dietary
carbohydrate to protein improves body composition and
blood lipid profiles during weight loss in adult women.J
Nutr. 2003 Feb;133(2):411-7. [PubMed]

[Back to Contents]

Page 18

Processed foods - are they really that bad for you?


By Chris & Eric Martinez
____________________________________________________
Introduction
The term processed foods carries a strong negative
connotation similar to the term cheat meals or dirty foods.
Much of this slander doesnt have scientific evidence to back it
up. Its usually spawned from hunches and hearsay. These
oversimplifications seem to be self-perpetuating since people
rarely dig further into the details, definitions, and wide-ranging
applications of food processing. To process food means to use a
series of mechanical or chemical operations to change or
preserve it. A review by Floros et al1 described processing as one
or more of a range of operations, including:

Washing
Grinding
Mixing
Cooling
Storing
Heating
Freezing
Filtering
Fermenting
Extracting
Extruding
Frying
Drying
Concentrating
Irridating
Microwaving
Packaging

There are a multitude of foods in the diet like bread, cheese,


beer, and wine that are highly processed, yet arent looked at as
processed foods by consumers. By the way, its true that in
some cases processed foods create problems, specifically in
cases of abusing and overconsuming highly refined food
products while neglecting whole and minimally refined foods.
However, food technology has also contributed significantly to
the improvement of human health and specifically the ability to
optimize nutrition for populations such as:

Infants
Pregnant mothers
People with food allergies
Athletes
The elderly

Filling in the critical gaps


There are so many populations that have nutrient deficiencies
and if it werent for food technology these populations would be
at high risk of disease and possibly even death. Micronutrient
shortcomings are highly variable, and are strongly populationAlan Aragons Research Review May 2014

specific. An example on one end of the broad continuum, Iodine


deficiency is the worlds most prevalent, yet most easily
preventable cause of brain damage. This deficiency is most
prevalent in developing countries, especially those with iodinedeficient soil in Africa and Asia. The solution to this problem,
iodized salt, is described by the World Health Organization as
follows:2 A spectacularly simple, universally effective, wildly
attractive and incredibly cheap technical weapon thats
iodized salt! Similarly, vitamin A deficiency is a serious public
health crisis in Africa and South-East Asia, particularly in young
children and pregnant women in low-income countries.
One micronutrient deficiency that seems to cross all
socioeconomic and geographic boundaries is iron deficiency. A
little-known fact is that iron deficiency is the most common and
widespread micronutrient deficiency in the world,3 and is
exacerbated by infectious diseases such as malaria, HIV/AIDS,
hookworm infestation, and tuberculosis. Iron deficiency is the
most common nutrient deficiency and leading cause of anemia in
the United States.4 Those at highest risk are young children
between 3-6 months of age, adolescent girls and women who are
young enough to be menstruating, and pregnant women. Iron
deficiency is caused by either increased iron needs that are not
met, decreased intake or absorption, or a combination of the
aforementioned.
In contrast to those at-risk populations, its easy to see how iron
deficiency anemia is not likely to be common among men who
consume copious amounts of meat, along with a reasonable
variety of foods across the food groups. While deficiencies in
iron, iodine, vitamin A, and zinc are the most common
deficiencies in developing countries,5 affluent and athletic
populations have a tendency to create their own unique set of
problems and special needs. Food technology has played a
significant role in the enhancement of athletic performance
through sports beverages and easy-to-digest foods. Lets take
NFL players for example, when training camp starts they usually
perform two-a-days. Do you really think after each training
session they can gorge down a chicken breast and sweet potato
and then go back at it for another session? The early stages of
mechanical digestion are not conducive to performance.
Did we forget to mention elite professional-level bodybuilders in
the off-season that have high caloric intakes? This population
needs food technology in order to reach their high caloric intakes
and higher-than-average macronutrient allotments. For example,
you just cant simply eat 500g of carbs worth of potatoes, oats,
and veggies without experiencing gastrointestinal issues. As for
those who are averse to preparing or consuming animal flesh,
this is where protein shakes come in handy.
Food technology pros and cons
A recent review by van Boekel et al lists the most important
benefits of food processing:6
Food safety (pathogens): The main benefit of food
processing is inactivation of food-borne pathogens, as is
normally required by food safety legislation.
Food safety (other aspects): Inactivation of natural toxins
and enzymes, prolongation of shelf life.
[Back to Contents]

Page 19

Nutritional value: Improved digestibility, bioavailability


of nutrients.
Sensory quality: Taste, texture, and flavor. Functional
health benefits such as probiotics, prebiotics, etc.
Maillard reaction products, flavonoids, other food
constituents and their reaction products.
Convenience: availability of ready to eat and semi
prepared foods such as microwavable frozen meals.
Cost: Economy of Scale
Diversity: Independence forms the seasonal availability
of foods, and introduction of global food supply chain.
Quality of Life: Improved because less time required for
food supply and preparation
Now, on the contrary, food processing can also damage food
quality, leading to undesired consequences, such as:
Losses of certain (essential) nutrients due to chemical
reactions (e.g. vitamin C, available lysine).
Formation of undesired compounds, e.g. acrylamide,
acrolein chloropropanediols, heterocyclic amines, etc.
In some cases, formation of compounds that have a
negative effect on flavor perception (for instance,
sulphur compounds formed during heating of milk.
Loss of texture, discoloration, etc.
Moreover, processed foods can also lead to an increase in dietary
components that may need to be limited, such as salt, sugars, and
saturated fats. So, its not all that simple there are both pros
and cons to processed foods.
The fear of processed foods
Consumer research by the International Food Information
Council (IFIC) shows that 43% of consumers are concerned
about some aspects of processed foods.7 The many issues
currently being debated include views on the following:

Nutritional quality
Freshness
Safety
Origin (locally grown compared with grown elsewhere)
Healthfulness
Sustainability
Techniques used for raising them (organic compared
with non-organic and genetically modified organisms
Perceived ethical aspects of production
One of the major issues with food processing is commercial food
processing technique is poorly understood by the general public.
These techniques that are used to process foods are difficult for
the public to understand and not to mention are out of their
control, thus causing people to look at things in a vacuum per
say, black and white thinking, and generating suspicion and
concerns about safety. This is very similar to those that bash
research studies without even knowing how difficult the process
is to conduct a study.
However, thats not to say that theres anything wrong with
trying to be as healthy as you possibly can and putting safety at
the forefront of your health. But theres something
fundamentally wrong when the preponderance of scientific data
Alan Aragons Research Review May 2014

is ignored. A recent large-scale systematic review of research on


by Nicolia et al8 failed to find any cause for alarm regarding
genetically engineered (GE) crops compared to the non GE stuff.
To quote their conclusion:
We have reviewed the scientific literature on GE crop safety
for the last 10 years that catches the scientific consensus
matured since GE plants became widely cultivated worldwide,
and we can conclude that the scientific research conducted so
far has not detected any significant hazard directly connected
with the use of GM crops.

This body of literature is controversial, but the trends lean


towards safety, and you have to consider that biotechnology is
all necessary and life-saving and it all really makes the world
turn. We would have innumerable diseases and deaths without
food technology and genetic modification to produce a wide
range of foods in a sustainable fashion. Also consider more
subtle but important advances in food technology such as protein
powders for athletes and liquid nutrition for clinical use such as
enteral and parenteral nutrition.
Conclusions
Lets not beat around the bush here, a consumers perception of
processed foods is rather negative, mainly due to the large
attention to formation of undesired compounds, but its pretty
clear from what the data shows that processed foods leads to the
formation of compounds with beneficial properties.
Rather than completely limiting processed foods in a nutrition
program, it may be more productive to encourage the best
available options, namely, those that provide fewer in-adequate
nutrients and more essential nutrients to encourage for the
calories consumed. If we become smart consumers and keep our
goals in mind, body type, activity levels, metabolic variances,
and health in mind, we can certainly have a diet predominantly
containing whole and minimally refined foods along with some
room for some processed foods.
This controversial topic of processed vs un-processed foods just
may come down to the consumers goals when choosing between
processed or un-processed foods. For example, if one is trying to
lose weight or diet down for a show, it would be a better choice
to choose whole and minimally refined foods instead of
processed-packaged foods. The reason being, the more packaged
foods there are, the greater the chance of error on their side
(labeling). We dont have any data to back up this claim, but
anecdotally we have seen this with clients. The minute we have
them remove low carb products, bars, basically diet friendly
packaged foods they drop weight.
On the contrary, if someone isnt dieting, is in a hypercaloric
state, or a high level athlete that has an extremely vigorous
activity level and high macronutrient allotment, they can
probably get away with having more processed foods in their
nutrition programs. Sure, they still have to have nutrient dense
foods to perform at a high level but they have more wiggle room
to work with and can get away with it as opposed to dieting or
relatively sedentary populations.
We will end with a quote from a recent review by Weaver et al:9
[Back to Contents]

Page 20

We conclude that processed foods are nutritionally important


to American diets. They contribute to both food security
(ensuring that sufficient food is available) and nutrition security
(ensuring that food quality meets human nutrient needs).

____________________________________________________
Chris and Eric Martinez, CISSN,
CSCS, CPT, BA, also known as
the Dynamic Duo operate a
world class online training and
nutrition consulting business
Dynamic
Duo
Training.
Theyre also fitness and
nutrition writers, Diet Doc
permanent weight loss coaches,
and exclusive Team K Peaking
Directors that love helping
people reach their goals.

http://dynamicduotraining.com/

____________________________________________________
References
1.

2.
3.
4.
5.

6.

7.

8.

9.

Floros et al. Feeding the world today and tomorrow: the


importance of food science and technology. IFT scientific
review. 2010 [IFT]
World Health Organization. Micronutrient deficiencies:
iodine deficiency disorders. [WHO]
World Health Organization. Micronutrient deficiencies: iron
deficiency anaemia. [WHO]
Centers for Disease Control and Prevention. Iron and iron
deficiency. Updated February 23, 2011. [CDC]
Mller O, Krawinkel M. Malnutrition and health in
developing countries. CMAJ. 2005 Aug 2;173(3):279-86.
[PubMed]
van Boekel M, Fogliano V, Pellegrini N, Stanton C, Scholz
G, Lalljie S, Somoza V, Knorr D, Jasti PR, Eisenbrand G. A
review on the beneficial aspects of food processing. Mol
Nutr Food Res. 2010 Sep;54(9):1215-47. [PubMed]
International Food Information Council Foundation. 2011
Food and health survey; consumer attitudes toward food
safety, nutrition, and health. 2012. [Full PDF]
Nicolia A, et al. An overview of the last 10 years of
genetically engineered crop safety research. Crit Rev
Biotechnol. 2014 Mar;34(1):77-88. [PubMed]
Weaver CM, et al. Processed foods: contributions to
nutrition. Am J Clin Nutr. 2014 Apr 23;99(6):1525-1542.
[Epub ahead of print] [PubMed]

Alan Aragons Research Review May 2014

[Back to Contents]

Page 21

How can you get through to people who *think* they


understand the science behind a certain topic?
By Alan Aragon
I recently had the pleasure of meeting Greg Nuckols, a supersharp strength coach and accomplished athlete. He also has been
a student of my work for a number of years. He asked me a
question that I think many science-based fitness professionals
regularly ponder:
____________________________________________________
"Most people are happy to reconsider their opinions about
diet and nutrition when they're exposed to what the scientific
literature has to say about a particular topic because they
realize a scientific explanation is usually better than an
unscientific one.
However, things get a little stickier when talking to someone
who *thinks* they understand the science behind a certain
topic when, in fact, they have a very limited and skewed
picture of what's going on, usually filtered through the biases
and marketing of some guru or another promising a shortcut.
These folks are usually a lot more resistant to changing their
positions because they think their position is already wellsupported.
How can you get through to the latter group to make them
realize their understanding is, in fact, incomplete when
they're sold on the fact that it's solidly supported?"

____________________________________________________
In the beginning, people have to be ready to learn. They need to
have a certain minimum level of teachability, otherwise they
wont even hear you, let alone listen. In the context of this
discussion, a persons readiness to learn exists on a continuum.
Theres a broad range of roles or positions that people have
assumed at the time of your communication with them:1
Teacher/Authority Role

Neutral Role

Student/Learner Role

Assumes authority

Assumes no

Assumes limited

on the topic.
Harbors the
primary objective of
teaching/preaching,
and defending his
position.
Is not open to the
possibility that hes
incorrect or unaware
of data pertaining to
the topic.
Strong emotional
or commercial
attachment to his
views prevents any
consideration of
counter-evidence.

authority on the
topic.
Harbors no
particular objective
or opinion of the
topic.
Has no emotional
or commercial
attachment to his
views.
May or may not be
particularly
interested in the
topic; may listen out
of politeness or
placation.

understanding of the
topic.
Is actively seeking,
gathering, and
learning new
information.
Is open to the
possibility that hes
incorrect or unaware
of data pertaining to
the topic.
Has no strong
emotional or
commercial
attachment to his
views, welcomes
counter-evidence.

Alan Aragons Research Review May 2014

In order to get through to people who think they understand the


science behind a given topic requires that they are either in the
neutral or student/learner role. There is no penetrating the barrier
of someone entrenched in the teacher/authority role (at least not
immediately). I sometimes leverage debates with the latter cases
as a means to educate lurkers listening in on the discussion. If
theres no audience, then theres a greater chance that youre not
benefitting anyone other than yourself (by getting practice
listening to opposing views and digging up counterpoints).
I will say, however, that there occasionally are instances where
people who seemingly were unteachable ended up approaching
me years later and told me that our debate forced them to reevaluate their position and seek further education. For this, they
expressed sincere gratitude. Thus, Ive come to learn that even
the apparently worthless pissing matches can still prove to be
productive at some delayed point in time.
A prime example of a debate I had with someone deep in the
teacher/authority role occurred recently on social media. Some
of you may be familiar with Fred Hahn, a trainer and author
whos notorious for his dogmatic attachment to low-carb/highfat diets and super-slow training tempo. He exemplifies someone
who thinks he understands the science behind things, yet
regularly spews misinformation to the public. Naturally, this
results in debates with folks who actually do know the science
behind various topics of debate that surround Fred.
In one of many disagreements in this particular discussion, Fred
claimed that the results of a 1999 study by Golay et al et al
supports the low-carb diets superiority to the high-carb diet,
despite the authors explicit conclusion that neither one showed
a significant weight loss advantage over the other. Fred pointed
out that, as shown in table 4 (full text PDF here), the low-carb
group lost 1 kg more total bodyweight and 2 kg more fat mass
than the high-carb group. He contended that this difference is
important despite not reaching statistical significance. Well,
heres what he missed: the low-carb group began the study with
5 kg more total bodyweight and 6 kg more total body fat than the
high-carb group. After a research-oriented guy named Mark
Germaine attempted to explain to the importance of proportional
change but did not get through to Fred, I decided to weigh in and
make my own attempt:
Let me explain something that's apparently not getting across.
Percent change (or proportional change) is ultimately what
matters since it's nearly impossible for people to start off with
identical weight & body composition between the groups to be
compared. It's precisely because the groups have different BW
& composition (for example, the LC group in the study in
question started off with more BF) that we need to look at
proportional rather than net change. Folks with more fat to lose
will lose it faster regardless of protocol. Why do you think the
Biggest Loser contest is judged on percent of weight lost from
baseline instead of merely net pounds lost? Because percent
change is what matters among people beginning a trial with
different BWs. The higher BW folks would have an unfair
advantage if only net weight loss was judged. Hopefully that
makes sense.

[Back to Contents]

Page 22

One more example. Let's imagine a 300 lb person with 50% BF


(150 lbs of fat) loses 25% of his fat mass over a 3-month period,
which equals a loss of 37.5 lbs. By comparison, another person
starting off at 200 lbs with 25% BF (50 lbs of fat) loses 50% of
his fat mass over the same 3 months, and this equals a loss of
25 lbs. Did the 300 lb subject have better results since he lost a
higher amount of NET pounds? Of course not, because
PROPORTIONAL change is what matters.

I sat back and reviewed my explanation and was satisfied with


the examples I gave. I felt that they were ideal learning tools for
the context of the discussion. Heck, I was even able to include
an example of a hit television show with super-high financial
stakes riding on the fairness factor in assessing weight loss
progress. To my disappointment (but not necessarily surprise),
Fred responded with the following dismissal:
Nice try Alan on the proportion issue. But like most anti-low
carb zealots like yourself, you'll say anything - even something
as ridiculous as what u said to support your religion. Had the
results been opposite, you'd be singing a different song and u
know it. Priceless BS.

The above quote clearly indicates that Fred does not understand
nor care about the concept I relayed. His low-carb views have
become closely guarded dogma that he is sworn to defend
despite the evidence. Combine this with Freds presumed
authoritative position on matters of diet and nutrition, and you
have a recipe for a brick wall in terms of learning.
So, how do you get through to folks who think they know what
theyre talking about but actually dont? First, consider whether
or not they are positioned to listen and learn. If they arent, then
consider debating with them for the learning benefit of the
audience knowing that your chances of educating the audience
are far greater than educating the authority on the matter.
Folks in the neutral zone of the learning readiness continuum are
variable in their receptiveness and teachability. Those who are
actively seeking further education are a pleasure to deal with,
especially if they ask questions that challenge you to substantiate
what youre teaching. The self-perceived authority types will
rarely ask questions. They are perfectly content to preach their
gospel while shutting their eyes and ears to anything else.
Reference
1. This is an original schematic of mine. If you decide to use it
or adapt it for teaching purposes, please cite it as such:
Aragon AA. Continuum of scientific learning readiness. Alan
Aragons Research Review, May 2014.

America is the only place where people go hunting on a full


stomach. Chris Rock

If you have any questions, comments, suggestions, bones of


contention, cheers, jeers, guest articles youd like to submit, or
any feedback at all, send it over to aarrsupport@gmail.com.
Alan Aragons Research Review May 2014

[Back to Contents]

Page 23

Potrebbero piacerti anche