CC3 GW MPitcu FVIYYi 3 QJJ 6 Ee D3 P 8 NKGZ

Team Physician’s Corner
Reliability, Validity, and Injury Predictive M

Value of the Functional Movement Screen
A Systematic Review and Meta-analysis
Nicholas A. Bonazza,* MD, Dallas Smuin,y BS, Cayce A. Onks,z DO, MS, ATC,
Matthew L. Silvis,z MD, and Aman Dhawan,*§ MD
Investigation performed at the Milton S. Hershey Medical Center,
Penn State Health, Pennsylvania State University College of Medicine, Hershey, Pennsylvania, USA
Background: The Functional Movement Screen (FMS) is utilized by professional and collegiate sports teams and the military for
the prevention of musculoskeletal injuries.
Hypothesis: The FMS demonstrates good interrater and intrarater reliability and validity and has predictive value for musculoskel-
etal injuries.
Study Design: Systematic review and meta-analysis.
Methods: A systematic review and meta-analysis were conducted using a computerized search of the electronic databases
MEDLINE and ScienceDirect in adherence with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses
(PRISMA) guidelines. Extracted relevant data from each included study were recorded on a standardized form. The Cochran
Q statistic was utilized to evaluate study heterogeneity. Pooled quantitative synthesis was performed to measure the intraclass
correlation coefficient (ICC) for interrater and intrarater reliability, along with 95% CIs, and odds ratios with 95% CIs for the injury
predictive value for a score of 14.
Results: Eleven studies for reliability, 5 studies for validity, and 9 studies for the injury predictive value were identified that met
inclusion and exclusion criteria; of these, 6 studies for reliability and 9 studies for the injury predictive value were pooled for quan-
titative synthesis. The ICC for intrarater reliability was 0.81 (95% CI, 0.69-0.92) and for interrater reliability was 0.81 (95% CI, 0.70-
0.92). The odds of sustaining an injury were 2.74 times with an FMS score of 14 (95% CI, 1.70-4.43). Studies for validity dem-
onstrated flaws in both internal and external validity of the FMS.
Conclusion: The FMS has excellent interrater and intrarater reliability. Participants with composite scores of 14 had a signifi-
cantly higher likelihood of an injury compared with those with higher scores, demonstrating the injury predictive value of the test.
Significant concerns remain regarding the validity of the FMS.
Keywords: injury prevention; Functional Movement Screen
Professional and National Collegiate Athletic Association

§
Address correspondence to Aman Dhawan, MD, Department of
(NCAA) sports teams and the United States (US) military
Orthopaedics and Rehabilitation, Penn State Health, 30 Hope Drive, rely on physically healthy people to compose their respec-
Building B, Suite 2400, Hershey, PA 17033-0850, USA (email: adhawan tive work forces. Musculoskeletal injuries are a major
@hmc.psu.edu). source of lost participation time, lost income, and medical
*Department of Orthopaedics and Rehabilitation, Milton S. Hershey
resources for the care of these injuries.35 The Functional
Medical Center, Penn State Health, Pennsylvania State University College
of Medicine, Hershey, Pennsylvania, USA. Movement Screen (FMS) is a screening test that was devel-
y
Pennsylvania State University College of Medicine, Hershey, Penn- oped with the goal of identifying deficits in movements that
sylvania, USA. may predispose an otherwise healthy person to injuries
z
Department of Family and Community Medicine, Milton S. Hershey during activity.6-9 Preparticipation examinations have
Medical Center, Penn State Health, Pennsylvania State University College
of Medicine, Hershey, Pennsylvania, USA.
long been used to assess a person’s ability to safely partic-
One or more of the authors has declared the following potential con- ipate in physical activity at the time of examination, but no
flict of interest or source of funding: A.D. is a consultant for Smith & existing screening test has been shown to predict a person’s
Nephew and is on the speakers’ bureau for Smith & Nephew and Biomet. risk of injury while participating in future activities. Kiesel
et al21 were the first to explore the possible predictive
The American Journal of Sports Medicine, Vol. 45, No. 3
DOI: 10.1177/0363546516641937
value of the FMS when they found that lower FMS scores
Ó 2016 The Author(s) were predictive of a significantly higher risk of injury in
725
726 Bonazza et al The American Journal of Sports Medicine
Initial Search Terms: MEDLINE Search

Functional Movement Screen Science Direct Search
Functional Movement Screening Total Articles Found Duplicates Removed
111
AND [Predict OR Prediction OR Injury

Predict OR Injury Prediction OR Predictive AND [Valid OR Validity
AND [Reliable Value or Injury Predictive Value] OR Effective or
OR Reliability] Effectiveness]
Articles Screened Articles Screened Articles Screened

30 45 20 33 33 43
Full Text Screened Full Text Screened Full Text Screened

4 15 4 13 5 10
Inclusion/Exclusion Inclusion/Exclusion Inclusion/Exclusion
Qualitative Analysis Qualitative Analysis Qualitative Analysis

5 11 9 5
Quantitative Analysis Quantitative Analysis

6 9
Figure 1. Search methodology.
professional football players in a 2007 study. The value of searched for relevant studies with the primary search terms
such a screening test was quickly realized, and the FMS ‘‘Functional Movement Screen’’ OR ‘‘FMS’’ and secondary
was widely adopted in organizations such as the National search terms ‘‘reliable’’ OR ‘‘reliability,’’ ‘‘valid’’ OR ‘‘validity,’’
Football League (NFL), the National Hockey League ‘‘effective’’ OR ‘‘effectiveness,’’ ‘‘predict’’ OR ‘‘prediction,’’
(NHL), and the US military.20,21,28,33,35 ‘‘injury predict’’ OR ‘‘injury prediction,’’ and ‘‘predictive
Subsequent studies regarding the FMS, however, have value’’ OR ‘‘injury predictive value.’’ Inclusion criteria
produced varied results in regard to the injury predictive value included (1) English-language studies in peer-reviewed jour-
as well as the validity of the FMS as a screening nals and (2) use of the FMS to assess uninjured people before
test.4,11,14,15,20,39,40 Analyses of the structure of the FMS have participation in their respective activities. Any reviews, case
questioned the ambiguity inherent in its grading structure reports, technique articles, or abstracts were excluded. The
and its ability and sensitivity to identify functionally relevant references of articles that met inclusion/exclusion criteria
movement limitations.1,2,5,14,15,40 Interrater and intrarater reli- were also hand reviewed to ensure that any additional rele-
ability were also introduced as possible sources for error in the vant studies were not missed.
FMS, although some early studies found high reliability Duplicates were removed from the results of each of the 3
among examiners with varying levels of experience.27 separate searches. The titles and abstracts for all of the
Given the implementation of the FMS in numerous studies were then screened by the senior author (A.D.) to
organizations and the growing body of literature examin- remove studies that did not involve the FMS. Each relevant
ing the FMS, we performed a systematic review of the lit- study was then reviewed by the senior author and assessed
erature and meta-analysis to determine whether (1) the using our inclusion and exclusion criteria for appropriate-
FMS is a reliable screening tool; (2) the FMS is a valid ness for qualitative and quantitative analyses in our study.
tool to identify functional asymmetries; and (3) if a lower Qualitative and quantitative analyses were then performed
score, and what specific score, on the FMS correlates by all authors. Quantitative analysis was performed for all
with a higher risk of musculoskeletal injuries. We hypoth- articles in which data were sufficient to be included with
esized that the FMS demonstrates both interrater and other studies in our meta-analysis as described below. A dia-
intrarater reliability with validity that can be used to iden- gram of our search methodology can be found in Figure 1.
tify people at a higher risk of an injury during activity.
Data Extraction and Statistics

METHODS
Interrater and Intrarater Reliability. Interrater and
A written protocol was developed in adherence to the Pre- intrarater reliability were assessed using the intraclass
ferred Reporting Items for Systematic Reviews and Meta- correlation coefficient (ICC). Where variance was not
Analyses (PRISMA) guidelines to conduct a systematic directly reported, the confidence interval was used to
review and meta-analysis of the available literature.24 The determine the variance using the Fisher method.13,19 An
MEDLINE and ScienceDirect electronic databases were evaluation for heterogeneity using the I2 statistic
AJSM Vol. 45, No. 3, 2017 Functional Movement Screen Meta-analysis 727
TABLE 1
Reliability Studiesa
Study (Year) Level of Evidence Study Population ICC or k (95% CI)
Elias12 (2013) 3 Physical therapists (N = 20) ICCInter = 0.906b

Gribble et al17 (2013) 3 Athletic training faculty and students (N = 38) ICCIntra = 0.754 (0.526-0.872)
Gulgin and 3 Physical therapy students (n = 3); expert ICCInter = 0.88 (0.767-0.948)
Hoogenboom18 (2014) FMS rater (n = 1)
Minick et al27 (2010) 3 Novice FMS raters (n = 2); expert FMS raters (n = 2) kInter = 0.79-1.0
Onate et al29 (2012) 3 Experienced FMS rater (n = 1); rater with no ICCInter = 0.98b
experience with FMS (n = 1) ICCIntra = 0.92b
Parenteau et al30 (2014) 3 FMS-certified physical therapists (N = 4) ICCInter = 0.96 (0.92-0.98)
ICCIntra = 0.96 (0.92-0.98)
Schneiders et al34 (2011) 3 Noncertified FMS researchers (N = 2) ICCInter = 0.971b
Shultz et al36 (2013) 3 Undergraduate student (n = 1); physical therapist (n = 1); ICCInter = 0.29 (0.03-0.55)
athletic trainers (n = 2); strength and conditioning ICCIntra = 0.6 (0.35-0.77)
coaches (n = 2)
Smith et al37 (2013) 3 Rater with experience with FMS (no certification; n = 1); ICCInter = 0.89 (0.85-0.94)
rater with experience with FMS (certification; n = 1);
athletic training faculty (no certification; n = 1);
physical therapy student (no certification; n = 1)
Teyhen et al38 (2012) 3 Physical therapy students (N = 8; interrater: n = 4, ICCInter = 0.76 (0.63-0.85)
intrarater: n = 4) ICCIntra = 0.74 (0.60-0.83)
Wright et al41 (2015) 3 Researchers (n = 2) kInter = 0.11-0.83
kIntra = 0.23-0.87
a
FMS, Functional Movement Screen; ICC, intraclass correlation coefficient; Inter, interrater; Intra, intrarater.
b
95% CI not reported.
suggested significant heterogeneity between studies. A addressing more than 1 aspect of the FMS. Inclusion and
meta-analysis was performed using the DerSimonian and exclusion criteria resulted in 11 studies evaluating reliabil-
Laird10 random-effects model. A sensitivity analysis was ity for qualitative analysis, which can be found in Table 1,
performed for the inclusion of the study of Smith et al,37 and 6 studies for quantitative analysis/data synthesis.
which reported 4 separate analyses of intrarater reliability Results that did not include the ICC with associated
(1 for each subtype of observer). As Smith et al37 included 95% CIs were unable to be included in the meta-analysis.
the individual ICC for each rater, the data were pooled to Inclusion and exclusion criteria resulted in 9 studies eval-
reduce the analysis to a single ICC, which was then uating the injury predictive value for qualitative analysis
included in the meta-analysis of the ICC. and 9 for quantitative analysis, which are described in
Injury Predictive Value. The pooled effect measure that Table 2. Inclusion and exclusion criteria resulted in 5 stud-
was subjected to the meta-analysis was the odds ratio (OR) ies evaluating validity for qualitative analysis (Table 3).
for failure of functional movement. The numerical cutoff of Given the variability of these studies for validity, a quanti-
the FMS was evaluated at a value of 14, which was the tative analysis could not be performed.
only value consistently utilized in all of the studies that
met inclusion/exclusion criteria. Again, there was signifi-
cant heterogeneity between studies based on the Mantel- Reliability
Haenszel Q statistic. Therefore, the DerSimonian and
Interrater Reliability. Ten studies evaluated interrater
Laird10 random-effects model was used to estimate the
reliability of the FMS. All studies examined the reliability
pooled OR.26 All analyses were performed using R (version
of scores of more than 1 examiner grading the same partic-
3.1.3) statistical software32 and the rmeta package.25
ipants. Significant differences were seen in the character-
Validity. Because of insufficient reporting and heteroge-
istics of the raters included in the studies. Five of the 10
neity of the data, a pooled quantitative analysis of validity
studies included examiners of various levels of experience
could not be performed. Any reported data are included as
with the FMS. Only a few studies included raters specifi-
part of the results and discussion of the articles from which
cally certified in the FMS.
they are extracted as part of our qualitative review.
Nine of the 10 studies found acceptable interrater reli-
ability, with ICC values of 0.76 to 0.98. Shultz et al36
RESULTS was the only study to report poor interrater reliability
with a Krippendorff alpha (a) value of only 0.38, which is
The initial search using the primary search terms resulted well below the 0.8 cutoff considered acceptable. The Cohen
in 111 articles. Inclusion of the secondary search terms and kappa (k) coefficient was reported as a measure of reliabil-
removal of duplicates resulted in 45 articles addressing ity for each individual FMS test component in 6 studies.
reliability, 43 articles addressing validity, and 33 address- These values varied, despite most finding overall scores
ing the injury predictive value, with several articles had acceptable interrater reliability. Of the individual
TABLE 2
Injury Predictive Value Studiesa
Level of FMS
Study (Year) Study Design Evidence Study Population Injury Definition Cutoff Value OR (95% CI)
3
Butler et al (2013) Prospective cohort 3 Firefighter trainees (N = 108) Missed time 14 8.31 (3.2-21.6)
Chorba et al4 (2010) Prospective cohort 3 NCAA Division II athletes Required medical 14 3.85 (0.980-15.130)
(N = 38; all female) attention
Dossa et al11 (2014) Prospective cohort 3 Major junior hockey players Missed time 14 2.33 (0.37-14.61)
(N = 20; all male)
Garrison et al16 (2015) Prospective cohort 3 NCAA Division I athletes Required medical 14 5.61 (2.73-11.51)
(N = 160) attention
Kiesel et al21 (2007) Prospective cohort 3 Professional football players Missed time 14 11.67 (2.47-54.52)
(N = 46; all male)
Kiesel et al22 (2014) Prospective cohort 3 Professional football players Missed time 14 2.33 (1.14-4.77)
(N = 238; all male)
Knapik et al23 (2015) Prospective cohort 3 Coast Guard cadets Required medical 14 1.42 (1.05-1.93)
(N = 1045; 770 male, 275 female) attention
28
O’Connor et al (2011) Prospective cohort 3 Marine officer candidates Required medical 14 2.00 (1.29-3.08)
(N = 874; all male) attention
Warren et al39 (2015) Prospective cohort 3 NCAA Division I athletes Required medical 14 1.01 (0.53- 1.92)
(N = 167; 89 male, 78 female) attention
a
FMS, Functional Movement Screen; NCAA, National Collegiate Athletic Association; OR, odds ratio.
TABLE 3
Validity Studiesa
Level of External/Internal
Study (Year) Study Design Evidence Study Population Validity Significance
1
Beach et al (2014) Case control 3 Male firefighters External FMS movements are difficult to replicate
and question external validity.
Clifton et al5 (2013) Prospective cohort 2 General population Internal Postural fatigue after exercise did not
(male/female) alter FMS scores.
Frost et al14 (2013) Prospective cohort 2 Firefighters Internal Participants were able to increase composite
(male/female) scores with knowledge of scoring criteria.
Kazman et al20 (2014) Cohort 3 Marine officers Internal FMS movements are not interrelated as a
(male/female) unitary sum. Individual movement scores
may be more informative.
Whiteside et al40 (2015) Cross-sectional 3 NCAA Division I External Manual grading is not sensitive enough
basketball players to detect cues for defects in joint angles
(male/female) and is subjective to raters.
a
FMS, Functional Movement Screen; NCAA, National Collegiate Athletic Association.
test components, the in-line lunge, rotary stability, and the Intrarater Reliability. Six studies evaluated intrarater
hurdle step were all implicated as the least reliable compo- reliability of the FMS. All studies examined the reliability
nent by at least 1 study.27,29,30,34,38,41 of scores of the same examiners grading the same partici-
There were 5 studies that measured interrater reliabil- pants at 2 points separated by times ranging from 48 hours
ity among raters with varying levels of FMS experience. to 4 weeks. Five of 6 studies included multiple examiners of
Only 1 of those found unacceptable interrater reliability. various levels of experience. Four of 6 studies had exam-
Additionally, Gulgin and Hoogenboom18 and Shultz iners evaluate video-recorded FMS tests, while 3 had
et al36 found that overall scores did not differ significantly examiners evaluate participants in real time.
between raters of different experience levels. Shultz et al36 All studies found acceptable intrarater reliability, with
found overall unacceptable interrater reliability, but fur- ICC values ranging from 0.6 to 0.96. As with interrater
ther analysis of their data did not show that experience reliability, the level of experience did not consistently
had any affect. Both interrater reliability of raters with affect intrarater reliability. Gribble et al17 showed that
less than 1 year of experience (ICC, 0.44; 95% CI, 0.12 to intrarater reliability increased with experience. However,
0.67) and that of raters with more than 2 years of experi- Smith et al37 found no difference in their 4 raters and actu-
ence (ICC, 0.177; 95% CI, –0.15 to 0.46) were unacceptable. ally found that the only certified FMS rater among their
Five studies were finally included in the meta-analysis group had the lowest intrarater reliability. Individual com-
for interrater reliability as the remaining 5 studies did not ponents of the FMS showed significant variability with
have sufficient data for inclusion as previously described. regard to intrarater reliability, again similar to the find-
Pooled quantitative analysis demonstrated that the mean ings for interrater reliability.
ICC was found to be 0.81 (95% CI, 0.70-0.92) (Figure 2), A meta-analysis of intrarater reliability was performed,
indicating acceptable interrater reliability. and a pooled mean ICC of 0.77 (95% CI, 0.58-0.96) (Figure
Study ICC (95% CI) Smith et al37 ICC (95% CI)

Gulgin et al 18 2014 0.88 (0.80-0.96) Rater 1 0.90 (0.81-0.99)
Parenteau et al30 2014 0.96 (0.94-0.98) Rater 2 0.81 (0.66-0.96)
36
Shultz et al 2013 0.29 (0.03-0.55) Rater 3 0.91 (0.83-0.99)
37
Smith et al 2013 0.89 (0.82-0.96)
38 Rater 4 0.88 (0.78-0.98)
Teyhen et al 2012 0.76 (0.69-0.83)
Summary 0.89 (0.84-0.94)
Summary 0.81 (0.70-0.92)
0.5 0.6 0.7 0.8 0.9 1 1.1 1.2
0.2 0.4 0.6 0.8 1 1.2
Figure 2. Analysis of interrater reliability. ICC, intraclass cor- Figure 4. Analysis of intrarater reliability of raters per Smith
relation coefficient. et al.37 ICC, intraclass correlation coefficient.
Study ICC (95% CI)

Study ICC (95% CI) Gribble et al 2013 0.75 (0.45-1.06)
Gribble et al17 2013 0.75 (0.45-1.06) Parenteau et al30 2014 0.96 (0.94-0.98)
30
Parenteau et al 2014 0.96 (0.94-0.98) Shultz et al36 2013 0.60 (0.47-0.73)
36
Shultz et al 2013 0.60 (0.47-0.73) Smith et al37 2013 0.89 (0.84-0.94)
38
Teyhen et al38 2012 0.74 (0.67-0.81) Teyhen et al 2012 0.74 (0.67-0.81)
Summary 0.77 (0.58−0.96) Summary 0.81 (0.69-0.92)
0.2 0.4 0.6 0.8 1 1.2 0.2 0.4 0.6 0.8 1 1.2
Figure 3. Analysis of intrarater reliability excluding Smith et al.37 Figure 5. Analysis of total intrarater reliability. ICC, intraclass
ICC, intraclass correlation coefficient. correlation coefficient.
3) was obtained, which signified acceptable intrarater reli-

ability. A synthesis of the data presented by Smith et al37 Study OR (95% CI)
(Figure 4) was included in the analysis, and an ICC of 0.81 Butler et al3 2013 8.31 (3.20-21.59)
(95% CI, 0.69-0.92) was observed, which also signified Chorba et al4 2010 3.85 (0.98-15.13)
acceptable intrarater reliability (Figure 5). Dossa et al11 2014 2.33 (0.37-14.61)
16
Injury Predictive Value. Nine studies with a total of Garrison et al 2015 5.61 (2.73-11.52)
21
2696 participants were identified that evaluated the injury Kiesel et al 2007 11.67 (2.48-54.81)
predictive value of the FMS. All studies, per the inclusion Kiesel et al22 2014 2.33 (1.14-4.77)
Knapik et al23 2015 1.42 (1.05-1.93)
criteria, evaluated healthy people before participation in
O'Connor et al28 2011 2.00 (1.29-3.08)
their respective activities. As seen in the studies evaluat- 39
Warren et al 2015 1.01 (0.53-1.92)
ing reliability, a significant variation was seen in the
raters and in the populations being tested. Summary 2.74 (1.70-4.43)
Definitions of injury were based on either time lost or
0 2 4 6 8 10
treatment sought. The Kiesel et al,21,22 Butler et al,3 and
Dossa et al11 studies used time lost from activity or sport,
Figure 6. Analysis of injury predictive value. OR, odds ratio.
while Chorba et al,4 Garrison et al,16 Knapik et al,23
O’Connor et al,28 and Warren et al39 based their definitions
of injury on seeking medical care. for women. Additionally, the risk ratio was optimized at a cut-
All 9 studies included in the quantitative synthesis com- off of 12 for men and 15 for women.
pared participants who scored 14 with those who scored Six of the 9 studies found that participants with an FMS
.14. This cutoff was first established by Kiesel et al21 in score of 14 had a statistically significant higher risk of
a 2007 study in which a receiver operating characteristic injury during subsequent activity than those with scores
(ROC) curve of their data showed that a cutoff of 14 maxi- of .14. Studies consisted of mostly male participants,
mized sensitivity and specificity and was affirmed in studies although Garrison et al16 and Knapik et al23 included
by Butler et al3 and Garrison et al.16 Chorba et al,4 Dossa female participants. The ORs ranged from 1.42 to 11.67.
et al,11 and Kiesel et al22 used a cutoff of 14 based on the pre- Three studies with a total of 225 participants did not find
vious studies. O’Connor et al28 and Warren et al39 developed a statistically significant correlation between FMS scores
ROC curves with their data but found no value that opti- and the risk of injuries. Chorba et al,4 the only study
mized sensitivity and specificity and thus used 14 as a cutoff with all female participants, as well as Dossa et al11 and
per prior studies as well. Knapik et al23 reported data using Warren et al,39 found ORs that ranged from 1.01 to 3.85
14 as a cutoff but found that sex affected optimal values. In but were not statistically significant.
their analysis of 1045 Coast Guard cadets, they found that A pooled quantitative synthesis using all 9 studies was
the FMS score cutoff that maximized sensitivity and specific- performed using a score of 14 points as a cutoff cumulative
ity, determined by the Youden index, was 11 for men and 14 score. An OR of 2.74 (95% CI, 1.70-4.43) was found (Figure
6), indicating that participants who scored 14 on the FMS of pressure standard deviation in the medial-lateral and
had a 2.74 times greater probability of sustaining an injury anterior-posterior directions. These measures had previ-
during subsequent activity than those who scored .14 on ously been validated to measure fatigue, which is a risk
the FMS. factor for musculoskeletal injuries.5 They tested active peo-
ple, defined as participants aged 18 to 50 years who exer-
Validity cised 3 times per week for at least 30 minutes per
workout, both before and after exercise. They hypothesized
Ten studies were identified in the initial search that evalu- that fatigue would lead to a decrease in static balance after
ated the validity of the FMS. We defined validity based on exercise and would correlate with changes in FMS scores.
the described purpose of the screen to identify deficiencies Although the static balance measurements decreased after
in movements that may contribute to a higher risk of inju- exercise as expected, they found that FMS scores did not
ries.6 The application of the screen as an injury prediction change. They concluded that the FMS was not a useful pre-
tool is evaluated separately. The full text of all 10 studies dictor of who will experience greater balance deficits after
was then reviewed independently by 2 of the authors exercise. This finding does not support the idea that the
(C.A.O., M.L.S.), who agreed independently on the appropri- FMS can be used in all settings as an injury predictor.
ateness of 5 of the 10 studies for inclusion in the qualitative Beach et al1 hypothesized that if the FMS identified defi-
analysis. One study did not meet inclusion/exclusion criteria ciencies in functional movements, scores may correlate with
as it was an abstract only and was excluded. One study more activity-specific parameters. A total of 30 firefighters
administered the FMS differently than it has been described were evaluated: 15 who scored .14 on the FMS and 15
in the literature and was excluded as well. Two studies were who scored 14. The participants were height and weight
excluded as they sought to correlate scores with athletic per- matched. They were asked to perform the standardized
formance, and the last excluded study looked at changes in task of lifting a box. Lumbar spine loading magnitudes
FMS scores with training intervention, all outside our focus and lumbar spine angles were measured and compared
on the validity of the screen itself. A pooled quantitative with FMS scores. They did not find a statistically significant
synthesis could not be performed on these 5 studies because difference between the 2 groups, which calls into question
of the absence of a standard quantitative value, which is the ability of the FMS to measure core stability.
used to assess the validity of the FMS.
Kazman et al20 used the FMS results of 934 Marine offi-
cer candidates to evaluate the factor structure of the FMS DISCUSSION
using the Cronbach alpha value and exploratory factor
analysis (EFA). The Cronbach alpha value was found to The popularity and utilization of the FMS have grown rap-
be 0.39, less than 0.5, which signifies unacceptable inter- idly since its development, bolstered by evidence in the lit-
nal consistency of the test. EFA additionally showed that erature to suggest its injury predictive value. Its adoption
the different test components within the FMS differ in at the highest level of athletics as well as the military and
their contributions to the total score, suggesting that the other public service organizations has further contributed
FMS is not unidimensional and cannot be used in its cur- to its rise in popularity, despite the existence of conflicting
rent form as a unitary construct to predict injuries. literature evaluating not only the injury predictive value
Two articles evaluated the grading used in the FMS. but also the validity and reliability of the FMS. Given
Frost et al14 hypothesized that instruction on grading cri- this, we sought to assess and assimilate the relevant litera-
teria could improve the performer’s score on the FMS. ture where appropriate to determine whether the FMS is
Twenty-one firefighters completed the FMS first and a reliable, valid screening test with an injury predictive value.
then were educated on the specific grading criteria used It is essential that limitations in the screen are understood to
to assess each move. After the instruction, the participants eliminate to the extent possible inaccurate evaluations that
were tested again, and the score was found to be signifi- can result in significant consequences for both the screened
cantly higher for 4 of the 6 moves. Their conclusion was participant and his or her respective organization.
that additional performance instruction significantly The reliability of the examiners has been thoroughly
changed FMS scores, not only functional asymmetries, investigated, especially given the theoretical concern that
speaking to potential flaws in the validity of the FMS. varying levels of experience as well as the presence or
Whiteside et al40 compared manual real-time testing to absence in certification would result in significant differen-
objective kinematic grading utilizing motion capture in ces in examinations. The variety in examiners evaluating
the evaluation of 11 NCAA Division I athletes. They found participants undergoing the FMS and the methodology for
significant differences between the 2 types of grading for how participants were examined are clearly evident in our
all of the FMS exercises tested (only 6 of the 7 moves included studies for both reliability and the injury predic-
were tested), pointing to the ambiguous criteria for grad- tive value, introducing numerous biases that could affect
ing. This study also highlighted the difficulty in assessing the results of the respective individual studies and, ulti-
multiple aspects of a movement from one vantage point. mately, the results of the meta-analysis. Overall, there is
Two studies evaluated the FMS against other measures significant evidence that the composite FMS test is reliable
of functional movement. Clifton et al5 sought to validate and can be replicated by raters with varying degrees of
the FMS by attempting to correlate scores with measure- experience with the FMS. Gribble et al17 had the only study
ments in static balance during a single-leg stance: center evaluating interrater or intrarater reliability that found
of pressure velocity, center of pressure area, and center that increasing experience led to increased reliability. Smith
et al37 was the only other study to report the ICC for inter- measures of movement and balance also did not find a corre-
rater reliability of multiple raters who were both certified lation to FMS scores, again questioning its accuracy or at
and not certified with different experience levels, and they least sensitivity in detecting physical abnormalities. Because
found no difference. None of the 5 studies evaluating inter- of the absence of any gold-standard comparison and signifi-
rater reliability with raters of different experience levels cant heterogeneity of the existing data, it is difficult to derive
found any effect on reliability. Only Shultz et al36 found any definitive conclusions from the current literature as to
poor interrater reliability, but as stated above, further anal- whether the FMS is a valid tool for the measurement of func-
ysis found that dividing more and less experienced raters tional limitations and asymmetries.
did not improve their interrater reliability. The results of Limitations of our meta-analysis of both reliability and
our meta-analysis, which showed high interrater and intra- the injury predictive value included insufficient reporting
rater reliability, suggest that level of experience and formal of raw data, P values, and variance. Other observed poten-
certification by Functional Movement Screen Inc have little tial biases in analysis include the need to determine vari-
effect on scoring of the FMS. ance using the Fisher method in situations where these
Interrater and intrarater reliability for individual tests data were not directly reported.13 This has been well
of the FMS did vary greatly among all types of raters, described previously in the literature.13
which may indicate a lack of specificity in the grading cri- Although all studies included either time missed from
teria or simply a level of difficulty in grading certain sub- sport or activity because of an injury or that which
tests that could be a source of error in the screen. This required medical attention, the variability in the definition
may be an area for further research. While the ICC is adds an element of heterogeneity to our analysis. As these
used as a quantitative measure of reliability, individual studies include a variety of participants including athletes,
test subsection analysis was often reported using the military personnel, and public servants such as fire-
kappa coefficient. While it would have been ideal to ana- fighters, much of this variability is inherent to their vary-
lyze each individual subtest within the FMS, a degree of ing respective activities and demands.
variance was often not reported with the kappa coefficient. Based on the results of this systematic review and meta-
Because of these restraints in statistical methodology, we analysis, the FMS as a composite score has excellent inter-
were unable to calculate a P value for our calculations rater and intrarater reliability and can be effectively admin-
based on insufficient reporting of raw data and/or variance. istered by raters of varying levels of experience with the
Our study demonstrates that the historical pass/fail cutoff FMS both with and without formal certification. Partici-
of 14 points is valid in predicting those at a higher risk of pants who score 14 on the FMS have greater than twice
injury. Evidence supporting 14 points as the optimal cutoff, the odds of sustaining a musculoskeletal injury than those
however, is limited as only 2 studies replicated the finding with scores of .14. However, the FMS lacks validation of
of Kiesel et al21 via independent ROC curves. These findings its structure as a composite score of multiple subtest scores
were also limited to only male participants, which may be and of its ability to accurately and sensitively measure def-
a limitation. Knapik et al23 found that the optimal cutoff icits in posture and balance. Despite this demonstrated
may differ by sex. O’Connor et al28 and Warren et al39 inde- injury predictive value of the FMS, the clinical application
pendently found no optimal value in their data. Studies of the FMS should be exercised with caution until further
with mixed-sex populations did find statistically significant studies can confirm the screen’s validity.
results, but the effect of sex and other population characteris-
tics on a cutoff that optimizes the injury predictive value of
the FMS may be an area of further research.
An online CME course associated with this article is avail-
Garrison et al16 found that a history of injuries alone
able for 1 AMA PRA Category 1 CreditTM at http://www
could identify those at a higher risk of future injuries
.sportsmed.org/aossmimis/Members/Education/AJSM%20
and that combining a history of injuries with an FMS score
Current%20Concepts%20Store.aspx. In accordance with
of 14 suggested a significantly higher risk of future inju-
the standards of the Accreditation Council for Continuing
ries, increasing their ORs from 5.61 (95% CI, 2.73-11.51) to
Medical Education (ACCME), it is the policy of The Amer-
15.11 (95% CI, 6.60-34.61). This is consistent with the prior
ican Orthopaedic Society for Sports Medicine that
finding that a history of injuries is associated with lower
authors, editors, and planners disclose to the learners all
FMS scores.31 Further research may identify other factors
financial relationships during the past 12 months with
that, combined with the FMS, significantly affect its injury
any commercial interest (A ‘commercial interest’ is any
predictive value.
entity producing, marketing, re-selling, or distributing
The included studies regarding the validity of the FMS
health care goods or services consumed by, or used on,
point to several concerns about its structure and its ability
patients). Any and all disclosures are provided in the
to detect abnormal movement patterns. Kazman et al20
online journal CME area which is provided to all partici-
showed that the composite score of the FMS is not valid as
pants before they actually take the CME activity. In accor-
a unitary construct as it is often used. Grading may be flawed
dance with AOSSM policy, authors, editors, and planners’
by somewhat ambiguous criteria, and Frost et al14 showed
participation in this educational activity will be predicated
that educating those being screened on the criteria can signif-
upon timely submission and review of AOSSM disclosure.
icantly affect scores, suggesting that scores may be reflecting
Noncompliance will result in an author/editor or planner
more than just the physical characteristics that they intend
to be stricken from participating in this CME activity.
to assess (ie, learned behavior). Comparisons with other
REFERENCES 21. Kiesel K, Plisky PJ, Voight ML. Can serious injury in professional foot-
ball be predicted by a preseason functional movement screen? N Am
1. Beach TA, Frost DM, Callaghan JP. FMS scores and low-back load- J Sports Phys Ther. 2007;2(3):147-158.
ing during lifting: whole-body movement screening as an ergonomic 22. Kiesel KB, Butler RJ, Plisky PJ. Prediction of injury by limited and
tool? Appl Ergon. 2014;45(3):482-489. asymmetrical fundamental movement patterns in American football
2. Beach TA, Frost DM, McGill SM, Callaghan JP. Physical fitness players. J Sport Rehabil. 2014;23(2):88-94.
improvements and occupational low-back loading: an exercise inter- 23. Knapik JJ, Cosio-Lima LM, Reynolds KL, Shumway RS. Efficacy of
vention study with firefighters. Ergonomics. 2014;57(5):744-763. functional movement screening for predicting injuries in Coast Guard
3. Butler RJ, Contreras M, Burton LC, Plisky PJ, Goode A, Kiesel K. cadets. J Strength Cond Res. 2015;29(5):1157-1162.
Modifiable risk factors predict injuries in firefighters during training 24. Liberati A, Altman DG, Tetzlaff J, et al. The PRISMA statement for
academies. Work. 2013;46(1):11-17. reporting systematic reviews and meta-analyses of studies that eval-
4. Chorba RS, Chorba DJ, Bouillon LE, Overmyer CA, Landis JA. Use of uate healthcare interventions: explanation and elaboration. BMJ.
a functional movement screening tool to determine injury risk in 2009;339:B2700.
female collegiate athletes. N Am J Sports Phys Ther. 2010;5(2):47-54. 25. Lumley T. rmeta: meta-analysis. R package version 2.16. 2012. Avail-
5. Clifton DR, Harrison BC, Hertel J, Hart JM. Relationship between able at: http://CRAN.R-project.org/package=rmeta. Accessed July
functional assessments and exercise-related changes during static 16, 2015.
balance. J Strength Cond Res. 2013;27(4):966-972. 26. Mantel N, Brown C, Byar DP. Tests for homogeneity of effect in an
6. Cook G, Burton L, Hoogenboom B. Pre-participation screening: the epidemiologic investigation. Am J Epidemiol. 1977;106(2):125-129.
use of fundamental movements as an assessment of function, part 27. Minick KI, Kiesel KB, Burton L, Taylor A, Plisky P, Butler RJ. Interrater
1. N Am J Sports Phys Ther. 2006;1(2):62-72. reliability of the functional movement screen. J Strength Cond Res.
7. Cook G, Burton L, Hoogenboom B. Pre-participation screening: the 2010;24(2):479-486.
use of fundamental movements as an assessment of function, part 28. O’Connor FG, Deuster PA, Davis J, Pappas CG, Knapik JJ. Func-
2. N Am J Sports Phys Ther. 2006;1(3):132-139. tional movement screening: predicting injuries in officer candidates.
8. Cook G, Burton L, Hoogenboom BJ, Voight M. Functional movement Med Sci Sports Exerc. 2011;43(12):2224-2230.
screening: the use of fundamental movements as an assessment of 29. Onate JA, Dewey T, Kollock RO, et al. Real-time intersession and
function, part 1. Int J Sports Phys Ther. 2014;9(3):396-409. interrater reliability of the functional movement screen. J Strength
9. Cook G, Burton L, Hoogenboom BJ, Voight M. Functional movement Cond Res. 2012;26(2):408-415.
screening: the use of fundamental movements as an assessment of 30. Parenteau GE, Gaudreault N, Chambers S, et al. Functional move-
function, part 2. Int J Sports Phys Ther. 2014;9(4):549-563. ment screen test: a reliable screening test for young elite ice hockey
10. DerSimonian R, Laird N. Meta-analysis in clinical trials. Control Clin players. Phys Ther Sport. 2014;15(3):169-175.
Trials. 1986;7:177-188. 31. Peate WF, Bates G, Lunda K, Francis S, Bellamy K. Core strength:
11. Dossa K, Cashman G, Howitt S, West B, Murray N. Can injury in a new model for injury prediction and prevention. J Occup Med Tox-
major junior hockey players be predicted by a pre-season functional icol. 2007;2:3.
movement screen: a prospective cohort study. J Can Chiropr Assoc. 32. R Foundation for Statistical Computing. R: A Language and Environ-
2014;58(4):421-427. ment for Statistical Computing [computer program]. Vienna: R Foun-
12. Elias JE. The inter-rater reliability of the functional movement screen dation for Statistical Computing; 2015.
within an athletic population using untrained raters [published online 33. Rowan CP, Kuropkat C, Gumieniak RJ, Gledhill N, Jamnik VK. Inte-
July 8, 2013]. J Strength Cond Res. doi:10.1519/JSC.0b013e3182a1ff1d. gration of the functional movement screen into the National Hockey
13. Fisher RA. Statistical Methods for Research Workers. 14th ed. Edin- League Combine. J Strength Cond Res. 2015;29(5):1163-1171.
burgh: Oliver and Boyd; 1970. 34. Schneiders AG, Davidsson A, Horman E, Sullivan SJ. Functional
14. Frost DM, Beach TA, Callaghan JP, McGill SM. FMS scores change movement screen normative values in a young, active population.
with performers’ knowledge of the grading criteria: are general Int J Sports Phys Ther. 2011;6(2):75-82.
whole-body movement screens capturing ‘‘dysfunction’’ [published 35. Shah VM, Andrews JR, Fleisig GS, Mcmichael CS, Lemak LJ. Return
online November 20, 2013]? J Strength Cond Res. doi:10.1519/ to play after anterior cruciate ligament reconstruction in National
Jsc.0b013e3182a95343. Football League athletes. Am J Sports Med. 2010;38(11):2233-2239.
15. Frost DM, Beach TA, Callaghan JP, McGill SM. Using the functional 36. Shultz R, Anderson SS, Matheson GO, Marcello B, Besier T. Test-
movement screen to evaluate the effectiveness of training. J Strength retest and interrater reliability of the functional movement screen.
Cond Res. 2012;26(6):1620-1630. J Athl Train. 2013;48(3):331-336.
16. Garrison M, Westrick R, Johnson MR, Benenson J. Association 37. Smith CA, Chimera NJ, Wright NJ, Warren M. Interrater and intrarater
between the functional movement screen and injury development reliability of the functional movement screen. J Strength Cond Res.
in college athletes. Int J Sports Phys Ther. 2015;10(1):21-28. 2013;27(4):982-987.
17. Gribble PA, Brigle J, Pietrosimone BG, Pfile KR, Webster KA. Intra- 38. Teyhen DS, Shaffer SW, Lorenson CL, et al. The functional move-
rater reliability of the functional movement screen. J Strength Cond ment screen: a reliability study. J Orthop Sports Phys Ther.
Res. 2013;27(4):978-981. 2012;42(6):530-540.
18. Gulgin H, Hoogenboom B. The functional movement screening 39. Warren M, Smith CA, Chimera NJ. Association of the functional
(FMS): an inter-rater reliability study between raters of varied experi- movement screen with injuries in Division I athletes. J Sport Rehabil.
ence. Int J Sports Phys Ther. 2014;9(1):14-20. 2015;24(2):163-170.
19. Ip EH, Wasserman R, Barkin S. Comparison of intraclass correlation 40. Whiteside D, Deneweth JM, Pohorence MA, et al. Grading the func-
coefficient estimates and standard errors between using cross- tional movement screen: a comparison of manual (real-time) and
sectional and repeated measurement data: the safety check cluster objective methods. J Strength Cond Res. 2015;29(1):254-261.
randomized trial. Contemp Clin Trials. 2011;32(2):225-232. 41. Wright MD, Portas MD, Evans VJ, Weston M. The effectiveness of 4
20. Kazman JB, Galecki JM, Lisman P, Deuster PA, O’Connor FG. Factor weeks of fundamental movement training on functional movement
structure of the functional movement screen in Marine officer candi- screen and physiological performance in physically active children.
dates. J Strength Cond Res. 2014;28(3):672-678. J Strength Cond Res. 2015;29(1):254-261.
For reprints and permission queries, please visit SAGE’s Web site at http://www.sagepub.com/journalsPermissions.nav.

CC3 GW MPitcu FVIYYi 3 QJJ 6 Ee D3 P 8 NKGZ

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

CC3 GW MPitcu FVIYYi 3 QJJ 6 Ee D3 P 8 NKGZ

Caricato da

Copyright:

Formati disponibili

Team Physician’s Corner

Reliability, Validity, and Injury Predictive M

Professional and National Collegiate Athletic Association

Initial Search Terms: MEDLINE Search

AND [Predict OR Prediction OR Injury

Articles Screened Articles Screened Articles Screened

Full Text Screened Full Text Screened Full Text Screened

Inclusion/Exclusion Inclusion/Exclusion Inclusion/Exclusion

Qualitative Analysis Qualitative Analysis Qualitative Analysis

Quantitative Analysis Quantitative Analysis

Figure 1. Search methodology.

Data Extraction and Statistics

Study (Year) Level of Evidence Study Population ICC or k (95% CI)

Elias12 (2013) 3 Physical therapists (N = 20) ICCInter = 0.906b

Study ICC (95% CI) Smith et al37 ICC (95% CI)

Study ICC (95% CI)

Summary 0.77 (0.58−0.96) Summary 0.81 (0.69-0.92)

3) was obtained, which signified acceptable intrarater reli-

Potrebbero piacerti anche