4./s10551 017 3688 y

J Bus Ethics
DOI 10.1007/s10551-017-3688-y
ORIGINAL PAPER
Wanting More, Getting Less: Gaming Performance Measurement

as a Form of Deviant Workplace Behavior
Laura Graf1 • Wiebke S. Wendler1 • Jutta Stumpf-Wollersheim2 • Isabell M. Welpe1,3
Received: 29 March 2017 / Accepted: 24 August 2017

Springer Science+Business Media B.V. 2017
Abstract Investigating the causes of unethical behaviors organizational norms that harms the university). We dis-
in academia, such as scientific misconduct, has become a cuss that gaming performance measurement has not been
highly important research subject. The current performance considered as a type of deviant workplace behavior in the
measurement practices (e.g., equating research perfor- previous literature. Furthermore, we draw from research on
mance with the number of publications in top-tier journals) deviant workplace behavior and goal setting to discuss
are frequently referred to as being responsible for scien- psychological processes that may underlie gaming perfor-
tists’ unethical behaviors. We conducted qualitative semi- mance measurement. Our results indicate the importance of
structured interviews with different stakeholders of the connecting literature on deviant workplace behavior and
higher education system (e.g., professors and policy mak- goal setting to advance our understanding of gaming per-
ers; N = 43) to analyze the influence of performance formance measurement.
measurement on scientists’ behavior. We followed a three-
step coding procedure and found (1) that the participants Keywords Academia Deviant workplace behavior
described a variety of positive behavioral consequences Counterproductive work behavior Gaming performance
(e.g., higher productivity) but mainly negative behavioral measurement Higher education Qualitative interviews
consequences (e.g., questionable publishing practices) of Performance measurement Production deviance
current performance measurement practices in academia; Scientific misconduct
(2) that scientists’ behavior can be described as gaming
performance measurement (i.e., achieving performance
goals by reducing performance quality and focusing on Introduction
those tasks that are measured); and (3) that gaming per-
formance measurement shares the same characteristics as Scientific misconduct, such as strategically manipulating
deviant workplace behavior (i.e., a voluntary violation of experimental settings, not only violates ethical rules but
also puts a university’s reputation for research at risk.
Recent scandals illustrate how scientists’ behavior can
Laura Graf and Wiebke S. Wendler have contributed equally to this
work and they are listed in alphabetical order. cause severe organizational damage for the employing
university (Christian and Ellis 2014; Gross 2016; Stroebe
& Laura Graf et al. 2012). Due to negative media coverage, cases of
laura.graf@tum.de scientific misconduct have an impact on the image of the
1
Chair for Strategy and Organization, TUM School of
entire scientific community (Stroebe et al. 2012). Honig
Management, Technical University of Munich, Arcisstraße et al. (2017) call attention to the danger of unethical
21, 80333 Munich, Germany practices in the production of knowledge by scientists.
2
Technische Universität Bergakademie Freiberg, Freiberg, Even less-severe behaviors than scientific misconduct, such
Germany as carelessness, ‘cooking’ data in order to find significant
3
Bavarian State Institute for Higher Education Research and results, or harking (i.e., hypothesizing after the results are
Planning, Munich, Germany known; Butler et al. 2017) can lead to severe impacts, such
123
L. Graf et al.
as less-reliable scientific findings or a wasted funding (Robinson and Bennett 1995, p. 565). Thus, the study
(Steneck 2006). Preventing such damage to universities makes two important theoretical contributions: First, based
requires a better understanding of this phenomenon as well on the results of our interviews, we propose that gaming
as its antecedents and consequences. performance measurement is a type of task-related deviant
Several authors suggest that an important antecedent of workplace behavior that is missing in current frameworks
scientists’ unethical behaviors may be the current perfor- and typologies of deviant workplace behavior. Second, we
mance measurement practices in academia (e.g., Barsky embed our findings in both the literature on goal setting and
2008; Clor-Proell et al. 2015; Gross 2016; Ordónez et al. the literature on deviant workplace behavior, and we
2009). Performance measurement in academia is based on demonstrate that both streams of literature offer potential
managerial principles that originated from the private explanations for the psychological processes underlying
sector (e.g., Osterloh 2010; Walker et al. 2011). This so- gaming performance measurement. We claim that con-
called ‘new managerialism’ (Deem and Brehony 2005; necting these research fields will contribute to a better
Deem et al. 2007) is related to the principles of goal setting understanding of gaming performance measurement.
theory, which proposes that setting goals and measuring the The study also has important practical implications for
attainment of these goals leads to an increase in perfor- university management. Including gaming performance
mance (Latham and Locke 1990, 2007; Locke and Latham measurement in frameworks of deviant workplace behavior
2002). Moreover, new managerialism implies a heightened helps universities to understand the unintended conse-
accountability, for example, to third parties such as funding quences of their performance measurement practices on
organizations (Melo et al. 2008). Consequently, there is a scientists’ behavior. Our finding that scientists who engage
need to measure and monitor the performance of scientists in gaming performance measurement can severely harm
and higher education institutions in an objective and time- the university should motivate universities to question their
efficient manner (Deem and Brehony 2005; Melo et al. current performance measurement practices, such as
2008). For this reason, current performance measurement focusing on one-sided performance goals (i.e., publications
practices in academia focus on the outcome of scientists’ and third-party funding) and neglecting the multidimen-
work (Aguinis et al. 2012, 2014; Hood 1991; Osterloh sionality of scholarly performance.
2010; Sousa et al. 2010). With regard to research, for The rest of this paper is structured as follows. We start
example, performance is mainly based on the number of by defining gaming performance measurement and suc-
publications, particularly in high-impact journals, and on cessively describe which characteristics of performance
the amount of acquired funding. Critical voices suggest that measurement can lead to gaming behavior. The method
these performance measurement practices do not lead to and the results are structured in a three-step process: First,
the intended increase in performance but encourage sci- we identify scientists’ negative and positive behavioral
entists to behave unethically (e.g., Osterloh 2010; Schubert reactions to current performance measurement practices in
2009; van Thiel and Leeuw 2002). More specifically, academia. Second, we show that these behavioral conse-
previous research indicates that performance measurement quences can be characterized as gaming performance
practices that focus on the outcome of employees’ work measurement. Third, we illustrate that gaming performance
and directly link success to goal attainment can motivate measurement shows clear parallels to deviant workplace
employees to game the performance measurement prac- behavior. In the discussion, we provide explanations for
tices (i.e., fulfilling the performance measures while gaming performance measurement and situate it within
behaving dysfunctional; Barsky 2008; Jaworski 1988; Saini current frameworks of deviant workplace behavior.
and Krush 2008). What is missing so far is a comprehen-
sive and extensive exploration of scientists’ behavioral
adaptations to current performance measurement practices. Theoretical Background
To address this research gap, we conducted qualitative
interviews with stakeholders of the higher education sys- Scientific Misconduct and Other Bad Behaviors
tem (e.g., professors and policy makers) and identified of Scientists
positive as well as negative behavioral consequences of
current performance measurement practices in academia. Following the National Science Foundation,1 scientific
Our results not only show that scientists engage in gaming misconduct encompasses three types of behaviors: (1)
performance measurement but also demonstrate that gam- falsification (i.e., manipulation of the research material,
ing performance measurement is a type of deviant work- process, data, or results), (2) fabrication of data or results,
place behavior, that is, a voluntary behavior that violates and (3) plagiarism (i.e., not citing another person’s work
significant organizational norms and in so doing threatens
1
the well-being of an organization, its members, or both Source: https://www.nsf.gov/oig/_pdf/cfr/45-CFR-689.pdf.
123
Wanting More, Getting Less
but presenting it as one’s own; see also Fanelli 2009). summarizes frequently reported difficulties associated with
However, besides scientific misconduct, there is behavior performance measurement indicators in public manage-
that might not be directly related to fraud but is inherently ment as a form of tunnel vision, gaming, or measure fix-
unethical and opposed to organizational norms (Martinson ation (p. 11). In the context of budgeting decisions, gaming
et al. 2005; Steneck 2006). Examples are questionable performance measurement takes the form of financial
research practices such as processing data as much as misconduct. Jensen (2003), for example, states that setting
possible in order to achieve significant results (Babbage particular performance goals and tying the compensation to
1830; Ghysels 2000) or failing to represent data that con- these goals, while at the same time giving employees full
tradict one’s own previous research (Martinson et al. 2005, freedom in the realization of these goals leads to gaming in
p. 737). These behaviors often include ‘cooking’ numbers the planning and realization of goals. Jensen (2003)
and models or building hypotheses after having conducted therefore assumes that pay-for-performance encourages
the research (Butler et al. 2017). Such questionable cheating and claims that setting strong performance goals
research methods are referred to as being in the ‘gray can lead to extreme losses for the company. Similarly,
zone’; however, they present a form of wrongdoing (Butler Goebel and Weißenberger (2016) determined that very
et al. 2017). In a meta-analysis of surveys conducted on strong financial control leads to employee behavior that
this matter, Fanelli (2009) found that 1.97% (N = 7, does not align with organizational goals. Instead, employee
95%CI .86–4.45) of scientists admitted to scientific misbehavior is focused on meeting the performance goals even
conduct, and 33.7% reported engaging in questionable though this behavior might be harmful to the organization.
research methods. Similarly, Martinson et al. (2005) argue A severe example of setting the wrong performance goals
that the ‘bad behavior’ of scientists should not be limited to and how this encourages gaming performance measure-
actual scientific misconduct but that other behaviors need ment is the company Enron (i.e., a large energy company
to be taken into account as well. As a result, previous that failed in 2001). Because their executives’ pay-for-
research has addressed a multitude of different question- performance was tied to the revenue the company gener-
able research methods. While they range from severe cases ated and not its profit, managers focused on the revenue,
(i.e., scientific misconduct) to minor transgressions, they while losing sight of the profit2 (Niven and Healy 2016;
all have in common that they are unethical. Therefore, in Ordónez et al. 2009). Thus, due to a lack of fit between the
this study, we do not limit our analysis to research per- performance measurement criteria and the actually desired
formance and scientific misconduct but take other tasks employee behavior, Enron encouraged the gaming of per-
into account as well, such as teaching, grant acquisition, formance measurement.
and administrational tasks. Taken together, gaming performance measurement
There are several widely discussed explanations for seems to be prevalent in diverse work contexts in which
scientific misconduct and similar unethical behaviors, employee behavior is subject to performance measure-
both in universities and other organizations. To begin ment. Jaworski (1988) concludes that gaming ‘‘refers to
with, several studies analyze individual differences in situations in which employees behave in ways that look
scientific misconduct (e.g., individual improperty; see good in terms of the control system measures but are
Sovacool 2008). Other authors describe scientists’ dysfunctional for the firm’’ (p. 34; see also Saini and
behavior as ‘playing the game’ or ‘gaming the system’ Krush 2008). Similarly, Butler and Spoelstra (2012)
(e.g., Butler et al. 2017; Osterloh and Frey 2015), arguing derive from previous research that gaming individuals or
that their unethical behavior can be explained as reactions institutions ‘‘seek to skew the system of which they are a
to current performance measurement practices in acade- part’’ (p. 900). These definitions and the examples above
mia (e.g., Fanelli 2009; Martinson et al. 2005; Steneck all describe the essence of gaming performance mea-
2003). surement: The focus of employee behavior is not on
showing good performance but on fulfilling the perfor-
mance measurement criteria. Cases of financial miscon-
Gaming Performance Measurement duct, such as the example of Enron, further suggest that
gaming performance measurement is not only dysfunc-
Gaming performance measurement is not only a problem- tional but also very harmful to organizations. In the fol-
atic behavior of scientists but a phenomenon that has been lowing, we will review further studies on performance
discussed in other work contexts as well. For example, measurement and goal setting to describe which
several studies have found that public sector employees
adapt their performance to clearly specified performance
indicators (Butler and Spoelstra 2012; De Bruijn 2002; 2
Ackman (2002) Pay madness at Enron. Forbes. Source: https://
Smith 1995; van Thiel and Leeuw 2002). Johnsen (2005) www.forbes.com/2002/03/22/0322enronpay.html.
123
L. Graf et al.
characteristics of the current performance measurement emphasizes that one tactic that is frequently used to
practices in academia might lead to scientists’ gaming antagonize scientific misconduct is to decrease the
behavior. emphasis on the number of publications and reinforce
quality over quantity. When the quality of the research
Characteristics of Performance Measurement process is rewarded instead of the quantity of publications
Practices That Can Lead to Gaming (i.e., high-result focus), this might be an incentive to
behave in an ethically correct way. This assumption finds
Barsky (2008) specifies under which circumstances per- support in the finding of Tijdink et al. (2014) that there is a
formance measurement may lead to unintended unethical strong correlation between scientists’ perception of publi-
consequences. Similarly, Aguinis (2013) derives several cation pressure and cases of scientific misconduct.
factors from prior studies on performance measurement Third, if performance goals do not represent all impor-
that are necessary prerequisites for effective performance tant job tasks, employees tend to concentrate solely on
measurement practices. However, the current performance attaining the performance goals and neglect those tasks of
measurement practices in academia do not fulfill all of their job that are not measured (Smith 1995). A so-called
these prerequisites, because the practices do not fit the type ‘tunnel vision’ occurs (Smith 1995) because being focused
of work that is conducted in science. on particular content draws attention away from other
First, the performance goals in academia may be very aspects (Polzer and Neale 1995). Scientists, for example,
difficult for scientists to achieve (see Aguinis 2013; Barsky may direct their attention toward research and publishing
2008). Freeman et al. (2001) describe the work environ- papers at the expense of providing high-quality teaching.
ment of scientists as influenced by extremely high demands As described above, most previous studies have focused on
in addition to particularly intense competition. If scientists bad behaviors in the research setting. Few studies have
are faced with very high expectations, they may lack the addressed misbehaviors with regard to teaching. However,
ability to successfully complete their tasks. Therefore, to when it comes to reactions to current performance mea-
meet these high performance goals, scientists might adapt surement practices in academia, the disregard of teaching
their behavior to the performance measurement criteria and duties for the benefit of better performance in research
resort to questionable research methods. In line with this seems like a logical consequence of such tunnel vision.
argument, Clor-Proell et al. (2015) found that employee
fraud is more likely when employees are faced with diffi-
cult compared to easy goals. Method
Second, if the attainment of performance goals is not
directly linked to behavior but only to the outcome, Design and Procedure
employees are not rewarded for behaving in an ethical way
(Aguinis 2013; Barsky 2008). In other words, performance We collected data in the years 2014 and 2015 by con-
criteria should be an appropriate representation of ducting semi-structured interviews with stakeholders of the
employees’ actual performance. In a knowledge-intensive German higher education system. The main purpose of
context, such as in academia, linking work outcomes to these interviews was to analyze the criteria that are cur-
employee behavior is challenging because knowledge work rently used to measure performance in academia. The
is difficult to analyze and measure (Daft et al. 2010). An interviews were conducted by the first two authors and
example is the work of a scientist who derives well-in- research assistants, who received an interview guideline
formed hypotheses and conducts a properly planned study training. Sixty percent of the interviews were conducted by
but nevertheless yields insignificant results. The scientist’s telephone, and 40% were conducted face-to-face. On
actual performance may be good, but the outcome (i.e., an average, the interviews lasted 31 min (SD = 9.15). In the
insignificant result that is difficult to publish) may be beginning of each interview, we asked the interviewees for
insufficient. Consequently, the scientist might resort to consent to record the interviews, which all interviewees
violating the standards of good scientific practice in order provided. Prior to analyzing the interviews, we made a
to achieve the goal to produce publications. This example verbatim transcript of each interview.
illustrates how performance measurement practices that are
outcome-related and not behavior-related (Barsky 2008) Interview Guideline
reward unethical behavior. Instead of rewarding scientists’
behavior during the research process (i.e., the quality of The purpose of the interviews was to improve our under-
performance), current performance measurement practices standing of current performance measurement practices in
encourage scientists to publish as many publications as academia. We sought to discover advantages and disad-
possible (i.e., the quantity of performance). Gross (2016) vantages of the prevalent criteria for measuring scientific
123
performance in order to derive suggestions for amelioration 20,129 (SD = 14,592.74), the universities in our sample
and future development. To clarify the participants’ per- are slightly larger compared to the total sample of 110
spectives on these questions, we first asked whether they German universities with an average number of 15,682.35
perceived any advantages or disadvantages of the currently students (SD = 14,747.37).3 The vast majority of univer-
used performance criteria and whether they see any pos- sities follow Humboldt’s ideal of unity of research and
sibilities for improvement (questions 1–3, see Table 1). teaching and thus have a strong tradition in both research
Second, we asked whether the participants thought that the and teaching. The policy makers were affiliated with three
currently used performance criteria have an effect on the different large German political parties.
behavior of scientists (question 4) and, if yes, whether they We asked the participants to evaluate their experience
thought that this influence was rather positive or negative with science on a scale of 0 (no experience) to 6 (extensive
(question 5, see Table 1). Further, we asked the participants experience) and ensured that all participants, including
to elaborate on this influence and how the behavior of policy makers and students, had practical experience with
scientists changes due to performance measurement science. Four of the 47 participants were excluded because
(question 6, see Table 1). they were unable to answer questions related to the current
performance measurement practices in higher education,
Sample resulting in a final sample size of 43. The participants rated
their experience with a 4.50 (SD = 1.40) and all partici-
The target population of the interviews included stake- pants had at least some experience with science. On
holders of the German higher education system (N = 47). average, the interviewees had 13.96 years of work expe-
We asked the following stakeholder groups to participate in rience (SD = 10.98). The participants possessed the fol-
the interviews: (1) Professors (n = 10), because as scien- lowing academic degrees: high school diploma (n = 6),
tists, they are directly affected by performance measure- bachelor degree (n = 4), master’s degree or equivalent
ment, but are also involved in the performance (n = 10), Ph.D. (n = 17), and professorship (n = 10).
measurement of their Ph.D. students; (2) junior scientists
(n = 12), and among those, six Ph.D. students and six Coding Procedure and Scheme
postdoctoral fellows, because they are affected by perfor-
mance measurement and must meet the performance We used MAXQDA (version 11) to conduct a qualitative
measurement indicators in order to earn their Ph.D./a content analysis (Krippendorff 2013) and obtain coding
professorship; (3) policy makers who are related to the frequencies. The KALPHA Macro (Hayes and Krippen-
higher education system (n = 5), because they are dorff 2007) for IBM SPSS Statistics was used to calculate
involved in distributing money to universities; (4) univer- Krippendorff’s Alpha (Krippendorff 2013). To code the
sity managers (n = 10), because they are involved in uni- interviewees’ answers as objectively as possible, we used a
versity leadership and governance and therefore, also in the stepwise coding process. In Step 1, we identified negative
decisions regarding the performance measurement prac- and positive consequences of the current performance
tices of their universities; and (5) students (n = 10), measurement practices in academia. We explain the ratio-
because they are an important segment of universities and nale for Step 2 and Step 3 of the coding process in more
because of the unity of teaching and research, are directly detail in the results section because we based our decision
affected by performance measurement practices. Partici- to reanalyze the interview data on the results of the pre-
pants were recruited through conferences and an internet ceding coding steps. We defined the coding unit as the
search (particularly for policy makers). In alignment with description of a specific behavioral consequence resulting
the approach of theoretical saturation (see, e.g., Guest et al. from current performance measurement practices in aca-
2006; Sandelowski 1995), we thoroughly conducted inter- demia. It includes one or more sentences from the inter-
views until saturation was observed within each subgroup viewee’s answers.
and within the entire sample.
Fifty-five percent (n = 26) of the sample were female,
and the mean age was 36.87 years (SD = 12.36). The
students, professors, junior scientists, and university man- 3
We downloaded information on the number of students at all higher
agers were from 22 different higher education institutions education institutions in Germany from the website of the German
in Germany, including 19 state-funded universities, two Rector’s Conference (http://www.hs-kompass2.de/kompass/xml/
private universities, and one privately funded university of download/hs_liste.txt). Because the university of corporate educa-
tion was not included in this data set, we could not obtain reliable data
corporate education (i.e., a higher education institution
on the number of students currently enrolled at this university.
offering dual courses that combine tertiary education and Therefore, the average numbers reported for the number of students
vocational training). With an average student population of do not include this university.
123
L. Graf et al.
Table 1 Interview questions

Block 1: Advantages and disadvantages
1. In your opinion, what are the most important advantages and disadvantages of currently used indicators for performance measurement?
(open question)
2. If you (at least partially) perceive disadvantages in the currently used indicators for performance measurement, do you see any
possibilities to address these issues? (yes/no)
3. Which changes would have to be implemented to address these disadvantages? (open question)
Block 2: Behavioral consequences
4. Do you think that currently used criteria for performance measurement affect scientists’ behavior? (yes/no)
5. If you think that currently used indicators for performance measurement influence the behavior of scientists: Do you think this influence is
rather positive or negative? (0 = very negative to 6 = very positive)
6. Please provide a concrete insight into the influence that currently used indicators for performance measurement could have on scientists’
performance. (open question)
For the present study, we analyzed the subset of questions presented in this table. The interview guideline included additional questions regarding
current performance measurement practices in academia and several related issues, such as the need for performance-based funding or the
characteristics of research and teaching quality
Step 1: Behavioral Consequences an explorative, inductive content analysis (Krippendorff

2013).
In the development of the coding scheme ‘behavioral The final coding scheme for the behavioral conse-
consequences,’ we followed a two-step, inductive proce- quences of current performance measurement practices is
dure. First, the first two authors of this article separately displayed in Table 2, including the code descriptions, the
analyzed all interviews and coded the questions about the intercoder agreement (a), and coding frequencies (k).
behavioral consequences of the currently used performance Coding frequencies represent the number of codings, that
criteria (questions 1–3, see Table 1). Coding differences is, the number of times a behavioral consequence was
were discussed and resolved, resulting in an initial coding mentioned in the interviews, across all 47 participants. The
scheme. Second, two additional coders used this initial coding frequency for one of the ten behavioral conse-
coding scheme to repeat the analysis on all interviews. In quences is not identical to the number of interviewees who
addition, we extended the analysis to the questions reported the behavioral consequence, because in rare cases,
regarding advantages and disadvantages of the currently an interviewee referred to different behaviors that were
used performance indicators (questions 4–6, see Table 1) later aggregated to a more general level. For example, if an
because we realized that many of the participants reported interviewee reported both ‘disseminating scientific find-
behavioral consequences when answering these questions. ings’ and ‘conducting more research projects and gener-
After inspection of the coding differences by the first two ating more publications,’ we counted it as two codings of
authors, the code descriptions and coding guidelines were the code ‘higher productivity.’ After coding, the negative
revised and discussed with the two subsequent coders. consequences were further categorized into similar
These two subsequent coders reanalyzed the interviews and behaviors; for example, all negative consequences related
revised their codings according to the new guidelines. to a decrease in research quality were grouped into ‘de-
Because intercoder agreement was not sufficient for some crease in research quality.’ For these categories of behav-
of the codes (amin = .33, amax = 1.00), we modified the ioral consequences, the coding frequencies represent the
coding scheme. In particular, we aggregated some of the sum of coding frequencies of all subordinate codes.
codes to a more general level. For example, all behavioral
consequences related to questionable methods, such as Step 2: Gaming Performance Measurement
‘violation of norms and standards’ and ‘use of simple
experimental designs,’ were aggregated to the code ‘use of After Step 1 of the coding process, we noticed that many of
questionable methods.’ These modifications resulted in a the negative behavioral consequences seemed to describe
final coding scheme that included ten codes (seven nega- the phenomenon ‘gaming performance measurement,’ that
tive consequences and three positive consequences). The is, scientists ‘‘behave in ways that look good in terms of the
intercoder agreement on the basis of this coding control system measures but are dysfunctional for the firm’’
scheme was greater than .75 for all codes (amin = .75, (Jaworski 1988, p. 34). The interviewees’ responses that
amax = 1.00, M = .82, SD = .05), which is acceptable for resembled gaming performance measurement described
123
Table 2 Coding scheme for behavioral consequences of current performance measurement practices in academia
Code Description aa kb
Positive behavioral consequences 56

1. Increase in quality Increase in research quality .77 17
Increase in teaching quality
2. Higher productivity Disseminating scientific findings (e.g., at conferences or in the media) .75 22
Conducting more research projects and generating more publications
3. Motivation Higher motivation due to positive pressure .79 17

Participating in professional development courses
Revising one’s teaching
Individuals with wrong motives (i.e., obtain an academic title) are sorted out
Negative behavioral consequences 181

1. Decrease in quality 113
1.1. Decrease in research quality 101
1.1.1. Questionable publishing Orientation toward top-tier journals .84 46
practices Generating more publications but of lower quality
Slicing (i.e., publishing the results of a research project in numerous separate
publications)
Publishing fewer books
Publication bias (i.e., researchers do not publish nonsignificant results)
Publishing more single-authored articles
Other issues related to publishing behavior that lead to a decrease in research quality
1.1.2. Selective choice of research Only topics that quickly lead to results are chosen .87 34
topics Only popular and easily publishable topics are chosen
Risks are not taken even if the topic would be a better fit
1.1.3. Use of questionable methods Violation of norms and standards .84 12

Use of undemanding/modest statistics
Use of experimental designs that are likely to generate significant results
Use of simple experimental designs that can be quickly tested
Other methodological issues that lead to a decrease in research quality
1.1.4. Research quality decreases The quality of research decreases (not further specified) .75 9
1.2. Decrease in teaching quality The quality of teaching decreases: .91 12

Specialization on research topic too high and no time to prepare lessons
Supervising too many Ph.D. students
2. Prioritization Effort is focused on a specific aspect, while others are disregarded: .87 55
Prioritization of research
Disregard of teaching
Concentration on grant acquisition instead of research
3. Demotivation and pressure Pressure leads to a decrease in motivation and effort .80 13
Leads to the wrong individuals conducting research due to the wrong incentives
a
a = Krippendorff’s Alpha
b
k = coding frequency (number of codings)
two different aspects of scientists’ behavior: (1) focusing expense of other tasks that are not defined as performance
on attaining specific performance goals while actually goals but that nevertheless are important for the univer-
producing lower-quality work and (2) putting more effort sity’s success. The second coders used this definition of
into tasks that are defined as performance goals at the gaming performance measurement to reanalyze the
123
L. Graf et al.
interviews. More specifically, they coded all questions that neglecting other important performance aspects; k = 55),
they had also analyzed during Step 1 of the coding process and (3) demotivation and pressure (k = 13).
(i.e., the identification of positive and negative conse- To obtain a clear understanding of the behavioral con-
quences; see Table 1). They decided independently which sequences of performance measurement practices that are
of the interviewees’ descriptions of the consequences of the currently used in academia, we begin by illustrating
current performance measurement practices fulfill one or examples of positive behavioral consequences and con-
both aspects of gaming performance measurement as trasting them with negative behavioral consequences.
defined above. Intercoder agreement after the first coding Table 3 includes a list of sample codings of Step 1 (be-
was already very high (a = .89). After the two first authors havioral consequences), which we will use in the para-
examined the differences and discussed the code descrip- graphs below to describe the different positive and negative
tions with the research assistants, the intercoder agreement behavioral consequences.
increased to a = .94 after a second round of coding.
Positive Behavioral Consequences
Step 3: Deviant Workplace Behavior
With regard to category (1), increase in quality (k = 17),
The codes for deviant workplace behavior were deduc- most interviewees indeed addressed an increase in research
tively derived from Robinson and Bennett’s (1995) concept quality (14 of 17 codings of ‘increase in quality’ were
of organizational deviance, which includes (1) voluntary related to research). For example, they described incen-
behavior and (2) violation of significant organizational tivizing publication in internationally highly ranked jour-
norms that threatens the well-being of the organization. nals as a positive influence because it enhances the
During this process, we specifically reanalyzed the 74 motivation to improve the quality of their research papers
codings of gaming performance measurement we had (see Table 3). Regarding category (2), higher productivity
identified in Step 2. Intercoder agreement was very high (k = 22), most answers (12 of 22 codings) referred to a
(a = .93 for the code ‘voluntary’ and a = .95 for the code better dissemination of research results, emphasizing that
‘violates norms and threatens the organization’). research results are published and thus distributed as a
After the final rounds of coding were conducted, the first consequence of the requirements of performance mea-
two authors of this article examined the remaining differ- surement. The statement of a master’s student who was
ences and chose the codings that fit better. This resulted in involved in research exemplifies category (3), motivation
a final set of codings that formed the basis for an in-depth (k = 17). He mentioned that the positive effect of the time
qualitative analysis and descriptive analyses of coding pressure that was generated by performance measurement
frequencies. was an incentive to publish results, but he also implied that
this time pressure could have negative behavioral conse-
quences in the form of a decrease in the quality of scientific
Results knowledge (see Table 3). Similarly, other interviewees
described the international orientation toward top-tier
The results were analyzed in a stepwise procedure (see journals as a positive effect because it sets high standards.
method section): The described positive reactions refer to an improvement in
quality as a positive behavioral reaction to the provided
Step 1: Behavioral Consequences incentives. However, many interviewees described positive
and negative reactions as mutually occurring.
Overall, the participants reported 56 positive behavioral Even more interviewees described particularly negative
consequences and 181 negative behavioral consequences of reactions to incentives that resulted from the performance
current performance measurement practices.4 The positive measurement practices:
behavioral consequences can be clustered in three cate-
gories, namely (1) increase in quality (k = 17), (2) higher Negative Behavioral Consequences
productivity (k = 22), and (3) motivation (k = 17). Simi-
larly, the negative behavioral consequences can be clus- With regard to category (1), decrease in quality (k = 113),
tered in three categories and represent behaviors such as (1) most statements of behavioral consequences referred to a
decrease in quality (k = 113), (2) prioritization (i.e., decrease in research quality (101 of 113 codings). We
grouped the behavioral consequences related to a decrease
4 in research quality in four groups, described below:
The results are simultaneously reported for all five groups of
interviewees. There were no meaningful differences in the answers A decrease in research quality was often connected to
the participants provided. questionable publishing practices (k = 46). So-called
123
Table 3 Sample codings of ‘behavioral consequences’ (Step 1)

Code Sample coding
Positive behavioral consequences

1. Increase in quality Among other things, it led the German research community to give themselves a motivational kick, which
led to more rigor in research, better methodological knowledge, to more international comparability. This
means it really led to an enhancement of research quality on average. […] This is also not surprising, when
you look at it from the point of view of controlling […]. (professor)
2. Higher productivity It has something really positive, because it makes sense to not only work by yourself. […] I think, it truly has
something positive, to make people publish and maybe also kind of force them a little bit to publish. […]
(junior scientist)
3. Motivation When my performance as a researcher is measured by how many research papers I published in a particular
period of time, in particular research journals, then I can concentrate on conducting and publishing a
certain amount of studies and I would not wait endlessly to do so. So, it [talking about performance
measurement] is virtually an inducement to be very productive. But it is possible that the knowledge that
can be retrieved from each single publication does not contribute significantly to the advancement of
scientific knowledge, because the researchers do not invest a sufficient amount of time. (student)
Negative behavioral consequences
1. Decrease in quality
1.1. Decrease in research quality
1.1.1. Questionable Considering research […] due to the criteria that are in use at the moment, there is a clear trend toward short-
publishing practices term projects that quickly lead to publications. Further, there is a trend toward a publication strategy that is
as compartmentalized as possible. Whereas long-term projects are being rather avoided due to the existing
incentives. (professor)
1.1.2. Selective choice of My impression while reading many papers is that they are published to be published and not in order to
research topics address a relevant, a really relevant problem. As a result, I would reconsider whether rankings and the
quantity of publications really are the only suitable metric. [Explains that research fellows already have to
publish very early and quickly] […] And with this, they are being forced to publish something that is easily
publishable at the moment. (junior scientist)
1.1.3.Use of questionable What happens again and again, particularly with experimental research, and whenever I come across this I
methods am always asking myself how this is even possible, is that publications are retracted, because the data has
been plagiarized. And I am just left thinking: How is this possible? This happened in a renowned journal.
(university manager)
1.1.4. Research quality More and more unqualified people are brought in, especially with publications and acquisition, which is
decreases shuffled off onto students or doctoral candidates, and professors do not conduct research themselves
anymore. Accordingly, the quality goes down. (junior scientist)
1.2. Decrease in teaching A frequently used criterion of pay-for-performance, for example, is the number of doctoral candidates who
quality are being supervised. And some hazard the consequences that it takes some doctoral candidates an
extremely long time to finish their Ph.D. and that many also break off. (university management)
2. Prioritization A very straightforward incentive, for example, is not to advise master theses or only a few… because you
won’t get any extra points for that and nobody appreciates it. So I will refrain from that and rather put my
time into a paper that I can get published. (professor)
3. Demotivation and pressure Well, then there is of course a high amount of pressure for everybody who is included in the system.
Particularly for the professors and I think it has negative implications when they have too much pressure.
Because we should not forget that we are talking about knowledge generation and you cannot do this under
time pressure. (junior scientist)
slicing (i.e., artificially slicing a research project in order to the use of questionable methods (k = 12). The intervie-
generate several publications instead of only one) was wees described a range of different practices that fall under
described by several interviewees (see Table 3) as lower- the category ‘use of questionable methods,’ of which six
ing the quality of research. Another prominent example of referred to the violation of methodological norms and
a decrease in research quality is the selective choice of standards, that is, a particularly severe threat against clean
research topics (k = 34), that is, selecting only popular scientific methods (see Table 3). A few statements that
topics for research projects without considering their rele- were related to a decrease in research quality were coded as
vance for the advancement of knowledge. The third group ‘research quality decreases’ (k = 9) because they did not
of behaviors that lead to a decrease in research quality is fit into one of the three categories above.
123
L. Graf et al.
Some interviewees also reported a decrease in teaching We went through the interviewees’ responses related to
quality (12 of 113 codings of ‘decrease in quality’). They negative behavioral consequences and derived two differ-
provided different reasons for this consequence. A problem ent aspects of scientists’ behavior that describe the phe-
that the interviewees described several times was the nomenon of gaming performance measurement: (1)
supervision of too many Ph.D. students (see Table 3). focusing on attaining specific performance goals while
Answers that fell into category (2), prioritization actually producing lower-quality work and (2) putting
(k = 55), emphasized that high pressure (e.g., pressure to more effort into tasks that are defined as performance goals
publish or to acquire research grants) leads scientists to at the expense of other tasks that are not defined as per-
focus on some tasks at the expense of others. Many par- formance goals but that nevertheless are important for the
ticipants reported that scientists neglect their teaching university’s success. To confirm our initial impression, we
duties to concentrate on research and publishing, whereas reanalyzed the same segments of the interviews that we had
others noted that they put less effort into research because also analyzed during Step 1 of the coding process (i.e., the
they focus on writing grant proposals instead. For category identification of positive and negative consequences; see
(3), demotivation and pressure (k = 13), most examples Table 1). The purpose of Step 2 was to identify all
described negative effects on scientists’ performance due descriptions of the consequences of the current perfor-
to the pressure from performance measurement, for mance measurement practices that classify as gaming
example, because research is about knowledge generation performance measurement. Altogether, ‘gaming perfor-
and ‘‘you cannot do this under time pressure’’ (junior sci- mance measurement’ was coded 74 times across all 43
entist; see Table 3). interviews. Table 4 includes sample codings of gaming
The results of the first coding step suggest that current performance measurement that exemplify the two different
performance measurement practices have both positive and aspects we identified.
negative effects on scientists’ behavior. These practices
may increase the motivation to perform, leading to higher Step 3: Deviant Workplace Behavior
productivity. A few participants also reported an increase
in the quality of research and teaching. Conversely, most During the analysis of the 74 codings of gaming perfor-
participants reported a negative effect on the quality of mance measurement, it became apparent that employees
research and teaching, the selective prioritization of tasks, who engage in gaming performance measurement may
and decreased motivation. cause serious damage to universities. The behaviors
describe situations in which scientists adapt their behavior
Step 2: Gaming Performance Measurement to incentives that are favored by performance measurement
practices to meet the demands. These observations led to
When closely analyzing the negative behavioral conse- the idea that the behaviors that are reactions to the incen-
quences we had identified, we were intrigued by the tives provided by performance measurement practices may
impression that some scientists seem to engage in gaming in fact be a serious threat to the university and its goals.
performance measurement. Some participants openly dis- Consequently, we hypothesized that gaming perfor-
cussed how scientists strategically adapt their behavior to mance measurement is similar to deviant workplace
the performance measurement criteria: behavior. To confirm our assumption that gaming perfor-
mance measurement is a form of deviant workplace
The moment you create an incentive—and that’s
behavior, we conducted a third coding step (Step 3). This
what it is—it leads smart people to adapt to it. […]
additional step enabled us to decide, for each coding of
you adopt useless criteria for your own part, just for
gaming performance measurement, whether it also fulfilled
the sake of one’s own success, no matter whether it
the definition of deviant workplace behavior. For Step 3 of
really makes sense or not (policy maker)
the coding process, we used the definition of organizational
[A] disadvantage [of the current performance mea-
workplace deviance proposed by Robinson and Bennett
surement practices] is that it leads scientists to opti-
(1995), that is, deviant workplace behavior that is targeted
mize everything, like in the automobile industry,
at the organization. Specifically, we coded whether the
where cars are optimized to fulfill the EU [European
examples of gaming performance measurement complied
Union] test cycles. (junior scientist)
with the two defining characteristics of organizational
Apparently, scientists are aware that performance cri- workplace deviance: (1) voluntary behavior and (2) a
teria do not reflect their true performance and that opti- violation of significant organizational norms that threatens
mizing their performance in terms of the criteria may the well-being of the organization (see Table 5 for a list of
contradict the goals of universities. sample codings of Step 3).
123
Table 4 Sample codings of ‘gaming performance measurement’ (Step 2)

Code Sample coding
Focusing on attaining specific performance goals 1. You find something interesting and you don’t put it in one paper but water it down
while actually producing lower-quality work somehow; you stretch it to like four (university manager)
2. I think it is disadvantageous that scientists produce plenty of papers and pursue
topics that may make it easy to publish but that are rarely relevant […] There are
also a lot of papers that are really not so great or important. More quality instead of
quantity would be desirable, but that is not what is rewarded nowadays. It is all
about counting how many papers you produce and how much money you acquire.
(junior scientist)
3. What results [from the current performance measurement practices] is that
scientists try to generate data as quickly as possible so that they can get a paper out
of it and publish it […] And this also leads to worse experimental designs and
statistical analyses. (university management)
4. Due to the pressure to publish, scientists publish differently […] the results in
their pure form are high quality, but the process of producing the results has some
severe undocumented flaws (student)
5. A frequently used criterion of pay-for-performance, for example, is the number of
doctoral candidates who are being supervised. And some hazard the consequences
that it takes some doctoral candidates an extremely long time to finish their Ph.D.
and that many also break off. (university management)
6. Researchers publish everything, no matter how the quality is. […] Researchers
simply publish very much and this leads to a decrease in quality, and sometimes it
also makes researchers not adhere to certain ethical standards. (university
management)
7. I think there’s something to be said for making people publish and maybe also
forcing them to do so a little bit. But on the other hand, it also results in people
doing only this anymore and me reading some publications and thinking: ok, 80%
of research is basically irrelevant because it is only done to be done. (junior
scientist)
8. Well, I think it is a disadvantage [of the current performance measurement] that
some projects are conducted just to collect data that can then be published.
Regardless of whether this advances our knowledge in a scientific discipline or
field. And I think this is not expedient anymore. (junior scientist)
Putting more effort into tasks that are defined as 9. If I am being evaluated solely on the basis of publications, I will try to cut down
performance goals at the expense of other tasks all other tasks, for example, teaching and administrative duties, as much as
that are not defined as performance goals but that possible, which might negatively impact my students. (junior scientist)
nevertheless are important for the university’s 10. Teaching is extremely underweighted. […] And I think it does not help to
success improve the education at universities if you set such a focus on research (junior
scientist)
11. As a result [of the current performance measurement practices], no scientist will
agree to take the job of academic dean, for example, because it does not benefit
your performance in any way (professor)
12. I think it is a disadvantage [of the current performance measurement practices]
[…] that many researchers subordinate teaching because it does not contribute to a
positive performance evaluation (junior scientist)
13. The criterion of third-party funding is quite difficult for many because it is so
easily measureable and difficult to manipulate. And, of course, as a consequence,
you want to fulfill it by any means and maybe put less time into something else.
(professor)
Eighty-two percent of the 74 codings of gaming per- as a threat to the organization, for example, a bad reputa-
formance measurement (k = 61) were coded as ‘a violation following a scandal of scientific misconduct by one of
tion of significant organizational norms that threatens the the university’s employees.
well-being of the organization.’ In particular, a threat to the Numerous participants explained that disregarding
university was defined as deterioration of a university’s important tasks and a decrease in the quality of teaching
performance with regard to its main tasks: research and and research threaten the organizational performance of
teaching. Any type of damage to the university was coded universities. An extreme example that was described by a
123
L. Graf et al.
Table 5 Sample codings of ‘deviant workplace behavior’ (Step 3)

Code Sample coding
Violation of significant organizational norms that 1. Usually, the professors are smarter than those who come up with the [performance]
threatens the well-being of the organization criteria. They know exactly how to cheat. I will give you an example. The moment we
decided to use the number of graded exams as an indicator of teaching load, a colleague
instantly says: ‘So the failure rate does not apply anymore? Because then I will make all
students fail so I will have twice as many exams to grade. (professor)
2. Well, I think it is a disadvantage [of the current performance measurement] that some
projects are conducted just to collect data that can then be published. Regardless of
whether this advances our knowledge in a scientific discipline or field. And I think this is
not expedient anymore. (junior scientist)
3. I think this all [the current performance measurement] has a disadvantage from the
viewpoint of science in general. Researchers publish everything, no matter how the
quality is. Recently, there are critical newspaper articles about this, too. […] they are very
critical of the trend to not look into the question ‘is this really worthy of publication?’
[…] Researchers simply publish very much and this leads to a decrease in quality, and
sometimes it also makes researchers not adhere to certain ethical standards. So, scientific
reality does not work anymore. (university manager)
Voluntary behavior 4. A very straightforward incentive, for example, is not to advise master theses or only a
few… because you won’t get any extra points for that and nobody appreciates it. So I will
refrain from that and rather put my time into a paper that I can get published. (professor)
5. Considering the objectives of universities, teaching is as important as research. However,
as long as this is not implemented in performance measurement, it is not very rational to
put a lot of effort into teaching. You would have to be highly intrinsically motivated to do
good teaching because […] from the outside, you are rewarded for research. (junior
scientist)
6. When researchers feel pressured to acquire as much grant money as possible, it
influences the choice of topics. When they rather apply for a well-endowed research
project that somehow relates to the their interests and not exactly the topic they see as an
important research gap, this is how the criteria ‘research grants’ influences the topics and
even the results of research. […] It is not without reason that scientists themselves show
concern that there will soon be only mainstream research. (policy maker)
7. I would say that about one half [of scientists] is totally independent from any kind of
evaluations. You couldn’t even tempt them with bonuses, they just do what they think is
important and what is right. […] They are based within their peer community and they do
whatever is valued there. […] And there is one part who instantly reacts to the smallest
incentivizing with ‘how can I better present myself?’ And I personally think this is very
negative. (professor)
professor demonstrated that some scientists would go as far organizational norms to strategically adapt one’s work
as purposefully harming the teaching quality at their uni- behavior to the performance measurement practices.
versity in order to improve their own performance (Coding Although performance measurement encourages scien-
1 in Table 5). With regard to research, a common example tists to engage in gaming performance measurement and
of gaming performance measurement was publishing more puts them under pressure to perform, participants spoke of
papers of a lower quality. The participants indicated that gaming performance measurement as a voluntary behavior
this behavior leads to a decrease in knowledge production (99% of the codings, k = 73). It became clear that scien-
and occasionally triggers unethical behavior and scientific tists can choose to adapt their behavior to the performance
misconduct (Codings 2 and 3 in Table 5). criteria but often decide to behave in a strategic manner
Scientists who engage in gaming performance mea- because it is incentivized. For example, several participants
surement contribute to a decrease in the overall perfor- stressed that prioritizing research at the expense of teaching
mance of their university, whether in research or in duties is a rational choice for scientists because they are not
teaching. Thus, gaming performance measurement runs rewarded for putting extra effort into teaching (Codings 4
counter to the mission of universities to perform both and 5 in Table 5).
research and teaching with a high level of quality. Con- Although the participants made clear that it is the per-
sequently, this behavior represents a violation of formance measurement practices that induce the gaming of
performance measurement, it remains the scientist’s
123
decision of whether to game the performance criteria or work and (2) putting more effort into tasks that are defined
not. Deciding not to do so, however, is perceived to be very as performance goals at the expense of other tasks that are
difficult and requires intrinsic motivation. not defined as performance goals but that nevertheless are
The trend toward mainstream research is yet another important for the university’s success.
example that emphasizes that universities do not directly Reanalyses of the behaviors that the interviewees had
influence scientists to change their research topics in a reported showed that gaming performance measurement
strategic manner. Rather, scientists decide to study main- exhibits the same characteristics as deviant workplace
stream topics because they try to fulfill the high expectations behavior (cf. Robinson and Bennett 1995). First, gaming
of their university management or department (Coding 6 in performance measurement is a voluntary behavior. The
Table 5). The voluntary nature of gaming performance current performance measurement practices in academia
measurement was also evident when the participants set strong incentives for scientists to strategically adapt
emphasized that not all scientists adapt their behavior to the their behavior to improve their performance outcomes at
performance measurement practices (Coding 7 in Table 5). the expense of work quality and job tasks that are not
In summary, participants reported a wide range of measured. Nevertheless, scientists choose whether to
behavioral consequences that result from performance engage in gaming performance measurement. Second,
measurement. Clearly, scientists are perceived to engage in gaming performance measurement is a violation of orga-
various forms of gaming performance measurement, that nizational norms that causes harm to universities. Gaming
is, they strategically adapt their behavior to performance performance measurement threatens the well-being of
criteria and optimize their outcomes accordingly. Further- universities because scientists who focus on attaining per-
more, gaming performance measurement can be classified formance goals while reducing the quality of their work
as both a voluntary behavior of scientists and a violation of and who focus on tasks that are measured while neglecting
organizational norms that is harmful to universities. other tasks contribute to a decrease in their universities’
organizational performance in research and teaching.
Characterizing gaming performance measurement as
Discussion deviant workplace behavior has several implications for
both future research on gaming performance measurement
This study identified how scientists adapt their behavior to and current frameworks of deviant workplace behavior.
current performance measurement practices in academia. First, connecting the literature on goal setting and gaming
Qualitative interviews with different stakeholders of the performance measurement with the literature on deviant
higher education system revealed a high prevalence of workplace behavior will allow for a much more detailed
negative behavioral consequences as a reaction to perfor- explanation of gaming performance measurement. Second,
mance measurement. Most of all, interviewees reported a we propose that gaming performance measurement is a
decrease in research quality due to questionable publishing type of behavior that is not included in current frameworks
practices, a selective choice of research topics, and ques- and typologies of deviant workplace behavior. In the fol-
tionable research methods. Current performance measure- lowing, we will elaborate on both of these implications.
ment practices also lead to a decrease in teaching quality,
but these behaviors were not reported as often as behaviors Theoretical Implications
that lead to a decrease in research quality. Another negative
behavioral consequence of performance measurement is Psychological Processes Underlying Gaming Performance
prioritizing research at the expense of teaching and grant Measurement
acquisition at the expense of research. In addition to these
negative behavioral consequences, some interviewees Research on goal setting and deviant workplace behavior
reported positive effects of performance measurement, provides valuable input to understand the underlying psy-
including higher productivity and motivation as well as an chological processes of gaming performance measurement.
increase in research and teaching quality. Altogether, In the following, we will first discuss how our findings
however, the negative behavioral consequences clearly relate to previous research on deviant workplace behavior
outweighed the positive ones. and how explanations from this stream of literature can
During the analysis, we realized that many of the enrich future research on goal setting and gaming perfor-
reported behavioral reactions resemble the ‘gaming per- mance measurement. Afterwards, we will draw from the
formance measurement’ phenomenon. Two different literature on goal setting and unethical behavior to explain
aspects of gaming performance measurement emerged in greater depth the cognitive processes that may underlie
from the interview data: (1) focusing on attaining specific gaming performance measurement: moral disengagement
performance goals while actually producing lower-quality and rationalization.
123
L. Graf et al.
Interaction of Situational Variables and Individual Dif- by performance goals may be more likely to react to per-
ferences In general, deviant workplace behavior is formance measurement practices with gaming behavior
assumed to result from an interaction of situational vari- than other employees. Taken together, research on gaming
ables (e.g., competitive environment) and individual dif- performance measurement could benefit from examining in
ferences (e.g., integrity; see, e.g., Colbert et al. 2004; greater depth the interaction of causes that lie within the
Martinko et al. 2002; Spector and Fox 2002; Vardi and organization (i.e., performance measurement practices) and
Wiener 1996). In a similar vein, authors have claimed that individual differences between employees (e.g., perfor-
deviant workplace behavior is an adaptation to the working mance goal orientations).
environment (Dalal 2005; Raelin 1994). Previous research
has identified a multitude of antecedents—both individual Organizational Goals Our study adds to the growing
and organizational—that can induce deviant workplace literature on the influence of an organization’s goals and
behavior (see, e.g., Bolin and Heatherly 2001; Lasson and policies on deviant workplace behavior (e.g., Barsky 2008;
Bass 1997; Martinko et al. 2002; Peterson 2002; Thoms Pierce and Aguinis 2015; Umphress and Bingham 2011;
et al. 2001). In line with this thinking, our findings suggest Vardi and Wiener 1996). For example, Pierce and Aguinis
that gaming performance measurement results from an (2015) argue that organizations may trigger employee
interaction of situational factors and individual differences. behavior that does not truly advance organizational goals.
Stakeholders of the higher education system have explicitly In particular, the authors propose the concept of detri-
described gaming performance measurement as a conse- mental citizenship behavior, which refers to employee
quence of the current performance measurement practices in behavior that is intended to advance an organization’s
academia. Thus, performance measurement practices may goals but actually causes harm to the organization or one of
be one of the situational variables that encourage employees its stakeholder groups (e.g., environmentally harmful
to engage in gaming performance measurement. To date, behavior to save costs for the organization). Similarly,
only a few authors have discussed performance measure- gaming performance measurement is influenced by orga-
ment as a potential cause of deviant workplace behavior. nizational goals. In contrast to detrimental citizenship
Litzky et al. (2006), for example, claim that compensation behavior, employees who engage in gaming performance
and reward structures may motivate good employees to measurement primarily intend to enhance their own per-
engage in deviant workplace behavior, such as manipulating formance and not the performance of the organization.
sales data to attain performance goals. However, the authors More research is needed to examine the question of how
limit their discussion to commission and gratuity-based organizational goals and policies, such as performance
reward systems and do not consider more-complex forms of measurement practices, affect employee behavior and
performance measurement systems that are less based on under what circumstances employees engage in which type
pay-for-performance. Our findings suggest that performance of deviant workplace behavior (gaming performance
measurement per se, without financial rewards, also has the measurement versus detrimental citizenship behavior).
potential to cause deviant workplace behavior. Accordingly,
future studies on the organizational antecedents of deviant Perception of Disequilibria Martinko et al. (2002) pro-
workplace behavior should include an organization’s per- posed that deviant workplace behavior occurs because
formance measurement system. employees perceive some kind of disequilibrium in the
Although our interviewees clearly linked gaming per- workplace. For example, scientists who recognize a mis-
formance measurement to current performance measure- match between their job performance (i.e., what is needed
ment practices in academia, individual differences are to perform well and what they achieve) and the perfor-
likely to play an important role as well. Many of our mance criteria that are used to evaluate their performance
interviewees noted that some scientists engage in gaming may perceive this situation as unfair. The perception of this
performance measurement and others do not, although they disequilibrium (i.e., feeling of unfairness due to the mis-
are subject to the same performance measurement prac- match) can induce deviant workplace behavior such as
tices. This finding underlines that deviant workplace gaming performance measurement because scientists adapt
behavior is considered voluntary employee behavior inso- their behavior to the performance measurement criteria.
far as employees are motivated to engage in deviant This argument aligns well with Aguinis (2013) and Barsky
behavior or to refrain from doing so (Griffin and Lopez (2008), who stress the negative consequences that can
2005; Kaplan 1975; Robinson and Bennett 1995; Spector result from a mismatch between employee performance
and Fox 2002). Louw et al. (2016) found that employees and the criteria that are used to measure employee per-
whose behavior is motivated by performance goals engage formance. A mismatch can also result because scientists
in a higher level of deviant workplace behavior than other working in universities are faced with multiple contradic-
employees do. Accordingly, employees who are motivated tory expectations by their universities. On the one hand,
123
most universities have published ‘codes of conduct’ or rationalization of unethical behavior. Employees whose
‘principles of good scientific practice.’ These explicit performance is measured through strict performance goals
norms state a university’s expectations of what constitutes might easily find a way to morally disengage from uneth-
good research and teaching. On the other hand, perfor- ical behavior because high performance goals serve as a
mance criteria also strongly signal what universities regard way to rationalize such behavior. Mazar et al. (2008) found
as good performance. Accordingly, scientists perceive a in a series of six experiments support for the assumption
disequilibrium (i.e., mismatch) between the expectation of that people act sufficiently dishonestly to profit from their
both adhering to the university’s codes of conduct and behavior. However, they also act honestly enough to justify
fulfilling the performance criteria. The notion of perceived their behavior and continue to maintain their self-concept
disequilibria or mismatches may help describe how scien- of being honest. This explanation might be applicable to
tists perceive the current performance measurement prac- scientists’ adaptation to performance measurement prac-
tices in academia and why this perception can lead to tices: They adapt their behavior enough (e.g., cooking of
gaming performance measurement. We propose that it is data) to be able to publish their research in a top-tier
not solely organizational goals per se that can be ante- journal. Nevertheless, they do not feel dishonest by reason
cedents of deviant workplace behavior, as other authors that they do not engage in more serious scientific
have already claimed (Barsky 2008; Pierce and Aguinis misconduct.
2015; Umphress and Bingham 2011; Vardi and Wiener Our short review of previous research on deviant
1996). Rather, the existence of contradictory goals or workplace behavior and goal setting demonstrates that
expectations can also result in the perception of disequi- there are already several possible explanations for gaming
libria and cause deviant workplace behavior, such as performance measurement. At the same time, it highlights
gaming performance measurement. the need for a closer integration of these two streams of
Although the literature on deviant workplace behavior literature. In addition, more empirical studies on gaming
offers fruitful directions for future research that will performance measurement are needed to shed light on the
enhance our understanding of gaming performance mea- psychological processes that explain why employees
surement, such as the interaction of situational variables sometimes react to performance measurement practices
and individual differences, a closer look into the interme- with gaming behavior.
diate psychological processes is warranted. There is a need
to explain in greater depth the question of why employees Integrating Gaming Performance Measurement
decide to engage in gaming performance measurement. To into Frameworks of Deviant Workplace Behavior
that end, we will now draw from the literature on goal
setting, which provides several explanations for gaming Our finding that gaming performance measurement con-
performance measurement. forms to the definition of deviant workplace behavior also
has implications for current frameworks of deviant work-
Moral Disengagement Barsky (2008) introduces moral place behavior. In the following, we briefly review several
disengagement as the mechanism that links the attributes of typologies that have been most prevalent in the literature
goals and the relative practices with unethical behavior. on workplace deviance (cf., Berry et al. 2007; Griffin and
The author argues that performance goals, such as pub- Lopez 2005; Gruys and Sackett 2003; Klotz and Buckley
lishing a certain number of top-tier publications, might 2013; Marcus et al. 2002; Robinson and Bennett 1995;
direct the cognitive resources of the person from critical Spector 2006; Vardi and Wiener 1996; Warren 2003), and
ethical thinking to the achievement of the goal. Conse- we demonstrate that they do not take into account behav-
quently, a scientist does not recognize questionable pub- iors of the category gaming performance measurement. To
lishing practices as unethical (Barsky 2008). This effect has this end, we refer to those sub-dimensions of the frame-
been shown in experiments by Niven and Healy (2016), works that are most similar to gaming performance mea-
who found that although only participants who were high surement in that they are directly linked to performing core
in moral disengagement behaved unethically in a normal job tasks.5 Table 6 gives an overview of the sub-dimen-
task (i.e., anagram task), all individuals behaved signifi- sions of the frameworks, including sample behaviors and
cantly more unethically in an ethical dilemma task with items.
specific goals. Similarly, competition has been found to
have a dysfunctional effect on performance in goal setting
5
conditions (Campbell and Furrer 1995). From our review of frameworks we specifically exclude deviant
workplace behavior (1) that causes harm to the organization but that is
unrelated to an adaptation of core job tasks (e.g., stealing company
Rationalization Barsky (2008) describes a second mech- goods) or (2) that causes harm to other members of the organization
anism related to moral disengagement, namely the (e.g., sexual harassment).
123
L. Graf et al.
Gaming performance measurement seems to most clo- be incorporated into current frameworks of deviant work-
sely fit the category ‘production deviance’ in the sense that place behavior, which could be accomplished by extending
‘‘production deviance refers to behaviors that directly the definition of existing sub-categories such as production
interfere with work being performed in the organization’’ deviance (Robinson and Bennett 1995) or poor quality
(Everton et al. 2007, p. 119). This type of deviant work- work (Gruys and Sackett 2003). In addition, items referring
place behavior was first introduced by Hollinger and Clark to gaming performance measurement need to be added to
(1982) and was incorporated into several other frameworks measures of deviant workplace behavior.
of deviant workplace behavior (cf. Bennett and Robinson
2000; Robinson and Bennett 1995; Spector et al. 2006). Limitations and Future Research
Whereas the definition of production deviance and similar
categories (e.g., ‘poor quality work’ in the framework of Our study provides initial evidence that gaming perfor-
Gruys and Sackett 2003) suggests that it also includes mance measurement should be treated as a deviant act that
gaming performance measurement, the sample behaviors causes damage to an organization. This result raises
and items of the respective subscales demonstrate that the questions that provide fruitful avenues for future research,
authors of the frameworks did not consider this type of which will deepen our understanding of gaming perfor-
employee behavior. For example, Hollinger and Clark mance measurement as a form of deviant workplace
(1982) define production deviance as ‘‘behaviors which behavior.
violate the formally proscribed norms delineating the First, gaming performance measurement should be
quality and quantity of work to be accomplished’’ (p. 98). studied in different work contexts and types of organiza-
The sample behaviors the authors use to describe produc- tions to enhance the generalizability of our findings. The
tion deviance (‘‘tardiness, sloppy or slow workmanship, or results of this study may not be generalizable to less-
the use of alcohol or drugs while at work,’’ Hollinger and knowledge-based contexts because strategically adapting
Clark 1982, p. 98) show that they did not consider one’s work behavior to performance criteria could be
employees who adapt the way they perform their job task particularly prevalent in the case of non-routine tasks that
to try to fulfill the performance criteria and thereby lower are difficult to analyze and measure (cf. Daft et al. 2010).
the quality of their work performance. Spector et al. (2006) Rather, the findings of this study are most likely to be
define production deviance as ‘‘the purposeful failure to applicable to scientists in other types of knowledge-inten-
perform job tasks effectively the way they are supposed to sive organizations, because tasks performed in the uni-
be performed’’ (p. 449). However, the items of the pro- versity setting are similar to tasks in other knowledge-
duction deviance subscale refer to ‘doing work incor- intensive organizations (Chung and Jackson 2013; Keller
rectly,’ ‘working slowly when things needed to get done,’ 2012). The latter assumption is supported by the fact that
and ‘failing to follow instructions.’ Thus, the subscale does universities represent expert organizations (Pellert 1999)
not consider an employee’s intention underlying the and, thus, are similar to other knowledge-intensive firms
adaptation of job behavior (i.e., achieving higher work (Glauber et al. 2015). The assumption that our finding can
results in terms of measureable performance). Contrary to be generalized to knowledge-intensive contexts outside
most other frameworks of deviant workplace behavior, the academia should be investigated by future research. In
framework of Vardi and Wiener (1996) uses employees’ particular, future research on gaming performance mea-
intention to distinguish different forms of deviance. In this surement should analyze other knowledge-intensive con-
framework, gaming performance measurement is most texts such as research and development teams in private
similar to organizational misbehavior ‘Type S,’ which is organizations.
defined as ‘‘misbehaviors that are intended to benefit the Because our results are based on qualitative interviews,
self’’ (Vardi and Wiener 1996, p. 155) and which is tar- we cannot draw conclusions about the actual consequences
geted at ‘‘the work itself (e.g., distorting data)’’ (Vardi and or causal effects of performance measurement on scien-
Wiener 1996, p. 155). Interestingly, the authors use ‘career tists’ behavior. Similar to most previous studies, we rely on
considerations’ as an example for an intention that can reports of the perceived effects of performance measure-
underlie Type S behavior. However, this intention would ment on scientists’ behavior. Future research could use
also apply to behaviors that are not related to performing objective data for measuring both scientific performance
job tasks, such as gossiping about colleagues. (e.g., publication performance) and the prevalence of per-
The short review of current conceptualizations of task- formance measurement practices in order to draw causal
related deviant workplace behavior shows that gaming inferences about intended and unintended effects of per-
performance measurement has not been explicitly consid- formance measurement in academia, such as gaming per-
ered in the literature on deviant workplace behavior. Thus, formance measurement behavior of scientists.
we propose that gaming performance measurement should
123
Table 6 Sub-dimensions of current frameworks of deviant workplace behavior

Framework Sub-dimension and definition Sample behaviors (items)
Hollinger and Production deviance Long lunch or coffee breaks

Clark (1982) ‘‘behaviors which violate the formally proscribed Slow or sloppy work
norms delineating the quality and quantity of Influence of alcohol or drugs
work to be accomplished (e.g., tardiness,
Come late/leave early from work
sloppy or slow workmanship, or the use of
alcohol or drugs while at work)’’ (p. 98) Sick leave when not sick
Robinson and Production deviance Leaving early

Bennett (1995) ‘‘relatively minor but still organizationally Taking excessive breaks
harmful deviant acts’’ (p. 565) Intentionally working slow
Wasting resources
Bennett and Organizational deviance Taken property from work without permission
Robinson ‘‘deviant behaviors directly harmful to the Spent too much time fantasizing or daydreaming instead of working
(2000)a organization’’ (p. 349) Falsified a receipt to get reimbursed for more money than you spent on
business expenses
Taken an additional or longer break than is acceptable at your
workplace
Come in late to work without permission
Littered your work environment
Neglected to follow your boss’s instructions
Intentionally worked slower than you could have worked
Discussed confidential company information with an unauthorized
person
Used an illegal drug or consumed alcohol on the job
Put little effort into your work
Dragged out work in order to get overtime
Spector et al. Production deviance Purposely did your work incorrectly.

(2006) ‘‘the purposeful failure to perform job tasks Purposely worked slowly when things needed to get done.
effectively the way they are supposed to be Purposely failed to follow instructions.
performed’’ (p. 449)
Gruys and Sackett Poor quality work Intentionally perform your job below acceptable standards.
(2003) (no definition) Intentionally do work badly or incorrectly
Intentionally do slow or sloppy work.
Vardi and Wiener Organizational misbehavior Type S Distorting data (target: the work itself)
(1996) ‘‘misbehaviors that are intended to benefit the Stealing and selling manufacturing secrets (target: the organization’s
self’’ (p. 155) property, resources, symbols or regulations)
Harassing peers (target: other members)
a
In the original framework of Robinson and Bennett (1995), the authors distinguished severe cases of organizational deviance (property
deviance) from behaviors that result in less-serious consequences for an organization (production deviance). Later, the authors dropped the
severity dimension (Bennett and Robinson, 2000). The items reported for the framework of Bennett and Robinson (2000) are from the subscale
‘organizational deviance,’ which includes the behaviors of the former category ‘production deviance’
Despite the limitation that we cannot draw causal con- our interviewees about the advantages and disadvantages
clusions, our study adds to previous research on the of performance measurement in academia and possible
intended and unintended effects of performance measure- consequences for scientists’ behavior. In sum, they repor-
ment in academia. Our study complements previous find- ted more negative than positive behavioral consequences of
ings in that we interviewed different stakeholders of the current performance measurement practices in universities.
higher education system. Thus, we provide a more com- Other studies that examined the effects of performance
prehensive perspective on the perceived consequences of measurement in academia yielded mixed results. Ringelhan
performance measurement than previous studies. We asked et al. (2013), for example, found that extrinsic motivation
123
L. Graf et al.
is negatively related to scholarly performance, whereas universities have increasingly come under pressure to be at
Lewis (2014) concludes that there is no relationship the top of international rankings and to outperform other
between scientists’ attitudes toward performance mea- universities in competing for public funding (Auranen and
surement and their publication performance. Overman Nieminen 2010; Hicks 2012; Melo et al. 2008; Rabovsky
et al. (2016) found a positive relationship between the 2014; van Thiel and Leeuw 2002). There is a strong focus
specificity of performance contracts (i.e., the extent to on quantitative measures of research performance in the
which there are concrete and detailed agreements on per- evaluation of universities’ performance, foremost being the
formance targets regarding results and rewards) and the number of publications in top-tier journals and the amount
level of integrity (i.e., conforming to norms) that aca- of grant money a university acquires (Aguinis et al. 2014;
demics perceive at their workplace. Both studies, Lewis Aguinis et al. 2012; Harmon 2006; Ringelhan et al. 2015;
(2014) and Overman et al. (2016), focused on the positive Schmoch et al. 2010). From this perspective, scientists who
consequences of increasing the use of performance mea- adapt their behavior in such a way as to fulfill the perfor-
surement practices in academia. This viewpoint is not only mance criteria (e.g., by focusing on easily publishable
consistent with our finding that performance measurement research topics and neglecting teaching duties) appear to
practices have positive as well as negative effects on sci- contribute to their university’s overall performance instead
entists’ behavior, but it is also in line with the extensive of harming it. Notwithstanding this argument, we claim
research on the positive effects of goal setting on perfor- that other factors need to be taken into account in deter-
mance (e.g., Latham and Locke 1990, 1991; Locke and mining the harm that gaming performance measurement
Latham 2002, 2006). Furthermore, finding both positive can cause for an organization. If cases of scientific mis-
and negative effects of performance measurement practices conduct, such as falsifying results, are made public, they
stresses the point that performance measurement is a can lead to a severe loss of a university’s reputation (e.g.,
double-edged sword. the Staple fraud, cf. Stroebe et al. 2012). Thus, it is
More research is also needed to examine (1) the role of important to consider not only short-term consequences
different organizational norms that are related to gaming and immediate effects but also risks that may threaten an
performance measurement and (2) the negative as well as organization’s well-being in the long run. Furthermore, all
positive consequences this behavior can have for employ- stakeholders of the organization should be taken into
ees and their organizations. First, we propose that gaming account in deciding whether gaming performance mea-
performance measurement is a form of deviant workplace surement is harmful to an organization. For example, pri-
behavior because it is a significant violation of organiza- oritizing research tasks and putting less effort into teaching
tional norms that causes harm to the organization. It vio- threatens the teaching quality at a university and thus
lates the organizational norm to produce high-quality work affects students’ quality of education.
results and to fulfill all tasks that belong to an employee’s Therefore, this study contributes to the ongoing dis-
job. However, one could also argue that the performance cussion of which norms need to be violated by employee
measurement criteria an organization uses constitute a form behavior and who is being harmed in order for the activity
of organizational norm that employees are expected to to fall under the definition of deviant workplace behavior.
adhere to. In that sense, employees who engage in gaming Some authors refer only to damage to the organization as a
performance measurement do not violate but adhere to defining element of deviant workplace behavior (e.g.,
organizational norms and act the way they are told to, for Robinson and Bennett 1995). Classifying gaming perfor-
example, publish as many papers as possible. As discussed mance measurement as a form of deviant workplace
earlier, however, many universities have published official behavior reinforces the proposition of other authors to
‘codes of conduct’ or ‘principles of good scientific prac- broaden the concept of deviant workplace behavior by also
tice.’ Clearly, gaming performance measurement consti- considering damage to any of the organization’s stake-
tutes a violation of these official university norms holders, including society (e.g., Vardi and Wiener 1996).
regarding the quality of research and teaching.
Second, we conclude from our results that gaming per- Practical Implications for Universities’ Performance
formance measurement classifies as deviant workplace Measurement
behavior because employees who produce lower-quality
results and put less effort into tasks that are not measured The present study sheds light on the negative consequences
contribute to an overall loss in organizational performance. of new managerialism in academia. New managerialism
It must be taken into account, however, that an organiza- implies the general tendency to control scientists’ perfor-
tion’s choice of specific performance criteria is usually mance through performance measurement in Germany but
influenced by the question of how the performance of the also in other countries. However, even if—as the inter-
organization itself is measured. In the last few decades, viewees’ answers in this study show—a tight performance
123
measurement focusing on publications and third-party References

funding also has a positive impact on scientists’ perfor-
mance, the negative consequences should not only be kept Aguinis, H. (2013). Performance management (3rd ed.). Boston, MA:
Pearson.
in mind but also be taken into account in universities’
Aguinis, H., Shapiro, D. L., Antonacopoulou, E. P., & Cummings, T.
performance measurement practices. In particular, we add G. (2014). Scholarly impact: A pluralist conceptualization.
to growing concerns that the current performance mea- Academy of Management Learning and Education, 13(4),
surement practices do not always lead to the intended 623–639.
Aguinis, H., Suárez-González, I., Lannelongue, G., & Joo, H. (2012).
increase in scientists’ performance (e.g., Barsky 2008;
Scholarly impact revisited. Academy of Management Perspec-
Honig et al. 2017; Osterloh 2010). They rather tempt sci- tives, 26(2), 105–132.
entists to game the performance measurement practices by Auranen, O., & Nieminen, M. (2010). University research funding
reducing the quality of their work and neglecting tasks that and publication performance: An international comparison.
Research Policy, 39(6), 822–834.
are not measured (e.g., teaching). Scientists’ gaming per-
Babbage, C. (1830). Reflections on the decline of science in England,
formance measurement poses a threat to the university and and on some of its causes. London, U.K.: Fellowes.
their stakeholders. Understanding the underlying psycho- Barsky, A. (2008). Understanding the ethical cost of organizational
logical processes of gaming performance measurement can goal-setting: A review and theory development. Journal of
Business Ethics, 81(1), 63–81.
help university management to scrutinize their perfor-
Bennett, R. J., & Robinson, S. L. (2000). Development of a measure
mance measurement practices. Most of all, there is a need of workplace deviance. Journal of Applied Psychology, 85(3),
to take the multidimensionality of scholarly performance 349–360.
into account (Aguinis et al. 2014, 2012). Performance Berry, C. M., Ones, D. S., & Sackett, P. R. (2007). Interpersonal
deviance, organizational deviance, and their common correlates:
measurement practices should not focus solely on the
A review and meta-analysis. Journal of Applied Psychology,
number of publications in top-tier journals and the amount 92(2), 410–424.
of acquired funding. They should also consider, for Bolin, A., & Heatherly, L. (2001). Predictors of employee deviance:
example, the quality and creativity of research ideas or The relationship between bad attitudes and bad behavior.
Journal of Business and Psychology, 15(3), 405–418.
scientists’ activities of transferring their ideas to the sci-
Butler, N., Delaney, H., & Spoelstra, S. (2017). The gray zone:
entific community as well as to students and society. Questionable research practices in the business school. Academy
Otherwise, measuring scientists’ performance can induce of Management Learning and Education, 16(1), 94–109.
deviant workplace behavior and thereby impair universi- Butler, N., & Spoelstra, S. (2012). Your excellency. Organization,
19(6), 891–903.
ties’ true mission of generating and transmitting
Campbell, D. J., & Furrer, D. M. (1995). Goal setting and competition
knowledge. as determinants of task performance. Journal of Organizational
Behavior, 16(4), 377–389.
Acknowledgements The authors would like to express their sincere Christian, J. S., & Ellis, A. P. J. (2014). The crucial role of turnover
gratitude to the German Federal Ministry of Education and Research, intentions in transforming moral disengagement into deviant
to the participants of the study, and to Margit Osterloh, who provided behavior at work. Journal of Business Ethics, 119(2), 193–208.
very valuable feedback on the interview guideline. Additionally, the Chung, Y., & Jackson, S. E. (2013). The internal and external
authors thank the student assistants who were involved in the tran- networks of knowledge-intensive teams: The role of task
scription of the interviews and partly in the coding of the interviews, routineness. Journal of Management, 39(2), 442–468.
especially, Miriam Rosentritt, Carl Schade, and Falk Serbser. Further Clor-Proell, S. M., Kaplan, S. E., & Proell, C. A. (2015). The impact
thanks go to the two anonymous reviewers of the Journal of Business of budget goal difficulty and promotion availability on employee
Ethics and to the editor. Their constructive comments and sugges- fraud. Journal of Business Ethics, 131(4), 773–790.
tions, which enabled us to improve the manuscript, were extremely Colbert, A. E., Mount, M. K., Harter, J. K., Witt, L. A., & Barrick, M.
helpful and supportive. R. (2004). Interactive effects of personality and perceptions of
the work situation on workplace deviance. Journal of Applied
Funding This study was funded by the German Federal Ministry of Psychology, 89(4), 599–609.
Education and Research (Grant No. FKZ 01PY13012). Daft, R. L., Murphy, J., & Willmott, H. (2010). Organization theory
and design. Andover: South-Western Cengage Learning.
Compliance with Ethical Standards Dalal, R. S. (2005). A meta-analysis of the relationship between
organizational citizenship behavior and counterproductive work
Conflict of interest The authors declare that they have no conflict of behavior. Journal of Applied Psychology, 90(6), 1241–1255.
interest. De Bruijn, H. (2002). Performance measurement in the public sector:
Strategies to cope with the risks of performance measurement.
Ethical Approval All procedures performed in studies involving International Journal of Public Sector Management, 15(7),
human participants were in accordance with the ethical standards of 578–594.
the institutional and/or national research committee and with the 1964 Deem, R., & Brehony, K. J. (2005). Management as ideology: The
Declaration of Helsinki and its later amendments or comparable case of ‘new managerialism’ in higher education. Oxford Review
ethical standards. of Education, 31(2), 217–235.
Deem, R., Hillyard, S., & Reed, M. (2007). Knowledge, higher
Informed Consent Informed consent was obtained from all indi- education, and the new managerialism: The changing manage-
vidual participants included in the study. ment of UK universities. Oxford: Oxford University Press.
123
L. Graf et al.
Everton, W. J., Jolton, J. A., & Mastrangelo, P. M. (2007). Be nice Krippendorff, K. (2013). Content analysis: An introduction to its
and fair or else: Understanding reasons for employees’ deviant methodology (3rd ed.). Los Angeles, CA: Sage.
behaviors. Journal of Management Development, 26(2), Lasson, E. D., & Bass, A. R. (1997). Integrity testing and deviance:
117–131. Construct validity issues and the role of situational factors.
Fanelli, D. (2009). How many scientists fabricate and falsify Journal of Business and Psychology, 12(2), 121–146.
research? A systematic review and meta-analysis of survey data. Latham, G. P., & Locke, E. (1990). A theory of goal setting and task
PLoS ONE, 4(5), e5738. performance. Eaglewood Cliffs, NJ: Prentice Hall.
Freeman, R., Weinstein, E., Marincola, E., Rosenbaum, J., & Latham, G. P., & Locke, E. A. (1991). Self-regulation through goal
Solomon, F. (2001). Competition and careers in biosciences. setting. Organizational Behavior and Human Decision Pro-
Science, 294(5550), 2293–2294. cesses, 50(2), 212–247.
Ghysels, E. (2000). Some econometric recipes for high-frequency Latham, G. P., & Locke, E. A. (2007). New developments in and
data cooking. Journal of Business and Economic Statistics, directions for goal-setting research. European Psychologist,
18(2), 154–163. 12(4), 290–300.
Glauber, J., Wollersheim, J., Sandner, P., & Welpe, I. M. (2015). The Lewis, J. M. (2014). Research productivity and research system
patenting activity of German universities. Journal of Business attitudes. Public Money and Management, 34(6), 417–424.
Economics, 85(7), 719–757. Litzky, B. E., Eddleston, K., & Kidder, D. L. (2006). The good, the
Goebel, S., & Weißenberger, B. E. (2016). The dark side of tight bad, and the misguided: How managers inadvertently encourage
financial control: Causes and remedies of dysfunctional deviant behaviors. Academy of Management Perspectives, 20(1),
employee behaviors. Schmalenbach Business Review, 17(1), 91–103.
69–101. Locke, E. A., & Latham, G. P. (2002). Building a practically useful
Griffin, R. W., & Lopez, Y. P. (2005). ‘‘Bad behavior’’ in theory of goal setting and task motivation: A 35-year odyssey.
organizations: A review and typology for future research. American Psychologist, 57(9), 705–717.
Journal of Management, 31(6), 988–1005. Locke, E. A., & Latham, G. P. (2006). New directions in goal-setting
Gross, C. (2016). Scientific misconduct. Annual Review of Psychol- theory. Current Directions in Psychological Science, 15(5),
ogy, 67, 693–711. 265–268.
Gruys, M. L., & Sackett, P. R. (2003). Investigating the dimension- Louw, K. R., Dunlop, P. D., Yeo, G. B., & Griffin, M. A. (2016).
ality of counterproductive work behavior. International Journal Mastery approach and performance approach: The differential
of Selection and Assessment, 11(1), 30–42. prediction of organizational citizenship behavior and workplace
Guest, G., Bunce, A., & Johnson, L. (2006). How many interviews are deviance, beyond HEXACO personality. Motivation and Emo-
enough? An experiment with data saturation and variability. tion, 40(4), 566–576.
Field Methods, 18(1), 59–82. Marcus, B., Schuler, H., Quell, P., & Hümpfner, G. (2002).
Harmon, M. M. (2006). Business research and Chinese patriotic Measuring counterproductivity: Development and initial valida-
poetry: How competition for status distorts the priority between tion of a German self-report questionnaire. International Journal
research and teaching in U.S. business schools. Academy of of Selection and Assessment, 10(1–2), 18–35.
Management Learning and Education, 5(2), 234–243. Martinko, M. J., Gundlach, M. J., & Douglas, S. C. (2002). Toward an
Hayes, A. F., & Krippendorff, K. (2007). Answering the call for a integrative theory of counterproductive workplace behavior: A
standard reliability measure for coding data. Communication causal reasoning perspective. International Journal of Selection
Methods and Measures, 1(1), 77–89. and Assessment, 10(1–2), 36–50.
Hicks, D. (2012). Performance-based university research funding Martinson, B. C., Anderson, M. S., & De Vries, R. (2005). Scientists
systems. Research Policy, 41(2), 251–261. behaving badly. Nature, 435(7043), 737–738.
Hollinger, R., & Clark, J. (1982). Employee deviance: A response to Mazar, N., Amir, O., & Ariely, D. (2008). The dishonesty of honest
the perceived quality of the work experience. Work and people: A theory of self-concept maintenance. Journal of
Occupations, 9(1), 97–114. Marketing Research, 45(6), 633–644.
Honig, B., Lampel, J., Siegel, D., & Drnevich, P. (2017). Special Melo, A. I., Sarrico, C. S., & Radnor, Z. (2008). Research design for
section on ethics in management research: Norms, identity, and analysing the relationship between governance structures and
community in the 21st century. Academy of Management performance management systems in universities. Reflecting
Learning and Education, 16(1), 84–93. Education, 4(2), 68–81.
Hood, C. (1991). A public management for all seasons? Public Niven, K., & Healy, C. (2016). Susceptibility to the ‘dark side’ of
Administration, 69(1), 3–19. goal-setting: Does moral justification influence the effect of
Jaworski, B. J. (1988). Toward a theory of marketing control: goals on unethical behavior? Journal of Business Ethics, 137(1),
Environmental context, control types, and consequences. The 115–127.
Journal of Marketing, 52(3), 23–39. Ordónez, L. D., Schweitzer, M. E., Galinsky, A. D., & Bazerman, M.
Jensen, M. C. (2003). Paying people to lie: The truth about the H. (2009). On good scholarship, goal setting, and scholars gone
budgeting process. European Financial Management, 9(3), wild. Academy of Management Perspectives, 23(3), 82–87.
379–406. Osterloh, M. (2010). Governance by numbers. Does it really work in
Johnsen, Å. (2005). What does 25 years of experience tell us about research? Analyse and Kritik, 32(2), 267–283.
the state of performance measurement in public policy and Osterloh, M., & Frey, B. S. (2015). Ranking games. Evaluation
management? Public Money and Management, 25(1), 9–17. Review, 39(1), 102–129.
Kaplan, H. B. (1975). Self-attitudes and deviant behavior. Pacific Overman, S., Akkerman, A., & Torenvlied, R. (2016). Targets for
Palisades, CA: Goodyear. honesty: How performance indicators shape integrity in Dutch
Keller, R. T. (2012). Predicting the performance and innovativeness higher education. Public Administration, 94(4), 1140–1154.
of scientists and engineers. Journal of Applied Psychology, Pellert, A. (1999). Die Universität als Organisation: Die Kunst
97(1), 225–233. Experten zu managen. Vienna: Böhlau.
Klotz, A. C., & Buckley, M. R. (2013). A historical perspective of Peterson, D. K. (2002). Deviant workplace behavior and the
counterproductive work behavior targeting the organization. organization’s ethical climate. Journal of Business and Psychol-
Journal of Management History, 19(1), 114–132. ogy, 17(1), 47–61.
123
Pierce, J. R., & Aguinis, H. (2015). Detrimental citizenship Spector, P. E. (2006). Method variance in organizational research
behaviour: A multilevel framework of antecedents and conse- truth or urban legend? Organizational Research Methods, 9(2),
quences. Management and Organization Review, 11(1), 69–99. 221–232.
Polzer, J. T., & Neale, M. A. (1995). Constraints or catalysts? Spector, P. E., & Fox, S. (2002). An emotion-centered model of
Reexamining goal setting within the context of negotiation. voluntary work behavior: Some parallels between counterpro-
Human Performance, 8(1), 3–26. ductive work behavior and organizational citizenship behavior.
Rabovsky, T. M. (2014). Using data to manage for performance at Human Resource Management Review, 12(2), 269–292.
public universities. Public Administration Review, 74(2), Spector, P. E., Fox, S., Penney, L. M., Bruursema, K., Goh, A., &
260–272. Kessler, S. (2006). The dimensionality of counterproductivity:
Raelin, J. A. (1994). Three scales of professional deviance within Are all counterproductive behaviors created equal? Journal of
organizations. Journal of Organizational Behavior, 15(6), Vocational Behavior, 68(3), 446–460.
483–501. Steneck, N. H. (2003). The role of professional societies in promoting
Ringelhan, S., Wollersheim, J., & Welpe, I. M. (2015). I like, I cite? integrity in research. American Journal of Health Behavior,
Do Facebook likes predict the impact of scientific work? PLoS 27(1), S239–S247.
ONE, 10(8), e0134389. Steneck, N. H. (2006). Fostering integrity in research: Definitions,
Ringelhan, S., Wollersheim, J., Welpe, I. M., Fiedler, M., & Spörrle, current knowledge, and future directions. Science and Engineer-
M. (2013). Work motivation and job satisfaction as antecedents ing Ethics, 12(1), 53–74.
of research performance: Investigation of different mediation Stroebe, W., Postmes, T., & Spears, R. (2012). Scientific misconduct
models. In A. Dilger (Ed.), Performance Management im and the myth of self-correction in science. Perspectives on
Hochschulbereich (pp. 7–38). Wiesbaden: Springer. Psychological Science, 7(6), 670–688.
Robinson, S. L., & Bennett, R. J. (1995). A typology of deviant Thoms, P., Wolper, P., Scott, K. S., & Jones, D. (2001). The
workplace behaviors: A multidimensional scaling study. Acad- relationship between immediate turnover and employee theft in
emy of Management Journal, 38(2), 555–572. the restaurant industry. Journal of Business and Psychology,
Saini, A., & Krush, M. (2008). Anomie and the marketing function: 15(4), 561–577.
The role of control mechanisms. Journal of Business Ethics, Tijdink, J. K., Verbeke, R., & Smulders, Y. M. (2014). Publication
83(4), 845–862. pressure and scientific misconduct in medical scientists. Journal
Sandelowski, M. (1995). Sample size in qualitative research. of Empirical Research on Human Research Ethics, 9(5), 64–71.
Research in Nursing and Health, 18(2), 179–183. Umphress, E. E., & Bingham, J. B. (2011). When employees do bad
Schmoch, U., Schubert, T., Jansen, D., Heidler, R., & von Görtz, R. things for good reasons: Examining unethical pro-organizational
(2010). How to use indicators to measure scientific performance: behaviors. Organization Science, 22(3), 621–640.
A balanced approach. Research Evaluation, 19(1), 2–18. van Thiel, S., & Leeuw, F. L. (2002). The performance paradox in the
Schubert, T. (2009). Empirical observations on New Public Manage- public sector. Public Performance and Management Review,
ment to increase efficiency in public research—Boon or bane? 25(3), 267–281.
Research Policy, 38(8), 1225–1234. Vardi, Y., & Wiener, Y. (1996). Misbehavior in organizations: A
Smith, P. (1995). On the unintended consequences of publishing motivational framework. Organization Science, 7(2), 151–165.
performance data in the public sector. International Journal of Walker, R. M., Brewer, G. A., Boyne, G. A., & Avellaneda, C. N.
Public Administration, 18(2–3), 277–310. (2011). Market orientation and public service performance: New
Sousa, C. A. A., de Nijs, W. F., & Hendriks, P. H. J. (2010). Secrets of Public Management gone mad? Public Administration Review,
the beehive: Performance management in university research 71(5), 707–717.
organizations. Human Relations, 63(9), 1439–1460. Warren, D. E. (2003). Constructive and destructive deviance in
Sovacool, B. K. (2008). Exploring scientific misconduct: Isolated organizations. Academy of Management Review, 28(4), 622–632.
individuals, impure institutions, or an inevitable idiom of
modern science? Journal of Bioethical Inquiry, 5(4), 271–282.
123

4./s10551 017 3688 y

Caricato da

Informazioni sul documento

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

4./s10551 017 3688 y

Caricato da

Copyright:

Formati disponibili

J Bus Ethics

Wanting More, Getting Less: Gaming Performance Measurement

Received: 29 March 2017 / Accepted: 24 August 2017

Table 1 Interview questions

Step 1: Behavioral Consequences an explorative, inductive content analysis (Krippendorff

Positive behavioral consequences 56

3. Motivation Higher motivation due to positive pressure .79 17

Negative behavioral consequences 181

1.1.3. Use of questionable methods Violation of norms and standards .84 12

1.2. Decrease in teaching quality The quality of teaching decreases: .91 12

Table 3 Sample codings of ‘behavioral consequences’ (Step 1)

Positive behavioral consequences

Table 4 Sample codings of ‘gaming performance measurement’ (Step 2)

Table 5 Sample codings of ‘deviant workplace behavior’ (Step 3)

Table 6 Sub-dimensions of current frameworks of deviant workplace behavior

Hollinger and Production deviance Long lunch or coffee breaks

Robinson and Production deviance Leaving early

Spector et al. Production deviance Purposely did your work incorrectly.

measurement focusing on publications and third-party References

Potrebbero piacerti anche