Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Computer Games
Inger Ekman, inger.ekman@tml.hut.fi
Department of Media Technology,
Helsinki University of Technology
P.O.Box 5400
FIN-02015 HUT
Abstract. One main function of sound in games is to create and enhance emotional impact. The expressive model for game sound
has its tradition in sound design for linear audiovisual media: animation and cinema. Current theories on emotional responses to
fiction are mainly concerned with linear medial, and only partly applicable to interactive systems like games. The interactivity
inherent to games introduces new requirements for sound design, and suggests a break in perception compared with linear media.
This work reviews work on emotional responses to fiction and applies them to the area of game sound. The synthesis is
interdisciplinary, combining information and insights from a number of fields, including psychology of emotion, film sound theory,
experimental research on music perception and philosophy. The paper identifies two competing frameworks for explaining fictional
emotions, with specific requirements, and signature techniques for sound design. The role of sound is examined in both cases. The
result is a psychologically motivated theory of sound perception capable of explaining the emotional impact of sound in film, as well
as identifying the similarities and difference in emotional sound design for these two media.
Whereas the protagonist can also provide a vessel for Soundtrack CD sales gives strong evidence that artefact
empathetic emotions, to some extent, the two frameworks for emotions functions with regards to both film and game
emotional evaluation - empathetic and gameplay - are music. Beautiful pieces of music obviously give themselves
competing. As Perron [21] notes, even in games, both the to being appreciated as such, disregarding of whether the
main story line as well as individual plot elements and viewer is attending to the story. Further, it is motivated to
narrative turning points are indeed furthered from time to broaden the spectrum of artefact emotions to include also
time through filmic means, using stills, predetermined negative effects. With thought to how visual material is used
animation sequences/ dialogue or cut scenes. During these for shock effects (e.g. displaying blood and entrails), it is
moments, the player is stripped of control and, effectively, easy to imagine a similar process in which unpleasant
reduced into a witness position. Empathetic emotion comes sounds, regardless of story, could produce negative affect.
with loss of activity. A similar point is made by Lankoski Both in film and games, sound provokes sensory pleasure
[15] when he suggests the empathetic capability of the player and displeasure.
is inversely related to the cognitive challenge of action - in
the heat of a battle, there is little time to ponder the 3.1 The Realism Fallacy
protagonist's feelings. The above categories identify the main roles of sound in
creating and steering the emotional experience. However, as
we shall soon see, there are unresolved contradictions hiding Inferences” [31]. This view is further fortified by subsequent
within these categories. Namely, the effects within the two results from experimental psychology, leading researchers to
latter categories of sound (gameplay and non-empathetic) suggest that there exists such a thing as unconscious emotion
seem to contradict the traditional sound design goal of [30]. For example, Öhman [32] has demonstrated fear
narrative realism. reactions in people who were presented with spider and snake
pictures subconsciously; that is, people became frightened of
Appraisal theory of emotion holds that empathetic emotional pictures they never even realized they had seen.
processing and value judgement is guided by conscious
attention to a story, and this cognitive investment is In light of unconscious value judgements, it is easy to read
heightened in the perceived realism of portrayal. On the other Tan's non-empathic emotions are but one example of these:
hand, it allows emotional experiences that arise from the sounds of the film are invoking emotion by nature of their
appreciation of the artefact: the pictures and sounds of the perceptual properties, unrelated to story at hand. This also
film/game as such. Similar appreciation is present in games, a opens up a way to understand and predict what would result
medium where technological artistry is elaborately in a positive (or negative, for that matter) value judgement.
showcased, often even used in promotion. One especially interesting factor is the importance of
familiarity and perceptual fluency for eliciting positive value
Tan’s view is that artefact emotions detach the viewer from judgements [23]. By this account, beauty is defined by the
the story, drawing attention away from the narrative towards ease in which a stimulus can be processed.
the film as artefact, thus making the actions within the
narrative less consequential for the viewer. This position 3.3 Misattribution and Making Sense of
raises a complicated question, namely how to interpret such Emotions
sounds that appear to transgress the borders of realism,
Affective responses include paraphernalia of bodily
despite experientially supporting narrative.
responses (pounding heart, sweaty palms) and may also bring
with them a certain action tendency (e.g. fight-or-flight). In
For example, while discussing the effect of sentiment, Tan
fact, it has been suggested that one possible function in which
and Frijda [28] 62] mention sound, especially orchestral
we do consciously attend to our affective processes is by
music, as one possible source for the awe-inspiring. Awe
appreciation of the abrupt changes in the felt background
requires a sense of overwhelming power, and this role is
state, or what Russell [24] has called core affect. This change
partly to be played by sound. The effect, according to Tan
would lead us seeking for a cause of our altered state, leading
and Frijda, is the emotional function of total submission, a
us via cognitive process to attribute the jolt to the most
feeling underlying e.g. crying. Sound is thus a tool in
plausible event in our environment.
portraying power, and heightening sentiment. The question
is, by which channel this non-empathetic heightening of
Whether or not we are willing to accept unconscious affect as
sentiment is capable of influencing the (empathetic)
the source of emotions, we can agree that when consciously
evaluation of narrative events.
attended to, emotions tend to have an object. Emotions are
evaluations of something. To be able to function properly, we
The problem of border transgression is most apparent in two
must be able to determine an object for our emotions,
sound conventions that would seem to deviate from the
something to be afraid of, or pleased by. This distinguishes
purpose of reality. In what seems like a blatant contradiction,
emotion from moods, which are long-term affective states
they invoke a sense of realism in highly unrealistic sounds.
without an object. However, and here comes the catch, the
One is the use of musical scoring. The other is the use of
events we cognitively allocate, as objects do not necessarily
sound effects that mismatch what is seen on screen. Similar
have to be the true cause of our initial affective response. In
breaches are omnipresent in games, as well, where sound
fact, when it comes to reasoning why we feel like we do, we
elements effortlessly transgress borders, allowing objects
are prone to make mis-attributions and erroneously
within the story world (diegesis) to refer to non-diegetic
appreciate our affects quite differently from their real causes
space, and vice versa [8][12].
even in everyday life.
The above concerns are about where to draw the line of
A classical example of misattribution is a study by Schachter
realism and about how emotional effects communicate across
and Singer [26]. They injected subjects with doses of
different categories of judgement. Upon closer scrutiny, both
adrenaline, a hormone associated with an excited body state.
questions generalize to the way non-empathetic affect
Depending on the situation that followed the injection,
influences other sources of emotion (empathetic or
subjects judged their aroused state as either anger or elation.
gameplay- related appraisal). To proceed further, we need an
It can be argued that in most cases, our appreciation of our
explanation on how non-empathetic emotions arise, and a
emotional state is at least partly determined by context.
way of predicting when and how artefact emotions lend
emotional meaning to other evaluative processes.
3.4 Misattribution and Emotion in Film and
Games
3.2 Unconscious Affective Processes
Misattribution is a process wherein the contextual appraisal
The theoretical frameworks dealt with above have considered
of perceived emotional ‘raw material’ lending emotional
affect by means of perceived experience. Nevertheless, it
meaning to an outside cause, irrelevant to that particular
seems many of the associations and effects of sound in both
emotional stir. Now consider a similar process at work during
film and games are working on an unconscious level.
film viewing or when playing a game, with music, sounds,
pictures, actions, all mingling to create emotional impacts. Is
Several findings state to the fact that at least some
it not probable, that at some point, the true causes of our
evaluations of stimuli are made precognitively. Zajonc was
feelings might remain oblivious to us? Is it not possible, that
among the first to point this out in his essay, famously
we, stirred by our passions, unwittingly, in deciphering the
entitled “Feeling and Thinking: Preferences need no
cause of our emotions take them to be caused by whatever studies at least part of the functions of musical meaning
the film serves to us on a silver(screen) plate? Could it be appear to be universal [19].
that we just happen to attend to a game event, and assume our
emotions are caused by that event, when in fact they are not? The existence of universal function of musical emotion
suggests there may be other sources of emotion humans draw
This is, essentially, what Annabel Cohen proposes. Cohen from in their interpretations of music. The obvious case is
[5][6] has dealt extensively with the difficult question why, that music invokes memories and connotations awaken by
and how, something as obviously constructed as the film that music. However, those would not be universal, even less
score, does not completely destroy the sense of realism in a so than culturally learned expectations. Van Leuwen [17]
film. On the contrary, as many composers will confirm, a turns to the human body, suggesting that the most primitive,
carefully chosen (or composed) piece of music will actually and also a common link between sound and emotion for all
heighten the sense of reality in a film. Music also seems to humans, is the perception of our own bodies. Especially the
lend a great deal of emotion to events, in ways other than vocal system sets a reference point through simultaneous
proposed by the cognitive appraisal theory. Cohen's answer experience of how it feels and what it requires to produce a
lies in a congruence-associationist model of film viewing certain sound.
[18], whereby music focuses attention on those objects in the
film that are congruent with the sound. At the same time, the Another suggestion is that evaluation of some sounds has
conscious attention is directed away from non-associated biological motivation. This appears to be the fact with the
sounds, and attending suppressed for stimuli irrelevant for startle response (the phenomenon in which we jump to
ongoing cognitive processes. someone shouting ‘boo’ at us), which aside from providing
for pranks also makes us more alert for dangers and
The emotional impact comes from the fact that sounds, even automatically directs our attention toward potentially harmful
when unattended to, will nevertheless affect perception of events. However, there may well be other ways in which our
objects in the film. Cohen [6] highlights the importance of perception of sound is evolutionary determined. For example
temporal unity as a binding factor and predictor of which Huron [11] suggests that the perceived cuteness of sounds
parts of the sound will draw attention. She calls our attention may be an evolutionary adaptation that promotes parenting.
to animation and the technique called mickey-mousing,
whereby sound effects are replaced by short musical motifs. Value judgements (which are the raw stuff of emotion) seem
Their temporal matching allows these music snippets to to be going on even on the lowest level of perception. A
replace the original sounds of the events, at the same imbuing classical example within music (and other perceptual)
both the events, and the objects part of the action, with research is the mere exposure effect, wherein a stimulus is
specific characteristics. judged as likeable merely as the function of familiarity.
Investigations into a phenomenon called perceptual fluency
The account on musical meaning in film provides an equally suggest that emotional processes are influenced by the very
useful tool for approaching the question about other film ease of processing [23]. These findings would imply that
sounds as well. Applied to object sound, the theory suddenly such things as differences in perceptual clarity (think signal-
appears much less mysterious: Consider how we find out to-noise ratio) of audio influence the emotional impact of
properties of objects in real life. What we do is handle the sound, such as perceived beauty or likeability.
object – tap it, stroke it, bang it against something. By
perceiving synchronic sounds, we find out the normal sound The critical requirement for musical expectations to arise is
of a chair, a balloon or a mandolin. Now, turn that process that it is attended to as music. Further, there appears to be
around and we have precisely what sounds do in a film (and, boundaries in our listening schemes that separate different
to a great extent in games as well): now the temporal unity of styles of listening. Notably, musical listening, in which
event and sound defines the object through what sound it sounds are perceived as sounds, is not the only form of
makes. Longer chains of events, if temporally matched, cause attending to events. An illustrative deviation from this frame
similar perceptions. When approached from this angle, it is are listening styles provoked by compositional techniques
not so odd that sounds in fiction may deviate somewhat from invoking other listening styles, like musique concrète, where
their real life counterparts without seeming false or the use of real world sounds provokes listening, not at sounds
unrealistic. What may seem surprising is that a whole sum of and patterns, but for causes – Chion aptly refers to this as
temporally congruent sounds may become involved in the causal listening [4]. This is a special case of music, perhaps
same process, from simple Foley through more elaborate seldom used in film, but appearing more and more in games.
layers of sound effects all the way to music. In these cases, the framework of listening is perhaps more
determined by evolutionary and low-level perceptual
3.5 Where do The Emotions Come From? processes of meaning-making than musical listening modes.
So far we have established that unconscious emotional
processes may ‘contaminate’ temporally congruent events 3.6 Realism Revisited
through misattribution and shown how this may influence We should now attempt a new understanding of realism.
perception of events in film. The big question remains, where Within fiction, realism is not an absolute, but a nominator for
do the emotions come from? a certain level of fit, an apparent realism or credibility. On
the narrative level, realism allows taking the story seriously
The most frequently researched category of emotional sound enough to allow emotions of empathetic quality. Good fit is
is music. Huron [10] describes musical emotions in terms of determined by whether the sound is credible (or illustrative)
fulfilling expectations: the interplay between anticipated and of a certain sound source [1], pp 190]. When the Foley artist
sounded music progression creates patterns of dynamic (the person responsible for creating sounds to on-screen
tension and relaxation. Musical expectations arise from events) smashes pumpkins in his studio, he does so in order
several sources, most of them cultural, but according to some to produce such sounds with good fit with on screen events.
Many times, the sounds produced have little to do with the
actual event seen on the film screen – indeed, often non- Finally, in interactive systems interpretations of sound take
realistic sounds are purposefully used to make the action on a new role, conveying functional information [29]. By this
sound better. It is, for example, recognized that walking on task, sounds are evaluated on a third level, in how well they
cornstarch sounds much 'more real' on film than the actual serve a functional value, Jørgensen [12], pp 49] refers to this
sounds of walking on snow. as a sounds functional fidelity. On this stage, emotional
evaluations are no longer determined only by the sound itself,
A possible explanation underlying the perceived realism of but by the utility of a sound for the higher goal of performing
some Foley sounds is the notion of prototypicality. A goal-related actions. The value of functional sound depends
prototype is an object that inhabits central perceptual on how the functional aspect supports game progress, the
characteristics of a given category. Prototypes do not utility of sound. The utility of sound is connected to goal-
necessarily exist in reality, they are mental constructs of our related cognitive evaluations. High utility enforces gameplay
perceptual system. The prototypical chair is the average of all emotion.
chair perceptions of your brain, and by definition, it will be
the 'chairest' chair of them all. Experimental psychology has 4 Comparison of Emotional Sound in Film
established that people perceive prototypes as more easily and Games
recognized [27], also more beautiful, and trustworthy [23]
than other category members. The cognitive appraisal framework provides two alternatives
for emotional relation to fictive events: the passive witness
Similarly, narrative reality determines how sound behaves position allows empathetic emotion, while the active player
within the diegesis, and how the source sounds should sound draws emotional meaning from goal-related evaluation.
when listened to from different Points of Audition1, such as These two frameworks ask the viewer/gamer to take on a
listening behind a wall or under water. At the core, then, also different attitudes towards the fiction and appear to be
immediacy is but one way of creating a sense of realism. In competing. They also rely on different strategies for sound
film, it serves to reinforce the witness position, being present design.
but out of control. Tan [27], pp 25] considers this in his
analysis of the camera point of view, mentioning how even in For film sound, the effort is usually on heightening narrative
first person view the camera is often a bit off, making space reality. This is achieved through detailed attention to
for someone to ‘look over the shoulder’. In the case of sound, narrative fit, often striving for high apparent reality. Focus is
the heightened, focussed sound including over-clear dialogue on advancing the narrative, heightening and clarifying those
can be considered realistic if we view it as a portrayal not of specific actions that are necessary for following the story’s
the scene, but of the experience of listening to the scene. progress (usually the top priority is on dialogue).
Consider this: while our environment usually contains a
multitude of sounds, we only attend to a select few at a time. For games, the focus point is different, as games have to
We are also exemplary at picking out and following these support player action. In games, the task of many sounds is
sounds. A person with normal hearing has no difficulty in primarily to provide feedback about actions. Hence, narrative
following a single conversation in a room filled with people, fit is often sacrificed for utility. To the extent that auditory
the phenomenon so aptly named the ‘cocktail-party’ effect. cues are used to guide actions, they are treated with utmost
Thus, what would seem presented as the films sounds is not respect for legibility. For example, even in the case of
the scene itself from a given point in space, but the scene as instructions with diegetic source (a non-player character,
heard if listened to by attentive ears. voice mail, etc.) it is common that auditory instructions
remain heard even if the character runs away from their
The narrative realism of a sound is thus not in faithful diegetic source.
reproduction of sound sources, nor of their environments.
The apparent realism of a sound in the context of narrative is An interesting avenue for sound design in games is to shift
defined by how representative a sound is of a certain event. focus from music to the emotional impacts of Foley and
Sounds that are highly representative have good narrative sound effects. A possible alternative for emotionality in
fit. High narrative fit supports empathetic emotion. games is in environmental sounds, which is already used in
many games, where ambient sounds are beautifully merged
Importantly, however, the evaluation of sounds spans several with musically suggestive elements and event sounds into a
layers. Below the level of narrative meaning are (partly) sonic landscape in the spirit of musique concrète. However,
unconscious processes whereby sounds are judged for this approach to be systematically explored, there is need
emotionally. For example, a sound can have good fit for better understanding of how everyday sounds influence
narratively, but poor legibility, because the signal-to-noise emotions. In these investigations, theories of unconscious
ratio is so high. Importantly, breaches in this level are emotion may prove especially informative.
disruptive to the perception of sound. We have seen that
perceptual fluency is also capable of influencing affective References
evaluations [23]. Thus, as with the narrative fit, unconscious [1] Bordwell, D. and Thompson, K. 1985. Fundamental
processing of sound influences emotional judgements. These Aesthetics of Sound in the Cinema. In Weis, e. and
affects are unrelated to the narrative content of the sound, but Belton, J. (eds.) Film Sound, Theory and Practice.
tap into the notion of artefact emotions. Depending on their Columbia University Press. 181-199.
nature, they can cause pleasure or displeasure, which can [2] Brandon, A. 2005. Audio For Games. Planning, Process
then be attributed to other temporally congruent events. and Production. New Riders.
[3] Bridgett, B. 2007. Audio Postmortem: Scarface: The
World is Yours. Available at
1 http://www.gamasutra.com/features/20070322/bridgett_
Similar to Point of View in camera techniques, but
using sound. pfv.htm [Accessed 29.5.2008]
[4] Chion, M. 1994. Audio-Vision. Sound on Screen. [19] Narmour, E. 1990. The analysis and cognition of basic
(Translation by Claudia Gorbman). Columbia University melodic structures: The implication-realization model.
Press. University of Chicago Press.
[5] Cohen, A. 1990. Understanding musical soundtracks. [20] Oatley, K. and Jenkins, J. 1996. Understanding Emotion.
Empirical Studies of the Arts 8, 111-124. Blackwell Publishing.
[6] Cohen, A. 2001. Music as the source of Emotion in [21] Perron, B. 2005. A Cognitive Psychological Approach to
Film. In Juslin, P. and Sloboda, J (eds.) Music and Gameplay Emotions. Proc. DiGRA 2005 Conference:
Emotion. Oxford University Press. 249-272. Changig Views – Worlds in Play.
[7] Damasio, A. 2005. Descartes' Error: Emotion, Reason, [22] Prince, B. Tricks and Techniques for Sound Effect
and the Human Brain. Penguin, paperback reprint Design. Computer Game Developers Conference 1996.
(1994). Available at
[8] Ekman, I. 2005. Understanding Sound Effects in http://www.gamasutra.com/features/sound_and_music/0
Computer Games In Proc. Digital Arts and Cultures 81997/sound_effect.htm [Accessed August 20, 2008.]
2005, Kopenhagen, Denmark. [23] Reber, R.; Schwarz, N. and Winielman, P. 2004.
[9] Frijda, N. H. 1986. The emotions. Cambridge University. Processing Fluency and Aesthetic Pleasure: Is Beauty in
[10] Huron, D. 2007. Sweet Anticipation: Music and the the Perceiver's Processing Experience? Personality and
Psychology of Expectation. MIT press. Paperback Social Psychology Review, 8 (4). 364-382.
reprint (2006). [24] Russell, J. 2003. Core Affect and the Psychological
[11] Huron, D. 2005. The Plural Pleasures of Music. Proc. Construction of Emotion. Psychological Review 110 (1),
2004 Music and Music Science Conference. Kungliga 145-172.
Musikhögskolan & KTH (Royal Institute of [25] Sanger, G. 2003. The Fat Man on Game Audio: Tasty
Technology), 1-13. Morsels of Sonic Goodness. New Riders.
[12] Jørgensen, K. 2007. ‘What are Those Grunts and Growls [26] Schachter, S., & Singer, J. 1962. Cognitive, Social, and
Over There?’ Computer Game Audio and Player Action. Physiological Determinants of Emotional State.
Ph.D. dissertation, Copenhagen University. Psychological Review, 69, 379-399.
[13] Kutay, S. 2006. Bigger Than Big: The Game Audio [27] Tan, E. 1994. Film-induced affect as a witness emotion.
Explosion. A Guide to Great Game Sound. Available at Poetics 23, 7-32.
http://www.gamedev.net/reference/articles/article2317.a [28] Tan, E. and Frijda, N. 1999. Sentiment in Film Viewing.
sp [Accessed 29.5.2008] In Plantinga, C. and Smith, G. (eds.) Passionate Views.
[14] Lacan,J. 1951. Some reflections on the ego. (fut lu par Film, Cognition, and Emotion. Johns Hopkins. 48-64.
Lacan à la British Psycho-Analytical Society le 2 mai [29] Tuuri, K.; Mustonen, M.-S.; Pirhonen, A. 2007. "Same
1951) Available at: http://aejcpp.free.fr/lacan/1951-05- sound - Different meanings: A novel scheme for modes
02.htm [Accessed 29.5.2008] of listening." Proc. AudioMostly 2007, Ilmenau,
[15] Lankoski, P. 2007. Goals, affects, and empathy in Germany.
games. Paper presented at Philosophy of Computer [30] Winkielman, P. and Berridge, K. 2004. Unconscious
Games, Reggio Emilia, Italy. Available at: Emotion. Current Directions in Psychological Science,
http://www.mlab.uiah.fi/~plankosk/blog/?p=53. 13 (3). 120-123.
[Accessed 29.5.2008] [31] Zajonc, R. B. 1980. Feeling and Thinking: Preferences
[16] Lazarus, R. 1991. Emotion and adaptation. Oxford Need No Inferences. American Psychologist, 35, 151-
University Press. 175.
[17] Leewen, T. van. 1999. Speech, Music, Sound. [32] Öhman, A. 2005. The role of the amygdala in human
Macmillan. fear: Automatic detection of threat.
[18] Marshall, S. and Cohen, A. 1988. Effects of Musical Psychoneuroendocrinology 30, 953-958.
Soundtracks on Attitudes toward Animated Geometric
Figures. Music Perception 6, 95-112.