Summary
Music in video games serves functions that overlap with but diverge from film scoring. The designer and composer must account for the variable length of each player’s experience, the number of times the game will be replayed, non-linear story paths, and the fact that the player — not a director — controls pacing. These constraints demand techniques beyond simple linear scoring. Understanding the types and functions of music in games is the starting point for any audio design decision.
(Sweet, Writing Interactive Music for Video Games, see source-writing-interactive-music)
Types of music within games
Extra-diegetic music (underscore)
Extra-diegetic music, or non-diegetic music, is music added purely to enhance the emotional experience. The player does not expect to see its source — the musical ensemble is not part of the game world. This is the majority of what most players call “the game’s soundtrack.”
Underscore works on a subconscious level to:
- Reinforce story elements through themes and motifs
- Intensify the emotional context of scenes
- Foreshadow events or indicate off-screen information
- Establish a sense of time and place
A classic film example that translates directly to games: John Williams’s two-note motif in Jaws (1975). Players and audiences learn to associate specific musical ideas with story events through repeated pairing. Williams famously breaks the motif late in the film — the shark appears without warning music, making it more terrifying. This deliberate subversion of learned musical expectations is a technique equally available to game composers.
“It is commonly said that the best film scores are not noticed by the audience or viewer. More obvious (and clumsier) scores take the viewer out of the experience of watching a movie by bringing what should be an unconscious element to the forefront for the listener.” — Sweet, Ch. 1
Examples in games:
- Red Dead Redemption (2010): Ennio Morricone–inspired score that changes dynamically as the player moves between scenes and plot points
- BioShock (2007): Garry Schyman uses aleatoric (chance-based) techniques and solo violin to create a score that is simultaneously terrifying and beautiful
Diegetic music (source music)
Diegetic music is music that a character in the game world would actually hear — music that exists within the fiction. This is called “source music” in film terminology. When the player can see or infer the source (a band on stage, a car radio, a nightclub), the music is diegetic.
In 3D game worlds, diegetic music requires real-time spatial audio: as the player moves, the apparent position of the sound source must pan, attenuate, and change reverb characteristics dynamically to simulate a real physical space.
Examples:
- BioShock Infinite (2013): a barbershop quartet visible on screen, singing
- Mass Effect 2 (2010): nightclub music heard from outside; the music shifts as the player enters
- Grand Theft Auto V (2013): in-car radio with multiple stations the player can switch between
- Assassin’s Creed III (2012): an opera performance in progress within the game world
Diegetic music is frequently licensed — pre-existing recordings obtained through music licensing agreements. This creates different budget and production considerations from composed underscore.
Music as gameplay
In some games, music is not background support but the core mechanic. Guitar Hero (2005), Beat Saber, Crypt of the NecroDancer, and Thumper are examples where player input directly produces or drives the music. This is sometimes distinguished from adaptive music by the term “interactive music” — the player has direct, immediate control over what is heard.
Player-customised music
Some games allow players to import their own music (early Grand Theft Auto games, DJ Hero). This removes the composer’s control over emotional context but maximises player agency and personalisation.
Music functions in games
Music in games serves multiple overlapping purposes:
| Function | Description |
|---|---|
| Emotional underscore | Enhancing the emotional register of a scene |
| Narrative continuity | Using recurring motifs to remind players of story elements across long play sessions |
| Spatial and temporal orientation | Signalling location, time of day, faction territory |
| Tension and release | Building anticipation, then resolving it when goals are achieved |
| Game state communication | Signalling to the player that something has changed (enemy nearby, health low, objective complete) |
| Immersion | Sustaining a believable world that the player inhabits |
“The primary function of music in video games is to create tension. The resolution of that tension amplifies the gamer’s euphoria when finishing a goal, feeding his or her desire to keep playing.” — Bear McCreary (composer perspective, Sweet Ch. 1)
Cues, motifs, and themes
Three core vocabulary terms for game composers:
- Cue: A complete piece of music with a beginning, middle, and end, written for a specific scene or group of related scenes. In games, cues are often associated with emotional states (explore cue, battle cue, victory cue) or specific locations or characters.
- Motif: A short, recurring melody or musical passage associated with a character, emotion, or place. Motifs are reused throughout the game to maintain emotional continuity and remind the player of story relationships.
- Theme: A longer, more fully developed musical idea typically associated with a main character or the game’s central emotional concept.
The Jaws two-note motif is one of the clearest examples of motif construction: simple, distinctive, and immediately learnable by audiences.
The repetition problem
The central challenge separating game scoring from film scoring is repetition. A film composer controls exactly how many times a musical idea is heard. A game composer does not — the player may spend an hour in the same area, hearing the same cue loop dozens of times.
“Our brains have evolved to filter out information that has no meaning… A creature constantly focused on the sound of a babbling brook may not notice the new sound of rustling reeds that hail the arrival of a deadly lion preparing to pounce.” — Bear McCreary (Sweet, Ch. 1)
The brain habituates to repeated stimuli: a scary cue that creates genuine tension on first encounter becomes background noise after five repetitions. This is not a failure of the music — it is a fundamental property of human perception that favours attention to change.
The primary design response to the repetition problem is interactive music: adaptive techniques that prevent the score from looping identically. See interactive-music-techniques for horizontal resequencing, vertical remixing, and related approaches.
Serendipitous sync
Serendipitous sync occurs when the music happens to align with a game event in a way that feels intentional but was not planned — a musical swell coinciding with an explosion, or a beat landing on a kill. This is a natural by-product of continuous music playback over interactive gameplay. Skilled composers can increase the likelihood of serendipitous sync through compositional choices (writing loops with strong metrically regular beats, composing music that works across many emotional contexts) without being able to guarantee it.
Music conceptualisation process (Sweet’s framework)
Sweet outlines a seven-step process for approaching a game score from scratch:
- Gather and assess materials — script, design documents, art direction, reference games
- Prioritise primary music objectives — what is the most important emotional and narrative role for music in this game?
- Create an asset list — what cues, loops, stingers, and ambient layers will be required?
- Define interactive elements — which parts of the score must adapt to gameplay, and how?
- Create a supporting audio style guide — instrumentation, harmonic language, tempo ranges, references
- Create an audio design document — the formal specification that guides the team
- Revise — the plan will change as the game is built
What makes game scoring unique
Sweet identifies several dimensions in which game scoring differs from linear media:
| Dimension | Film/TV | Games |
|---|---|---|
| Experience length | Fixed | Variable (minutes to hundreds of hours) |
| Number of plays | Typically once | Many replays |
| Story path | Linear | Branching or emergent |
| Synchronisation | Frame-accurate | Approximate / event-triggered |
| Repetition control | Complete | None |
| Audience control | Passive | Active |
These differences are not limitations to work around — they are the defining constraints of the medium. Interactive music techniques exist specifically to leverage player control rather than fight it.
Implications
- Every game needs an audio design document as early as possible in production — decisions about music type (extra-diegetic/diegetic), interactivity, and asset volume affect pipeline, budget, and programming scope.
- The tension/release cycle is the core emotional engine of game music. Sustaining tension without resolution produces fatigue; resolving too quickly prevents investment.
- Diegetic music requires spatial audio programming support. This must be planned with the engineering team, not assumed.
- Repetition is the enemy of emotional impact. Any game with significant play time needs interactive music or a large enough cue library to prevent habituation.
Open questions
- Where is the boundary between diegetic music design and sound design? In games like Nier: Automata or Returnal, the distinction blurs deliberately.
- Player-customised music removes the composer entirely. Is this a legitimate design choice for emotional engagement, or does it signal an abdication of design responsibility?
Related
- interactive-music-techniques — how adaptive/interactive music is technically implemented
- sound-design-basics — synthesis, DSP effects, the audio production layer
- narrative-design — music as a narrative tool; motifs and thematic coherence
- presence-and-immersion — music as a presence-sustaining mechanism
- game-feel — audio feedback as a component of game feel
- unity-audiosource — Unity implementation: playing, looping, and mixing audio at runtime
- source-writing-interactive-music