Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
The Fast Fourier Transform (FFT) is an efficient implementation of the discrete Fourier transform
that is basic operation in audio processing systems. The FFT is applied on relatively small windows
of contiguous audio samples. For the FFT, the size of the window is a power of 2, typically on the
order of 2048 or 4096 audio samples. The relatively small window size focuses the transform on a
short segment of time in order to determine what the frequency components are in that period.
Figure 2 shows the choices of FFT window size in Adobe Audition, ranging from 32 to 65536.
When the FFT is applied, it is implicitly being assumed that the windowed audio segment is
periodic – that is, that the window contains an integral number of cycles of a repeating pattern. If
this were actually the case, then the amplitude of the audio sample at the end of each window
would match up with the one at the beginning. But this rarely is this true. If you were to attach the
end of the window to the beginning, there would likely be sudden a jump from one level to another,
as shown in the figure below. This manifests itself in the output of the Fourier transform as the
detection of a frequency components that aren't really there – called spurious frequencies. You
can see the presence of spurious frequencies, called spectral leakage, in Figure 1. Although there
really is just one frequency in the time domain audio signal, the FFT indicates a small amount of
other frequencies all across the audible spectrum.
Four commonly-used windowing functions are given in the table below. Windowing functions are
easy to apply. The segment of audio data being transformed is simply multiplied by the windowing
function before the transform is applied.
( ) [ ( )]
( ) { }
( ) ( ) ( ) ( ) ( )
t is time.
Application of Windowing functions with the FFT is illustrated in the Chapter 2 of the book
associated with this exercise. Using that example, your assignment is to compare the results of the
four window functions of a windowed segment of music that you read in from a WAV file. You can
use the WAV file stored with this exercise if you like.