Spectral Envelope
EN | FR
Spectral Envelope
by
Timbre Lingo | Timbre and Orchestration Writings
Published: November 6, 2023 | How to cite
In the process of analyzing the spectrum of a sound, it is sometimes useful to describe its spectral properties in terms of energy distribution rather than individually mapping all its components. In so doing, we invoke the concept of spectral envelope, a curve that can be obtained by successively connecting the peaks of the partials shown in the frequency representation of the sound (i.e., with frequency on the x-axis and energy or amplitude on the y-axis). Spectral envelopes are important factors in timbre perception. They reflect the acoustic properties of an object that produces sound in terms of the energy distribution across the frequency spectrum. Here is an example showing the spectral envelope of an oboe sound next to a clarinet sound, with amplitude measured in decibels (dB):
Oboe: (play sample)
Clarinet: (play sample)
Spectral envelopes can also be described with other descriptors, which include :
Spectral Spread
Spectral spreads represent the standard deviation of amplitude distribution around a sound’s spectral centroid, calculated using the square root of its second-order moment (see this video about moments in statistics, if you want to know more). A sound with a high spectral spread will look “larger” and “flatter” (literally more “spread”) than a sound with a lower spectral spread value.
Crash cymbals: (play sample)
Clarinet: (play sample)
Spectral Skewness
Spectral skewness measures the symmetry of the spectrum around its spectral centroid. The spectral skewness is calculated using the third-order moment of the amplitude distribution. A positive value indicates more energy in the frequencies that are below the spectral centroid (i.e., the “tail”/inclination of the spectrum is longer on the right, hence the attribution of a positive value), while a negative value means that the energy is greater in frequencies that are above the spectral centroid (i.e., the left “tail” of the spectrum is longer).
Pink noise: (play sample)
Blue noise: (play sample)
Pink noise and blue noise are two common types of filtered noise, as discussed in this article.
Spectral Kurtosis
Spectral kurtosis quantifies the “peakedness” or “tailedness” of the spectrum around its spectral centroid (calculated using the fourth-order moment of the amplitude distribution). A higher kurtosis value means that more energy is concentrated in the region of the peak (which will be more “pointy”) while a lower value usually describes a flatter distribution. At first, this seems very similar to what the spectral spread is used for. However, one can change the spectral kurtosis without modifying the spectral spread value, and vice-versa.
Filtered crash cymbals (lower kurtosis): (play sample)
Filtered crash cymbals (higher kurtosis): (play sample)
Spectral Slope
The spectral slope indicates the tendency of amplitude decrease as the frequency increases. It is calculated using a linear regression, that is, finding a straight line that best fits the general tendency of the spectrum and getting its gradient.
Spectral Rolloff
The spectral rolloff point is the limit frequency under which 95% of the energy is located.
Spectral spread, skewness, kurtosis, slope, and rolloff enable us to characterize and classify sounds, which can be useful in the study of timbre perception. For instance, these are the kinds of descriptors looked upon when finding acoustic correlates in the study of timbre space. They are also very often used in phonetics (e.g., the sibilant fricatives /sh/ and /s/ mainly differ in spectral centroid, skewness and kurtosis [2]) and in automatic instrumental timbre recognition for machine learning [3].
References
[1] Peeters, G. (2004). A large set of audio features for sound description (similarity and classification) in the CUIDADO project. IRCAM.
[2] Jongman, A., Wayland, R. and Wong, S. (2000). Acoustic characteristics of English fricatives. Journal of the Acoustical Society of America, 108(3), 1252–1263.
[3] Fujinaga, H. (1999). Machine Recognition of Timbre Using Steady-State Tone of Acoustic Musical Instruments. Peabody Conservatory of Music, Johns Hopkins University.