As long as several simple tones of a sufficiently different pitch enter the ear together the sensation due to each remains undisturbed in the ear probably because entirely different bundles of [auditory] nerve fibers are affected. But tones of the same, or of nearly the same pitch, which therefore affect the same nerve fibers, do not produce a sensation which is the sum of the two they would have separately excited, but new and peculiar phenomena arise which we term interference… and beats… Rapidly beating tones are jarring and rough… the sensible impression is also unpleasant. Consonance is a continuous, dissonance an intermittent tone sensation. The nature of dissonance is simply based on very fast beats. These are rough and annoying to the auditory nerve.Hermann von Helmholtz (1863 and 1885) On the Sensations of Tone as a Physiological Basis for the Theory of Music
Above is a quote by a German physician and physicist who made significant contributions to the our understanding of the perception of sound. The research presented below concerning the perception of consonances and dissonances confirms von Helmholtz's assumptions almost exactly.
A Breif Explanation of the Overtone Series
Before we continue on, there is a concept we need to understand in order to fully comprehend the research presented below. A pitch produced by a musical instrument or voice is "composed of a fundamental frequency and harmonics or overtones."( Doering [link]) These components of sounds establish the timbre of a particular instrument, based on the relative strength and number produced by the instrument.(Jones [link]) The reason a trumpet sounds different from a clarinet or a human voice, even when all are producing the same note, is due to the relative strength and number of harmonics (or overtones) present in the sound. Despite the presence of harmonics the listener perceives only a single tone, but the presence of these harmonics greatly affects our perception of harmonic intervals and chords as being consonant or dissonant.
Examining the Neurological Representation of Consonance and Dissonance
The study by Tramo et al. (2001) compares the acoustic representation of several intervals and the trains of action potentials produced by the auditory nerve fibers of a population of 100 cats hearing the same intervals. It is important to keep in mind the information presented about the ascending auditory pathway . Auditory nerve fibers are the central axons of the spiral ganglion cells that transmit synaptic information to the cochlear nucleus neurons in the brainstem. When an interval is sounded the nerve fibers corresponding to the frequencies present in that interval will fire. Virtually all information about sound is transmitted to the brain through trains of action potentials produced by these synapses.[3]
First the researchers examined the acoustic waveforms and their corresponding autocorrelations (the cross-correlation of a signal with itself, to find disguised repeating patterns) for four different intervals. Two of the intervals, the perfect fourth and fifth, are considered consonant, while the other two, the minor second and tritone, are considered dissonant. When looking at the acoustic representations of the consonant harmonic intervals there is a clear pattern of peaks and the autocorrelation is perfectly periodic. The period of the autocorrelation corresponds to the fundamental of bass of the interval. Yet with the dissonant intervals, there was no true periodicity. Instead the acoustic spikes occurred inconsistently.
![]() |
Using 100 cats, the experimenters measured the firings of the auditory nerve fibers on the cat's cochlear nucleus using electrodes implanted in the brainstem. Similar to the best frequencies in the section on tonotopic organization there is frequency selectivity present with the spiral ganglion cells and the nerve fiber "will only increase the number of action potentials it fires if it is sensitive to the frequencies present in the interval."[3]
Using the data collected, the researchers measured the the intervals between the spikes and then constructed an all-order histogram for the interspike intervals (ISIs). This all-order histogram is equivalent to the autocorrelation of the acoustic waveforms. In examining the histogram for the consonant intervals, it is obvious that the major peaks appear with clear periodicity corresponding to the fundamental bass. Dissonant harmonic intervals' ISIs are irregular and contain little or no representation of pitches corresponding to notes in the interval or the fundamental bass.[3] As was seen in the autocorrelation, there is again no clear periodicity of spikes. The neural response appears to mirror the acoustic representation of the intervals.
![]() |
The reason that the consonant intervals' autocorrelations and histograms exhibit a clear periodicity and the dissonant intervals fail to demonstrate this relates back to the overtone series. Existing with each note of an interval are the inherent harmonics. The notes in consonant intervals, such as the perfect fourth and fifth, have some of the same overtones. Thus, the harmonic series of each note reinforces the other. However, the harmonics of the notes in the dissonant intervals, such as the tritone and minor second, do not reinforce each other in this way. This results in periodic amplitude fluctuations, known as beats, which make the tone combination sound rough or unpleasant. The audio example below presents a comparison of a perfect fifth followed by another fifth with the top note slightly lowered resulting in a clear example of beats.
Lesion Study
In another study, a patient, MHS, who had bilateral lesion of the auditory cortices, was examined. In this study the experimenter presented the patient with either a major triad or a major triad with the top note slightly flattened, similar to the audio example presented above. After the chord was presented the patient was asked to indicate whether the chord was out of tune or properly tuned. The patient answered with an accuracy of 56% which is two standard deviations below the norm of 13 controls. [3]
There are two possible interpretations of MHS's results.
- MHS was having difficulty extracting the pitches of chord frequency components and analyzing their harmonic relationships
- He heard more roughness in the chords than the controls because his effective critical bandwidths were wider. (less precision of pitch perception )[3]
In addition to his inability to recognize the tuning of the chords, in another study he also demonstrated an impairment in his ability to detect the direction of pitch change between two notes. This was determined using a pitch discrimination test where the subject must identify whether the pitch is going up or down. The changes in frequency get progressively smaller. MHS "performed poorly in the final third of the test where the ΔFs [changes in frequency] were smallest."[3]
Conclusion
Drawing on the evidence presented by these studies we can conclude that representation of roughness or dissonance exists in the patterns of neural activity at both the level of the auditory nerve fibers, as seen in the trains of action potentials of the spiral ganglion cells, and in the cerebral cortex, specifically the auditory cortices and the planum temporale. In addition there is a clearly inverse relationship between the amount of temporal fluctuation in amplitude and perception of consonance, or put more simply, the more beat present in a sound, the less consonant it seems. Finally, bilateral lesions to the auditory cortices can lead to severe impairments in perception of consonance with a particular bias to perceive well-tuned chords as out of tune.








