Article Text


Electrophysiological evidence for a defect in the processing of temporal sound patterns in multiple sclerosis
  1. S J Jones,
  2. L Sprague,
  3. M Vaz Pato
  1. The National Hospital for Neurology and Neurosurgery, London, UK
  1. Correspondence to:
 Dr S J Jones, Department of Clinical Neurophysiology, The National Hospital for Neurology and Neurosurgery, Queen Square, London WC1N 3BG, UK;


Objectives: To assess the processing of spectrotemporal sound patterns in multiple sclerosis by using auditory evoked potentials (AEPs) to complex harmonic tones.

Methods: 22 patients with definite multiple sclerosis but mild disability and no auditory complaints were compared with 15 normal controls. Short latency AEPs were recorded using standard methods. Long latency AEPs were recorded to synthesised musical instrument tones, at onset every two seconds, at abrupt frequency changes every two seconds, and at the end of a two second period of 16/s frequency changes. The subjects were inattentive but awake, reading irrelevant material.

Results: Short latency AEPs were abnormal in only 4 of 22 patients, whereas long latency AEPs were abnormal to one or more stimuli in 17 of 22. No significant latency prolongation was seen in response to onset and infrequent frequency changes (P1, N1, P2) but the potentials at the end of 16/s frequency modulations, particularly the P2 peaking approximately 200 ms after the next expected change, were significantly delayed.

Conclusion: The delayed responses appear to be a mild disorder in the processing of change in temporal sound patterns. The delay may be conceived of as extra time taken to compare the incoming sound with the contents of a temporally ordered sensory memory store (the long auditory store or echoic memory), which generates a response when the next expected frequency change fails to occur. The defect cannot be ascribed to lesions of the afferent pathways and so may be due to disseminated brain lesions visible or invisible on magnetic resonance imaging.

  • auditory cortex
  • auditory evoked potential
  • mismatch negativity
  • multiple sclerosis
  • AEP, auditory evoked potential
  • ERP, event related potential
  • MIDI, musical instrument digital interface
  • MRI, magnetic resonance imaging
  • MS, multiple sclerosis

Statistics from

A large proportion of patients with established multiple sclerosis (MS) have lesions of central auditory pathways, which can be shown by using short latency auditory evoked potentials (AEPs).1 Abnormalities are also often observed in long latency event related potentials (ERPs). These, however, cannot usually be ascribed to afferent pathway lesions but presumably to localised or diffuse lesions within the brain.2 Most patients in the advanced stages of MS exhibit some degree of cognitive impairment.3 Minor cognitive deficits have also been noted in the early stages, for example, in the performance of verbal memory and abstract reasoning tasks.4 In a group of mildly affected MS patients whose clinical symptoms were entirely attributable to disease of the spinal cord, the P3 component of the ERP was yet found to be significantly altered as compared with control subjects during performance of auditory and visual working memory tasks.5

An earlier ERP component reflecting short term memory processes is the mismatch negativity.6 This potential is elicited automatically (that is, not requiring the conscious attention of the subject) by discrete sounds that differ in some respect from the preceding sequence of sounds. The mismatch negativity appears to depend on retention of the sounds in a sensory memory store, the long auditory store or echoic memory, which decays over a period of a few seconds.7 The chief distinction between the long auditory store and verbal working memory is that in the former sounds are represented in a precategorical state, effectively as an acoustic image. It is arguable, however, that verbal working memory may depend to some degree on the “rehearsal” of material through precategorical levels of acoustic storage.

From our previous studies of long latency AEPs to complex harmonic tones (synthesised musical instrument sounds)8–10 we have distinguished at least two neuronal populations of the supratemporal cortex, responsive to different types of change in the spectrotemporal structure of the sound. A negative potential peaking at approximately 90 ms and a positivity at 160 ms are produced by sudden, infrequently occurring changes in the distribution of energy across the audible frequency spectrum. These were termed “change-type” N1 (CN1) and CP2 components. Similar responses were described in the early literature to continuous pure tones (reviewed by Näätänen and Picton11) and an apparently similar “acoustic change complex” has recently been described in the context of speech sounds.12 The C potentials are highly sensitive to the rate at which spectral energy changes occur. When the frequencies of the tone were modulated by approximately ± 12% at a rate of 16 changes/s the C potentials were made almost completely refractory but superficially similar N1 and P2 potentials of slightly longer latency, and more anterior distribution on the scalp, were produced on resumption of a steady tone.9,13 These were termed “mismatch-type” N1 (MN1) and MP2 components, on account (we have argued13) of their likely close relation to the mismatch negativity. The amplitudes of the CN1 and the MN1 were found to depend to comparable degrees on the time for which the preceding sound pattern (steady for the CN1, rapidly changing for the MN1) had been present, increasing with the duration of the sound for at least 4.5 seconds.9 This suggested that responses of maximal amplitude may be generated by the appropriate change when the long auditory store is “full” with a particular sound pattern.

In the present study we examined the C and M potentials, together with the conventional P1, N1, and P2 components evoked at the onset of the tone, in a group of patients with relatively mild MS. We consider how the findings may suggest a defect in the processing of temporal sound patterns and may be related to the working memory impairment.


The study was approved by the local ethical committee and all subjects gave their informed consent according to the Declaration of Helsinki. Recordings were initially obtained from 38 patients with definite or suspected MS, referred to the Department of Clinical Neurophysiology of the National Hospital for Neurology and Neurosurgery for investigative tests. The patients in the final group were 22 whose condition, after neurological examination and laboratory tests (including evoked potentials and magnetic resonance imaging (MRI)), was diagnosed as definite MS.14 Twelve female patients and 10 male patients were aged 21–58 years, mean 42.0 (9.1) years. The diagnosis was of relapsing/remitting MS in 15 cases, primary progressive MS in 5, and secondary progressive MS in 2. The duration of symptoms ranged from 5 months to 39 years. None of the patients had any significant history of hearing impairment. The control group comprised 15 volunteers, 8 women and 7 men aged 21–49 years, mean 35.7 (7.3) years, with no significant history of hearing impairment or neurological disease. Long latency AEPs were recorded in a quiet room while the subjects sat in a reclining chair and read a magazine or book. The duration of each recording was approximately one hour. Conventional short latency (< 10 ms) AEPs were also recorded to clicks delivered at 10/s to either ear and compared with results from age and sex matched laboratory controls.

The complex harmonic tones were created by a Yamaha MU10 tone generator (Yamaha Corporation, Hamamatsu, Japan) and controlled by an IBM compatible PC. The stimulus sequences were created using Midisoft Recording Studio software (Diamond Recording Systems, Bellevue, Washington, USA), stored as general musical instrument digital interface (MIDI) files and played using Cubasis (Steinberg Soft and Hardware, Hamburg, Germany). The left and right output channels were split, one being used for the stimulus signal (presented to the left and right ears in separate runs, with a fixed intensity 45–50 dB above the threshold of the control subjects) and the other converted to a square wave pulse to trigger the recording apparatus. The frequency spectra of the tones were determined by fast Fourier transform (Pico Technology, Hardwick, UK; fig 1 upper) and their temporal envelopes were examined. The rise time of the tones was approximately 15 ms and their decay was roughly exponential with a half life of approximately 15 ms (fig 1). Consecutive tones overlapped so as to produce a smooth transition without any perceptible click or bulge.

Figure 1

(Upper) Frequency spectrum of clarinet tone, fundamental frequency 440 Hz, and envelope showing extent of overlap between consecutive tones. (Middle) Recording montage. Ref, reference electrode. (Bottom) Stimulus conditions: 1, onset every two seconds (fundamental 440 Hz, tone on for one second, off for one second); 2, frequency change every two seconds (fundamental 440 Hz to 494 Hz and back); 3, end of two seconds of 16/s frequency oscillation (440 Hz to 494 Hz and back) followed by one second of steady tone.

Silver/silver chloride disc electrodes were attached to the scalp with paste at locations Fpz, Fz, Cz, and Pz on the sagittal midline and lateral locations C3, C4, F7, and F8 of the 10–20 System (fig 1 middle). The reference electrode was on the dorsum of the neck at the base of the skull. Amplification was by CED 1902 amplifiers (Cambridge Electronic Design, Cambridge, UK) and analogue to digital conversion was by a CED 1401 Plus. The recording bandwidth was between 1 Hz and 200 Hz (corner frequencies). Amplified signals were digitised at 1000 points/s for 500 ms starting 50 ms before each stimulus change. There were three stimulus conditions, each repeated three times for either ear (fig 1 lower). The first condition was the onset of a tone of clarinet timbre with a fundamental frequency of 440 Hz. Each tone was presented for one second and was followed by a silent interval of one second. In the second condition, every two seconds a continuous tone of clarinet timbre abruptly changed all its frequencies in geometric proportion, the fundamental changing from 440 Hz to 494 Hz and back. In the third condition a continuous tone of clarinet timbre oscillated between the same two frequencies at a rate of 16 changes/s for two seconds before coming to rest on the higher tone for one second (in this condition the trigger occurred 50 ms before the next expected change). Fifty responses were averaged in each run and the three averaged responses for each stimulus condition were combined into grand average waveforms. P1, N1, and P2 peaks were determined by visual inspection of the waveforms at Fz, their latencies measured from the midpoint of the rise time of the tones. The amplitudes of P1 and N1 were measured from the prestimulus baseline, while P2 amplitude was measured from the peak of N1. Normal limits were defined as the mean (3 SD) of the control group, calculated directly for absolute latency and interaural latency difference measurements and after logarithmic transformation for amplitudes. Normal limits were also calculated for left to right amplitude ratios and latency differences between the three response types. Groups were compared by the Mann-Whitney U test.


Routine evoked potential and MRI findings

Abnormalities of routine visual or somatosensory evoked potentials were seen in 18 of 22 patients. Short latency AEPs, however, were abnormal in only 4 of 22. Two additional patients had equivocally abnormal short latency AEPs, in one of them suggesting a mild peripheral hearing defect. On T2 weighted MRI, disseminated brain abnormalities compatible with demyelination were seen in 20 of 22 patients. In cases 2 and 21 (both with relapsing/remitting MS diagnosed on clinical and evoked potential criteria) the brain images were reported to be inconclusive and normal, respectively. In no instances did the MRI abnormalities appear specifically to involve auditory structures of the brain.

Long latency AEPs

Normative data

Reproducible responses to each of the three stimulus types were recorded in every subject. A P1 potential peaking at approximately 50 ms was present in the responses to onset (OP1) and frequency change (CP1) but not at the end of oscillatory frequency changes. N1 potentials at 65–145 ms and P2 potentials at 120–260 ms were present to all three stimuli, termed ON1/OP2 (onset), CP1/CP2 (frequency change), and MN1/MP2 (mismatch-type at the end of frequency oscillation). The C potentials were on average 5–15 ms longer in latency than the O potentials and the M potentials were 15–25 ms longer in latency than the C potentials (table 1). The MN1 and MP2 peaks were consistently largest at Fz, while the O and C potentials were of comparable magnitude at Fz and Cz (the amplitudes used for analysis were all measured at Fz; table 2). No significant differences were found between the responses to left and right ear stimulation, nor were there any significant amplitude asymmetries between corresponding left and right sided electrodes (however, for left ear stimulation the N1 and P2 amplitudes were on average larger at right sided electrodes in 12 of 12 comparisons (3 stimulus conditions × 2 components × 2 electrode pairs), while for right ear stimulation the N1 and P2 amplitudes were larger on the right in 9 of 12 comparisons).

Table 1

Latency measures (ms) in each group and normal latency limits

Table 2

Amplitude measures (μV) and normal limits. The N1 peaks were measured from the prestimulus baseline and the P2 peaks from N1

Individual patient data

Measurable responses were also recorded to all three stimulus types in every patient. When compared with the limits of the control group (mean (3 SD); table 1 and table 2), 17 of the patients with MS had one or more long latency AEP measures that were outside the normal range, in absolute terms, in terms of the interaural latency difference or amplitude ratio, or in terms of the latency difference between the three response types (table 3). Abnormalities of the O potentials were seen in 10 patients, of the C potentials in 6, and of the M potentials in 10. Latency abnormalities were present in 1, 4, and 9 patients, respectively.

Table 3

Auditory evoked potential abnormalities in the patient group, with criteria of low amplitude, prolonged latency, and increased left to right and interresponse latency differences. All other potentials were within normal limits (mean ±3SD)

In no patients were the long latency AEPs abnormal to all three stimulus types in the same ear. Among the four patients with abnormal short latency AEPs, one with bilateral involvement had abnormal long latency responses to onset on both sides and to frequency change on the left. Two patients had abnormal long latency AEPs only to stimuli delivered to the ear contralateral to that from which the short latency AEPs were abnormal. There was no apparent association, therefore, between long latency AEP abnormalities and lesions of the afferent auditory pathways suggested by short latency AEPs.

Intergroup latency comparisons

When the two groups were statistically compared using the Mann-Whitney U test, no significant latency differences were found for either the onset responses or the responses to changes of frequency (table 1). Significant differences were observed, however, in the responses at the end of frequency oscillation, the MP2 being significantly delayed in the patient group to stimulation of either ear, while the MN1 was significantly delayed on the left and just non-significantly on the right. After Bonferroni correction for 16 intergroup latency comparisons, the effect on the MP2 remained significant for both sides. When the group mean waveforms are superimposed (fig 2), the ON1, OP2, CN1, and CP2 peaks are seen to correspond closely in latency between patients and controls, while a small latency disparity is suggested for the MN1 and a more pronounced disparity is evident for the MP2.

Figure 2

Superimposition of group mean responses of the patients with multiple sclerosis (darker line) and the controls (paler line) to left and right ear stimulation, sagittal midline electrodes. Note the close correspondence between peak latencies for the two groups in the responses to onset (O) and frequency change (C) but the slight delay at the end of frequency oscillation (M) of the MN1 in the multiple sclerosis group and the more pronounced delay of the MP2.

The latency differences between O, C, and M potentials were also compared between groups (table 4). No significant intergroup effects were observed for the C-O differences. However, the MP2-OP2 and MP2-CP2 differences were all significantly larger in the patient group (p < 0.01 or better) while the MN1-CN1 latency difference was significantly larger in the patient group for responses to stimulation of the left ear only (p < 0.05).

Table 4

Comparison of interresponse latency differences between the control and patient groups

Intergroup amplitude comparisons

All AEP amplitudes were on average slightly lower in the patients than in the controls (table 2). Before Bonferroni correction this reached significance (p < 0.05) for the left and right ear OP2 but after Bonferroni correction the differences were all non-significant.

Effect of duration and classification of disease

No significant differences were found when seven patients with disease duration longer than six years were compared with the remainder of the patient group. Most latencies were longer on average in seven patients classified as primary or secondary progressive MS, as compared with the remainder classified as having relapsing/remitting MS, but the differences were all non-significant.


We have previously described the properties of long latency AEPs to spectrotemporal modulation of complex tones in normal subjects.8–10,13 One possible clinical application of these techniques has been explored in a group of patients with brain injury15 to establish the degree to which cortical mechanisms for processing complex sounds may be preserved and to what degree this is correlated with behavioural responsiveness.

In the present study as in previous ones, the M potentials were distributed slightly more anteriorly on the scalp than the C and O potentials, whose distributions were similar to one another. On average most responses were slightly larger on the right, although left-right differences were not significant. Our interpretation of the C potentials is that they reflect a process for analysing the distribution of energy across the spectral envelope, responses being generated when the spectral profile of the sound abruptly changes causing an immediate change in the impression of pitch or timbre (intensity changes have not yet been investigated).10 From their similar morphology and scalp distribution, it seems likely that the O potentials at the onset of the tone from silence largely reflect the same process. The M potentials, on the other hand, are elicited either when an unexpected change occurs in a rapid sequence16 or at the moment an expected change fails to occur.13 Therefore, whereas the C process seems to be governed purely by the spectral composition over the few seconds preceding the change, the M process is sensitive to the temporal structure.

Somewhat paradoxically, in view of the high incidence of abnormal short latency AEPs in the MS population (although not in the relatively mildly affected patients of the present study), the N1 and P2 potentials to clicks and the onset of tone bursts are seldom found to be delayed.17–19 One study in which the N1 and P2 potentials were reportedly affected is that of Giesser et al,20 but this was in demented as compared with non-demented patients. More recently, Hendler et al21 found long latency AEPs to be abnormal only in conjunction with bilaterally abnormal short latency potentials. If lesions of the afferent auditory pathway tend to cause conduction delays of only a few milliseconds, it is understandable that this may be insufficient to cause long latency potentials to exceed the normal latency range. It may also be significant that the pathways to the auditory cortex are at least partially bilateral with several decussations, such that input may arrive by more than one route.

The evidence for cognitive dysfunction in MS includes patients in the early stages of the disease.4,22 Some studies have noted a tendency for certain forms of memory to be more affected than others, the emphasis usually being on working memory and retrieval of long term memory.23–30 Others, however, have concluded that all memory domains are likely to be impaired31 and that visual and auditory modalities are not differentially affected.32 Specific deficits noted by some authors include accelerated forgetting33 and increased short term memory scanning time.34 Follow up studies have observed that cognitive and neurological deficits do not necessarily develop in parallel,4 suggesting that the former may be caused by disseminated cerebral lesions that are neurologically silent. Some correlation has been noted between the degree of cognitive impairment and the lesion load seen in MRIs obtained using conventional spin echo sequences.35 Others, however, have found that such lesions account for the severity of cognitive impairment only partially36,37 and that it may be necessary also to postulate dysfunction of the normal appearing white matter. A closer association between MRI lesion load and cognitive performance (as assessed by an auditory verbal learning test) was obtained using a fast fluid attenuated inversion recovery technique,38 which resolved many more small perivascular lesions. Functional imaging studies have suggested a hypometabolism of structures including the thalamus and the deep grey matter of the temporal lobe to account for impaired episodic memory performance.39

Cognitive ERPs are frequently abnormal in MS,17,20,40,41 even in the early stages.2 In relatively mildly affected patients a tendency has been noted for responses reflecting auditory working memory to be more impaired than those to similar tasks in the visual modality, the defect being ascribed to dysfunction of the “phonological loop”,42 which may conceivably involve rehearsal of verbal material through precategorical levels of acoustic storage. However, deficits in ERPs during acquisition and recall of both auditory verbal and visuospatial material have been reported in patients whose clinical presentation was confined to symptoms implicating the spinal cord.5 Although the N2 ERP component is frequently reported to be affected in addition to the P3, most ERP studies have noted no significant involvement of the N1 or P2.17,19,41,43

It is remarkable that patients with MS rarely complain of auditory symptoms. In psychoacoustic tests, subtle defects have been recognised that mostly appear to reflect abnormalities of the subcortical auditory pathways.44 Abnormalities of interaural time difference discrimination have been frequently reported, particularly in patients with known brainstem lesions,45 and one study46 found a large proportion of patients with MS who experienced difficulties with a task designed to test the temporal precedence effect in sound localisation. It is understandable that these defects are frequently associated with abnormalities of short latency AEPs, in which the measured conduction delays are of the same order as the temporal discrimination deficit.

Very few attempts have been made to identify the kinds of deficits that may be associated with lesions of higher auditory structures in MS. In one study,47 abnormal binaural masking level differences (a test of the perception of interaural phase relations) were always found to be associated with abnormal short and middle latency AEPs but increased temporal gap detection thresholds for monaural sounds (seen in only 2 of 15 cases) appeared to be specifically related to abnormalities of long latency AEPs. Perhaps more directly relevant to the present study is a report by Rappaport et al48 describing a selective impairment of speech sound recognition in the presence of interrupted background noise. This was interpreted as being predominantly due to damage to the auditory pathways of the forebrain. In normal subjects we have found the latency of the MN1 to vary with the strength of perceptual “streaming”, a relatively short latency being associated with wider frequency separations between consecutive tones and thus a greater tendency for them to be perceived as forming separate streams.16 A defect of this mechanism may therefore cause difficulties in the analysis of changing sound patterns such as speech, particularly when interfering sounds are intermittently present.

In conclusion, among a group of patients in the relatively early stages of MS (most of them with a recent diagnosis and not severely physically or cognitively impaired) we have shown a subtle but consistent delay in the cortical responses to the cessation of rapid, repetitive frequency changes but not to the onset of tones or to infrequent changes of frequency. This cannot be attributed to lesions of the afferent auditory pathways and may therefore result from a defect of corticothalamic or corticocortical circuits concerned with analysing the temporal structure of sound sequences. The likelihood that the affected long latency AEPs depend on the retention of sounds in a short term sensory memory store suggests that this disorder may reflect at a lower physiological level the impaired mnemonic precesses that are a common feature of MS.


View Abstract


  • Competing interests. none declared

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.