Communication sounds across all mammals consist of multiple frequencies repeated in sequence. The onset and offset of vocalizations are potentially important cues for recognizing distinct units, such as phonemes and syllables, which are needed to perceive meaningful communication. The superior paraolivary nucleus (SPON) in the auditory brainstem has been implicated in the processing of rhythmic sounds. Here, we compared how best frequency tones (BFTs), broadband noise (BBN), and natural mouse calls elicit onset and offset spiking in the mouse SPON. The results demonstrate that onset spiking typically occurs in response to BBN, but not BFT stimulation, while spiking at the sound offset occurs for both stimulus types. This effect of stimulus bandwidth on spiking is consistent with two of the established inputs to the SPON from the octopus cells (onset spiking) and medial nucleus of the trapezoid body (offset spiking). Natural mouse calls elicit two main spiking peaks. The first spiking peak, which is weak or absent with BFT stimulation, occurs most consistently during the call envelope, while the second spiking peak occurs at the call offset. This suggests that the combined spiking activity in the SPON elicited by vocalizations reflects the entire envelope, that is, the coarse amplitude waveform. Since the output from the SPON is purely inhibitory, it is speculated that, at the level of the inferior colliculus, the broadly tuned first peak may improve the signal-to-noise ratio of the subsequent, more call frequency-specific peak. Thus, the SPON may provide a dual inhibition mechanism for tracking phonetic boundaries in social-vocal communication.