Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, MA 02139 A procedure has been developed that automatically extracts pulse excitation information from the speech signal. This time‐domain algorithm operates on the transduction stage outputs of an auditory model developed by Seneff [this meeting]. Specifically, robust inflection points are determined from a waveform obtained by summing the channel outputs. This pulselike representation provides information about periodic and aperiodic excitation, and can be used to extract periodicities in voiced regions. Thus the information provided by this signal complements the spectral information provided by mean‐rate response outputs of the auditory model. Furthermore, this signal can be used to sample channel outputs pulse synchronously. This produces a spectral representation with sharper temporal onsets and offsets, and with an enhanced dynamic range in the frequency dimension. [Work supported by DARPA under Contract N00039‐85‐C‐0254, monitored through Naval Electronics Systems Command.]
Read full abstract