Abstract

In a speech segment detection method, a sequence of speech samples is provided from an input speech signal and a sequence of feature vectors is provided from the speech samples, the feature vectors having respective speech power levels. A minimum speech power among the speech power levels in the feature vector sequence is detected. Normalized speech power levels are computed based on the speech power levels and the minimum speech power. Each of the normalized speech power levels is compared with a predetermined threshold value to detect speech segments in the input speech signal. Further, a speech recognition system and method and a computer-readable medium, using the speech segment detection method, are also disclosed.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call