Abstract

In this paper, we propose competing models based on text-prompted speaker independent verification algorithm for an intelligent surveillance guard robot, wherein a robot prompts a code (i.e. word or phrase) for user entrance authentication. The proposed system requires the text-prompted speaker independent verification. In addition, it does not require a speaker dependent model and extra trained model as an alternative hypothesis for log-likelihood ratio test because of memory limitation. This is due to the given application scenario that an administrator changes the voice code every day for security reasoning and the targeting domain is unlimited. To resolve these issues, we propose to exploit the sub-word based anti-models for log-likelihood normalization through reusing an acoustic model and competing with voice code model. Anti-models using automatic production rules are set up in an initial time by using the statistical distance of phonemes against a voice code. The proposed system uses a two-pass strategy using a SCHMM-based recognition and verification step. In addition, a harmonics-based spectral subtraction algorithm is applied for a noisy robustness on an outdoor environment. The performance evaluation is done by using a common Korean database, PBW452DB, which consists of 63,280 utterances of 452 isolated words recorded in silent environment.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call