Abstract

In order to improve movie audio scene (MAS) recognition accuracy, weighted finite-state transducer (WFST) is proposed to recognize MAS in this paper. WFST is introduced firstly, how to construct WFST is introduced secondly, WFST is used to recognize MAS using FBANK, MFCC and PLPCC, separately. The experimental results on twenty MASs using the three features shows that WFST can recognize MAS well, FBANK feature performs better than MFCC and PLPCC, which can reach 79.9%.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call