Abstract

This study aimed to automatically extract frequent words from speech data in Japanese. The length of words that can be extracted by the previous method was up to 2 symbols length. So, in this paper, we aimed to extract sub-sequences longer than 3 symbols. To extract sub-sequences longer than 3 symbols length, we proposed a new structure based on the neural network of the previous method. The new structure can extract longer sub-sequences by repeatedly stacking the structure which extracts sub-sequences of two symbol length. In order to confirm that the proposed method can extract frequent words from speech data by real time processing, we gave reading aloud data of Japanese to the proposed neural network and confirmed that the neural network can extracted frequent words. We confirmed that this neural network extracted a frequent word of 4 symbol length.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.