Research on Chinese Audio and Text Alignment Algorithm Based on AIC-FCM and Doc2Vec

Keliang Chen,Weizheng Ren,Yansong Cui,Jianming Huang

doi:10.1145/3532852

Keliang Chen, Weizheng Ren + Show 2 more

Open Access

PDF Available

https://doi.org/10.1145/3532852

Copy DOI

Export

Save

Cite

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

‘‘Audiobook” is a multimedia-based reading technology that has emerged in recent years. Realizing the alignment of e-book text and book audio is the most important part of its processing. This article describes an audio and text alignment algorithm using deep learning and neural network technology to improve the efficiency and quality of audiobook production. The algorithm first uses dual-threshold endpoint detection technology to segment long audio into short audio with sentence dimensions and recognizes it as short text. The threshold is calculated by AIC-FCM optimized based on simulated annealing genetic algorithm. Then the algorithm uses Doc2vec optimized by the threshold prediction method based on the average length of the short text to calculate the text similarity. Finally, proofread and output the text sequence and audio segment aligned in the time dimension to meet the needs of audiobook production. Experiments show that compared to traditional audio and text alignment algorithms, the proposed algorithm is closer to the ideal segmentation result in long audio segmentation, and the alignment effect is basically the same as Doc2vec and the time complexity is reduced by about 35%.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Research on Chinese Audio and Text Alignment Algorithm Based on AIC-FCM and Doc2Vec

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Mar 31, 2023
Citations: 2

Similar Papers

Deep Learning and Blockchain for Electronic Health Record in Healthcare System
Ch Sravanthi ... Smitha Chowdary
-
Ch Sravanthi, et. al.Ch Sravanthi ... Smitha Chowdary
28 Oct 2022
28 Oct 2022

Deep Learning for Autonomous Driving System
Karuppasamy Pandiyan M ... Sreenatha Reddy S
-
Karuppasamy Pandiyan M, et. al.Karuppasamy Pandiyan M ... Sreenatha Reddy S
04 Aug 2021
04 Aug 2021

Research on power monitoring network attack detection technology based on deep learning
Zhihua Wang ... Jian Zhou
-
Zhihua Wang, et. al.Zhihua Wang ... Jian Zhou
19 May 2022
19 May 2022

Deep PHM: IoT-Based Deep Learning Approach on Prediction of Prognostics and Health Management of an Aircraft Engine
R Mohammed Harun Babu ... M Shebana
-
R Mohammed Harun Babu, et. al.R Mohammed Harun Babu ... M Shebana
01 Nov 2022
01 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Research on Chinese Audio and Text Alignment Algorithm Based on AIC-FCM and Doc2Vec

Abstract

Published Version (Free)

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing