Abstract

Spelling errors are common in our daily life and the industrial application, caused by automatic speech recognition, optional character recognition and human writing. Because of lack of robustness, Text classification models trained on clean datasets tend to perform poorly on the datasets with spelling errors. We conduct experiments to find out the influence of spelling errors on the performance of Chinese text classification and solve the Chinese text classification task with spelling errors by multi-task fine-tuning on BERT. We use spelling errors correction task to assist the text classification task. The results on four Chinese text classification datasets show that our method can effectively improve the robustness of the classification model which decrease the influence of spelling errors and prove the effectiveness of multi-task fine-tuning on BERT.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.