Robust Word Vectors: Context-Informed Embeddings for Noisy Texts

V Malykh,T Khakhulin,V Logacheva

doi:10.1007/s10958-023-06523-w

Robust Word Vectors: Context-Informed Embeddings for Noisy Texts

V Malykh, T Khakhulin + Show 1 more

Open Access

https://doi.org/10.1007/s10958-023-06523-w

Copy DOI

Journal: Journal of Mathematical Sciences

Publication Date: Jun 22, 2023

Affiliation: Moscow Institute of Physics and Technology, Steklov Mathematical Institute, Institute for Systems Analysis, Russian Academy of Sciences, Skolkovo Institute of Science and Technology

#Natural Language Processing #User-generated Content + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We suggest a new language-independent architecture of robust word vectors (RoVe). It is designed to alleviate the issue of typos and misspellings, common in almost any user-generated content, which hinder automatic text processing. Our model is morphologically motivated, which allows it to deal with unseen word forms in morphologically rich languages. We present the results on a number of natural language processing (NLP) tasks and languages for a variety of related architectures and show that the proposed architecture is robust to typos.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Mathematical Sciences

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.