Resource Creation for Training and Testing of Normalisation Systems for Konkani-English Code-Mixed Social Media Text

Akshata Phadte

doi:10.1007/978-3-319-91947-8_26

Resource Creation for Training and Testing of Normalisation Systems for Konkani-English Code-Mixed Social Media Text

Akshata Phadte

https://doi.org/10.1007/978-3-319-91947-8_26

Copy DOI

Publication Date: Jan 1, 2018

Citations: 2

Affiliation: Goa University

#Code-mixed Social Media #Social Media Data + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Code-Mixing is the mixing of two or more languages or language varieties in speech. Apart from the inherent linguistic complexity, the analysis of code-mixed content poses complex challenges owing to the presence of spelling variations and non-adherence to a formal grammar. However, for any downstream Natural Language Processing task, tools that are able to process and analyze code-mixed social media data are required. Currently there is a lack of publicly available resources for code-mixed Konkani-English social media data, while the amount of such text is increasing everyday. The lack of a standard dataset to evaluate these systems makes it difficult to make any meaningful comparisons of their relative accuracies.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.