Diachronic Semantic Tracking for Chinese Words and Morphemes over Centuries

Yang Chi,Fausto Giunchiglia,Hao Xu,Fausto Giunchiglia,Hao Xu,Fausto Giunchiglia

doi:10.3390/electronics13091728

Abstract

Lexical semantic changes spanning centuries can reveal the complicated developing process of language and social culture. In recent years, natural language processing (NLP) methods have been applied in this field to provide insight into the diachronic frequency change for word senses from large-scale historical corpus, for instance, analyzing which senses appear, increase, or decrease at which times. However, there is still a lack of Chinese diachronic corpus and dataset in this field to support supervised learning and text mining, and at the method level, few existing works analyze the Chinese semantic changes at the level of morpheme. This paper constructs a diachronic Chinese dataset for semantic tracking applications spanning 3000 years and extends the existing framework to the level of Chinese characters and morphemes, which contains four main steps of contextual sense representation, sense identification, morpheme sense mining, and diachronic semantic change representation. The experiment shows the effectiveness of our method in each step. Finally, in an interesting statistic, we discover the strong positive correlation of frequency and changing trend between monosyllabic word sense and the corresponding morpheme.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Diachronic Semantic Tracking for Chinese Words and Morphemes over Centuries

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Apr 30, 2024
License type: CC BY 4.0

Similar Papers

NLP methods in host-based intrusion detection systems: A systematic review and future directions
Zarrin Tasnim Sworna ... Muhammad Ali Babar
Journal of Network and Computer Applications | VOL. 220
Zarrin Tasnim Sworna, et. al.Zarrin Tasnim Sworna ... Muhammad Ali Babar
06 Oct 2023
Journal of Network and Computer Applications | VOL. 220

Survey of Natural Language Processing Techniques in Bioinformatics.
Zhiqiang Zeng ... Yun Wu
Computational and Mathematical Methods in Medicine | VOL. 2015
Zhiqiang Zeng, et. al.Zhiqiang Zeng ... Yun Wu
01 Jan 2015
Computational and Mathematical Methods in Medicine | VOL. 2015

Learning Relevant Models using Symbolic Regression for Automatic Text Summarization
Eder Vazquez Vazquez ... Yulia Ledeneva
Computación y Sistemas | VOL. 23
Eder Vazquez Vazquez, et. al.Eder Vazquez Vazquez ... Yulia Ledeneva
30 Mar 2019
Computación y Sistemas | VOL. 23

Dynamic-automatic pipelines for finding topic-specific information clusters using NLP methods in connection with a model-driven approach
Tobias Dorrn ... Achim Kuwertz
-
Tobias Dorrn, et. al.Tobias Dorrn ... Achim Kuwertz
28 Oct 2022
28 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Diachronic Semantic Tracking for Chinese Words and Morphemes over Centuries

Abstract

Talk to us

Similar Papers

More From: Electronics