Abstract

The act of comparing is a comprehensive human cognitive process based on judgments about similarities and differences in value and degree. Inheriting its complex nature, Chinese comparative structure shows various aspects worth researching in the study of Artificial Intelligence and Linguistics. In response to this, this paper uncovers the structural, distributional features of the main Chinese equative constructions (等比句) and its semantic roles, namely Comparative Elements (CEs), based on which implemented automatic CEs extraction.<BR> First, based on comparative prepositional phrase, the equative constructions are classified into four main categories: A. [和······相同]; B. [和/像······一样]; C. [和/像······一般]; D. [有/像······这麽(那麽)]. Next, structural and semantic differences, collocational features, and usage patterns for each type were revealed. For example, type A functions as a central predicate, while type D is mainly used as an adverbial phrase. ‘一般’ in type C generally has stronger metaphorical implication than ‘一样’ in type D, and tends to co-occur with preposition ‘像’. Finally, we established a rule-based CEs extraction model employing structural and lexical features, and measured its accuracy. As a result, we found a blind spot in which the model could not correctly recognize the comparative elements of SUB and DIM that are likely to appear remote from comparative prepositional phrases. This suggests that probabilistic approach is required to identify CEs that deviates from the marked construction.<BR> With these findings, this paper established the basis for automatic Comparative Elements extraction by suggesting the main types and structural features of Chinese equative constructions through quantitative analysis of the actual corpus. Besides, the extraction model presented in this paper can be used to retrieve and summarize comparative information in large-scale texts such as newspapers, emails, product reviews, and social network data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call