Abstract

Chinese authors indirectly portray real social relationships in society by incorporating their own experiences and feelings about politics, economic life, and cultural habits. Social and natural knowledge are mixed and hidden in natural literary language. Therefore, by well defining the type of original text, the pipeline for processing the text and the rules to build the graph, we can extract enough valid and useful information. Based on higher-level information, we can build a literary character graph to promote computers' comprehension of literature. The identification of opposite characters is a challenging topic because the character relationships are intricate and complex. The cryptic expression in literary works can enhance the readability of the plot, but it is obvious that it increases the cost of understanding. Even for human readers different types of characters are difficult to classify and identify, and it usually needs careful reading for several times to get the subtle differences among the characters. Though extracting knowledge from Chinese literature is a complicated and difficult topic, this paper models the graph of polar literary characters and divides polar communities by extracting polar vertices and polar edges based on the concept of literary sentiment polarity. The results show that using a long-sentence window is a good trade-off. The experiments of modern Chinese polar literature show that the accuracy of the community division method for the integrated graph polarity is obviously better than the method based on the co-occurrence network, and it can automatically match the positive and negative communities as well as build the complete graph structure. It clearly reflects the meaning of literary works from the perspective of polarity. This systematic model describes general standard operation steps for the machine to understand complex Chinese literature. The experiments conducted on seven benchmark Chinese novel datasets demonstrate that the method based on emotional polarity shows a significant improvement compared to baseline performance. Though we use Chinese datasets in this paper, the model and methods are important references for literature analysis and graph extraction in other languages.

Highlights

  • Literature is an important form for people to spread culture and exchange ideas by shaping images and reflectingThe associate editor coordinating the review of this manuscript and approving it for publication was Bin Liu .social life through language [1]

  • The searching process can be substituted by deterministic finite automaton (DFA) or Bayesian spam filter (BSF) to extract graph knowledge from the huge amount of literature materials

  • The literary space entropy clearly defines the process of extracting information from the natural text in long texts, and the literary interaction window defines the range of nodes and polar morpheme indicators

Read more

Summary

INTRODUCTION

Literature is an important form for people to spread culture and exchange ideas by shaping images and reflecting. Regardless of the emotional polarity information, the .literary character set V and the link set E are extracted from TV , and the weights of the links w[e(vi, vj)] in E indicate the interaction strength between nodes In this way, a simple community division result is obtained, that is, two community subnets. Since E does not contain emotional polarity information, these two subnets are the final division results, and the positive and negative polarities need to be manually matched This method has flaws because it only considers the co-occurrence network structure and does not consider the context semantics of the co-occurrence of character nodes (i.e. co-occurring words, the emotional tendency of co-occurrence paragraphs) [37]. The nodes are directly classified into the community, and the result does not include the graph structure of the network

COMMUNITY CLASSIFICATION BASED ON THE LINK POLARITY
COMMUNITY CLASSIFICATION BASED ON THE POLARITY OF THE ENTIRE NETWORK
CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.