Abstract
Commonsense knowledge graphs (CSKG) are crucial for artificial intelligence systems to understand natural language. Recently, with the construction of COMET (Commonsense Transformer) and ATOMIC2020, a comprehensive coverage commonsense reasoning knowledge graph, CSKG research is increasingly vital in natural language understanding and reasoning. Since sentiment commonsense knowledge is understudied yet, our work focuses on improving the generation of sentiment commonsense in ATOMIC2020. We first show a problem in natural language generation that degrades the accuracy due to the unbalanced sentiment distribution in the dataset. Next, we propose the EDA (Easy Data Augmentation) and UDA(Unsupervised Data Augmentation) based methods that improve the accuracy through biased mitigation of the unbalanced dataset. Our experimental results show that EDA method has little effect on the accuracy, while UDA-based method has some accuracy improvements in ROUGE-I, ROUGE-2, and ROUGE-L.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.