Abstract

Document-level event argument extraction (EAE), a critical task for event knowledge acquisition, seeks to find event arguments beyond the sentence level. Previous approaches for this task adopt supervised learning, which suffers from the data scarcity issue, especially in low-resource situations. To address this challenge, we propose a new method for document-level EAE that addresses the data scarcity from two perspectives: self-augmentation and cross-domain joint training. On the one hand, our method can generate additional training samples from an existing dataset using pre-trained language models, considerably expanding the training set; on the other hand, our method can learn with out-of-domain datasets with varying event schema, using a cross-domain joint training framework. To address data quality problems and domain mismatches, we also provide a new noise filtering method based on a teacher–student framework. The extensive experiments demonstrate that our approach delivers state-of-the-art performance on benchmark datasets, and that it is especially effective in low-resource situations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.