Joint entity and relation extraction refers to the extraction of entities and their corresponding relationships in the given sentence, which has gained increasing attention in recent years. Some joint extraction models utilize a shared encoder to model the interactions between named entity recognition and relation extraction subtasks. Despite achieving decent performance, they inevitably face the issue of error propagation. One-step exhaustive methods can mitigate the error propagation problem to some extent, but they suffer from issues such as huge computation complexity and a proliferation of negative samples. Therefore, addressing the problems mentioned above, we propose a Greedy Filter Network that combines Greedy-NER and Filter-RE. GFN employs the Greedy-NER with a redundancy strategy to prioritize recall, thereby reducing error propagation between subtasks. To reduce the computational complexity, we design an innovative approach to represent and store spans in Greedy-NER. In Filter-RE, we traverse all pairwise combinations of candidate entities. To address the issue of widespread negative samples, we design a denoising strategy with two filters, effectively filtering out entity pairs without relations, which can eliminate noise and alleviate the issue of negative sample proliferation. Finally, to enable flexible control over the redundancy strategy, we design two misclassifying penalty parameters for each module. The experimental results indicate that GFN achieves the state-of-the-art F1-score on the CoNLL04 and NYT datasets, with a notable 2.0% improvement observed specifically on CoNLL04.
Read full abstract