Addressing Class Imbalance for Improved Recognition of Implicit Discourse Relations

  • Abstract
  • Literature Map
  • Similar Papers
Abstract
Translate article icon Translate Article Star icon

In this paper we address the problem of skewed class distribution in implicit discourse relation recognition. We examine the performance of classifiers for both binary classification predicting if a particular relation holds or not and for multi-class prediction. We review prior work to point out that the problem has been addressed differently for the binary and multi-class problems. We demonstrate that adopting a unified approach can significantly improve the performance of multi-class prediction. We also propose an approach that makes better use of the full annotations in the training set when downsampling is used. We report significant absolute improvements in performance in multi-class prediction, as well as significant improvement of binary classifiers for detecting the presence of implicit Temporal, Comparison and Contingency relations.

Similar Papers
  • Conference Article
  • 10.18653/v1/w14-4320
Addressing Class Imbalance for Improved Recognition of Implicit Discourse Relations
  • Jan 1, 2014
  • Junyi Jessy Li + 1 more

In this paper we address the problem of skewed class distribution in implicit discourse relation recognition.We examine the performance of classifiers for both binary classification predicting if a particular relation holds or not and for multi-class prediction.We review prior work to point out that the problem has been addressed differently for the binary and multi-class problems.We demonstrate that adopting a unified approach can significantly improve the performance of multi-class prediction.We also propose an approach that makes better use of the full annotations in the training set when downsampling is used.We report significant absolute improvements in performance in multi-class prediction, as well as significant improvement of binary classifiers for detecting the presence of implicit Temporal, Comparison and Contingency relations.

  • Research Article
  • Cite Count Icon 16
  • 10.1145/3028772
Implicit Discourse Relation Recognition for English and Chinese with Multiview Modeling and Effective Representation Learning
  • Mar 17, 2017
  • ACM Transactions on Asian and Low-Resource Language Information Processing
  • Haoran Li + 2 more

Discourse relations between two text segments play an important role in many Natural Language Processing (NLP) tasks. The connectives strongly indicate the sense of discourse relations, while in fact, there are no connectives in a large proportion of discourse relations, that is, implicit discourse relations. Compared with explicit relations, implicit relations are much harder to detect and have drawn significant attention. Until now, there have been many studies focusing on English implicit discourse relations, and few studies address implicit relation recognition in Chinese even though the implicit discourse relations in Chinese are more common than those in English. In our work, both the English and Chinese languages are our focus. The key to implicit relation prediction is to properly model the semantics of the two discourse arguments, as well as the contextual interaction between them. To achieve this goal, we propose a neural network based framework that consists of two hierarchies. The first one is the model hierarchy, in which we propose a max-margin learning method to explore the implicit discourse relation from multiple views. The second one is the feature hierarchy, in which we learn multilevel distributed representations from words, arguments, and syntactic structures to sentences. We have conducted experiments on the standard benchmarks of English and Chinese, and the results show that compared with several methods our proposed method can achieve the best performance in most cases.

  • PDF Download Icon
  • Research Article
  • Cite Count Icon 16
  • 10.1109/access.2019.2954988
Implicit Discourse Relation Recognition via a BiLSTM-CNN Architecture With Dynamic Chunk-Based Max Pooling
  • Jan 1, 2019
  • IEEE Access
  • Fengyu Guo + 2 more

Implicit discourse relation recognition is a serious challenge in discourse analysis, which aims to understand and annotate the latent relations between two discourse arguments, such as temporal and comparison. Most neural network-based models encode linguistic features (such as syntactic parsing and position information) as embedding vectors, which are prone to error propagation due to unsuitable pre-processing. Other methods apply different attention or memory mechanisms, mainly considering the key points in the discourse, yet ignore some valuable clues. In particular, those using convolution neural networks retain local contexts but lose word order information due to the standard pooling operation. The methods that use bidirectional long short-term memory network consider the word sequence and retain the global information, but cannot capture the context with different range sizes. In this paper, we propose a novel <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">D</b> ynamic <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">C</b> hunk-based Max Pooling <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">B</b> iLSTM- <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">CNN</b> framework ( <bold xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">DC-BCNN</b> ) to address these issues. First, we exploit BiLSTMs to capture the semantic representations of discourse arguments. Second, we adopt the proposed convolutional layer to automatically extract the “multi-granularity” features (just like n-gram) by setting different convolution filter sizes. Then, we design a dynamic chunk-based max pooling strategy to obtain the important scaled features of different parts in one discourse argument. This strategy can dynamically divide each argument into several segments (called chunks) according to the argument length and the number of current pooling layer in the CNN and then select the maximum value of each chunk to indicate crucial information. We further utilize a fully connected layer with a softmax function to recognize discourse relations. The experimental results on two corpora (i.e., PDTB and HIT-CDTB) show that our proposed model is effective in implicit discourse relation recognition.

  • Conference Article
  • Cite Count Icon 1
  • 10.1109/iceiec51955.2021.9463835
Using Connectives in Implicit Discourse Relation Recognition
  • Jun 18, 2021
  • Hongfei Han + 3 more

Discourse relation recognition is an important branch of NLP, which is helpful to solve many NLP downstream tasks. In particular, implicit discourse relation recognition(IDRR) is the focus of many research, but they all ignore the role of connectives in IDRR. In this paper, we propose two methods for using connectives to help IDRR task and achieve better results.

  • Research Article
  • Cite Count Icon 8
  • 10.1145/3485016
Memorizing All for Implicit Discourse Relation Recognition
  • Dec 13, 2021
  • ACM Transactions on Asian and Low-Resource Language Information Processing
  • Kashif Munir + 3 more

Implicit discourse relation recognition is a challenging task due to the absence of the necessary informative clues from explicit connectives. An implicit discourse relation recognizer has to carefully tackle the semantic similarity of sentence pairs and the severe data sparsity issue. In this article, we learn token embeddings to encode the structure of a sentence from a dependency point of view in their representations and use them to initialize a baseline model to make it really strong. Then, we propose a novel memory component to tackle the data sparsity issue by allowing the model to master the entire training set, which helps in achieving further performance improvement. The memory mechanism adequately memorizes information by pairing representations and discourse relations of all training instances, thus filling the slot of the data-hungry issue in the current implicit discourse relation recognizer. The proposed memory component, if attached with any suitable baseline, can help in performance enhancement. The experiments show that our full model with memorizing the entire training data provides excellent results on PDTB and CDTB datasets, outperforming the baselines by a fair margin.

  • Research Article
  • Cite Count Icon 6
  • 10.1016/j.neucom.2019.08.081
Boosting implicit discourse relation recognition with connective-based word embeddings
  • Aug 30, 2019
  • Neurocomputing
  • Changxing Wu + 3 more

Boosting implicit discourse relation recognition with connective-based word embeddings

  • PDF Download Icon
  • Conference Article
  • Cite Count Icon 15
  • 10.18653/v1/2021.emnlp-main.187
Not Just Classification: Recognizing Implicit Discourse Relation on Joint Modeling of Classification and Generation
  • Jan 1, 2021
  • Feng Jiang + 4 more

Implicit discourse relation recognition (IDRR) is a critical task in discourse analysis. Previous studies only regard it as a classification task and lack an in-depth understanding of the semantics of different relations. Therefore, we first view IDRR as a generation task and further propose a method joint modeling of the classification and generation. Specifically, we propose a joint model, CG-T5, to recognize the relation label and generate the target sentence containing the meaning of relations simultaneously. Furthermore, we design three target sentence forms, including the question form, for the generation model to incorporate prior knowledge. To address the issue that large discourse units are hardly embedded into the target sentence, we also propose a target sentence construction mechanism that automatically extracts core sentences from those large discourse units. Experimental results both on Chinese MCDTB and English PDTB datasets show that our model CG-T5 achieves the best performance against several state-of-the-art systems.

  • Research Article
  • Cite Count Icon 5
  • 10.1016/j.neucom.2017.02.084
Leveraging bilingually-constrained synthetic data via multi-task neural networks for implicit discourse relation recognition
  • Mar 8, 2017
  • Neurocomputing
  • Changxing Wu + 4 more

Leveraging bilingually-constrained synthetic data via multi-task neural networks for implicit discourse relation recognition

  • Video Transcripts
  • 10.48448/qgy0-5t40
Not Just Classification: Recognizing Implicit Discourse Relation on Joint Modeling of Classification and Generation
  • Oct 15, 2021
  • Underline Science Inc.
  • Feng Jiang + 4 more

Implicit discourse relation recognition (IDRR) is a critical task in discourse analysis. Previous studies only regard it as a classification task and lack an in-depth understanding of the semantics of different relations. Therefore, we first view IDRR as a generation task and further propose a method joint modeling of the classification and generation. Specifically, we propose a joint model, CG-T5, to recognize the relation label and generate the target sentence containing the meaning of relations simultaneously. Furthermore, we design three target sentence forms, including the question form, for the generation model to incorporate prior knowledge. To address the issue that large discourse units are hardly embedded into the target sentence, we also propose a target sentence construction mechanism that automatically extracts core sentences from those large discourse units. Experimental results both on Chinese MCDTB and English PDTB datasets show that our model CG-T5 achieves the best performance against several state-of-the-art systems.

  • PDF Download Icon
  • Conference Article
  • Cite Count Icon 36
  • 10.18653/v1/p19-1411
Employing the Correspondence of Relations and Connectives to Identify Implicit Discourse Relations via Label Embeddings
  • Jan 1, 2019
  • Linh The Nguyen + 3 more

It has been shown that implicit connectives can be exploited to improve the performance of the models for implicit discourse relation recognition (IDRR). An important property of the implicit connectives is that they can be accurately mapped into the discourse relations conveying their functions. In this work, we explore this property in a multi-task learning framework for IDRR in which the relations and the connectives are simultaneously predicted, and the mapping is leveraged to transfer knowledge between the two prediction tasks via the embeddings of relations and connectives. We propose several techniques to enable such knowledge transfer that yield the state-of-the-art performance for IDRR on several settings of the benchmark dataset (i.e., the Penn Discourse Treebank dataset).

  • Conference Article
  • Cite Count Icon 68
  • 10.24963/ijcai.2017/562
SWIM: A Simple Word Interaction Model for Implicit Discourse Relation Recognition
  • Aug 1, 2017
  • Wenqiang Lei + 5 more

Capturing the semantic interaction of pairs of words across arguments and proper argument representation are both crucial issues in implicit discourse relation recognition. The current state-of-the-art represents arguments as distributional vectors that are computed via bi-directional Long Short-Term Memory networks (BiLSTMs), known to have significant model complexity.In contrast, we demonstrate that word-weighted averaging can encode argument representation which can incorporate word pair information efficiently. By saving an order of magnitude in parameters, our proposed model achieves equivalent performance, but trains seven times faster.

  • Conference Article
  • Cite Count Icon 14
  • 10.18653/v1/2023.findings-acl.510
Global and Local Hierarchy-aware Contrastive Framework for Implicit Discourse Relation Recognition
  • Jan 1, 2023
  • Yuxin Jiang + 2 more

Due to the absence of explicit connectives, implicit discourse relation recognition (IDRR) remains a challenging task in discourse analysis. The critical step for IDRR is to learn high-quality discourse relation representations between two arguments. Recent methods tend to integrate the whole hierarchical information of senses into discourse relation representations for multi-level sense recognition. Nevertheless, they insufficiently incorporate the static hierarchical structure containing all senses (defined as global hierarchy), and ignore the hierarchical sense label sequence corresponding to each instance (defined as local hierarchy). For the purpose of sufficiently exploiting global and local hierarchies of senses to learn better discourse relation representations, we propose a novel GlObal and Local Hierarchy-aware Contrastive Framework (GOLF), to model two kinds of hierarchies with the aid of multi-task learning and contrastive learning. Experimental results on PDTB 2.0 and PDTB 3.0 datasets demonstrate that our method remarkably outperforms current state-of-the-art models at all hierarchical levels.

  • Research Article
  • 10.1016/j.knosys.2023.110457
Meta-learning based instance manipulation for implicit discourse relation recognition
  • Mar 11, 2023
  • Knowledge-Based Systems
  • Jiali Zeng + 5 more

Meta-learning based instance manipulation for implicit discourse relation recognition

  • Conference Article
  • Cite Count Icon 1
  • 10.18653/v1/2024.findings-acl.500
Multi-Label Classification for Implicit Discourse Relation Recognition
  • Jan 1, 2024
  • Wanqiu Long + 2 more

Discourse relations play a pivotal role in establishing coherence within textual content, uniting sentences and clauses into a cohesive narrative.The Penn Discourse Treebank (PDTB) stands as one of the most extensively utilized datasets in this domain.In PDTB-3 (Webber et al., 2019), the annotators can assign multiple labels to an example, when they believe that multiple relations are present.Prior research in discourse relation recognition has treated these instances as separate examples during training, and only one example needs to have its label predicted correctly for the instance to be judged as correct.However, this approach is inadequate, as it fails to account for the interdependence of labels in real-world contexts and to distinguish between cases where only one sense relation holds and cases where multiple relations hold simultaneously.In our work, we address this challenge by exploring various multi-label classification frameworks to handle implicit discourse relation recognition.We show that multi-label classification methods don't depress performance for single-label prediction.Additionally, we give comprehensive analysis of results and data.Our work contributes to advancing the understanding and application of discourse relations and provide a foundation for the future study.

  • Research Article
  • Cite Count Icon 5
  • 10.1007/s11063-017-9582-x
Co-training for Implicit Discourse Relation Recognition Based on Manual and Distributed Features
  • Jan 23, 2017
  • Neural Processing Letters
  • Changxing Wu + 4 more

Implicit discourse relation recognition aims to discover the semantic relation between two sentences where the discourse connective is absent. Due to the lack of labeled data, previous work tries to generate additional training data automatically by removing discourse connectives from explicit discourse relation instances. However, using these artificial data indiscriminately has been proven to degrade the performance of implicit discourse relation recognition. To address this problem, we propose a co-training approach based on manual features and distributed features, which identifies useful instances from these artificial data to enlarge the labeled data. In addition, the distributed features are learned via recursive autoencoder based approaches, capable of capturing to some extent the semantics of sentences which is valuable for implicit discourse relation recognition. Experiment results on both the PDTB and CDTB data sets indicate that: (1) The learned distributed features are complementary to the manual features, and thus suitable for co-training. (2) Our proposed co-training approach can use these artificial data effectively, and significantly outperforms the baselines.

Save Icon
Up Arrow
Open/Close
Notes

Save Important notes in documents

Highlight text to save as a note, or write notes directly

You can also access these Documents in Paperpal, our AI writing tool

Powered by our AI Writing Assistant