Sentence Encoder Research Articles

In the development of a task-oriented dialogue system, defining the dialogue structure is a time-consuming task. Hence, several works have looked into automatically inferring it from data, e.g., actual conversations between a customer and a support agent. To recover such dialogue structure, recent methods based on discrete variational models learn to jointly encode and cluster utterances in dialogue states, but (i) represent utterances by only considering preceding dialogue context, and (ii) are slow to train since they are optimized with a compute-expensive decoding objective. We revisit and improve upon an existing efficient pipeline approach, commonly adopted as a baseline, that first encodes utterances and then clusters them with k-means to induce the dialogue structure. However, the existing approach represents utterances as bag-of-words or skip-thought vectors, which have been shown to perform poorly in semantic similarity tasks, and without considering dialogue context. We therefore first investigate the use of more powerful transformer-based encoders for encoding utterances. Next, we propose ellodar, a method for learning representations that capture both preceding and subsequent dialogue context, inspired by word-to-vec training strategies. ellodar is efficient since representations are learned directly in the encoding space by finetuning just a single linear layer on top of a frozen sentence encoder with a vector-to-vector regression training objective. Extensive experiments on representative datasets for dialogue structure induction (SimDial, Schema Guided Dialogues, DSTC2, and CamRest676) demonstrate that in terms of effectiveness to induce the correct dialogue structure, (i) clustering utterances represented by transformed-based encoders improves recent joint models by 13%–32% on standard cluster metrics, and (ii) clustering ellodar’s representations yields additional improvements ranging from +20% to +26%, with speedups of ×\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\ imes $$\\end{document} 10\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\ extbf{10}$$\\end{document}–104\\documentclass[12pt]{minimal} \\usepackage{amsmath} \\usepackage{wasysym} \\usepackage{amsfonts} \\usepackage{amssymb} \\usepackage{amsbsy} \\usepackage{mathrsfs} \\usepackage{upgreek} \\setlength{\\oddsidemargin}{-69pt} \\begin{document}$$\ extbf{10}^{\ extbf{4}}$$\\end{document} compared to the recent joint models.

Read full abstract

With the increase in misinformation across digital platforms, incongruent news detection is becoming an important research problem. Earlier, researchers have exploited various feature engineering approaches and deep learning models with embedding to capture incongruity between news headlines and their respective bodies. Studies have broadly considered different combinations of bag-of-words-based features, sequential encoding, hierarchical encoding, headline-guided attention-based encoding, and so on, of the text in headlines and bodies. In this article, we focus on addressing two important limitations observed with hierarchical encoding and headline-guided attention-based encoding methods. The existing hierarchical encoding-based studies limit the hierarchical structure of the body of a news article to paragraph level only, undermining the importance of incorporating long-term dependence from word level to sentence, paragraph, and body. Furthermore, the existing headline-guided attention-based encoding focuses on contextually similar contents in the body of the headline, undermining the importance of incorporating contextually dissimilar contents. Motivated by the above observations, this article proposes a gated recursive and sequential deep hierarchical encoding (GraSHE) method for detecting incongruent news articles by extending the hierarchical structure of the news body from the body to the word level and incorporating incongruity weight. From various experimental setups over three publicly available benchmarks datasets, the experimental results indicate that the proposed model outperforms baseline models with bag-of-word-based features, sequential, hierarchical, and headline-guided attention-based encoding methods. To further validate the performance of the proposed model, we conduct several ablation studies. The following key observations can be made from the ablation study: 1) models with hierarchical encoding outperform models with nonhierarchical encoding; 2) recursive encoding of sentences boosts the performance of models as compared with sequential encoding of sentences within paragraphs; and 3) incongruent news article detection is domain-dependent. Incorporating explicit features further boosts the performance of proposed model and also decreases the domain dependence of models.

Read full abstract

Sentence Encoder Research Articles

Related Topics

Articles published on Sentence Encoder

UniEmbed: A Novel Approach to Detect XSS and SQL Injection Attacks Leveraging Multiple Feature Fusion with Machine Learning Techniques

Soft cosine and extended cosine adaptation for pre-trained language model semantic vector analysis

Developing Semantic Textual Similarity for Guragigna Language Using Deep Learning Approach

Improvement for deep learning for suicide and depression identification with unsupervised label correction

A Chinese Nested Named Entity Recognition Model for Chicken Disease Based on Multiple Fine-Grained Feature Fusion and Efficient Global Pointer

Indonesian-English Textual Similarity Detection Using Universal Sentence Encoder (USE) and Facebook AI Similarity Search (FAISS)

Towards a cyberbullying detection approach: fine-tuned contrastive self-supervised learning for data augmentation

Extracting Key-phrase Embedding using Deep Average Network and Maximal Marginal Relevance to Enhance Information Retrieval

Automatic metaphor processing in developmental dyslexia

Bias Detection in Media: An NLP-Based Approach using Corpus Statistics and Sentence Embeddings

Semantically Enriched Cross-Lingual Sentence Embeddings for Crisis-related Social Media Texts

CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts

A Sentence-Embedding-Based Dashboard to Support Teacher Analysis of Learner Concept Maps

Code Comments: A Way of Identifying Similarities in the Source Code

Revisiting clustering for efficient unsupervised dialogue structure induction

Unveiling Business Activity Patterns of Digital Transformation through K-Means Clustering with Universal Sentence Encoder in Transport and Logistics Sectors

Unsupervised Extractive Summarization with Learnable Length Control Strategies

Gated Recursive and Sequential Deep Hierarchical Encoding for Detecting Incongruent News Articles

Novelty Evaluation using Sentence Embedding Models in Open-ended Cocreative Problem-solving

Can Generative AI be used to improve doctor/patient relationship?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sentence Encoder Research Articles

Related Topics

Articles published on Sentence Encoder

UniEmbed: A Novel Approach to Detect XSS and SQL Injection Attacks Leveraging Multiple Feature Fusion with Machine Learning Techniques

Soft cosine and extended cosine adaptation for pre-trained language model semantic vector analysis

Developing Semantic Textual Similarity for Guragigna Language Using Deep Learning Approach

Improvement for deep learning for suicide and depression identification with unsupervised label correction

A Chinese Nested Named Entity Recognition Model for Chicken Disease Based on Multiple Fine-Grained Feature Fusion and Efficient Global Pointer

Indonesian-English Textual Similarity Detection Using Universal Sentence Encoder (USE) and Facebook AI Similarity Search (FAISS)

Towards a cyberbullying detection approach: fine-tuned contrastive self-supervised learning for data augmentation

Extracting Key-phrase Embedding using Deep Average Network and Maximal Marginal Relevance to Enhance Information Retrieval

Automatic metaphor processing in developmental dyslexia

Bias Detection in Media: An NLP-Based Approach using Corpus Statistics and Sentence Embeddings

Semantically Enriched Cross-Lingual Sentence Embeddings for Crisis-related Social Media Texts

CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts

A Sentence-Embedding-Based Dashboard to Support Teacher Analysis of Learner Concept Maps

Code Comments: A Way of Identifying Similarities in the Source Code

Revisiting clustering for efficient unsupervised dialogue structure induction

Unveiling Business Activity Patterns of Digital Transformation through K-Means Clustering with Universal Sentence Encoder in Transport and Logistics Sectors

Unsupervised Extractive Summarization with Learnable Length Control Strategies

Gated Recursive and Sequential Deep Hierarchical Encoding for Detecting Incongruent News Articles

Novelty Evaluation using Sentence Embedding Models in Open-ended Cocreative Problem-solving

Can Generative AI be used to improve doctor/patient relationship?