Unsupervised Fine-tuning for Text Clustering

Shaohan Huang,Ming Zhou,Xingxing Zhang,Furu Wei,Lei Cui

doi:10.18653/v1/2020.coling-main.482

Shaohan Huang, Ming Zhou + Show 3 more

Open Access

https://doi.org/10.18653/v1/2020.coling-main.482

Copy DOI

Abstract

Fine-tuning with pre-trained language models (e.g. BERT) has achieved great success in many language understanding tasks in supervised settings (e.g. text classification). However, relatively little work has been focused on applying pre-trained models in unsupervised settings, such as text clustering. In this paper, we propose a novel method to fine-tune pre-trained models unsupervisedly for text clustering, which simultaneously learns text representations and cluster assignments using a clustering oriented loss. Experiments on three text clustering datasets (namely TREC-6, Yelp, and DBpedia) show that our model outperforms the baseline methods and achieves state-of-the-art results.

Highlights

Pre-trained language models have shown remarkable progress in many natural language understanding tasks (Radford et al, 2018; Peters et al, 2018; Howard and Ruder, 2018)
BERT has achieved great success in many natural language understanding tasks under supervised fine-tuning approaches, relatively little work has been focused on applying pre-trained models in unsupervised settings
By a case study of text clustering, we investigate how to leverage the pre-trained BERT model and fine-tune it in unsupervised settings, such as text clustering

Summary

Introduction

Pre-trained language models have shown remarkable progress in many natural language understanding tasks (Radford et al, 2018; Peters et al, 2018; Howard and Ruder, 2018). BERT (Devlin et al, 2018) applies the fine-tuning approach to achieve ground-breaking performance in a set of NLP tasks. BERT has achieved great success in many natural language understanding tasks under supervised fine-tuning approaches, relatively little work has been focused on applying pre-trained models in unsupervised settings. Two-stage approach uses deep learning frameworks to learn the representation first and run clustering algorithms (Chen, 2015; Yang et al, 2017). Jointly optimization approaches learn the representations and clustering jointly (Xie et al, 2016; Guo et al, 2017). Inspired by those methods, we can fine-tune pre-trained models by learning text representations and cluster assignments simultaneously

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised Fine-tuning for Text Clustering

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 2	License type: cc-by

Similar Papers

CrisisTransformers: Pre-trained language models and sentence encoders for crisis-related social media texts
Rabindra Lamsal ... Shanika Karunasekera
Knowledge-Based Systems | VOL. 296
Rabindra Lamsal, et. al.Rabindra Lamsal ... Shanika Karunasekera
09 May 2024
Knowledge-Based Systems | VOL. 296

On the Power of Pre-Trained Text Representations
Yu Meng ... Jiawei Han
-
Yu Meng, et. al.Yu Meng ... Jiawei Han
14 Aug 2021
14 Aug 2021

Text clustering based on pre-trained models and autoencoders.
Qiang Xu ... Hao Gu
Frontiers in Computational Neuroscience | VOL. 17
Qiang Xu, et. al.Qiang Xu ... Hao Gu
05 Jan 2024
Frontiers in Computational Neuroscience | VOL. 17

A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain
Yusuf Arslan ... Kevin Allix
-
Yusuf Arslan, et. al.Yusuf Arslan ... Kevin Allix
19 Apr 2021
19 Apr 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised Fine-tuning for Text Clustering

Abstract

Highlights

Summary

Talk to us

Similar Papers