Image Clustering: An Unsupervised Approach to Categorize Visual Data in Social Science Research

Han Zhang,Yilang Peng

doi:10.1177/00491241221082603

Abstract

Automated image analysis has received increasing attention in social scientific research, yet existing scholarship has mostly covered the application of supervised learning to classify images into predefined categories. This study focuses on the task of unsupervised image clustering, which aims to automatically discover categories from unlabelled image data. We first review the steps to perform image clustering and then focus on one key challenge in this task—finding intermediate representations of images. We present several methods of extracting intermediate image representations, including the bag-of-visual-words model, self-supervised learning, and transfer learning (in particular, feature extraction with pretrained models). We compare these methods using various visual datasets, including images related to protests in China from Weibo, images about climate change on Instagram, and profile images of the Russian Internet Research Agency on Twitter. In addition, we propose a systematic way to interpret and validate clustering solutions. Results show that transfer learning significantly outperforms the other methods. The dataset used in the pretrained model critically determines what categories the algorithms can discover.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Image Clustering: An Unsupervised Approach to Categorize Visual Data in Social Science Research

Abstract

Talk to us

Similar Papers

More From: Sociological Methods & Research

Lead the way for us

Journal: Sociological Methods & Research	Publication Date: Apr 7, 2022
Citations: 6

Similar Papers

Multi-Modal Self-Supervised Representation Learning for Earth Observation
Pallavi Jain ... Robert Ross
-
Pallavi Jain, et. al.Pallavi Jain ... Robert Ross
11 Jul 2021
11 Jul 2021

Semi-supervised transfer learning with hierarchical self-regularization
Xingjian Li ... Chengzhong Xu
Pattern Recognition | VOL. 144
Xingjian Li, et. al.Xingjian Li ... Chengzhong Xu
26 Jul 2023
Pattern Recognition | VOL. 144

Using self-supervised feature learning to improve the use of pulse oximeter signals to predict paediatric hospitalization.
Paul Mwaniki ... Dustin Dunsmuir
Wellcome Open Research | VOL. 6
Paul Mwaniki, et. al.Paul Mwaniki ... Dustin Dunsmuir
01 Feb 2023
Wellcome Open Research | VOL. 6

Joint Self-Supervised Image-Volume Representation Learning with Intra-inter Contrastive Clustering
Duy M H Nguyen ... Shadi Albarqouni
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Duy M H Nguyen, et. al.Duy M H Nguyen ... Shadi Albarqouni
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Image Clustering: An Unsupervised Approach to Categorize Visual Data in Social Science Research

Abstract

Talk to us

Similar Papers

More From: Sociological Methods &amp; Research

More From: Sociological Methods & Research