VICTOR: Visual incompatibility detection with transformers and fashion-specific contrastive pre-training

Stefanos-Iordanis Papadopoulos,Christos Koutlis,Symeon Papadopoulos,Ioannis Kompatsiaris

doi:10.1016/j.jvcir.2022.103741

Stefanos-Iordanis Papadopoulos, Christos Koutlis + Show 2 more

Open Access

https://doi.org/10.1016/j.jvcir.2022.103741

Copy DOI

Abstract

For fashion outfits to be considered aesthetically pleasing, the garments that constitute them need to be compatible in terms of visual aspects, such as style, category and color. Previous works have defined visual compatibility as a binary classification task with items in a garment being considered as fully compatible or fully incompatible. However, this is not applicable to Outfit Maker applications where users create their own outfits and need to know which specific items may be incompatible with the rest of the outfit. To address this, we propose the Visual InCompatibility TransfORmer (VICTOR) that is optimized for two tasks: 1) overall compatibility as regression and 2) the detection of mismatching items and utilize fashion-specific contrastive language-image pre-training for fine tuning computer vision neural networks on fashion imagery. We build upon the Polyvore outfit benchmark to generate partially mismatching outfits, creating a new dataset termed Polyvore-MISFITs, that is used to train VICTOR. A series of ablation and comparative analyses show that the proposed architecture can compete and even surpass the current state-of-the-art on Polyvore datasets while reducing the instance-wise floating operations by 88%, striking a balance between high performance and efficiency. We release our code at https://github.com/stevejpapad/Visual-InCompatibility-Transformer

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Visual Communication and Image Representation	Publication Date: Jan 2, 2023
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

VICTOR: Visual incompatibility detection with transformers and fashion-specific contrastive pre-training

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation

Lead the way for us

Similar Papers

Multi-modality radiomics of conventional T1 weighted and diffusion tensor imaging for differentiating Parkinson’s disease motor subtypes in early-stages
Mehdi Panahi ... Mahboube Sadat Hosseini
Scientific Reports | VOL. 14
Mehdi Panahi, et. al.Mehdi Panahi ... Mahboube Sadat Hosseini
05 Sep 2024
Scientific Reports | VOL. 14

Quality assessment of abdominal CT images: an improved ResNet algorithm with dual-attention mechanism.
Boying Zhu ... Yuanyuan Yang
American journal of translational research | VOL. 16
Boying Zhu, et. al.Boying Zhu ... Yuanyuan Yang
01 Jan 2024
American journal of translational research | VOL. 16

The application value of Rs-fMRI-based machine learning models for differentiating mild cognitive impairment from Alzheimer's disease: a systematic review and meta-analysis.
Chentong Wang ... Tingting Fu
Neurological sciences : official journal of the Italian Neurological Society and of the Italian Society of Clinical Neurophysiology | VOL. -
Chentong Wang, et. al.Chentong Wang ... Tingting Fu
03 Sep 2024
Neurological sciences : official journal of the Italian Neurological Society and of the Italian Society of Clinical Neurophysiology | VOL. -

On the Behavior of SVM and Some Older Algorithms in Binary Text Classification Tasks
Fabrice Colas ... Pavel Brazdil
-
Fabrice Colas, et. al.Fabrice Colas ... Pavel Brazdil
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

VICTOR: Visual incompatibility detection with transformers and fashion-specific contrastive pre-training

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation