Dense feature pyramid network for cartoon dog parsing

Jerome Wan,Guillaume Mougeot,Xubo Yang

doi:10.1007/s00371-020-01887-5

Abstract

While traditional cartoon character drawings are simple for humans to create, it remains a highly challenging task for machines to interpret. Parsing is a way to alleviate the issue with fine-grained semantic segmentation of images. Although well studied on naturalistic images, research toward cartoon parsing is very sparse. Due to the lack of available dataset and the diversity of artwork styles, the difficulty of the cartoon character parsing task is greater than the well-known human parsing task. In this paper, we study one type of cartoon instance: cartoon dogs. We introduce a novel dataset toward cartoon dog parsing and create a new deep convolutional neural network (DCNN) to tackle the problem. Our dataset contains 965 precisely annotated cartoon dog images with seven semantic part labels. Our new model, called dense feature pyramid network (DFPnet), makes use of recent popular techniques on semantic segmentation to efficiently handle cartoon dog parsing. We achieve a mIoU of 68.39%, a Mean Accuracy of 79.4% and a Pixel Accuracy of 93.5% on our cartoon dog validation set. Our method outperforms state-of-the-art models of similar tasks trained on our dataset: CE2P for single human parsing and Mask R-CNN for instance segmentation. We hope this work can be used as a starting point for future research toward digital artwork understanding with DCNN. Our DFPnet and dataset will be publicly available.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dense feature pyramid network for cartoon dog parsing

Abstract

Talk to us

Similar Papers

More From: The Visual Computer

Lead the way for us

Journal: The Visual Computer	Publication Date: Jul 9, 2020
Citations: 3

Similar Papers

Part Decomposition and Refinement Network for Human Parsing
Lu Yang ... Tianfei Zhou
IEEE/CAA Journal of Automatica Sinica | VOL. 9
Lu Yang, et. al.Lu Yang ... Tianfei Zhou
01 Jun 2022
IEEE/CAA Journal of Automatica Sinica | VOL. 9

Semantic Image Segmentation with Deep Convolutional Neural Networks and Quick Shift
Sanxing Zhang ... Rui Zhang
Symmetry | VOL. 12
Sanxing Zhang, et. al.Sanxing Zhang ... Rui Zhang
06 Mar 2020
Symmetry | VOL. 12

Class-level Aware Network for Human Parsing
Jiayi Yin ... Weibin Liu
-
Jiayi Yin, et. al.Jiayi Yin ... Weibin Liu
20 May 2021
20 May 2021

On the use of GNN-based structural information to improve CNN-based semantic image segmentation
Patty Coupeau ... Mickaël Dinomais
Journal of Visual Communication and Image Representation | VOL. 101
Patty Coupeau, et. al.Patty Coupeau ... Mickaël Dinomais
01 May 2024
Journal of Visual Communication and Image Representation | VOL. 101

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dense feature pyramid network for cartoon dog parsing

Abstract

Talk to us

Similar Papers

More From: The Visual Computer