Analysis of the Human Protein Atlas Image Classification competition

Wei Ouyang,Lovisa Åkesson,Yuanhao Wu,Xuan Cao,Runmin Wei,Alexander Kiselev,Dmitry Buslov,Chuanpeng Li,Bojan Tunguz,Xiaosu Yi ,Anthony J Cesnik,Dmytro Poplavskiy,Russel D Wolfinger,Shubin Dai,Dmytro Panchenko,Martin Hjelmare,Jun Lan,Casper F Winsnes,Yinzheng Gu,Jinbin Xie,Xun Zhu,Kathleen Hwang ,Kuan-Lun Tseng,Park Jinmo,Christof Henkel,Zhifeng Gao,Hao Xu,Devin P Sullivan,Hongdong Zheng,Sergei Fironov,Constantin Kappel,Cheng Ju,Shaikat M Galib,Emma Lundberg

doi:10.1038/s41592-019-0658-6

Abstract

Pinpointing subcellular protein localizations from microscopy images is easy to the trained eye, but challenging to automate. Based on the Human Protein Atlas image collection, we held a competition to identify deep learning solutions to solve this task. Challenges included training on highly imbalanced classes and predicting multiple labels per image. Over 3 months, 2,172 teams participated. Despite convergence on popular networks and training techniques, there was considerable variety among the solutions. Participants applied strategies for modifying neural networks and loss functions, augmenting data and using pretrained networks. The winning models far outperformed our previous effort at multi-label classification of protein localization patterns by ~20%. These models can be used as classifiers to annotate new images, feature extractors to measure pattern similarity or pretrained networks for a wide range of biological applications.

Highlights

Advancement in high-throughput microscopy has propelled the generation of massive amounts of biological imaging data[1]
Unbiased analysis of subcellular protein localizations from our images has greatly enriched our vocabulary for describing cellular systems
This analysis was first performed manually[3], and we have since integrated the labor-intensive annotation tasks into a mainstream video game[5], which produced tens of millions of human annotations. These annotations were successful at the challenging task of identifying mixed patterns of protein localizations, a task called multi-label classification[6]

Summary

Introduction

Advancement in high-throughput microscopy has propelled the generation of massive amounts of biological imaging data[1]. Unbiased analysis of subcellular protein localizations from our images has greatly enriched our vocabulary for describing cellular systems This analysis was first performed manually[3], and we have since integrated the labor-intensive annotation tasks into a mainstream video game[5], which produced tens of millions of human annotations. These annotations were successful at the challenging task of identifying mixed patterns of protein localizations, a task called multi-label classification[6]. Compared to Loc-CAT5, which uses hand-crafted features as inputs, CNNs typically take raw images as inputs and learn hierarchical feature representations in an end-to-end fashion This allows the model to better abstract cellular localization patterns and scale efficiently with data size[14]. Finding the best solution for classifying protein localizations within HPA Cell Atlas Images involves performing searches of Nature Methods | VOL 16 | December 2019 | 1254–1261 | www.nature.com/naturemethods

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Methods	Publication Date: Nov 28, 2019
Citations: 96	License type: open-access

R Discovery Prime

R Discovery Prime

Analysis of the Human Protein Atlas Image Classification competition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Methods

Lead the way for us

Similar Papers

AT_CHLORO, a Comprehensive Chloroplast Proteome Database with Subplastidial Localization and Curated Information on Envelope Proteins
Myriam Ferro ... Norbert Rolland
Molecular & Cellular Proteomics | VOL. 9
Myriam Ferro, et. al.Myriam Ferro ... Norbert Rolland
01 Jun 2010
Molecular & Cellular Proteomics | VOL. 9

Automated detection of leukemia by pretrained deep neural networks and transfer learning: A comparison
K.K Anilkumar ... T.M Sagi
Medical Engineering & Physics | VOL. 98
K.K Anilkumar, et. al.K.K Anilkumar ... T.M Sagi
13 Oct 2021
Medical Engineering & Physics | VOL. 98

Quantitative Protein Localization Signatures Reveal an Association between Spatial and Functional Divergences of Proteins
Lit-Hsin Loo ... Christian Von Mering
PLoS Computational Biology | VOL. 10
Lit-Hsin Loo, et. al.Lit-Hsin Loo ... Christian Von Mering
06 Mar 2014
PLoS Computational Biology | VOL. 10

Optimizing the Neural Network Loss Function in Electrical Tomography to Increase Energy Efficiency in Industrial Reactors
Monika Kulisz ... Jolanta Słoniec
Energies | VOL. 17
Monika Kulisz, et. al.Monika Kulisz ... Jolanta Słoniec
31 Jan 2024
Energies | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis of the Human Protein Atlas Image Classification competition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Methods