Multi-labelled proteins recognition for high-throughput microscopy images using deep convolutional neural networks

Enze Zhang,Xiaohua Wan,Fa Zhang,Shaohan Hu,Zhiyong Liu,Boheng Zhang

doi:10.1186/s12859-021-04196-3

Abstract

BackgroundProteins are of extremely vital importance in the human body, and no movement or activity can be performed without proteins. Currently, microscopy imaging technologies developed rapidly are employed to observe proteins in various cells and tissues. In addition, due to the complex and crowded cellular environments as well as various types and sizes of proteins, a considerable number of protein images are generated every day and cannot be classified manually. Therefore, an automatic and accurate method should be designed to properly solve and analyse protein images with mixed patterns.ResultsIn this paper, we first propose a novel customized architecture with adaptive concatenate pooling and “buffering” layers in the classifier part, which could make the networks more adaptive to training and testing datasets, and develop a novel hard sampler at the end of our network to effectively mine the samples from small classes. Furthermore, a new loss is presented to handle the label imbalance based on the effectiveness of samples. In addition, in our method, several novel and effective optimization strategies are adopted to solve the difficult training-time optimization problem and further increase the accuracy by post-processing.ConclusionOur methods outperformed the SOTA method of multi-labelled protein classification on the HPA dataset, GapNet-PL, by above 2% in the F1 score. Therefore, experimental results based on the test set split from the Human Protein Atlas dataset show that our methods have good performance in automatically classifying multi-class and multi-labelled high-throughput microscopy protein images.

Highlights

Proteins are of extremely vital importance in the human body, and no movement or activity can be performed without proteins
The experimental results demonstrate that our methods make the base models achieve higher F1 scores than other baseline approaches, including the SOTA method
When designing our network architectures, we proposed our customized layers, such as “buffering” layers and adaptive concatenate pooling (ACP) layers, which made our networks more adaptive to the task

Summary

Introduction

Proteins are of extremely vital importance in the human body, and no movement or activity can be performed without proteins. Due to the complex and crowded cellular environments as well as various types and sizes of proteins, a considerable number of protein images are generated every day and cannot be classified manually. Proteins execute all kinds of functions in the human body within different types of cells, and proteins in various environments perform differently. An algorithm capable of classifying mixed patterns of proteins was needed to make the system “smarter”. The algorithm or method is expected to accurately and efficiently recognize multi-patterns that are mixed together among various types of cells. As the HPA project has provided adequate protein microscopy images, as shown, with annotations to feed a large neuron network, we decided to adopt the DNN method, which has good performance in classifying images As the HPA project has provided adequate protein microscopy images, as shown in Fig. 1, with annotations to feed a large neuron network, we decided to adopt the DNN method, which has good performance in classifying images

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: May 1, 2021
Citations: 1	License type: open-access

R Discovery Prime

R Discovery Prime

Multi-labelled proteins recognition for high-throughput microscopy images using deep convolutional neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Classifying Mixed Patterns of Proteins in High-Throughput Microscopy Images Using Deep Neural Networks
Enze Zhang ... Xiaohua Wan
-
Enze Zhang, et. al.Enze Zhang ... Xiaohua Wan
01 Jan 2019
01 Jan 2019

Induced Circular Dichroism Analysis of Thermally Induced Conformational Changes on Protein Binding Sites Under a Crowding Environment.
Chikashi Ota ... Kazufumi Takano
Chemphyschem : a European journal of chemical physics and physical chemistry | VOL. 25
Chikashi Ota, et. al.Chikashi Ota ... Kazufumi Takano
15 Nov 2023
Chemphyschem : a European journal of chemical physics and physical chemistry | VOL. 25

Decision letter: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina ... Ricardo Azziz
-
Larisa V Suturina, et. al.Larisa V Suturina ... Ricardo Azziz
12 Dec 2022
12 Dec 2022

Author response: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Zhuoran Zhang ... Wenyuan Chen
-
Zhuoran Zhang, et. al.Zhuoran Zhang ... Wenyuan Chen
12 Jan 2023
12 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-labelled proteins recognition for high-throughput microscopy images using deep convolutional neural networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics