Abstract

The deep neural network is used to establish a neural network model to solve the problems of low accuracy and poor accuracy of traditional algorithms in screening differentially expressed genes and function prediction during the walnut endocarp hardening stage. The paper walnut is used as the research object to analyze the biological information of paper walnut. The changes of lignin deposition during endocarp hardening from 50 days to 90 days are observed by microscope. Then, the Convolutional Neural Network (CNN) and Long and Short-term Memory (LSTM) network model are adopted to construct an expression gene screening and function prediction model. Then, the transcriptome and proteome sequencing and biological information of walnut endocarp samples at 50, 57, 78, and 90 days after flowering are analyzed and taken as the training data set of the CNN + LSTM model. The experimental results demonstrate that the endocarp of paper walnut began to harden at 57 days, and the endocarp tissue on the hardened inner side also began to stain. This indicates that the endocarp hardened laterally from outside to inside. The screening and prediction results show that the CNN + LSTM model’s highest accuracy can reach 0.9264. The Accuracy, Precision, Recall, and F1-score of the CNN + LSTM model are better than the traditional machine learning algorithm. Moreover, the Receiver Operating Curve (ROC) area enclosed by the CNN + LSTM model and coordinate axis is the largest, and the Area Under Curve (AUC) value is 0.9796. The comparison of ROC and AUC proves that the CNN + LSTM model is better than the traditional algorithm for screening differentially expressed genes and function prediction in the walnut endocarp hardening stage. Using deep learning to predict expressed genes’ function accurately can reduce the breeding cost and significantly improve the yield and quality of crops. This research provides scientific guidance for the scientific breeding of paper walnut.

Highlights

  • The expression gene screening and the prediction model is established by using Convolutional Neural Network (CNN) and Long and Short-term Memory (LSTM)

  • The CNN + LSTM model is used to comprehensively analyze all the genes expressed in the endocarp hardening stage of paper walnut

  • Screening and function prediction model of DEGs based on deep learning Data acquisition and pre-processing

Read more

Summary

Introduction

Biological data is growing exponentially with the rapid development of life science and computer science. The expression gene screening and the prediction model is established by using CNN and LSTM.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call