Abstract
Although with the continuous development of sequencing technology, the number of genome and protein sequences has grown rapidly, these sequences are only a small part of nature. Biologically, it is still a challenging and important problem to detect and predict some new genome or protein sequences based on real sequence data, which motivates us to solve the problem mathematically. The first step to predict the new sequences is determining the nucleotide or amino acid composition of them. In this paper, we apply natural vector method and convex hull principle to determine the nucleotide or amino acid composition of new genome or protein sequences. Our algorithm is based on optimization strategy. The SARS-CoV-2 genome and protein datasets are used to verify the feasibility of our algorithm. Numerical experiments show that our algorithm can detect and predict possible number of each nucleotide or amino acid of genome and protein sequence with respect to the second order natural vectors.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.