Gene sequence analysis model construction based on k-mer statistics.

Dongjie Gao

doi:10.1371/journal.pone.0306480

Dongjie Gao

Open Access

https://doi.org/10.1371/journal.pone.0306480

Copy DOI

Export

Save

Cite

Journal: PloS one	Publication Date: Sep 12, 2024
License type: CC BY 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

With the rapid development of biotechnology, gene sequencing methods are gradually improved. The structure of gene sequences is also more complex. However, the traditional sequence alignment method is difficult to deal with the complex gene sequence alignment work. In order to improve the efficiency of gene sequence analysis, D2 series method of k-mer statistics is selected to build the model of gene sequence alignment analysis. According to the structure of the foreground sequence, the sequence to be aligned can be cut by different lengths and divided into multiple subsequences. Finally, according to the selected subsequences, the maximum dissimilarity in the alignment results is determined as the statistical result. At the same time, the research also designed an application system for the sequence alignment analysis of the model. The experimental results showed that the statistical power of the sequence alignment analysis model was directly proportional to the sequence coverage and cutting length, and inversely proportional to the K value and module length. At the same time, the model was applied to the system designed in this paper. The maximum storage capacity of the system was 71 GB, the maximum disk capacity was 135 GB, and the running time was less than 2.0s. Therefore, the k-mer statistic sequence alignment model and system proposed in this study have considerable application value in gene alignment analysis.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Gene sequence analysis model construction based on k-mer statistics.

Abstract

Published Version

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Genetic differentiation of Mycobacterium bovis and Mycobacterium tuberculosis isolated from cattle and human sources in, Egypt (Suez Canal area)
Ali Wahdan ... Shymaa Enany
Comparative Immunology, Microbiology and Infectious Diseases | VOL. 73
Ali Wahdan, et. al.Ali Wahdan ... Shymaa Enany
17 Sep 2020
Comparative Immunology, Microbiology and Infectious Diseases | VOL. 73

Analysis on the imported Coronavirus Disease 2019 related cluster epidemic in rural areas of Chengdu
...
Zhonghua yu fang yi xue za zhi [Chinese journal of preventive medicine] | VOL. 55
, et. al. ...
06 Oct 2021
Analysis on the imported Coronavirus Disease 2019 related cluster epidemic in rural areas of Chengdu
...

Data processing can mask biology: towards better reporting of fungal barcoding data?
Marc‐André Selosse ... Lucie Vincenot
The New phytologist | VOL. 210
Marc‐André Selosse, et. al.Marc‐André Selosse ... Lucie Vincenot
28 Jan 2016
The New phytologist | VOL. 210

Gene Sequences Parallel Alignment Model Based on Multiple Inputs and Outputs
Xiaolong Feng ... Jing Gao
International Journal of Computers Communications & Control | VOL. 14
Xiaolong Feng, et. al.Xiaolong Feng ... Jing Gao
14 Apr 2019
International Journal of Computers Communications & Control | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Gene sequence analysis model construction based on k-mer statistics.

Abstract

Published Version

Talk to us

Similar Papers

More From: PloS one