Integration of hybrid and self-correction method improves the quality of long-read sequencing data.

Tao Tang,Binshuang Zheng,Yuansheng Liu,Yiping Liu,Xiaocai Zhang,Rong Li

doi:10.1093/bfgp/elad026

Abstract

Third-generation sequencing (TGS) technologies have revolutionized genome science in the past decade. However, the long-read data produced by TGS platforms suffer from a much higher error rate than that of the previous technologies, thus complicating the downstream analysis. Several error correction tools for long-read data have been developed; these tools can be categorized into hybrid and self-correction tools. So far, these two types of tools are separately investigated, and their interplay remains understudied. Here, we integrate hybrid and self-correction methods for high-quality error correction. Our procedure leverages the inter-similarity between long-read data and high-accuracy information from short reads. We compare the performance of our method and state-of-the-art error correction tools on Escherichia coli and Arabidopsis thaliana datasets. The result shows that the integration approach outperformed the existing error correction methods and holds promise for improving the quality of downstream analyses in genomic research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integration of hybrid and self-correction method improves the quality of long-read sequencing data.

Abstract

Talk to us

Similar Papers

More From: Briefings in functional genomics

Lead the way for us

Similar Papers

A comprehensive evaluation of long read error correction methods
Haowen Zhang ... Chirag Jain
BMC Genomics | VOL. 21
Haowen Zhang, et. al.Haowen Zhang ... Chirag Jain
01 Dec 2020
BMC Genomics | VOL. 21

Data from Gene Fusion Detection and Characterization in Long-Read Cancer Transcriptome Sequencing Data with FusionSeeker
Zechen Chong ... Yu Chen
-
Zechen Chong, et. al.Zechen Chong ... Yu Chen
31 Mar 2023
31 Mar 2023

Gene Fusion Detection and Characterization in Long-Read Cancer Transcriptome Sequencing Data with FusionSeeker.
Yu Chen ... Herbert Chen
Cancer Research | VOL. 83
Yu Chen, et. al.Yu Chen ... Herbert Chen
01 Nov 2022
Cancer Research | VOL. 83

Data from Gene Fusion Detection and Characterization in Long-Read Cancer Transcriptome Sequencing Data with FusionSeeker
Weisheng Chen ... Zhengzhi Tan
-
Weisheng Chen, et. al.Weisheng Chen ... Zhengzhi Tan
31 Mar 2023
31 Mar 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integration of hybrid and self-correction method improves the quality of long-read sequencing data.

Abstract

Talk to us

Similar Papers

More From: Briefings in functional genomics