Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution.

Caitriona Brennan,Antonio González,Maria D Tiu,Charles Cowart,Daniel Mcdonald,Pedro Belda-Ferre,Rodolfo A Salido,Caitlin Tribelhorn,Amir Zarrinpar,Mackenzie Bryant,Rob Knight

doi:10.1128/msystems.00006-23

Caitriona Brennan, Antonio González + Show 9 more

Open Access

https://doi.org/10.1128/msystems.00006-23

Copy DOI

Journal: mSystems	Publication Date: Jun 23, 2023
Citations: 3	License type: CC BY 4.0

Affiliation: University of California, San Diego

Abstract

Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lagged behind. To counter this, researchers have adapted library miniaturization protocols and large sample pools to maximize the number of samples that can be prepared by a certain amount of reagents and sequenced in a single run. However, due to high variability of sample quality, over and underrepresentation of samples in a sequencing run has become a major issue in high-throughput sequencing. This leads to misinterpretation of results due to increased noise, and additional time and cost rerunning underrepresented samples. To overcome this problem, we present a normalization method that uses shallow iSeq sequencing to accurately inform pooling volumes based on read distribution. This method is superior to the widely used fluorometry methods, which cannot specifically target adapter-ligated molecules that contribute to sequencing output. Our normalization method not only quantifies adapter-ligated molecules but also allows normalization of feature space; for example, we can normalize to reads of interest such as non-ribosomal reads. As a result, this normalization method improves the efficiency of high-throughput next-generation sequencing by reducing noise and producing higher average reads per sample with more even sequencing depth. IMPORTANCE High-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution.

Abstract

Talk to us

Similar Papers

More From: mSystems

Lead the way for us

Similar Papers

Next Generation Sequencing Technologies and Their Applications
Ku Chee‐Seng ... Loy En Yun
-
Ku Chee‐Seng, et. al.Ku Chee‐Seng ... Loy En Yun
19 Apr 2010
19 Apr 2010

FASTAptamer: A Bioinformatic Toolkit for High-throughput Sequence Analysis of Combinatorial Selections.
Khalid K Alam ... Donald H Burke
Molecular Therapy - Nucleic Acids | VOL. 4
Khalid K Alam, et. al.Khalid K Alam ... Donald H Burke
01 Jan 2015
Molecular Therapy - Nucleic Acids | VOL. 4

Current state-of-art of sequencing technologies for plant genomics research
M Thudi ... Y Li
Briefings in Functional Genomics | VOL. 11
M Thudi, et. al.M Thudi ... Y Li
01 Jan 2012
Briefings in Functional Genomics | VOL. 11

An online copy number variant detection method for short sequencing reads
Ayten Yiğiter ... Nazan Danacioğlu
Journal of Applied Statistics | VOL. 42
Ayten Yiğiter, et. al.Ayten Yiğiter ... Nazan Danacioğlu
28 Jan 2015
Journal of Applied Statistics | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution.

Abstract

Talk to us

Similar Papers

More From: mSystems