Librarian: A quality control tool to analyse sequencing library compositions.

Kartavya Vashishtha,Simon Andrews,Christel Krueger,Caroline Gaud

doi:10.12688/f1000research.125325.2

Kartavya Vashishtha, Simon Andrews + Show 2 more

Open Access

https://doi.org/10.12688/f1000research.125325.2

Copy DOI

Journal: F1000Research	Publication Date: Jan 24, 2024
Citations: 1	License type: CC BY 4.0

Abstract

Robust analysis of DNA sequencing data needs to include a set of quality control steps to ensure that technical bias is kept to a minimum. A metric easily obtained is the frequency of each of the nucleobases for each position across all sequencing reads. Here, we explore the differences in nucleobase compositions of various library types produced by standard experimental methodologies. We obtained the compositions of nearly 3000 publicly available datasets and subjected them to Uniform Manifold Approximation and Projection (UMAP) dimensionality reduction for a two-dimensional representation of their composition characteristics. We find that most library types result in a specific composition profile. We use this to give an estimate of how strongly the composition of a test library resembles the profiles of previously published libraries, and how likely the test sample is to be of a particular type. We introduce Librarian, a user-friendly web application and command line tool which enables checking base compositions of test libraries against known library types. Library preparation methods strongly influence the per position nucleobase content. By comparing test libraries to a database of previously published library types we can make predictions regarding the library preparation method. Librarian is a user-friendly tool to access this information for quality assurance purposes as discrepancies can flag potential irregularities very early on.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Librarian: A quality control tool to analyse sequencing library compositions.

Abstract

Talk to us

Similar Papers

More From: F1000Research

Lead the way for us

Similar Papers

Librarian: A quality control tool to analyse sequencing library compositions.
Kartavya Vashishtha ... Caroline Gaud
F1000Research | VOL. 11
Kartavya Vashishtha, et. al.Kartavya Vashishtha ... Caroline Gaud
29 Sep 2022
F1000Research | VOL. 11

Librarian: A quality control tool to analyse sequencing library compositions
Konstantin Okonechnikov ... Kartavya Vashishtha
F1000Research | VOL. 11
Konstantin Okonechnikov, et. al.Konstantin Okonechnikov ... Kartavya Vashishtha
06 Oct 2022
F1000Research | VOL. 11

Author response: Simultaneous trimodal single-cell measurement of transcripts, epitopes, and chromatin accessibility using TEA-seq
Elliott Swanson ... Cara Lord
-
Elliott Swanson, et. al.Elliott Swanson ... Cara Lord
13 Feb 2021
13 Feb 2021

Author response: Single-cell analysis reveals dynamics of human B cell differentiation and identifies novel B and antibody-secreting cell intermediates
Sabrina Pollastro ... Marc Beyer
-
Sabrina Pollastro, et. al.Sabrina Pollastro ... Marc Beyer
31 Jan 2023
31 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Librarian: A quality control tool to analyse sequencing library compositions.

Abstract

Talk to us

Similar Papers

More From: F1000Research