InsertionMapper: a pipeline tool for the identification of targeted sequences from multidimensional high throughput sequencing data

Wenwei Xiong,Chunguang Du,Limei He,Yubin Li,Hugo K Dooner

doi:10.1186/1471-2164-14-679

Wenwei Xiong, Chunguang Du + Show 3 more

Open Access

https://doi.org/10.1186/1471-2164-14-679

Copy DOI

Abstract

BackgroundThe advent of next-generation high-throughput technologies has revolutionized whole genome sequencing, yet some experiments require sequencing only of targeted regions of the genome from a very large number of samples. These regions can be amplified by PCR and sequenced by next-generation methods using a multidimensional pooling strategy. However, there is at present no available generalized tool for the computational analysis of target-enriched NGS data from multidimensional pools.ResultsHere we present InsertionMapper, a pipeline tool for the identification of targeted sequences from multidimensional high throughput sequencing data. InsertionMapper consists of four independently working modules: Data Preprocessing, Database Modeling, Dimension Deconvolution and Element Mapping. We illustrate InsertionMapper with an example from our project 'New reverse genetics resources for maize’, which aims to sequence-index a collection of 15,000 independent insertion sites of the transposon Ds in maize. Identified sequences are validated by PCR assays. This pipeline tool is applicable to similar scenarios requiring analysis of the tremendous output of short reads produced in NGS sequencing experiments of targeted genome sequences.ConclusionsInsertionMapper is proven efficacious for the identification of target-enriched sequences from multidimensional high throughput sequencing data. With adjustable parameters and experiment configurations, this tool can save great computational effort to biologists interested in identifying their sequences of interest within the huge output of modern DNA sequencers. InsertionMapper is freely accessible at https://sourceforge.net/p/insertionmapper and http://bo.csam.montclair.edu/du/insertionmapper.

Highlights

The advent of next-generation high-throughput technologies has revolutionized whole genome sequencing, yet some experiments require sequencing only of targeted regions of the genome from a very large number of samples
We illustrate the use of InsertionMapper with an example from our maize genome project that aims to create a large set of single gene knockouts with a transgenic Ds transposon marked with green fluorescent protein (Dsg). This pipeline tool is applicable to similar situations that require analysis of the tremendous output of short reads produced in Next-generation sequencing (NGS) sequencing experiments of targeted genome sequences
The InsertionMapper pipeline tool was originally designed for the identification of Ds-targeted sequences from NGS data, but should suit most scenarios in which biologists want to identify and map targeted sequences from large multidimensional sample pools using high throughput sequencing technology

Summary

Results

We present InsertionMapper, a pipeline tool for the identification of targeted sequences from multidimensional high throughput sequencing data. InsertionMapper consists of four independently working modules: Data Preprocessing, Database Modeling, Dimension Deconvolution and Element Mapping. We illustrate InsertionMapper with an example from our project ‘New reverse genetics resources for maize’, which aims to sequence-index a collection of 15,000 independent insertion sites of the transposon Ds in maize. Identified sequences are validated by PCR assays. This pipeline tool is applicable to similar scenarios requiring analysis of the tremendous output of short reads produced in NGS sequencing experiments of targeted genome sequences

Conclusions

Background

Results and discussion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Oct 4, 2013
Citations: 17	License type: cc-by

R Discovery Prime

R Discovery Prime

InsertionMapper: a pipeline tool for the identification of targeted sequences from multidimensional high throughput sequencing data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Optimizing Multidimensional Pooling for Variational Quantum Algorithms
Mingyoung Jeng ... Dylan Kneidel
Algorithms | VOL. 17
Mingyoung Jeng, et. al.Mingyoung Jeng ... Dylan Kneidel
15 Feb 2024
Algorithms | VOL. 17

Automation Highlights from the Literature
Xiaole Mao ... Tal Murthy
SLAS TECHNOLOGY: Translating Life Sciences Innovation | VOL. 18
Xiaole Mao, et. al.Xiaole Mao ... Tal Murthy
15 May 2013
SLAS TECHNOLOGY: Translating Life Sciences Innovation | VOL. 18

High-throughput experiments for rare-event rupture of materials
Yifan Zhou ... Tongqing Lu
Matter | VOL. 5
Yifan Zhou, et. al.Yifan Zhou ... Tongqing Lu
20 Jan 2022
Matter | VOL. 5

Batch effect detection and correction in RNA-seq data using machine-learning-based automated assessment of quality
Maximilian Sprang ... Jean-Fred Fontaine
BMC bioinformatics | VOL. 23
Maximilian Sprang, et. al.Maximilian Sprang ... Jean-Fred Fontaine
01 Jul 2022
BMC bioinformatics | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

InsertionMapper: a pipeline tool for the identification of targeted sequences from multidimensional high throughput sequencing data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics