Introduction to the Analysis of Environmental Sequences: Metagenomics with MEGAN.

Caner Bağcı,Anna Górska,Daniel H Huson,Sina Beier

doi:10.1007/978-1-4939-9074-0_19

Caner Bağcı, Anna Górska + Show 2 more

Open Access

https://doi.org/10.1007/978-1-4939-9074-0_19

Copy DOI

Abstract

Metagenomics has become a part of the standard toolkit for scientists interested in studying microbes in the environment. Compared to 16S rDNA sequencing, which allows coarse taxonomic profiling of samples, shotgun metagenomic sequencing provides a more detailed analysis of the taxonomic and functional content of samples. Long read technologies, such as developed by Pacific Biosciences or Oxford Nanopore, produce much longer stretches of informative sequence, greatly simplifying the difficult and time-consuming process of metagenomic assembly. MEGAN6 provides a wide range of analysis and visualization methods for the analysis of short and long read metagenomic data. A simple and efficient analysis pipeline for metagenomic analysis consists of the DIAMOND alignment tool on short reads, or the LAST alignment tool on long reads, followed by MEGAN. This approach performs taxonomic and functional abundance analysis, supports comparative analysis of large-scale experiments, and allows one to involve experimental metadata in the analysis.

Highlights

Introduction to the Analysis of EnvironmentalSequences: Metagenomics with MEGANCaner Bagcı, Sina Beier, Anna Gorska, and Daniel H
Metagenomic sequencing allows the study of microorganisms found in environmental samples without relying on culturing methods or prior knowledge of the composition of the community
Functional classification of long reads does not necessarily assign each read into one functional class, instead reads are assigned to the functional class of best-scoring alignment in each segment, each segment is assigned to one function and one read can be assigned to multiple different functional classes

Summary

Introduction

Metagenomics is the study of microbiome samples, such as obtained from ocean water, soil, plant matter, or feces, say, using high-throughput DNA sequencing [1]. Long read technologies have been considered too expensive, difficult, or error-prone for application in metagenomics. This is changing and computational analysis methods designed for processing short reads need to be modified to work well on long. A major computational challenge in metagenomics is the alignment of sequencing reads against a comprehensive reference database. Long reads require frame-shift aware alignment tools, such as LAST [3, 4], because insertions or deletions due to sequencing errors impact long reads, as discussed in Subheading 2. We will first discuss how to perform basic alignment and analysis of short reads in Subheading 2.1 and long reads in Subheading 2.2. In Subheading 4.1 we describe some additional resources available for using MEGAN 6

Workflows for Metagenomic Analysis with MEGAN

Short Read Pipeline

Taxonomic and Functional Classification with MEGAN6

Investigation of the Results

Long Read Analysis Pipeline

Taxonomic and Functional Classification of Long Reads

Comparison of Multiple Samples

Outlook

Findings

MEGAN Resources

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Methods in molecular biology (Clifton, N.J.)	Publication Date: Jan 1, 2019
Citations: 26	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Introduction to the Analysis of Environmental Sequences: Metagenomics with MEGAN.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Methods in molecular biology (Clifton, N.J.)

Lead the way for us

Similar Papers

Long-read sequencing in ecology and evolution: Understanding how complex genetic and epigenetic variants shape biodiversity.
Dan G Bock ... Polina Novikova
Molecular Ecology | VOL. 32
Dan G Bock, et. al.Dan G Bock ... Polina Novikova
01 Mar 2023
Molecular Ecology | VOL. 32

Unicycler: Resolving bacterial genome assemblies from short and long sequencing reads.
Ryan R Wick ... Adam M Phillippy
PLoS computational biology | VOL. 13
Ryan R Wick, et. al.Ryan R Wick ... Adam M Phillippy
08 Jun 2017
PLoS computational biology | VOL. 13

G-SNPM - A GPU-based SNP mapping tool
Alessandro Orro ... Luciano Milanesi
EMBnet journal | VOL. 18
Alessandro Orro, et. al.Alessandro Orro ... Luciano Milanesi
09 Nov 2012
EMBnet journal | VOL. 18

Cerulean: A Hybrid Assembly Using High Throughput Short and Long Reads
Viraj Deshpande ... Son Pham
-
Viraj Deshpande, et. al.Viraj Deshpande ... Son Pham
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Introduction to the Analysis of Environmental Sequences: Metagenomics with MEGAN.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Methods in molecular biology (Clifton, N.J.)