Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences

R Henrik Nilsson,Teresita M Porter,Martin Hartmann,Ellen Larsson,Parag Vaishampayan,K Martin Eriksson,Martin Ryberg,Erik Kristiansson,Karl-Henrik Larsson,Johannes Bergsten,Kessy Abarenkov,Nils Hallenberg,Leho Tedersoo,Conrad L Schoch,Ari Jumpponen,Johan A A Nylander,Otso Ovaskainen,Johan Bengtsson-Palme,Urmas Kõljalg

doi:10.3897/mycokeys.4.3606

Abstract

Molecular data form an important research tool in most branches of mycology. A non-trivial proportion of the public fungal DNA sequences are, however, compromised in terms of quality and reliability, contributing noise and bias to sequence-borne inferences such as phylogenetic analysis, diversity assessment, and barcoding. In this paper we discuss various aspects and pitfalls of sequence quality assessment. Based on our observations, we provide a set of guidelines to assist in manual quality management of newly generated, near-full-length (Sanger-derived) fungal ITS sequences and to some extent also sequences of shorter read lengths, other genes or markers, and groups of organisms. The guidelines are intentionally non-technical and do not require substantial bioinformatics skills or significant computational power. Despite their simple nature, we feel they would have caught the vast majority of the severely compromised ITS sequences in the public corpus. Our guidelines are nevertheless not infallible, and common sense and intuition remain important elements in the pursuit of compromised sequence data. The guidelines focus on basic sequence authenticity and reliability of the newly generated sequences, and the user may want to consider additional resources and steps to accomplish the best possible quality control. A discussion on the technical resources for further sequence quality management is therefore provided in the supplementary material.

Highlights

The inconspicuous and largely subterranean or endophytic nature of much of fungal life presents a challenge to mycology
Discriminatory yet assessed morphological characters are something of a rare commodity in mycology, and morphology alone often falls short of providing unequivocal species identification and delimitation
DNA sequences represent a key source of information in most branches of mycology, including systematics, taxonomy, and ecology (Stajich et al 2009), and the landmarks include the establishment of a phylogenetic backbone and a classification system for the fungal kingdom (Blackwell et al 2006; James et al 2006; Hibbett et al 2007)

Summary

Introduction

The inconspicuous and largely subterranean or endophytic nature of much of fungal life presents a challenge to mycology. The guidelines are simple and straightforward to apply; substantial bioinformatics expertise is not required, and only on-line resources of the paste-and-click type are used Their simple nature notwithstanding, we believe that these guidelines would have caught the vast majority of the present severely compromised fungal ITS sequences in the public corpus, had they been available and applied at the time of data generation and accessioning. We would like to stress that the guidelines described here focus on basic sequence authenticity and reliability; they are certainly no panacea for sequence quality management Their purpose is to assist in pruning severely compromised entries from newly generated, nearly full-length (typically, but not exclusively, Sanger-derived) fungal ITS datasets before those sequences are put to scientific use. Establish that the sequences come from the intended gene or marker

Establish that any taxonomic annotations given to the sequences make sense

Findings

Concluding remarks

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: MycoKeys	Publication Date: Sep 5, 2012
Citations: 159	License type: CC BY 3.0

R Discovery Prime

R Discovery Prime

Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: MycoKeys

Lead the way for us

Similar Papers

An open source software package for automated extraction of ITS1 and ITS2 from fungal ITS sequences for use in high-throughput community assays and molecular ecology
R Henrik Nilsson ... Kessy Abarenkov
Fungal Ecology | VOL. 3
R Henrik Nilsson, et. al.R Henrik Nilsson ... Kessy Abarenkov
30 Jun 2010
Fungal Ecology | VOL. 3

Effects of cloning and root-tip size on observations of fungal ITS sequences from Picea glauca roots
Daniel L Lindner ... Mark T Banik
Mycologia | VOL. 101
Daniel L Lindner, et. al.Daniel L Lindner ... Mark T Banik
01 Jan 2009
Mycologia | VOL. 101

A note on the incidence of reverse complementary fungal ITS sequences in the public sequence databases and a software tool for their detection and reorientation
R Henrik Nilsson ... Sara Branco
Mycoscience | VOL. 52
R Henrik Nilsson, et. al.R Henrik Nilsson ... Sara Branco
01 Jan 2010
Mycoscience | VOL. 52

Response of Soil Fungal Communities in Diversified Rotations of Wheat and Different Crops
...
Huan jing ke xue= Huanjing kexue | VOL. 43
, et. al. ...
08 Jun 2022
Huan jing ke xue= Huanjing kexue | VOL. 43

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Five simple guidelines for establishing basic authenticity and reliability of newly generated fungal ITS sequences

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: MycoKeys