Identifying errors in avian influenza virus gene sequences and implications for data usage of public databases

Jinling Li,Heinrich Zu Dohna,Joy Miller,Carol J Cardona,Tim E Carpenter

doi:10.1016/j.ygeno.2009.09.005

Jinling Li, Heinrich Zu Dohna + Show 3 more

https://doi.org/10.1016/j.ygeno.2009.09.005

Copy DOI

Journal: Genomics	Publication Date: Sep 18, 2009
Citations: 5	License type: elsevier-specific: oa user license

Affiliation: University of California, Davis

Abstract

Public gene sequence databases have become important research tools to understand viruses and other organisms. Evidence suggests that the identifying information for some of the sequences in these databases might not belong to the sequences they are associated with. We developed two tests to conduct a comprehensive analysis of all published sequences of the hemaglutinin and neuramidase genes of avian influenza viruses (AIVs) to identify sequences that may have been misclassified. One test identified sequence pairs with highly similar nucleotide sequences despite a difference of several years between their sampling dates. Another test, which was applied to samples sequenced and deposited more than once, detected sequences with more nucleotide differences to their own than to their closest relatives. All sequences identified as misclassified were further traced to relevant publications to assess the likelihood of contamination and determine if any conclusions were associated with the use of these sequences. Our results suggested that among 4040 published gene sequences examined, approximately 0.8% might be misclassified and that publications using these sequences may include inaccurate statements. Findings from this report suggest that using laboratory-adapted strains and handling multiple samples simultaneously increases the risk of contamination. The tests reported here may be useful for screening new submissions to public sequence databases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Identifying errors in avian influenza virus gene sequences and implications for data usage of public databases

Abstract

Talk to us

Similar Papers

More From: Genomics

Lead the way for us

Similar Papers

Evolution of the NS genes of the influenza a viruses. I. The genetic relatedness of the NS genes of animal influenza viruses
Katsuhisa Nakajima ... Takashi Ogawa
Virus Genes | VOL. 4
Katsuhisa Nakajima, et. al.Katsuhisa Nakajima ... Takashi Ogawa
01 Jun 1990
Virus Genes | VOL. 4

Clade 2.3.2 Avian Influenza Virus (H5N1), Qinghai Lake Region, China, 2009–2010
Xudong Hu ... Qingyu Zhu
Emerging Infectious Diseases | VOL. 17
Xudong Hu, et. al.Xudong Hu ... Qingyu Zhu
01 Mar 2011
Clade 2.3.2 Avian Influenza Virus (H5N1), Qinghai Lake Region, China, 2009–2010
Xudong Hu ... Qingyu Zhu

Pathology, Molecular Biology, and Pathogenesis of Avian Influenza A (H5N1) Infection in Humans
Christine Korteweg ... Jiang Gu
The American Journal of Pathology | VOL. 172
Christine Korteweg, et. al.Christine Korteweg ... Jiang Gu
01 May 2008
The American Journal of Pathology | VOL. 172

Virus Pathotype and Deep Sequencing of the HA Gene of a Low Pathogenicity H7N1 Avian Influenza Virus Causing Mortality in Turkeys
Munir Iqbal ... Steve C. Essen
PLoS ONE | VOL. 9
Munir Iqbal, et. al.Munir Iqbal ... Steve C. Essen
28 Jan 2014
PLoS ONE | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identifying errors in avian influenza virus gene sequences and implications for data usage of public databases

Abstract

Talk to us

Similar Papers

More From: Genomics