Abstract

AbstractThis talk describes a mapping between the NCBI taxonomy database and Wikipedia. These two databases were chosen because the NCBI taxonomy contains all the taxa for which sequences are publicly available, and for many taxa Wikipedia is the first site returned in a Google search on that taxon's scientific name. The NCBI web pages for nearly 53,000 NCBI taxa now have a link to the corresponding page in Wikipedia.

Highlights

  • One
of
the
great
challenges
of
phyloinformatics
is
linking
together
information
 on
phylogenies,
taxa,
genomes,
specimens,
and
publications.

  • One
approach
to
 linking
disparate
data
is
to
use
shared
identifiers.

  • For
example,
if
a
bibliographic
 database
and
a
nomenclature
database
both
use
the
same
identifier
for
a
 publication
(such
as
a
DOI),

we
can

link
the
two
pieces
of
information
 together
using
that
identifier.

Read more

Summary

Introduction

One
of
the
great
challenges
of
phyloinformatics
is
linking
together
information
 on
phylogenies,
taxa,
genomes,
specimens,
and
publications. One
approach
to
 linking
disparate
data
is
to
use
shared
identifiers. For
example,
if
a
bibliographic
 database
and
a
nomenclature
database
both
use
the
same
identifier
for
a
 publication
(such
as
a
DOI),

we
can

link
the
two
pieces
of
information
 together
using
that
identifier. An
obstacle
to
this
approach
is
the
lack
identifiers,
 or
failure
to
reuse
existing
identifiers
(Page,
2008).

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call