Machine learning approaches outperform distance- and tree-based methods for DNA barcoding of Pterocarpus wood.

Tuo He,Alex C Wiedenhoeft,Yafang Yin,Lichao Jiao

doi:10.1007/s00425-019-03116-3

Abstract

Machine-learning approaches (MLAs) for DNA barcoding outperform distance- and tree-based methods on identification accuracy and cost-effectiveness to arrive at species-level identification of wood. DNA barcoding is a promising tool to combat illegal logging and associated trade, and the development of reliable and efficient analytical methods is essential for its extensive application in the trade of wood and in the forensics of natural materials more broadly. In this study, 120 DNA sequences of four barcodes (ITS2, matK, ndhF-rpl32, and rbcL) generated in our previous study and 85 downloaded from National Center for Biotechnology Information (NCBI) were collected to establish a reference data set for six commercial Pterocarpus woods. MLAs (BLOG, BP-neural network, SMO and J48) were compared with distance- (TaxonDNA) and tree-based (NJ tree) methods based on identification accuracy and cost-effectiveness across these six species, and also were applied to discriminate the CITES-listed species Pterocarpus santalinus from its anatomically similar species P. tinctorius for forensic identification. MLAs provided higher identification accuracy (30.8-100%) than distance- (15.1-97.4%) and tree-based methods (11.1-87.5%), with SMO performing the best among the machine learning classifiers. The two-locus combination ITS2 + matK when using SMO classifier exhibited the highest resolution (100%) with the fewest barcodes for discriminating the six Pterocarpus species. The CITES-listed species P. santalinus was discriminated successfully from P. tinctorius using MLAs with a single barcode, ndhF-rpl32. This study shows that MLAs provided higher identification accuracy and cost-effectiveness for forensic application over other analytical methods in DNA barcoding of Pterocarpus wood.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine learning approaches outperform distance- and tree-based methods for DNA barcoding of Pterocarpus wood.

Abstract

Talk to us

Similar Papers

More From: Planta

Lead the way for us

Similar Papers

DNA barcoding identification of IUCN Red listed threatened species in the genus Aquilaria (Thymelaeaceae) using machine learning approaches
Yuexia Lin ... Zhaoyu Wang
Phytochemistry Letters | VOL. 55
Yuexia Lin, et. al.Yuexia Lin ... Zhaoyu Wang
01 Jun 2023
Phytochemistry Letters | VOL. 55

Testing efficacy of distance and tree-based methods for DNA barcoding of grasses (Poaceae tribe Poeae) in Australia.
Joanne L Birch ... Neville G Walsh
PloS one | VOL. 12
Joanne L Birch, et. al.Joanne L Birch ... Neville G Walsh
30 Oct 2017
PloS one | VOL. 12

Identifying species of moths (Lepidoptera) from Baihua Mountain, Beijing, China, using DNA barcodes.
Xiao F Liu ... Cong H Yang
Ecology and evolution | VOL. 4
Xiao F Liu, et. al.Xiao F Liu ... Cong H Yang
20 May 2014
Ecology and evolution | VOL. 4

DNA barcoding evaluation of geophytes: Comparative efficiency of three barcode loci for Anemone (Ranunculaceae) and Gladiolus (Iridaceae)
Zübeyde Uğurlu Aydın ... Ali A Dönmez
Plant Biosystems - An International Journal Dealing with all Aspects of Plant Biology | VOL. 156
Zübeyde Uğurlu Aydın, et. al.Zübeyde Uğurlu Aydın ... Ali A Dönmez
18 Sep 2021
Plant Biosystems - An International Journal Dealing with all Aspects of Plant Biology | VOL. 156

Journal: Planta	Publication Date: Mar 1, 2019
Citations: 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine learning approaches outperform distance- and tree-based methods for DNA barcoding of Pterocarpus wood.

Abstract

Talk to us

Similar Papers

More From: Planta