Abstract
Nucleic acid biomarkers embody inherent importance for differentiating disease-causing organisms or environmental pathogens. Identifying unknown nucleic acids in low abundance remains extremely challenging. Previously, we reported a method to identify complementary DNA (cDNA) molecules based on sequence-specific topographical labels measured by atomic force microscopy (AFM). However, the accuracy is limited because only one type of nicking endonuclease was used as the labeling agent. Here we investigate how accuracy is improved using multiple types of nicking endonucleases in combinations. The numerical experiments created cDNA molecules incorporating measurement error or labeling defects, which were later compared with the 29,563 human messenger RNA (mRNA) transcript database with ideal labels. After comparison, the unknown cDNA molecule was identified as the transcript with the highest matching score. Thus, the accuracy was determined by the rate of true positives. We found that the accuracy is positively proportional to the label number. Compared with cases using single nicking endonuclease, which has an average accuracy of 51.2% ± 34.4%, the average accuracy was improved to 97.1% ± 5.6% using an optimized combination of NtBsmAI + NtBstNBI + NtAlwI. This improved accuracy is applicable to more than 85% of human mRNA transcripts. Together, our study suggests an optimization strategy for identifying nucleic acids in low abundance using the AFM-based method, with implications for diseases diagnosis, pathogen identification, and forensics at the single molecule level.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have