Existing Sequencing Methods Research Articles

Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.

Read full abstract

A mismatch-free hybridization of oligonucleotides containing from 11 to 20 monomers to unknown DNA represents, in essence, a sequencing of a complementary target. Realizing this, we have used probability calculations and, in part, computer simulations to estimate the types and numbers of oligonucleotides that would have to be synthesized in order to sequence a megabase plus segment of DNA. We estimate that 95,000 specific mixes of 11-mers, mainly of the 5′ (A,T,C,G)(A,T,C,G)N8(A,T,C,G)3′ type, hybridized consecutively to dot blots of cloned genomic DNA fragments would provide primary data for the sequence assembly. An optimal mixture of representative libraries in M13 vector, having inserts of (i) 7kb, (ii) 0.5 kb genomic fragments randomly ligated in up to 10-kb inserts, and (iii) tandem “jumping” fragments 100 kb apart in the genome, will be needed. To sequence each million base pairs of DNA, one would need hybridization data from about 2100 separate hybridization sample dots. Inevitable gaps and uncertainties in alignment of sequenced fragments arising from nonrandom or repetitive sequence organization of complex genomes and difficulties in cloning “poisonous” sequences in Escherichia coli, inherent to large sequencing by any method, have been considered and minimized by choice of libraries and number of subclones used for hybridization. Because it is based on simpler biochemical procedures, our method is inherently easier to automate than existing sequencing methods. The sequence can be derived from simple primary data only by extensive computing. Phased experimental tests and computer simulations increasing in complexity are needed before accurate estimates can be made in terms of cost and speed of sequencing by the new approach. Nevertheless, sequencing by hybridization should show advantages over existing methods because of the inherent redundancy and parallelism in its data gathering.

Read full abstract

Existing Sequencing Methods Research Articles

Articles published on Existing Sequencing Methods

Long read mitochondrial genome sequencing using Cas9-guided adaptor ligation

Nanopore Sequencing Accurately Identifies the Cisplatin Adduct on DNA.

Sequencing of Nucleic Acids: from the First Human Genome to Next Generation Sequencing in COVID-19 Pandemic

Genome reconstruction and haplotype phasing using chromosome conformation capture methodologies.

Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.

Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls

A high-throughput and quantitative method to assess the mutagenic potential of translesion DNA synthesis

An effective mixed‐model assembly line sequencing heuristic for just‐in‐time production systems

Sequencing of megabase plus DNA by hybridization: Theory of the method

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Existing Sequencing Methods Research Articles

Articles published on Existing Sequencing Methods

Long read mitochondrial genome sequencing using Cas9-guided adaptor ligation

Nanopore Sequencing Accurately Identifies the Cisplatin Adduct on DNA.

Sequencing of Nucleic Acids: from the First Human Genome to Next Generation Sequencing in COVID-19 Pandemic

Genome reconstruction and haplotype phasing using chromosome conformation capture methodologies.

Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.

Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls

A high-throughput and quantitative method to assess the mutagenic potential of translesion DNA synthesis

An effective mixed‐model assembly line sequencing heuristic for just‐in‐time production systems

Sequencing of megabase plus DNA by hybridization: Theory of the method