Abstract

This chapter describes the different sequencing strategies, the pros and cons of the different strategies to help you select the optimal DNA sequencing strategy for your research question, and how to assembly and annotate DNA sequences. DNA sequencing is the determination of the order of nucleotides of parts or whole chromosomes of organisms and virus. DNA sequencing can be done for a single gene or a whole genome or many genomes at a time such as in metagenomics. One of the most popular sequencing machines is the MiSeq from Illumina which is capable of doing small whole-genome sequencing, transcriptomics, and 16S rRNA metagenomics. It is possible to multiplex by using unique combinations of specific barcodes and indexes. Real-time, single-molecule sequencing allows for sequencing of the native DNA, resulting in significantly longer read lengths and sequence information available when the bases are incorporated, i.e., information available in real time. Base calling is the first step in sequencing where the electronic signal generated in the sequencing machine is separated from random noise and converted to nucleotide information. Then the nucleotide information needs to be assembled to DNA sequences which resemble the original DNA sequenced as best as possible. This can either be done de novo without a reference or with a reference if the genome of the organism or virus is well known. The most important quality parameter to consider is the coverage. Another important parameter is N50. Comparison of different assemblies can be made with Quast. The “minimum information about a genome sequence (MIGS) specification provides an exhaustive list of the information required for genomic sequences including demands to metadata. Genome annotation is the identification and labeling of all the relevant features of the genomic sequence. At first, this includes the coordinates provided as nucleotide positions where coding regions are predicted. It is mainly a prediction of coding genes; however, other structural genes such as rRNA are also identified.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.