The cobia Rachycentron canadum, mainly distributed in the warm waters of tropical and subtropical regions around the world, remains a fish of considerable economic importance. Detailed diversity and the number of microsatellite sequences in the cobia genome are still unintelligible. The primary aim of this work was to identify and quantify the miscellaneous SSR sequences in the cobia genome. More than 280,000 sequences were sequenced and screened using next-generation sequencing technology and microsatellite identification. Perfect mononucleotide repeats, dinucleotide microsatellites, and trinucleotide microsatellites contain (A)10 /(T)10 , (AC)6 /(TG)6 , and (AAT)5-32 as the largest number of motifs in each type of microsatellite, respectively. The tetranucleotide and pentanucleotide microsatellites (TTM and PTM) consist of the largest number of motifs of both (ATCT)5-32 and (TCAT)5-31 in TTMs, and (CTCTC)5-9 in PTMs, whereas the hexanucleotide microsatellites are rarely observed in the cobia genome. All c. 38000 sequences of composite microsatellites are extremely diverse, including compound (11.71%), interrupted compound (71.77%), complex (0.45%), and interrupted complex (16.07%). In this study, we developed a convenient and useful recording system for writing down and categorizing diverse composite microsatellite types. This system will provide great support for exploring repeat origins, evolutionary mechanisms, and the application of polymorphic microsatellites.
Read full abstract