Abstract
BackgroundThe presence of non-coding introns is a characteristic feature of most eukaryotic genes. While the size of the introns, number of introns per gene and the number of intron-containing genes can vary greatly between sequenced eukaryotic genomes, the structure of a gene with reference to intron presence and positions is typically conserved in closely related species. Unexpectedly, the ABCB1 (ATP-Binding Cassette Subfamily B Member 1) gene which encodes a P-glycoprotein and underlies dwarfing traits in maize (br2), sorghum (dw3) and pearl millet (d2) displayed considerable variation in intron composition.ResultsAn analysis of the ABCB1 gene structure in 80 angiosperms revealed that the number of introns ranged from one to nine. All introns in ABCB1 underwent either a one-time loss (single loss in one lineage/species) or multiple independent losses (parallel loss in two or more lineages/species) with the majority of losses occurring within the grass family. In contrast, the structure of the closest homolog to ABCB1, ABCB19, remained constant in the majority of angiosperms analyzed. Using known phylogenetic relationships within the grasses, we determined the ancestral branch-points where the losses occurred. Intron 7, the longest intron, was lost in only a single species, Mimulus guttatus, following duplication of ABCB1. Semiquantitative PCR showed that the M. guttatus ABCB1 gene copy without intron 7 had significantly lower transcript levels than the gene copy with intron 7. We further demonstrated that intron 7 carried two motifs that were highly conserved across the monocot-dicot divide.ConclusionsThe ABCB1 gene structure is highly dynamic, while the structure of ABCB19 remained largely conserved through evolution. Precise removal of introns, preferential removal of smaller introns and presence of at least 2 bp of microhomology flanking most introns indicated that intron loss may have predominantly occurred through non-homologous end-joining (NHEJ) repair of double strand breaks. Lack of microhomology in the exon upstream of lost phase I introns was likely due to release of the selective constraint on the penultimate base (3rd base in codon) of the terminal codon by the splicing machinery. In addition to size, the presence of regulatory motifs will make introns recalcitrant to loss.
Highlights
The presence of non-coding introns is a characteristic feature of most eukaryotic genes
Expanding the ATP Binding Cassette Subfamily B Member 1 (ABCB1) comparison to all sequenced genomes available at the time demonstrated that 28 out of Because ABCB1 is a member of a multigene family, we examined the structure of ABCB19, the closest extant paralog of ABCB1
If intron loss occurred before the duplication of ABCB19, one of the ABCB1 copies must have gained an intron in B. oleraceae
Summary
The presence of non-coding introns is a characteristic feature of most eukaryotic genes. While the size of the introns, number of introns per gene and the number of intron-containing genes can vary greatly between sequenced eukaryotic genomes, the structure of a gene with reference to intron presence and positions is typically conserved in closely related species. Introns are a characteristic and common feature in eukaryotic genomes. They likely accumulated very early in eukaryotic evolution and some introns have remained in conserved positions across kingdoms for a period close to two billion years [1,2,3]. Similar rates of intron loss have been observed in plants, including Arabidopsis thaliana (1–3 × 10−10 [8]), A. lyrata (2.73 × 10−11 [8]), Oryza sativa (3.3 × 10−10 [10]; 8.1 × 10−11 [9]), and the grasses Setaria italica, Brachypodium distachyon, Sorghum bicolor and Zea mays (1.1–1.8 × 10−10 [9])
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.