Abstract

We identified 411 processed sequences in the Arabidopsis thaliana genome based on the fact that they have lost their intron(s) and have a length that is at least 95% of the length of the gene that gave rise to them. These sequences were generated by 230 different genes and clearly originated from retrotranspositons events because most of them (91%) have a poly(A)-tail. They are composed of 376 sequences with frame shifts and/or premature stop codons (processed pseudogenes) and 35 sequences without disablements (processed genes). Eleven of these processed genes are likely functional retrotransposed genes because they have low Ka/Ks ratios and high Ks values, and their sequences match numerous Arabidopsis ESTs. Processed sequences are mostly randomly distributed in the Arabidopsis genome and their rate of accumulation has steadily been decreasing since it peaked some 50 MYA. In contrast with the situation observed in mammals, the processed sequences found in the Arabidopsis genome originate from genes with high copy numbers and not from highly expressed genes. The patterns of spontaneous mutations in Arabidopsis are slightly different than those of mammals but are similar to those observed in Drosophila. This suggests that methylated cytosine deamination is less frequent in Arabidopsis than in mammals.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.