Splicing is an important step of gene expression in all eukaryotes. Splice sites might be used with different efficiency, giving rise to alternative splicing products. At the same time, splice sites might be used at a variable rate. We used 5-ethynyl uridine labeling to sequence a nascent transcriptome of HeLa cells and deduced the rate of splicing for each donor and acceptor splice site. The following correlation analysis showed a correspondence of primary transcript features with the rate of splicing. Some dependencies we revealed were anticipated, such as a splicing rate decrease with a decreased complementarity of the donor splice site to U1 and acceptor sites to U2 snRNAs. Other dependencies were more surprising, like a negative influence of a distance to the 5' end on the rate of the acceptor splicing site utilization, or the differences in splicing rate between long, short, and RBM17-dependent introns. We also observed a deceleration of last intron splicing with an increase of the distance to the poly(A) site, which might be explained by the cooperativity of the splicing and polyadenylation. Additional analysis of splicing kinetics of SF3B4 knockdown cells suggested the impairment of a U2 snRNA recognition step. As a result, we deconvoluted the effects of several examined features on the splicing rate into a single regression model. The data obtained here are useful for further studies in the field, as they provide general splicing rate dependencies as well as help to justify the existence of slowly removed splice sites.
Read full abstract