Targeted mouse mutants are a common tool used to investigate gene function. The International Knockout Mouse Consortium undertook a large-scale screen of mouse mutants, making use of the knockout-first allele design that contains the En2 splice acceptor sequence coupled to the lacZ reporter gene. Although the knockout-first allele was designed to interfere with splicing and thus disrupt gene function, the En2 sequence has been reported to be transcribed within the host gene mRNA due to a cryptic splice site within the En2 sequence which allows splicing to the next exon of the host gene. In some circumstances, this has the potential to permit translation of a mutant protein. Here, we describe our computational analysis of all the mouse protein-coding genes with established knockout-first embryonic stem cell lines, and our predictions of their transcription outcome should the En2 sequence be included. As part of the large-scale mutagenesis program, mutant mice underwent a broad phenotyping screen, and their phenotypes are available. No wide-scale effects on mouse phenotypes reported were found as a result of the predicted En2 insertion. However, the En2 insertion was found experimentally in the transcripts of 24 of 35 mutant alleles examined, including the five already described, two with evidence of readthrough. Splicing from the cryptic splice site also has the potential to disrupt expression of the lacZ reporter gene. It is recommended that mutant transcripts be checked for this insertion as well as for leaky transcription in studies involving knockout-first alleles.
Read full abstract