ABSTRACTGene trapping is a high-throughput approach that has been used to introduce insertional mutations into the genome of mouse embryonic stem (ES) cells. It is performed with generic gene trap vectors that simultaneously mutate and report the expression of the endogenous gene at the site of insertion and provide a DNA sequence tag for the rapid identification of the disrupted gene. Large-scale international efforts assembled a gene trap library of 566,554 ES cell lines with single gene trap integrations distributed throughout the genome. Here, we re-investigated this unique library and identified mutations in 2202 non-coding RNA (ncRNA) genes, in addition to mutations in 12,078 distinct protein-coding genes. Moreover, we found certain types of gene trap vectors preferentially integrating into genes expressing specific long non-coding RNA (lncRNA) biotypes. Together with all other gene-trapped ES cell lines, lncRNA gene-trapped ES cell lines are readily available for functional in vitro and in vivo studies.
Read full abstract