Evaluation of host integration profiles by adeno-associated virus (AAV) is an important component of de-risking novel AAV gene therapies. Targeted enrichment sequencing (TES) is a cost-effective and comprehensive method for assessing integration. Most published TES datasets have been generated using short-read sequencing, which enables quantitation of integration sites (IS) and identifies patterns such as hotspots or clonal expansion. Characteristics such as IS length and recombination require longer reads to measure. The present study compared short-read to long-read TES using samples from monkeys treated with AAV, and used in vitro lentiviral-treated samples, a stable cell line and an engineered spike-in as controls. Both methods showed stochastic integration by both AAV and lentivirus, with most vector domains identified in IS. More IS were identified by short-read TES, as deeper coverage per base was achieved from a single sequencing run. AAV-treated samples showed minimal evidence of clonal expansion, in contrast to in vitro treated and stably transduced lentiviral cell line samples. Long-read TES revealed vector rearrangement in 4-40% of IS in AAV-treated animals. In summary, both methods yielded similar conclusions about relative numbers of IS and overall patterns. Long-read TES identified fewer IS, but enabled measurement of IS length and recombination patterns.
Read full abstract