Although multiple vaccines have been developed and widely administered, several severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants have been reported to evade immune responses and spread diffusely. Here, 108 RNA-seq files from coronavirus disease 2019 (COVID-19) patients and healthy donors (HD) were downloaded to extract their TCR immune repertoire by MiXCR. Those extracted TCR repertoire were compared and it was found that disease progression was related negatively with diversity and positively with clonality. Specifically, greater proportions of high-abundance clonotypes were observed in active and severe COVID-19 samples, probably resulting from strong stimulation of SARS-CoV-2 epitopes and a continued immune response in host. To investigate the specific recognition between TCR CDR3 and SARS-CoV-2 epitopes, we constructed an accurate classifier CoV2-TCR with an AUC of 0.967 in an independent dataset, which outperformed several similar tools. Based on this model, we observed a huge range in the number of those TCR CDR3 recognizing those different peptides, including 28 MHC-I epitopes from SARS-CoV-2 and 22 immunogenic peptides from SARS-CoV-2 variants. Interestingly, their proportions of high-abundance, low-abundance and rare clonotypes were close for each peptide. To expand the potential application of this model, we established the webserver, CoV2-TCR, in which users can obtain those recognizing CDR3 sequences from the TCR repertoire of COVID-19 patients based on the 9-mer peptides containing mutation site(s) on the four main proteins of SARS-CoV-2 variants. Overall, this study provides preliminary screening for candidate antigen epitopes and the TCR CDR3 that recognizes them, and should be helpful for vaccine design on SARS-CoV-2 variants.
Read full abstract