Background Here, we use summary statistics from Genome-Wide Association Studies (GWAS) to find significant biological pathways, disease pathways, drug targets and drug gene-sets in schizophrenia, BMI, Body Fat % (BF%), and Fat Free Mass (FFM). We introduce a workflow encompassing data collection and curation, analysis of drug gene-sets and biological pathways, drug class enrichment analysis, and visualization of pathway landscapes. Methods We used the schizophrenia GWAS from the PGC Schizophrenia working group phase 2 (35,476 cases), and new GWASs of BMI, BF% and FFM based on UK Biobank data (N=83,477). We curated drug/gene interactions from various databases: DGIdb, DSigDB, Ki DB, PHAROS, and ChEMBL. Biological pathways were collected from MSigDB (GO and canonical pathways) and disease/phenotype pathways from the Open Targets platform. The software MAGMA was used to produce gene-wise and pathway-wise associations, and MetaXcan for transcriptome prediction based on GTEx data (Genotype-Tissue Expression project). The enrichment of drug classes was estimated by grouping drugs by ATC class (Anatomical Therapeutic Chemical) and estimating the enrichment using Wilcoxon-Mann-Whitney test and the AUC (area under the enrichment curve). Pathway landscapes were generated using the kernel Generative Topographic Mapping (GTM), a probabilistic dimensionality reduction algorithm. On these maps, gene-sets are represented by points and the background color is derived from GWAS analysis p-values. Results Results show new Bonferroni-significant pathways in schizophrenia. Antipsychotics and antiepileptics are enriched in the latest and largest schizophrenia GWAS from the PGC Schizophrenia working group. Different disease/phenotype pathways are found in BMI and BF% - e.g., “binge eating” and “age at menarche” for BMI. 47 disease/phenotype pathways and 24 biological pathways are significantly associated with FFM: anthropometric measures, height, myopathies, midgut development, bone development, etc. Pathway landscapes reveal that many significant pathways overlap in “pathway clusters”, such as the skeletal system cluster or the myopathy cluster on FFM maps. Discussion A comprehensive approach is necessary to investigate drug repurposing opportunities and biological pathways using GWAS results. New visualization approaches may help to highlight the overlap between pathways and the genes driving the association. Two key issues are data availability (drug/target affinities, gene annotations) and sample sizes for GWAS studies. Growing public databases and increasing sample sizes will help us to improve our understanding of the genetic etiology of complex diseases.
Read full abstract