Abstract
BackgroundBlood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable. However, accuracy and specificity need to be improved, and methods for the integration of gene expression with epigenetic data need to be developed in order to make this feasible.MethodsWhole blood gene expression data and DNA methylation data were downloaded from Gene Expression Omnibus (GEO) database. A linear model was used to identify significantly differentially expressed genes (DEGs) and differentially methylated genes (DMGs) according to specific gene regions 5′—C—phosphate—G—3′ (CpGs) or all gene regions CpGs in PD. Gene set enrichment analysis was then applied to DEGs and DMGs. Subsequently, data integration analysis was performed to identify robust PD-associated blood biomarkers. Finally, the random forest algorithm and a leave-one-out cross validation method were performed to construct classifiers based on gene expression data integrated with methylation data.ResultsEighty-five (85) significantly hypo-methylated and upregulated genes in PD patients compared to healthy controls were identified. The dominant hypo-methylated regions of these genes were significantly different. Some genes had a single dominant hypo-methylated region, while others had multiple dominant hypo-methylated regions. One gene expression classifier and two gene methylation classifiers based on all or dominant methylation-altered region CpGs were constructed. All have a good prediction power for PD.ConclusionsGene expression and methylation data integration analysis identified a blood-based 53-gene signature, which could be applied as a biomarker for PD.
Highlights
Blood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable
We identified 1045 significantly Differentially expressed gene (DEG) in blood of PD compared to healthy controls, in which 108 genes are downregulated and 937 genes are upregulated (Additional file 1: Table S1)
In order to compare our differential results with the results from the original paper, we have made a table including the 100 most differential probes that are mapped to 87 genes from the original paper with logFC, p value, and Benjamini and Hochberg’s method (BH) adjusted p value from our analysis (Additional file 1: Table S2)
Summary
Blood-based gene expression or epigenetic biomarkers of Parkinson’s disease (PD) are highly desirable. Accuracy and specificity need to be improved, and methods for the integration of gene expression with epigenetic data need to be developed in order to make this feasible. Parkinson’s disease (PD) is the second most common neurodegenerative disease, following Alzheimer’s disease. Brain imaging of the nigrostriatal dopamine system has been used as a biomarker for early disease along with cerebrospinal fluid analysis of α-synuclein. These methods remain costly or are invasive [4]. Blood biomarkers are easier to obtain, much cheaper, and less invasive [5], and some researchers have
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have