Based on bioinformatics and machine learning methods, we conducted a study to screen the core genes of minimal change disease (MCD) and further explore its pathogenesis. First, we obtained the chip data sets GSE108113 and GSE200828 from the Gene Expression Comprehensive Database (GEO), which contained MCD information. We then used R software to analyze the gene chip data and performed functional enrichment analysis. Subsequently, we employed Cytoscape to screen the core genes and utilized machine learning algorithms (random forest and LASSO regression) to accurately identify them. To validate and analyze the core genes, we conducted immunohistochemistry (IHC) and gene set enrichment analysis (GSEA). Our results revealed a total of 394 highly expressed differential genes. Enrichment analysis indicated that these genes are primarily involved in T cell differentiation and p13k-akt signaling pathway of immune response. We identified NOTCH1, TP53, GATA3, and TGF-β1 as the core genes. IHC staining demonstrated significant differences in the expression of these four core genes between the normal group and the MCD group. Furthermore, GSEA suggested that their up-regulation may be closely associated with the pathological changes in MCD kidneys, particularly in the glycosaminoglycans signaling pathway. In conclusion, our study highlights NOTCH1, TP53, GATA3, and TGF-β1 as the core genes in MCD and emphasizes the close relationship between glycosaminoglycans and pathogenesis of MCD.
Read full abstract