Idiopathic pulmonary fibrosis (IPF) is a long-term condition with an unidentified cause, and currently there are no specific treatment options available. Alveolar epithelial type II cells (AT2) constitute a heterogeneous population crucial for secreting and regenerative functions in the alveolus, essential for maintaining lung homeostasis. However, a comprehensive investigation into their cellular diversity, molecular features, and clinical implications is currently lacking. In this study, we conducted a comprehensive examination of single-cell RNA sequencing data from both normal and fibrotic lung tissues. We analyzed alterations in cellular composition between IPF and normal tissue and investigated differentially expressed genes across each cell population. This analysis revealed the presence of two distinct subpopulations of IPF-related alveolar epithelial type II cells (IR_AT2). Subsequently, three unique gene co-expression modules associated with the IR_AT2 subtype were identified through the use of hdWGCNA. Furthermore, we refined and identified IPF-related AT2-related gene (IARG) signatures using various machine learning algorithms. Our analysis demonstrated a significant association between high IARG scores in IPF patients and shorter survival times (p-value < 0.01). Additionally, we observed a negative correlation between the percent predicted diffusing capacity for lung carbon monoxide (% DLCO) and increased IARG scores (cor = -0.44, p-value < 0.05). The cross-validation findings demonstrated a high level of accuracy (AUC > 0.85, p-value < 0.01) in the prognostication of patients with IPF utilizing the identified IARG signatures. Our study has identified distinct molecular and biological features among AT2 subpopulations, specifically highlighting the unique characteristics of IPF-related AT2 cells. Importantly, our findings underscore the prognostic relevance of specific genes associated with IPF-related AT2 cells, offering valuable insights into the advancement of IPF.
Read full abstract