ObjectiveDue to the high global prevalence of silicosis and the ongoing challenges in its diagnosis, this pilot study aims to screen biomarkers from routine blood parameters and develop a multi-biomarker model for its early detection.MethodsA case-control study was conducted to screen biomarkers for the diagnosis of silicosis using LASSO regression, SVM and RF. A sample of 612 subjects (half cases and half controls) were randomly divided into training and test groups in a 2:1 ratio. Logistic regression analysis and receiver operating characteristic (ROC) curves were used to construct a multiple biomarker-based model for the diagnosis of silicosis, which was applied to both the training and the testing datasets.ResultsThe training cohort revealed significant statistical differences (P < 0.05) in multiple hematologic parameters between silicosis patients and healthy individuals. Based on machine learning, eight silicosis biomarkers were screened and identified from routine blood cell, biochemical and coagulation parameters. D-dimer (DD), Albumin/Globulin (A/G), lactate dehydrogenase (LDH) and white blood cells (WBC) were selected for constructing the logistic regression model for silicosis diagnostics. This model had a satisfactory performance in the training cohort with an area under the ROC curve (AUC) of 0.982, a diagnostic sensitivity of 95.4%, and a specificity of 92.2%. In addition, the model had a prediction accuracy of 0.936 with an AUC of 0.979 in the independent test cohort. Moreover, the diagnostic accuracies of the logistic model in silicosis stages 1, 2, and 3 were 88.0, 95.4, and 94.3% with an AUC of 0.968, 0.983, and 0.990 for silicosis, respectively.ConclusionA diagnostic model based on DD, A/G, LDH and WBC is successfully proposed for silicosis diagnostics. It is cheap, sensitive, specific, and preliminarily offers a potential strategy for the large-scale screening of silicosis.
Read full abstract