Additional Test Set Research Articles

BackgroundWe aimed to establish and validate a deep learning-based hybrid artificial intelligence (AI) model for the objective morphometric and colorimetric assessment of vitiligo lesions.MethodsTwo main datasets containing curated images of vitiligo lesions from Chinese patients (Fitzpatrick skin types III or IV) were established, including one with 2,720 images for lesion localization study and the other with 1,262 images for lesion segmentation study. Besides, an additional test set containing 145 images of vitiligo lesions from other Fitzpatrick skin types (I, II, or V) was also generated. A 3-stage hybrid model was constructed. YOLO v3 (You Only Look Once, v3) architecture was trained and validated to classify and localize vitiligo lesions, with sensitivity and error rate as primary performance outcomes. Then a segmentation study comparing 3 deep convolutional neural networks (DCNNs), Pyramid Scene Parsing Network (PSPNet), UNet, and UNet++, was carried out based on the Jaccard index (JI). The architecture with the best performance was integrated into the model. Three add-on metrics, namely VAreaA, VAreaR, and VColor were finally developed to measure absolute, relative size changes and pigmentation, respectively. Agreement between the AI model and dermatologist evaluators were assessed.ResultsThe sensitivity of the YOLO v3 architecture to detect vitiligo lesions was 92.91% with an error rate of 14.98%. The UNet++ architecture outperformed the others in the segmentation study (JI, 0.79) and was integrated into the model. On the additional test set, however, the model achieved a lower detection sensitivity (72.41%) and a lower segmentation score (JI, 0.69). With respect to size changes, no difference was observed between the AI model, trained dermatologists (W=0.812, P<0.05), and Photoshop analysis (P=0.075, P=0.212 respectively), which all displayed good concordance.ConclusionsWe developed a novel, convenient, objective, and quantitative deep learning-based hybrid model which simultaneously evaluated both morphometric and colorimetric vitiligo lesions from patients with Fitzpatrick skin types III or IV, rendering it suitable for the assessment of severity of vitiligo lesions in Asians in both clinic and research scenarios. More work is also warranted for its use in other ethnic skin groups.

Read full abstract

Quantitative Structure Activity Relationship (QSAR) models can inform on the correlation between activities and structure-based molecular descriptors. This information is important for the understanding of the factors that govern molecular properties and for designing new compounds with favorable properties. Due to the large number of calculate-able descriptors and consequently, the much larger number of descriptors combinations, the derivation of QSAR models could be treated as an optimization problem. For continuous responses, metrics which are typically being optimized in this process are related to model performances on the training set, for example, and . Similar metrics, calculated on an external set of data (e.g., ), are used to evaluate the performances of the final models. A common theme of these metrics is that they are context -” ignorant”. In this work we propose that QSAR models should be evaluated based on their intended usage. More specifically, we argue that QSAR models developed for Virtual Screening (VS) should be derived and evaluated using a virtual screening-aware metric, e.g., an enrichment-based metric. To demonstrate this point, we have developed 21 Multiple Linear Regression (MLR) models for seven targets (three models per target), evaluated them first on validation sets and subsequently tested their performances on two additional test sets constructed to mimic small-scale virtual screening campaigns. As expected, we found no correlation between model performances evaluated by “classical” metrics, e.g., and and the number of active compounds picked by the models from within a pool of random compounds. In particular, in some cases models with favorable and/or values were unable to pick a single active compound from within the pool whereas in other cases, models with poor and/or values performed well in the context of virtual screening. We also found no significant correlation between the number of active compounds correctly identified by the models in the training, validation and test sets. Next, we have developed a new algorithm for the derivation of MLR models by optimizing an enrichment-based metric and tested its performances on the same datasets. We found that the best models derived in this manner showed, in most cases, much more consistent results across the training, validation and test sets and outperformed the corresponding MLR models in most virtual screening tests. Finally, we demonstrated that when tested as binary classifiers, models derived for the same targets by the new algorithm outperformed Random Forest (RF) and Support Vector Machine (SVM)-based models across training/validation/test sets, in most cases. We attribute the better performances of the Enrichment Optimizer Algorithm (EOA) models in VS to better handling of inactive random compounds. Optimizing an enrichment-based metric is therefore a promising strategy for the derivation of QSAR models for classification and virtual screening.

Read full abstract

Additional Test Set Research Articles

Related Topics

Articles published on Additional Test Set

Automated extraction of information of lung cancer staging from unstructured reports of PET-CT interpretation: natural language processing with deep-learning

Digital finance and investment of micro and small enterprises: Evidence from China

A deep learning-based hybrid artificial intelligence model for the detection and severity assessment of vitiligo lesions

Does directors' and officers' liability insurance induce empire building? Evidence from corporate labor investment

Customer concentration and bank loan contracting: evidence from China

Correcting the Estimation of Viral Taxa Distributions in Next-Generation Sequencing Data after Applying Artificial Neural Networks.

Retention time prediction in hydrophilic interaction liquid chromatography with graph neural network and transfer learning

Deep learning for diagnosis and survival prediction in soft tissue sarcoma

How do Stock Market Participants Value ESG Performance? Evidence from Middle Eastern and North African Countries

Do facilitation payments affect earnings management? Evidence from China

Fusion of neural networks, for LIDAR‐based evidential road mapping

In-line Phosphoramidite Identification by FTIR to Support Real-Time Oligonucleotide Sequence Confirmation

Experimental Analysis and Optimization to Maximize Ultimate Tensile Strength and Ultimate Elongation of Friction Stir Welded AA6082 Aluminum Alloy

Evaluation of QSAR Equations for Virtual Screening.

Intensity Augmentation to Improve Generalizability of Breast Segmentation Across Different MRI Scan Protocols.

Hilbert spectrum analysis for automatic detection and evaluation of Parkinson’s speech

Do Facilitation Payments Affect Earnings Management? Evidence from China

Fusion of Heterogeneous Earth Observation Data for the Classification of Local Climate Zones

Time-Domain Analysis of Molecular Dynamics Trajectories Using Deep Neural Networks: Application to Activity Ranking of Tankyrase Inhibitors.

Automatic Measurement of Kidney and Liver Volumes from MR Images of Patients Affected by Autosomal Dominant Polycystic Kidney Disease

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Additional Test Set Research Articles

Related Topics

Articles published on Additional Test Set

Automated extraction of information of lung cancer staging from unstructured reports of PET-CT interpretation: natural language processing with deep-learning

Digital finance and investment of micro and small enterprises: Evidence from China

A deep learning-based hybrid artificial intelligence model for the detection and severity assessment of vitiligo lesions

Does directors' and officers' liability insurance induce empire building? Evidence from corporate labor investment

Customer concentration and bank loan contracting: evidence from China

Correcting the Estimation of Viral Taxa Distributions in Next-Generation Sequencing Data after Applying Artificial Neural Networks.

Retention time prediction in hydrophilic interaction liquid chromatography with graph neural network and transfer learning

Deep learning for diagnosis and survival prediction in soft tissue sarcoma

How do Stock Market Participants Value ESG Performance? Evidence from Middle Eastern and North African Countries

Do facilitation payments affect earnings management? Evidence from China

Fusion of neural networks, for LIDAR‐based evidential road mapping

In-line Phosphoramidite Identification by FTIR to Support Real-Time Oligonucleotide Sequence Confirmation

Experimental Analysis and Optimization to Maximize Ultimate Tensile Strength and Ultimate Elongation of Friction Stir Welded AA6082 Aluminum Alloy

Evaluation of QSAR Equations for Virtual Screening.

Intensity Augmentation to Improve Generalizability of Breast Segmentation Across Different MRI Scan Protocols.

Hilbert spectrum analysis for automatic detection and evaluation of Parkinson’s speech

Do Facilitation Payments Affect Earnings Management? Evidence from China

Fusion of Heterogeneous Earth Observation Data for the Classification of Local Climate Zones

Time-Domain Analysis of Molecular Dynamics Trajectories Using Deep Neural Networks: Application to Activity Ranking of Tankyrase Inhibitors.

Automatic Measurement of Kidney and Liver Volumes from MR Images of Patients Affected by Autosomal Dominant Polycystic Kidney Disease