Interpretable Representation Research Articles

Abstract Background Dyslipidemia encompasses a wide range of lipoprotein disorders categorised through two classifications (Fredrickson-Levy [FL] or Sniderman).(1,2) However, both classifications are criticised for relying on incomplete knowledge of lipoprotein metabolism, especially with the emergence of novel treatment options and variations in individual treatment responses.(3) Clustering, an unsupervised machine learning (ML) algorithm that can process a wide range of variables, has the potential to unmask patient groups with distinct molecular profiles and unique therapeutic targets that can inform more effective prevention strategies for cardiovascular disease (CVD).(4) Aim We aimed to use unsupervised ML algorithms to discover intrinsic dyslipidaemia categories from lipoprotein measurements, recognise the necessary components of lipid panels for classification, and analyse the similarities between the newly formed clusters, FL and Sniderman classifications. Methods Lipid profiles of 5,080,248 patients were obtained from the ‘Very Large Database of Lipids’ database. This yielded up to 78 blood components per patient, including at least 31 lipoprotein variables. The analysis involved unsupervised K-means clustering with optimised values for K and the subset of variables, determined in an unsupervised manner using a suitable measure of complexity. We then interpreted our clusters using probabilistic decision trees to provide compact and interpretable representations. Finally, we compared the clusters with Sniderman and FL categories. Results In a completely unsupervised fashion, we identified 14 clusters that could be matched to Sniderman categories. The confusion matrix showed total agreement of 76% (see Figure 1, left panel), relative Cohen’s kappa of 0.78 (the relative version captures accuracy on categories containing smaller numbers of patient profiles) and an accuracy of 96% on the small Type III class. Similar results were observed when matching to FL types. We accurately represented our clusters using probabilistic decision trees of small depth (see Figure 2). We discovered that the data had low intrinsic dimension and a manifold-like structure in which the different clusters could be illustrated (see Figure 1, right panel). Specifically, only 3 variables were needed to obtain our classification: apolipoprotein b, total cholesterol and triglycerides. Conclusion We showed that completely unsupervised ML techniques can uncover dyslipidaemia categories in lipoprotein profiles from a large patient population. The categories largely align with existing classifications based on prior knowledge of lipoprotein metabolism. Furthermore, few lipoprotein variables were required for categorisation (low-dimension data), which could aid in determining which lipoproteins should be measured in a clinical setting. Further analysis of the differences between ML clusters and traditional classifications is needed, which may enhance CVD risk management.

Read full abstract

Abstract Study question Can we decipher the underlying visual properties that drive image-based AI embryo classification models to assist clinical decisions and biological discovery? Summary answer Our framework interpreted which annotated and non-explicitly-annotated phenotypes impact model predictions and rank their importance. These discoveries were aligned with known blastocyst quality criteria. What is known already Deep learning models have shown great promise for complex pattern recognition when applied to embryo images. The success of these models relies on their ability to perform non-linear optimization of feature extraction during model construction. However, this involves their entanglement of multiple classification-driving image properties, thereby producing ‘black-box’ systems that lack user confidence, trust and interpretability. Therefore, there is an urgent need for an interpretability method that can uncover the semantic image properties that contribute to ‘black box’ embryo image-based AI classification model predictions to assist in blastocyst selection. Study design, size, duration 11,211 time-lapse videos were retrospectively collected from three IVF centers. A deep convolutional neural network is first trained to discriminate high-versus-low quality blastocysts. We then developed DISCOVER, a general-purpose interpretability method designed to discover underlying visual properties driving the classifier. DISCOVER encodes an image to an interpretable lower dimensional representation which is correlated to the classifier and encapsulates a different distinct phenotype in each one of the dimensions. Participants/materials, setting, methods The encoding of embryo images to low dimensional representations enables interpretability globally and locally. Globally, the embryo images are synthetically altered by amplifying subtle properties that affect the classification decision. With our method this can be done one property at a time, therefore separating confounding properties. By evaluating the altered images, embryologists can decipher their meaning. Locally, each one of the discovered properties can be ranked by its importance for a specific embryo instance. Main results and the role of chance Using DISCOVER, we interpreted the classification model driving features. We quantitatively linked the top two classification features as blastocyst size (as proxy to degree of expansion and development) and trophectoderm quality, by embryologists evaluation and annotations. We then asked whether DISCOVER can identify non-explicitly annotated latent features that encode morphologic properties not defined by ASEBIR/Gardner criteria. Expert embryologist interpreted the third top classification feature to be the blastocoel. DISCOVER interpreted high quality embryos as having denser and more granular blastocoelic regions, suggesting that this change in the blastocoel appearance is one of the encoded classification-driving morphologic properties. This visualization indicates that there are additional parameters of the blastocoel beyond its volume expansion associated with its quality. We showed how embryo properties can be weighted differently by the classifier on a per embryo basis, giving clinical insight to which properties influence the classification of a specific instance. These results indicate that DISCOVER enables expert-in-the-loop interpretation of the classification model both globally, discovering the overall main properties driving the classifier, and locally, showing a per instance explanation. Limitations, reasons for caution DISCOVER failed to interpret the inner cell mass (ICM) as a classification-driving feature in its latent representation, though it was explicitly used to label the data for training the classification model. It is possible that other properties collectively contained the discriminative information encoded in the ICM. Wider implications of the findings This deep analysis demonstrates the feasibility of providing interpretability for biomedical image-based classification models for clinical use in the IVF clinic. Trial registration number not applicable

Read full abstract

Interpretable Representation Research Articles

Related Topics

Articles published on Interpretable Representation

TorchSISSO: A PyTorch-based implementation of the sure independence screening and sparsifying operator for efficient and interpretable model discovery

A cell atlas foundation model for scalable search of similar human cells.

Mathematical foundation of High-Dimensional Data Analysis: Leveraging Topology and Geometry for Enhanced Model Interpretability in AI

Synchronization-Inspired Interpretable Neural Networks.

Chimeric U-Net – Modifying the standard U-Net towards explainability

A novel classification of dyslipidaemia through the analysis of five million lipid profiles: an unsupervised machine learning approach

Multi-adversarial autoencoders: Stable, faster and self-adaptive representation learning

Interpretable Representation and Customizable Retrieval of Traffic Congestion Patterns Using Causal Graph-Based Feature Associations

Graph Fourier transform for spatial omics representation and analyses of complex organs

Deciphering 3'UTR Mediated Gene Regulation Using Interpretable Deep Representation Learning.

Interpretable representation learning for 3D multi-piece intracellular structures using point clouds.

Unsupervised Anomaly Detection via Nonlinear Manifold Learning

Leveraging Brain Modularity Prior for Interpretable Representation Learning of fMRI.

Developing a fair and interpretable representation of the clock drawing test for mitigating low education and racial bias

Improved diabetic retinopathy severity classification using squeeze-and-excitation and sparse light weight multi-level attention u-net with transfer learning from xception.

Multiview representation learning for identification of novel cancer genes and their causative biological mechanisms.

Interpretable wind power forecasting combining seasonal-trend representations learning with temporal fusion transformers architecture

Unsupervised Learning of Disentangled and Interpretable Representations of Material Appearance

Santiago Jiménez - Unsupervised Learning of Disentangled and Interpretable Representations of Material Appearance

P-149 A visual interpretability method to unbox ‘black-box’ deep learning image-based classification of embryo properties

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Interpretable Representation Research Articles

Related Topics

Articles published on Interpretable Representation

TorchSISSO: A PyTorch-based implementation of the sure independence screening and sparsifying operator for efficient and interpretable model discovery

A cell atlas foundation model for scalable search of similar human cells.

Mathematical foundation of High-Dimensional Data Analysis: Leveraging Topology and Geometry for Enhanced Model Interpretability in AI

Synchronization-Inspired Interpretable Neural Networks.

Chimeric U-Net – Modifying the standard U-Net towards explainability

A novel classification of dyslipidaemia through the analysis of five million lipid profiles: an unsupervised machine learning approach

Multi-adversarial autoencoders: Stable, faster and self-adaptive representation learning

Interpretable Representation and Customizable Retrieval of Traffic Congestion Patterns Using Causal Graph-Based Feature Associations

Graph Fourier transform for spatial omics representation and analyses of complex organs

Deciphering 3'UTR Mediated Gene Regulation Using Interpretable Deep Representation Learning.

Interpretable representation learning for 3D multi-piece intracellular structures using point clouds.

Unsupervised Anomaly Detection via Nonlinear Manifold Learning

Leveraging Brain Modularity Prior for Interpretable Representation Learning of fMRI.

Developing a fair and interpretable representation of the clock drawing test for mitigating low education and racial bias

Improved diabetic retinopathy severity classification using squeeze-and-excitation and sparse light weight multi-level attention u-net with transfer learning from xception.

Multiview representation learning for identification of novel cancer genes and their causative biological mechanisms.

Interpretable wind power forecasting combining seasonal-trend representations learning with temporal fusion transformers architecture

Unsupervised Learning of Disentangled and Interpretable Representations of Material Appearance

Santiago Jiménez - Unsupervised Learning of Disentangled and Interpretable Representations of Material Appearance

P-149 A visual interpretability method to unbox ‘black-box’ deep learning image-based classification of embryo properties