Low Intrinsic Dimensionality Research Articles

Abstract Background Dyslipidemia encompasses a wide range of lipoprotein disorders categorised through two classifications (Fredrickson-Levy [FL] or Sniderman).(1,2) However, both classifications are criticised for relying on incomplete knowledge of lipoprotein metabolism, especially with the emergence of novel treatment options and variations in individual treatment responses.(3) Clustering, an unsupervised machine learning (ML) algorithm that can process a wide range of variables, has the potential to unmask patient groups with distinct molecular profiles and unique therapeutic targets that can inform more effective prevention strategies for cardiovascular disease (CVD).(4) Aim We aimed to use unsupervised ML algorithms to discover intrinsic dyslipidaemia categories from lipoprotein measurements, recognise the necessary components of lipid panels for classification, and analyse the similarities between the newly formed clusters, FL and Sniderman classifications. Methods Lipid profiles of 5,080,248 patients were obtained from the ‘Very Large Database of Lipids’ database. This yielded up to 78 blood components per patient, including at least 31 lipoprotein variables. The analysis involved unsupervised K-means clustering with optimised values for K and the subset of variables, determined in an unsupervised manner using a suitable measure of complexity. We then interpreted our clusters using probabilistic decision trees to provide compact and interpretable representations. Finally, we compared the clusters with Sniderman and FL categories. Results In a completely unsupervised fashion, we identified 14 clusters that could be matched to Sniderman categories. The confusion matrix showed total agreement of 76% (see Figure 1, left panel), relative Cohen’s kappa of 0.78 (the relative version captures accuracy on categories containing smaller numbers of patient profiles) and an accuracy of 96% on the small Type III class. Similar results were observed when matching to FL types. We accurately represented our clusters using probabilistic decision trees of small depth (see Figure 2). We discovered that the data had low intrinsic dimension and a manifold-like structure in which the different clusters could be illustrated (see Figure 1, right panel). Specifically, only 3 variables were needed to obtain our classification: apolipoprotein b, total cholesterol and triglycerides. Conclusion We showed that completely unsupervised ML techniques can uncover dyslipidaemia categories in lipoprotein profiles from a large patient population. The categories largely align with existing classifications based on prior knowledge of lipoprotein metabolism. Furthermore, few lipoprotein variables were required for categorisation (low-dimension data), which could aid in determining which lipoproteins should be measured in a clinical setting. Further analysis of the differences between ML clusters and traditional classifications is needed, which may enhance CVD risk management.

Read full abstract

Radiation therapy treatment planning can be viewed as an iterative hyperparameter tuning process to balance conflicting clinical goals. In this work, we investigated the performance of modern Bayesian optimization (BO) methods on automated treatment planning problems in high-dimensionalsettings. Twenty locally advanced rectal cancer patients treated with intensity-modulated radiation therapy (IMRT) were retrospectively selected as test cases. The adjustable planning parameters included both dose objectives and their corresponding weights. We implemented an automated treatment planning framework and tested the performance of two BO methods on the treatment planning task: one standard BO method (Gaussian Process with Expected Improvement [GPEI]) and one BO method dedicated to high-dimensional problems (Sparse Axis Aligned Subspace BO [SAAS-BO]). Another derivative-free method (Nelder-Mead simplex search) and the random tuning method were also included as baselines. The four automated methods' plan quality and planning efficiency were compared with the clinical plans regarding target coverage and organs at risk (OAR) sparing. The predictive models in both BO methods were compared to analyze the different search patterns of the two BOmethods. For the target structures, the SAAS-BO plans achieved comparable hot spot control ( ) and homogeneity ( ) with the clinical plans, significantly better than the GPEI and Nelder-Mead plans ( ). Both SAAS-BO and GPEI plans significantly outperformed the clinical plans in conformity and dose spillage ( ). Compared with the clinical plans, the treatment plans generated by the four automated methods all made reductions in evaluated dosimetric indices for the femoral head and the bladder. The Nelder-Mead plans achieved similar plan quality scores compared with the BO plans, but exhibited poorer control in the target hot spot and dose spillage. The analysis of the underlying predictive models has shown that both BO methods have identified similar sensitive planning parameters. This work implemented a BO-based hyperparameter tuning framework for automated treatment planning. Both tested BO methods were able to produce high-quality treatment plans and reduce the workload of treatment planners. The model analysis also confirmed the intrinsic low dimensionality of the tested treatment planningproblems.

Read full abstract

Low Intrinsic Dimensionality Research Articles

Related Topics

Articles published on Low Intrinsic Dimensionality

A novel classification of dyslipidaemia through the analysis of five million lipid profiles: an unsupervised machine learning approach

Computed tomography of chemiluminescence using a data-driven sparse sensing framework

Towards Metric DBSCAN: Exact, Approximate, and Streaming Algorithms

Neural Network Approximation for Pessimistic Offline Reinforcement Learning

Simple Orthogonal Graph Representation Learning (Student Abstract)

Self-Assembly of Delta-Formamidinium Lead Iodide Nanoparticles to Nanorods: Study of Memristor Properties and Resistive Switching Mechanism.

Derivative-Informed Neural Operator: An efficient framework for high-dimensional parametric derivative learning

Symplectic model reduction of Hamiltonian systems using data-driven quadratic manifolds

Multi‐fidelity data fusion through parameter space reduction with applications to automotive engineering

Deep estimation for Q⁎ with minimax Bellman error minimization

A Data-dependent Approach for High-dimensional (Robust) Wasserstein Alignment

Correction to: Just Least Squares: Binary Compressive Sampling with Low Generative Intrinsic Dimension

HIPPYlib-MUQ: A Bayesian Inference Software Framework for Integration of Data with Complex Predictive Models under Uncertainty

Large-Scale Bayesian Optimal Experimental Design with Derivative-Informed Projected Neural Network

Just Least Squares: Binary Compressive Sampling with Low Generative Intrinsic Dimension

High-dimensional automated radiation therapy treatment planning via Bayesian optimization.

Inverse problems on low-dimensional manifolds

Bound-constrained global optimization of functions with low effective dimensionality using multiple random embeddings

Dynamics of Drosophila endoderm specification

Functional principal subspace sampling for large scale functional data analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Low Intrinsic Dimensionality Research Articles

Related Topics

Articles published on Low Intrinsic Dimensionality

A novel classification of dyslipidaemia through the analysis of five million lipid profiles: an unsupervised machine learning approach

Computed tomography of chemiluminescence using a data-driven sparse sensing framework

Towards Metric DBSCAN: Exact, Approximate, and Streaming Algorithms

Neural Network Approximation for Pessimistic Offline Reinforcement Learning

Simple Orthogonal Graph Representation Learning (Student Abstract)

Self-Assembly of Delta-Formamidinium Lead Iodide Nanoparticles to Nanorods: Study of Memristor Properties and Resistive Switching Mechanism.

Derivative-Informed Neural Operator: An efficient framework for high-dimensional parametric derivative learning

Symplectic model reduction of Hamiltonian systems using data-driven quadratic manifolds

Multi‐fidelity data fusion through parameter space reduction with applications to automotive engineering

Deep estimation for Q⁎ with minimax Bellman error minimization

A Data-dependent Approach for High-dimensional (Robust) Wasserstein Alignment

Correction to: Just Least Squares: Binary Compressive Sampling with Low Generative Intrinsic Dimension

HIPPYlib-MUQ: A Bayesian Inference Software Framework for Integration of Data with Complex Predictive Models under Uncertainty

Large-Scale Bayesian Optimal Experimental Design with Derivative-Informed Projected Neural Network

Just Least Squares: Binary Compressive Sampling with Low Generative Intrinsic Dimension

High-dimensional automated radiation therapy treatment planning via Bayesian optimization.

Inverse problems on low-dimensional manifolds

Bound-constrained global optimization of functions with low effective dimensionality using multiple random embeddings

Dynamics of Drosophila endoderm specification

Functional principal subspace sampling for large scale functional data analysis