Comparison Of Dimensionality Reduction Techniques Research Articles

We have used a Ligand Knowledge Base for bidentate P,P-donor ligands of potential interest to homogeneous catalysis to compare three dimensionality reduction techniques, namely Principal Component Analysis (PCA), Uniform Manifold Approximation and Projection (UMAP) and t-distributed Stochastic Neighbor Embedding (t-SNE). While our previous work on Ligand Knowledge Bases has focused on PCA, here we compare this approach with more recently-published approaches and assess the information retention, visualization, clustering and interpretability which can be achieved for each approach. We find that potential advantages of t-SNE are not realized with a database of the current size (275 entries), and that there is a degree of complementarity between PCA and UMAP. The statistics underlying PCA rely on linear relationships, making interpretation of the resulting plots comparatively straightforward. Since much of chemistry relies on linear structure-property relationships and low-dimensional visualization, the explainability and information retention achieved is attractive. UMAP proved more challenging to interpret, but achieved clear clustering which was often chemically meaningful, and it would be a useful approach for ensuring that distinct subsets of compounds are sampled in a machine-learning context. This analysis also highlighted that the tunability of catalysis achieved through ligand exchange maps well onto some areas of chemical space where closely related ligands cluster, while others represent outliers; these arise from different combinations of steric and electronic effects which chemists will find intuitive.

Read full abstract

Fluorescence spectroscopy shows promise as a tool for monitoring water quality due to its real-time capabilities and sensitive detection of several compounds of interest. Previous work has shown the possible use of fluorescence to detect and quantify low levels of polycyclic aromatic hydrocarbons and fluorescing pesticides. However, the fluorescence-based contaminant detection models are highly source-specific and require significant effort and resources to build and calibrate them for each source water of interest. In this study, the novel application of data processing techniques was investigated to enable the transfer of fluorescence detection models from one water source to another. A contaminant detection model from a relatively consistent and low organic background source (Lake Ontario, TOC: 2.07–2.26 mg L−1) was transferred to the Otonabee River, which has higher organic concentrations and distinct characteristics (TOC: 5.20–5.66 mg L−1). Only a few additional fluorescence spectra of the background water quality and contaminants of interest were required to successfully transfer the model, without the need for labelled samples in the new source. Notable differences in peak location and spectral shape of identical compounds were found in source-specific models between the two water sources, implying variability in fluorescence signals resulting from environmental conditions. Despite the impact of environmental conditions, features identified by principal component analysis (PCA) and an autoencoder produced sensitive transferred models capable of addressing the spatial and temporal source diversity with mean absolute error (MAE) < 0.5 μg L−1 for quantification of PAHs and pesticides at concentrations between 0.1 and 7 μg L−1. The results of this study show the potential of the cross-source transferred model to be implemented in a wide range of environmental conditions.

Read full abstract

Comparison Of Dimensionality Reduction Techniques Research Articles

Articles published on Comparison Of Dimensionality Reduction Techniques

Comparison of dimensionality reduction techniques for the visualisation of chemical space in organometallic catalysis

Comparison of dimensionality reduction techniques for multi-variable spatiotemporal flow fields

Comparison of dimensionality reduction techniques for cross-source transfer of fluorescence contaminant detection models

Unsupervised Adaptation for High-Dimensional with Limited-Sample Data Classification Using Variational Autoencoder

Evaluation of Dimensionality Reduction Techniques for Load Profiling Application in Smart Grid Environment

Accuracy comparison of dimensionality reduction techniques to determine significant features from IMU sensor-based data to diagnose vestibular system disorders

Efficient Information Retrieval through Comparison of Dimensionality Reduction Techniques with Clustering Approach

Comparison of dimensionality reduction techniques for the fault diagnosis of mono block centrifugal pump using vibration signals

Radial basis function neural network based comparison of dimensionality reduction techniques for effective bearing diagnostics

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Comparison Of Dimensionality Reduction Techniques Research Articles

Articles published on Comparison Of Dimensionality Reduction Techniques

Comparison of dimensionality reduction techniques for the visualisation of chemical space in organometallic catalysis

Comparison of dimensionality reduction techniques for multi-variable spatiotemporal flow fields

Comparison of dimensionality reduction techniques for cross-source transfer of fluorescence contaminant detection models

Unsupervised Adaptation for High-Dimensional with Limited-Sample Data Classification Using Variational Autoencoder

Evaluation of Dimensionality Reduction Techniques for Load Profiling Application in Smart Grid Environment

Accuracy comparison of dimensionality reduction techniques to determine significant features from IMU sensor-based data to diagnose vestibular system disorders

Efficient Information Retrieval through Comparison of Dimensionality Reduction Techniques with Clustering Approach

Comparison of dimensionality reduction techniques for the fault diagnosis of mono block centrifugal pump using vibration signals

Radial basis function neural network based comparison of dimensionality reduction techniques for effective bearing diagnostics