Nonlinear Dimensionality Reduction Method Research Articles

Geochemical data are usually high-dimensional data that could contain dozens of elements. Geochemical distribution patterns and anomalies related to mineralization and lithological features are always hidden in these high-dimensional data, which cannot be directly observed from the data. To solve this problem, a manifold learning-based uniform manifold approximation and projection (UMAP) method, was introduced to recognize mineralization-related geochemical anomalies from high-dimensional geochemical data in this study. The UMAP method is a nonlinear dimensionality reduction method, which is suitable for dimensionality reduction and visualization of high-dimensional data. A case study was conducted to demonstrate the advantages of the UMAP method for identifying ion-adsorbed rare-earth-element (REE) mineralization-related anomalies from high-dimensional data in the Nanling region, China. Factor analysis was used to determine ion-adsorbed REE mineralization-related element combination that consists of 10 elements. High-dimensional geochemical data were reduced to two dimensions based on the UMAP method. The results indicated that the UMAP method can effectively characterize the spatial distributions of ion-adsorbed REE mineralization-related anomalies by dimensionality reduction analysis and visualization analysis of high-dimensional geochemical data in the study area. To illustrate the superiority of the UMAP method, a comparative study was conducted between the UMAP and other three manifold learning methods, namely locally linear embedding (LLE), isometric feature mapping (Isomap) and t-distributed stochastic neighbor embedding (t-SNE). The performance of the four manifold learning methods was evaluated by receiver operating characteristic (ROC) curve and prediction-area (P-A) plot, showing that the performance of the UMAP method is superior to that of the LLE, Isomap and t-SNE methods in terms of recognizing ion-adsorbed REE mineralization-related anomalies and the spatial distributions of the REE-bearing geological bodies in the Nanling belt.

Read full abstract

Abstract [Introduction] Human epidermal growth factor receptor 2 (HER2), which is characterized by ERBB2 amplification, is one of the important markers for treatment decision related to breast cancer. Many targeted therapies for HER2-positive (immunohistochemistry [IHC] scores of 3+ or 2+ with an in situ hybridization [ISH] gene amplification) or HER2-low (IHC score 1+ or 2+ with no ISH gene amplification) breast cancers have been extensively developed thus far. Meanwhile, tumoral heterogeneity is considered one of the mechanisms for drug resistance; however, it has not been fully elucidated on each HER2 status at single-cell level. Therefore, an integrated analysis of the breast cancer single-cell gene expression data of both public datasets and our cohort was used to investigate tumor heterogeneity based on HER2 status. [Methods] We collected 21 and 6 scRNA-seq samples of primary breast cancer from the public Gene Expression Omnibus (GEO) and our institution, respectively. A total of 27 samples included HER2-positive cases (pure HER2 cases: estrogen receptor [ER]-negative/HER2-positive and Luminal-HER2 cases: ER-positive/HER2-positive) and Luminal cases (ER-positive/HER2-negative). These datasets were imported into R software version 4.2.0. and transformed into Seurat objects with the package Seurat version 4.3.0. UMAP plots, which is a non-linear dimension reduction method, were used for clustering analyses. Seurat in R was used to generate UMAP, feature, and violin plots. Additionally, we performed pathway enrichment analysis with the significant gene list from each cluster between the ERBB2-high and ERBB2-low groups. [Results] Clustering analysis revealed heterogeneous distribution in each gene related to breast cancer (ESR1, PGR, ERBB2, and MKI67). One of the clusters revealed high MKI67 expression. ERBB2 expressions were diffusely distributed on each cluster of pure HER2 cases; however, the expressions were considerably higher in one of the clusters in cases of Luminal-HER2. The ERBB2 expression in Luminal cases, which included HER2-low status, was lower than that in HER2-positive cases, albeit slight expression was observed. Cell proliferation factors, including IGF, IGF1R, and EGF, were included in the ERBB2-high expression group compared with the ERBB2-low expression group for pathway analysis. Further, we examined typical gene expressions which are associated with markers related to breast cancer, cancer stem cell, and epithelial-to-mesenchymal transition in each case and revealed heterogeneous patterns across patients. [Conclusion] Heterogeneous ERBB2 expression distribution was observed in HER2-positive cases, and slight ERBB2 expression was identified in the Luminal cases. Moreover, Luminal-HER2 cases could be considered a more heterogeneous subtype compared with pure HER2 cases. Additionally, gene expressions of typical gene markers varied across patients. These results indicated that breast cancer displays heterogeneous patterns on each HER2 status not only intra-tumoral heterogeneity but also inter-patient heterogeneity. Citation Format: Sho Shiino, Momoko Tokura, Jun Nakayama, Masayuki Yoshida, Akihiko Suto, Yusuke Yamamoto. Investigation of tumor heterogeneity using integrated single-cell RNA sequence data based on HER2 status in patients with breast cancer [abstract]. In: Proceedings of the AACR Special Conference in Cancer Research: Advances in Breast Cancer Research; 2023 Oct 19-22; San Diego, California. Philadelphia (PA): AACR; Cancer Res 2024;84(3 Suppl_1):Abstract nr B063.

Read full abstract

Nonlinear Dimensionality Reduction Method Research Articles

Related Topics

Articles published on Nonlinear Dimensionality Reduction Method

Wide Area VISTA Extra-galactic Survey (WAVES): Unsupervised star-galaxy separation on the WAVES-Wide photometric input catalogue using UMAP and hdbscan

Calibrating dimension reduction hyperparameters in the presence of noise.

Mesoscopic structure graphs for interpreting uncertainty in non-linear embeddings

Manifold learning-based UMAP method for geochemical anomaly identification

Exploring visual quality of multidimensional time series projections

Learnable faster kernel-PCA for nonlinear fault detection: Deep autoencoder-based realization

Interpretation of autoencoder-learned collective variables using Morse-Smale complex and sublevelset persistent homology: An application on molecular trajectories.

Identification of control equations using low-dimensional flow representations of pitching airfoil

Automatic Active Lesion Tracking in Multiple Sclerosis Using Unsupervised Machine Learning.

Nonlinear dimensionality reduction with q-Gaussian distribution

Predicting S. aureus antimicrobial resistance with interpretable genomic space maps.

Abstract B063: Investigation of tumor heterogeneity using integrated single-cell RNA sequence data based on HER2 status in patients with breast cancer

Combustion instability analysis in an ethylene-fueled scramjet combustor under various fuel penetration height conditions using an image-based nonlinear dimensionality reduction method

Soft sensor for predicting indoor PM2.5 concentration in subway with adaptive boosting deep learning model

Model-based evaluation of spatiotemporal data reduction methods with unknown ground truth through optimal visualization and interpretability metrics.

Trade-offs in the latent representation of microstructure evolution

A software defect prediction method based on learnable three-line hybrid feature fusion

Nonlinear dimensionality reduction methods for potentiometric multisensor systems data analysis

Absence of enterotypes in the human gut microbiomes reanalyzed with non-linear dimensionality reduction methods.

Laplacian-based Cluster-Contractive t-SNE for High-Dimensional Data Visualization

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Nonlinear Dimensionality Reduction Method Research Articles

Related Topics

Articles published on Nonlinear Dimensionality Reduction Method

Wide Area VISTA Extra-galactic Survey (WAVES): Unsupervised star-galaxy separation on the WAVES-Wide photometric input catalogue using UMAP and hdbscan

Calibrating dimension reduction hyperparameters in the presence of noise.

Mesoscopic structure graphs for interpreting uncertainty in non-linear embeddings

Manifold learning-based UMAP method for geochemical anomaly identification

Exploring visual quality of multidimensional time series projections

Learnable faster kernel-PCA for nonlinear fault detection: Deep autoencoder-based realization

Interpretation of autoencoder-learned collective variables using Morse-Smale complex and sublevelset persistent homology: An application on molecular trajectories.

Identification of control equations using low-dimensional flow representations of pitching airfoil

Automatic Active Lesion Tracking in Multiple Sclerosis Using Unsupervised Machine Learning.

Nonlinear dimensionality reduction with q-Gaussian distribution

Predicting S. aureus antimicrobial resistance with interpretable genomic space maps.

Abstract B063: Investigation of tumor heterogeneity using integrated single-cell RNA sequence data based on HER2 status in patients with breast cancer

Combustion instability analysis in an ethylene-fueled scramjet combustor under various fuel penetration height conditions using an image-based nonlinear dimensionality reduction method

Soft sensor for predicting indoor PM2.5 concentration in subway with adaptive boosting deep learning model

Model-based evaluation of spatiotemporal data reduction methods with unknown ground truth through optimal visualization and interpretability metrics.

Trade-offs in the latent representation of microstructure evolution

A software defect prediction method based on learnable three-line hybrid feature fusion

Nonlinear dimensionality reduction methods for potentiometric multisensor systems data analysis

Absence of enterotypes in the human gut microbiomes reanalyzed with non-linear dimensionality reduction methods.

Laplacian-based Cluster-Contractive t-SNE for High-Dimensional Data Visualization