Bootstrap Confidence Regions for Learned Feature Embeddings

Kris Sankaran

doi:10.1080/10618600.2023.2197478

Abstract

Algorithmic feature learners provide high-dimensional vector representations for non-matrix structured data, like image or text collections. Low-dimensional projections derived from these representations, called embeddings, are often used to explore variation in these data. However, it is not clear how to assess the embedding uncertainty. We adapt methods developed for bootstrapping principal components analysis to the setting where features are algorithmically derived from nonmatrix data. We empirically compare the derived confidence areas in simulations, varying factors influencing feature learning and the bootstrap, like feature learning algorithm complexity and bootstrap sample size. We illustrate the proposed approaches on a spatial proteomics dataset, where we observe that embedding precision is not uniform across all tissue types. Code, data, and pretrained models are available as an R compendium in the supplementary materials. Supplementary files for this article are available online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bootstrap Confidence Regions for Learned Feature Embeddings

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics

Lead the way for us

Similar Papers

Calibrating and Visualizing Some Bootstrap Confidence Regions
Welagedara Arachchilage Dhanushka M Welagedara ... David J Olive
Axioms | VOL. 13
Welagedara Arachchilage Dhanushka M Welagedara, et. al.Welagedara Arachchilage Dhanushka M Welagedara ... David J Olive
25 Sep 2024
Axioms | VOL. 13

Improving Preference Prediction Accuracy With Feature Learning
Alex Burnap ... Richard Gonzalez
-
Alex Burnap, et. al.Alex Burnap ... Richard Gonzalez
17 Aug 2014
17 Aug 2014

IGraph: a graph-based technique for visual analytics of image and text collections
Chaoli Wang ... Robert J Nemiroff
-
Chaoli Wang, et. al.Chaoli Wang ... Robert J Nemiroff
08 Feb 2015
08 Feb 2015

A Deep Representation Learning Framework for Medical Imaging Data Analysis
Pengcheng Xi
-
Pengcheng XiPengcheng Xi
24 Jun 2020
24 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bootstrap Confidence Regions for Learned Feature Embeddings

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Graphical Statistics