Photometric redshift estimation with convolutional neural networks and galaxy images: Case study of resolving biases in data-driven methods

Q Lin,S Arnouts,J Pasquet,R Ait Ouahmed,O Ilbert,D Fouchez,M Treyer

doi:10.1051/0004-6361/202142751

Abstract

Deep-learning models have been increasingly exploited in astrophysical studies, but these data-driven algorithms are prone to producing biased outputs that are detrimental for subsequent analyses. In this work, we investigate two main forms of biases: class-dependent residuals, and mode collapse. We do this in a case study, in which we estimate photometric redshift as a classification problem using convolutional neural networks (CNNs) trained with galaxy images and associated spectroscopic redshifts. We focus on point estimates and propose a set of consecutive steps for resolving the two biases based on CNN models, involving representation learning with multichannel outputs, balancing the training data, and leveraging soft labels. The residuals can be viewed as a function of spectroscopic redshift or photometric redshift, and the biases with respect to these two definitions are incompatible and should be treated individually. We suggest that a prerequisite for resolving biases in photometric space is resolving biases in spectroscopic space. Experiments show that our methods can better control biases than benchmark methods, and they are robust in various implementing and training conditions with high-quality data. Our methods hold promises for future cosmological surveys that require a good constraint of biases, and they may be applied to regression problems and other studies that make use of data-driven models. Nonetheless, the bias-variance tradeoff and the requirement of sufficient statistics suggest that we need better methods and optimized data usage strategies.

Highlights

Estimating galaxy redshifts is crucial for studies of galaxy evolution and cosmology
Discussion of the bias behaviors Following our discussions of correcting for zspec-dependent biases, we investigated the behaviors of biases and the performance of our methods by controlling the convolutional neural networks (CNNs) models with varying implementing and training conditions
We analyzed two biases that are generally present in data-driven methods, namely class-dependent residuals and mode collapse, which are two effects imposed by the prior of training data and the model implementation

Summary

Introduction

Estimating galaxy redshifts is crucial for studies of galaxy evolution and cosmology. While redshifts obtained by spectroscopic measurements (spec-z) typically have high accuracy, they are highly time intensive and not ideal for the extremely large data sizes from ongoing or future imaging surveys There are two broad categories of methods for estimating photometric redshifts for individual galaxies: template-fitting methods, and data-driven methods (see Salvato et al 2019 for a review). Template-fitting methods model the galaxy spectral energy distribution (SED) and infer redshifts by fitting the galaxy photometry based on the SED templates (e.g., Arnouts et al 1999; Feldmann et al 2006; Ilbert et al 2006; Greisel et al 2015; Leistedt et al 2019)

Objectives

Findings

Methods

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Astronomy & Astrophysics	Publication Date: Jun 1, 2022
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Photometric redshift estimation with convolutional neural networks and galaxy images: Case study of resolving biases in data-driven methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Astronomy & Astrophysics

Lead the way for us

Similar Papers

Development of hybrid models based on deep learning and optimized machine learning algorithms for brain tumor Multi-Classification
Muhammed Celik ... Ozkan Inik
Expert Systems with Applications | VOL. 238
Muhammed Celik, et. al.Muhammed Celik ... Ozkan Inik
18 Oct 2023
Expert Systems with Applications | VOL. 238

Spectroscopic needs for imaging dark energy experiments
Jeffrey A Newman ...
Astroparticle Physics | VOL. 63
Jeffrey A Newman, et. al.Jeffrey A Newman ...
05 Jul 2014
Astroparticle Physics | VOL. 63

Short-term water quality variable prediction using a hybrid CNN–LSTM deep learning model
Rahim Barzegar ... Jan Adamowski
Stochastic Environmental Research and Risk Assessment | VOL. 34
Rahim Barzegar, et. al.Rahim Barzegar ... Jan Adamowski
01 Feb 2020
Stochastic Environmental Research and Risk Assessment | VOL. 34

Hyperspectral signature-band extraction and learning: an example of sugar content prediction of Syzygium samarangense
Yung-Jhe Yan ... Mang Ou-Yang
Scientific Reports | VOL. 13
Yung-Jhe Yan, et. al.Yung-Jhe Yan ... Mang Ou-Yang
12 Sep 2023
Scientific Reports | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Photometric redshift estimation with convolutional neural networks and galaxy images: Case study of resolving biases in data-driven methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Astronomy &amp; Astrophysics

More From: Astronomy & Astrophysics