Normalized to: Frontera-Pons, J.
[1]
oai:arXiv.org:1903.04949 [pdf] - 1882572
Representation learning for automated spectroscopic redshift estimation
Submitted: 2019-03-12
Determining the radial positions of galaxies up to a high accuracy depends on
the correct identification of salient features in their spectra. Classical
techniques for spectroscopic redshift estimation make use of template matching
with cross-correlation. These templates are usually constructed from empirical
spectra or simulations based on the modeling of local galaxies. We propose two
new spectroscopic redshift estimation schemes based on new learning techniques
for galaxy spectra representation, using either a dictionary learning technique
for sparse representation or denoising autoencoders. We investigate how these
representations impact redshift estimation. These methods have been tested on
realistic simulated galaxy spectra, with photometry modelled after the Large
Synoptic Survey Telescope (LSST) and spectroscopy reproducing properties of the
Sloan Digital Sky Survey (SDSS). They were compared to Darth Fader, a robust
technique extracting line features and estimating redshift through
eigentemplates cross-correlations. We show that both dictionary learning and
denoising autoencoders provide improved accuracy and reliability across all
signal-to-noise regimes and galaxy types. The representation learning framework
for spectroscopic redshift analysis introduced in this work offers high
performance in feature extraction and redshift estimation, improving on a
classical eigentemplates approach. This is a necessity for next-generation
galaxy surveys, and we demonstrate a successful application in realistic
simulated survey data.
[2]
oai:arXiv.org:1705.05620 [pdf] - 1583467
Unsupervised feature-learning for galaxy SEDs with denoising
autoencoders
Submitted: 2017-05-16
With the increasing number of deep multi-wavelength galaxy surveys, the
spectral energy distribution (SED) of galaxies has become an invaluable tool
for studying the formation of their structures and their evolution. In this
context, standard analysis relies on simple spectro-photometric selection
criteria based on a few SED colors. If this fully supervised classification
already yielded clear achievements, it is not optimal to extract relevant
information from the data. In this article, we propose to employ very recent
advances in machine learning, and more precisely in feature learning, to derive
a data-driven diagram. We show that the proposed approach based on denoising
autoencoders recovers the bi-modality in the galaxy population in an
unsupervised manner, without using any prior knowledge on galaxy SED
classification. This technique has been compared to principal component
analysis (PCA) and to standard color/color representations. In addition,
preliminary results illustrate that this enables the capturing of extra
physically meaningful information, such as redshift dependence, galaxy mass
evolution and variation over the specific star formation rate. PCA also results
in an unsupervised representation with physical properties, such as mass and
sSFR, although this representation separates out. less other characteristics
(bimodality, redshift evolution) than denoising autoencoders.