Normalized to: Almosallam, I.
[1]
oai:arXiv.org:2001.03621 [pdf] - 2029805
Evaluation of probabilistic photometric redshift estimation approaches
for LSST
Schmidt, S. J.;
Malz, A. I.;
Soo, J. Y. H.;
Almosallam, I. A.;
Brescia, M.;
Cavuoti, S.;
Cohen-Tanugi, J.;
Connolly, A. J.;
DeRose, J.;
Freeman, P. E.;
Graham, M. L.;
Iyer, K. G.;
Jarvis, M. J.;
Kalmbach, J. B.;
Kovacs, E.;
Lee, A. B.;
Longo, G.;
Morrison, C. B.;
Newman, J. A.;
Nourbakhsh, E.;
Nuss, E.;
Pospisil, T.;
Tranin, H.;
Wechsler, R. H.;
Zhou, R.;
Izbicki, R.;
Collaboration, The LSST Dark Energy Science
Submitted: 2020-01-10
Many scientific investigations of photometric galaxy surveys require redshift
estimates, whose uncertainty properties are best encapsulated by photometric
redshift (photo-z) posterior probability density functions (PDFs). A plethora
of photo-z PDF estimation methodologies abound, producing discrepant results
with no consensus on a preferred approach. We present the results of a
comprehensive experiment comparing twelve photo-z algorithms applied to mock
data produced for the Large Synoptic Survey Telescope (LSST) Dark Energy
Science Collaboration (DESC). By supplying perfect prior information, in the
form of the complete template library and a representative training set as
inputs to each code, we demonstrate the impact of the assumptions underlying
each technique on the output photo-z PDFs. In the absence of a notion of true,
unbiased photo-z PDFs, we evaluate and interpret multiple metrics of the
ensemble properties of the derived photo-z PDFs as well as traditional
reductions to photo-z point estimates. We report systematic biases and overall
over/under-breadth of the photo-z PDFs of many popular codes, which may
indicate avenues for improvement in the algorithms or implementations.
Furthermore, we raise attention to the limitations of established metrics for
assessing photo-z PDF accuracy; though we identify the conditional density
estimate (CDE) loss as a promising metric of photo-z PDF performance in the
case where true redshifts are available but true photo-z PDFs are not, we
emphasize the need for science-specific performance metrics.
[2]
oai:arXiv.org:1712.02256 [pdf] - 1622376
Improving Photometric Redshift Estimation using GPz: size information,
post processing and improved photometry
Submitted: 2017-12-06
The next generation of large scale imaging surveys (such as those conducted
with the Large Synoptic Survey Telescope and Euclid) will require accurate
photometric redshifts in order to optimally extract cosmological information.
Gaussian Processes for photometric redshift estimation (GPz) is a promising new
method that has been proven to provide efficient, accurate photometric redshift
estimations with reliable variance predictions. In this paper, we investigate a
number of methods for improving the photometric redshift estimations obtained
using GPz (but which are also applicable to others). We use spectroscopy from
the Galaxy and Mass Assembly Data Release 2 with a limiting magnitude of r<19.4
along with corresponding Sloan Digital Sky Survey visible (ugriz) photometry
and the UKIRT Infrared Deep Sky Survey Large Area Survey near-IR (YJHK)
photometry. We evaluate the effects of adding near-IR magnitudes and angular
size as features for the training, validation and testing of GPz and find that
these improve the accuracy of the results by ~15-20 per cent. In addition, we
explore a post-processing method of shifting the probability distributions of
the estimated redshifts based on their Quantile-Quantile plots and find that it
improves the bias by ~40 per cent. Finally, we investigate the effects of using
more precise photometry obtained from the Hyper Suprime-Cam Subaru Strategic
Program Data Release 1 and find that it produces significant improvements in
accuracy, similar to the effect of including additional features.
[3]
oai:arXiv.org:1604.03593 [pdf] - 1457233
GPz: Non-stationary sparse Gaussian processes for heteroscedastic
uncertainty estimation in photometric redshifts
Submitted: 2016-04-12, last modified: 2016-06-16
The next generation of cosmology experiments will be required to use
photometric redshifts rather than spectroscopic redshifts. Obtaining accurate
and well-characterized photometric redshift distributions is therefore critical
for Euclid, the Large Synoptic Survey Telescope and the Square Kilometre Array.
However, determining accurate variance predictions alongside single point
estimates is crucial, as they can be used to optimize the sample of galaxies
for the specific experiment (e.g. weak lensing, baryon acoustic oscillations,
supernovae), trading off between completeness and reliability in the galaxy
sample. The various sources of uncertainty in measurements of the photometry
and redshifts put a lower bound on the accuracy that any model can hope to
achieve. The intrinsic uncertainty associated with estimates is often
non-uniform and input-dependent, commonly known in statistics as
heteroscedastic noise. However, existing approaches are susceptible to outliers
and do not take into account variance induced by non-uniform data density and
in most cases require manual tuning of many parameters. In this paper, we
present a Bayesian machine learning approach that jointly optimizes the model
with respect to both the predictive mean and variance we refer to as Gaussian
processes for photometric redshifts (GPz). The predictive variance of the model
takes into account both the variance due to data density and photometric noise.
Using the SDSS DR12 data, we show that our approach substantially outperforms
other machine learning methods for photo-z estimation and their associated
variance, such as TPZ and ANNz2. We provide a Matlab and Python implementations
that are available to download at https://github.com/OxfordML/GPz .
[4]
oai:arXiv.org:1505.05489 [pdf] - 1316987
A Sparse Gaussian Process Framework for Photometric Redshift Estimation
Submitted: 2015-05-20, last modified: 2015-10-19
Accurate photometric redshifts are a lynchpin for many future experiments to
pin down the cosmological model and for studies of galaxy evolution. In this
study, a novel sparse regression framework for photometric redshift estimation
is presented. Simulated and real data from SDSS DR12 were used to train and
test the proposed models. We show that approaches which include careful data
preparation and model design offer a significant improvement in comparison with
several competing machine learning algorithms. Standard implementations of most
regression algorithms have as the objective the minimization of the sum of
squared errors. For redshift inference, however, this induces a bias in the
posterior mean of the output distribution, which can be problematic. In this
paper we directly target minimizing $\Delta z = (z_\textrm{s} -
z_\textrm{p})/(1+z_\textrm{s})$ and address the bias problem via a
distribution-based weighting scheme, incorporated as part of the optimization
objective. The results are compared with other machine learning algorithms in
the field such as Artificial Neural Networks (ANN), Gaussian Processes (GPs)
and sparse GPs. The proposed framework reaches a mean absolute $\Delta z =
0.0026(1+z_\textrm{s})$, over the redshift range of $0 \le z_\textrm{s} \le 2$
on the simulated data, and $\Delta z = 0.0178(1+z_\textrm{s})$ over the entire
redshift range on the SDSS DR12 survey, outperforming the standard ANNz used in
the literature. We also investigate how the relative size of the training set
affects the photometric redshift accuracy. We find that a training set of
\textgreater 30 per cent of total sample size, provides little additional
constraint on the photometric redshifts, and note that our GP formalism
strongly outperforms ANNz in the sparse data regime for the simulated data set.