Normalized to: Pospisil, T.
[1]
oai:arXiv.org:2001.03621 [pdf] - 2029805
Evaluation of probabilistic photometric redshift estimation approaches
for LSST
Schmidt, S. J.;
Malz, A. I.;
Soo, J. Y. H.;
Almosallam, I. A.;
Brescia, M.;
Cavuoti, S.;
Cohen-Tanugi, J.;
Connolly, A. J.;
DeRose, J.;
Freeman, P. E.;
Graham, M. L.;
Iyer, K. G.;
Jarvis, M. J.;
Kalmbach, J. B.;
Kovacs, E.;
Lee, A. B.;
Longo, G.;
Morrison, C. B.;
Newman, J. A.;
Nourbakhsh, E.;
Nuss, E.;
Pospisil, T.;
Tranin, H.;
Wechsler, R. H.;
Zhou, R.;
Izbicki, R.;
Collaboration, The LSST Dark Energy Science
Submitted: 2020-01-10
Many scientific investigations of photometric galaxy surveys require redshift
estimates, whose uncertainty properties are best encapsulated by photometric
redshift (photo-z) posterior probability density functions (PDFs). A plethora
of photo-z PDF estimation methodologies abound, producing discrepant results
with no consensus on a preferred approach. We present the results of a
comprehensive experiment comparing twelve photo-z algorithms applied to mock
data produced for the Large Synoptic Survey Telescope (LSST) Dark Energy
Science Collaboration (DESC). By supplying perfect prior information, in the
form of the complete template library and a representative training set as
inputs to each code, we demonstrate the impact of the assumptions underlying
each technique on the output photo-z PDFs. In the absence of a notion of true,
unbiased photo-z PDFs, we evaluate and interpret multiple metrics of the
ensemble properties of the derived photo-z PDFs as well as traditional
reductions to photo-z point estimates. We report systematic biases and overall
over/under-breadth of the photo-z PDFs of many popular codes, which may
indicate avenues for improvement in the algorithms or implementations.
Furthermore, we raise attention to the limitations of established metrics for
assessing photo-z PDF accuracy; though we identify the conditional density
estimate (CDE) loss as a promising metric of photo-z PDF performance in the
case where true redshifts are available but true photo-z PDFs are not, we
emphasize the need for science-specific performance metrics.
[2]
oai:arXiv.org:1908.11523 [pdf] - 2031949
Conditional Density Estimation Tools in Python and R with Applications
to Photometric Redshifts and Likelihood-Free Cosmological Inference
Submitted: 2019-08-29, last modified: 2019-12-20
It is well known in astronomy that propagating non-Gaussian prediction
uncertainty in photometric redshift estimates is key to reducing bias in
downstream cosmological analyses. Similarly, likelihood-free inference
approaches, which are beginning to emerge as a tool for cosmological analysis,
require a characterization of the full uncertainty landscape of the parameters
of interest given observed data. However, most machine learning (ML) or
training-based methods with open-source software target point prediction or
classification, and hence fall short in quantifying uncertainty in complex
regression and parameter inference settings. As an alternative to methods that
focus on predicting the response (or parameters) $\mathbf{y}$ from features
$\mathbf{x}$, we provide nonparametric conditional density estimation (CDE)
tools for approximating and validating the entire probability density function
(PDF) $\mathrm{p}(\mathbf{y}|\mathbf{x})$ of $\mathbf{y}$ given (i.e.,
conditional on) $\mathbf{x}$. As there is no one-size-fits-all CDE method, the
goal of this work is to provide a comprehensive range of statistical tools and
open-source software for nonparametric CDE and method assessment which can
accommodate different types of settings and be easily fit to the problem at
hand. Specifically, we introduce four CDE software packages in
$\texttt{Python}$ and $\texttt{R}$ based on ML prediction methods adapted and
optimized for CDE: $\texttt{NNKCDE}$, $\texttt{RFCDE}$, $\texttt{FlexCode}$,
and $\texttt{DeepCDE}$. Furthermore, we present the $\texttt{cdetools}$
package, which includes functions for computing a CDE loss function for tuning
and assessing the quality of individual PDFs, along with diagnostic functions.
We provide sample code in $\texttt{Python}$ and $\texttt{R}$ as well as
examples of applications to photometric redshift estimation and likelihood-free
cosmological inference via CDE.
[3]
oai:arXiv.org:1905.03779 [pdf] - 1880595
Non-Gaussianity in the Weak Lensing Correlation Function Likelihood -
Implications for Cosmological Parameter Biases
Submitted: 2019-05-09
We study the significance of non-Gaussianity in the likelihood of weak
lensing shear two-point correlation functions, detecting significantly non-zero
skewness and kurtosis in one-dimensional marginal distributions of shear
two-point correlation functions in simulated weak lensing data though the full
multivariate distributions are relatively more Gaussian. We examine the
implications in the context of future surveys, in particular LSST, with
derivations of how the non-Gaussianity scales with survey area. We show that
there is no significant bias in one-dimensional posteriors of $\Omega_{\rm m}$
and $\sigma_{\rm 8}$ due to the non-Gaussian likelihood distributions of shear
correlations functions using the mock data ($100$ deg$^{2}$). We also present a
systematic approach to constructing an approximate multivariate likelihood
function by decorrelating the data points using principal component analysis
(PCA). When using a subset of the PCA components that account for the majority
of the cosmological signal as a data vector, the one-dimensional marginal
likelihood distributions of those components exhibit less skewness and kurtosis
than the original shear correlation functions. We further demonstrate that the
difference in cosmological parameter constraints between the multivariate
Gaussian likelihood model and more complex non-Gaussian likelihood models would
be even smaller for an LSST-like survey due to the area effect. In addition,
the PCA approach automatically serves as a data compression method, enabling
the retention of the majority of the cosmological information while reducing
the dimensionality of the data vector by a factor of $\sim$5.