Normalized to: Dalmasso, N.
[1]
oai:arXiv.org:1908.11523 [pdf] - 2031949
Conditional Density Estimation Tools in Python and R with Applications
to Photometric Redshifts and Likelihood-Free Cosmological Inference
Submitted: 2019-08-29, last modified: 2019-12-20
It is well known in astronomy that propagating non-Gaussian prediction
uncertainty in photometric redshift estimates is key to reducing bias in
downstream cosmological analyses. Similarly, likelihood-free inference
approaches, which are beginning to emerge as a tool for cosmological analysis,
require a characterization of the full uncertainty landscape of the parameters
of interest given observed data. However, most machine learning (ML) or
training-based methods with open-source software target point prediction or
classification, and hence fall short in quantifying uncertainty in complex
regression and parameter inference settings. As an alternative to methods that
focus on predicting the response (or parameters) $\mathbf{y}$ from features
$\mathbf{x}$, we provide nonparametric conditional density estimation (CDE)
tools for approximating and validating the entire probability density function
(PDF) $\mathrm{p}(\mathbf{y}|\mathbf{x})$ of $\mathbf{y}$ given (i.e.,
conditional on) $\mathbf{x}$. As there is no one-size-fits-all CDE method, the
goal of this work is to provide a comprehensive range of statistical tools and
open-source software for nonparametric CDE and method assessment which can
accommodate different types of settings and be easily fit to the problem at
hand. Specifically, we introduce four CDE software packages in
$\texttt{Python}$ and $\texttt{R}$ based on ML prediction methods adapted and
optimized for CDE: $\texttt{NNKCDE}$, $\texttt{RFCDE}$, $\texttt{FlexCode}$,
and $\texttt{DeepCDE}$. Furthermore, we present the $\texttt{cdetools}$
package, which includes functions for computing a CDE loss function for tuning
and assessing the quality of individual PDFs, along with diagnostic functions.
We provide sample code in $\texttt{Python}$ and $\texttt{R}$ as well as
examples of applications to photometric redshift estimation and likelihood-free
cosmological inference via CDE.
[2]
oai:arXiv.org:1707.00007 [pdf] - 1648585
Clarifying the Hubble constant tension with a Bayesian hierarchical
model of the local distance ladder
Submitted: 2017-06-30, last modified: 2017-11-08
Estimates of the Hubble constant, $H_0$, from the distance ladder and the
cosmic microwave background (CMB) differ at the $\sim$3-$\sigma$ level,
indicating a potential issue with the standard $\Lambda$CDM cosmology.
Interpreting this tension correctly requires a model comparison calculation
depending on not only the traditional `$n$-$\sigma$' mismatch but also the
tails of the likelihoods. Determining the form of the tails of the local $H_0$
likelihood is impossible with the standard Gaussian least-squares
approximation, as it requires using non-Gaussian distributions to faithfully
represent anchor likelihoods and model outliers in the Cepheid and supernova
(SN) populations, and simultaneous fitting of the full distance-ladder dataset
to correctly propagate uncertainties. We have developed a Bayesian hierarchical
model that describes the full distance ladder, from nearby geometric anchors
through Cepheids to Hubble-Flow SNe. This model does not rely on any
distributions being Gaussian, allowing outliers to be modeled and obviating the
need for arbitrary data cuts. Sampling from the $\sim$3000-parameter joint
posterior using Hamiltonian Monte Carlo, we find $H_0$ = (72.72 $\pm$ 1.67)
${\rm km\,s^{-1}\,Mpc^{-1}}$ when applied to the outlier-cleaned Riess et al.
(2016) data, and ($73.15 \pm 1.78$) ${\rm km\,s^{-1}\,Mpc^{-1}}$ with SN
outliers reintroduced. Our high-fidelity sampling of the low-$H_0$ tail of the
distance-ladder likelihood allows us to apply Bayesian model comparison to
assess the evidence for deviation from $\Lambda$CDM. We set up this comparison
to yield a lower limit on the odds of the underlying model being $\Lambda$CDM
given the distance-ladder and Planck XIII (2016) CMB data. The odds against
$\Lambda$CDM are at worst 10:1 or 7:1, depending on whether the SNe outliers
are cut or modeled, or 60:1 if an approximation to the Planck Int. XLVI (2016)
likelihood is used.