Full-text search for arXiv

Pospisil, Taylor

Normalized to: Pospisil, T.

3 article(s) in total. 32 co-authors, from 1 to 3 common article(s). Median position in authors list is 4,0.

[1] oai:arXiv.org:2001.03621 [pdf] - 2029805

Evaluation of probabilistic photometric redshift estimation approaches for LSST

Comments: submitted to MNRAS

Submitted: 2020-01-10

Many scientific investigations of photometric galaxy surveys require redshift estimates, whose uncertainty properties are best encapsulated by photometric redshift (photo-z) posterior probability density functions (PDFs). A plethora of photo-z PDF estimation methodologies abound, producing discrepant results with no consensus on a preferred approach. We present the results of a comprehensive experiment comparing twelve photo-z algorithms applied to mock data produced for the Large Synoptic Survey Telescope (LSST) Dark Energy Science Collaboration (DESC). By supplying perfect prior information, in the form of the complete template library and a representative training set as inputs to each code, we demonstrate the impact of the assumptions underlying each technique on the output photo-z PDFs. In the absence of a notion of true, unbiased photo-z PDFs, we evaluate and interpret multiple metrics of the ensemble properties of the derived photo-z PDFs as well as traditional reductions to photo-z point estimates. We report systematic biases and overall over/under-breadth of the photo-z PDFs of many popular codes, which may indicate avenues for improvement in the algorithms or implementations. Furthermore, we raise attention to the limitations of established metrics for assessing photo-z PDF accuracy; though we identify the conditional density estimate (CDE) loss as a promising metric of photo-z PDF performance in the case where true redshifts are available but true photo-z PDFs are not, we emphasize the need for science-specific performance metrics.

[2] oai:arXiv.org:1908.11523 [pdf] - 2031949

Conditional Density Estimation Tools in Python and R with Applications to Photometric Redshifts and Likelihood-Free Cosmological Inference

Dalmasso, Niccolò; Pospisil, Taylor; Lee, Ann B.; Izbicki, Rafael; Freeman, Peter E.; Malz, Alex I.

Comments: 27 pages, 7 figures, 4 tables

Submitted: 2019-08-29, last modified: 2019-12-20

It is well known in astronomy that propagating non-Gaussian prediction uncertainty in photometric redshift estimates is key to reducing bias in downstream cosmological analyses. Similarly, likelihood-free inference approaches, which are beginning to emerge as a tool for cosmological analysis, require a characterization of the full uncertainty landscape of the parameters of interest given observed data. However, most machine learning (ML) or training-based methods with open-source software target point prediction or classification, and hence fall short in quantifying uncertainty in complex regression and parameter inference settings. As an alternative to methods that focus on predicting the response (or parameters) $\mathbf{y}$ from features $\mathbf{x}$, we provide nonparametric conditional density estimation (CDE) tools for approximating and validating the entire probability density function (PDF) $\mathrm{p}(\mathbf{y}|\mathbf{x})$ of $\mathbf{y}$ given (i.e., conditional on) $\mathbf{x}$. As there is no one-size-fits-all CDE method, the goal of this work is to provide a comprehensive range of statistical tools and open-source software for nonparametric CDE and method assessment which can accommodate different types of settings and be easily fit to the problem at hand. Specifically, we introduce four CDE software packages in $\texttt{Python}$ and $\texttt{R}$ based on ML prediction methods adapted and optimized for CDE: $\texttt{NNKCDE}$, $\texttt{RFCDE}$, $\texttt{FlexCode}$, and $\texttt{DeepCDE}$. Furthermore, we present the $\texttt{cdetools}$ package, which includes functions for computing a CDE loss function for tuning and assessing the quality of individual PDFs, along with diagnostic functions. We provide sample code in $\texttt{Python}$ and $\texttt{R}$ as well as examples of applications to photometric redshift estimation and likelihood-free cosmological inference via CDE.

[3] oai:arXiv.org:1905.03779 [pdf] - 1880595

Non-Gaussianity in the Weak Lensing Correlation Function Likelihood - Implications for Cosmological Parameter Biases

Lin, Chien-Hao; Harnois-Déraps, Joachim; Eifler, Tim; Pospisil, Taylor; Mandelbaum, Rachel; Lee, Ann B.; Singh, Sukhdeep

Comments: 16 pages, 10 figures, submitted to MNRAS

Submitted: 2019-05-09

We study the significance of non-Gaussianity in the likelihood of weak lensing shear two-point correlation functions, detecting significantly non-zero skewness and kurtosis in one-dimensional marginal distributions of shear two-point correlation functions in simulated weak lensing data though the full multivariate distributions are relatively more Gaussian. We examine the implications in the context of future surveys, in particular LSST, with derivations of how the non-Gaussianity scales with survey area. We show that there is no significant bias in one-dimensional posteriors of $\Omega_{\rm m}$ and $\sigma_{\rm 8}$ due to the non-Gaussian likelihood distributions of shear correlations functions using the mock data ($100$ deg$^{2}$). We also present a systematic approach to constructing an approximate multivariate likelihood function by decorrelating the data points using principal component analysis (PCA). When using a subset of the PCA components that account for the majority of the cosmological signal as a data vector, the one-dimensional marginal likelihood distributions of those components exhibit less skewness and kurtosis than the original shear correlation functions. We further demonstrate that the difference in cosmological parameter constraints between the multivariate Gaussian likelihood model and more complex non-Gaussian likelihood models would be even smaller for an LSST-like survey due to the area effect. In addition, the PCA approach automatically serves as a data compression method, enabling the retention of the majority of the cosmological information while reducing the dimensionality of the data vector by a factor of $\sim$5.