Full-text search for arXiv

25 article(s) in total. 69 co-authors, from 1 to 9 common article(s). Median position in authors list is 2,0.

[1] oai:arXiv.org:2007.01844 [pdf] - 2127600

KiDS-1000 Methodology: Modelling and inference for joint weak gravitational lensing and spectroscopic galaxy clustering analysis

Comments: 44 pages, 34 figures, submitted to A&A. This paper is part of the KiDS-1000 series of papers, accompanying Giblin et al. appearing on the arXiv today

Submitted: 2020-07-03

We present the methodology for a joint cosmological analysis of weak gravitational lensing from the fourth data release of the ESO Kilo-Degree Survey (KiDS-1000) and galaxy clustering from the partially overlapping BOSS and 2dFLenS surveys. Cross-correlations between galaxy positions and ellipticities are incorporated into the analysis, developing a hybrid model of non-linear scales that blends perturbative and non-perturbative approaches, and assessing signal contributions by astrophysical effects. All weak lensing signals are measured consistently via Fourier-space statistics that are insensitive to the survey mask and display low levels of mode mixing. The calibration of photometric redshift distributions and multiplicative gravitational shear bias is updated, and a more complete tally of residual calibration uncertainties is propagated into the likelihood. A dedicated suite of more than 20000 mocks is used to assess the performance of covariance models and to quantify the impact of survey geometry and spatial variations of survey depth on signals and their errors. The sampling distributions for the likelihood and the $\chi^2$ goodness-of-fit statistic are validated, with proposed changes to calculating the effective number of degrees of freedom. Standard weak lensing point estimates on $S_8=\sigma_8\,(\Omega_{\rm m}/0.3)^{1/2}$ derived from its marginal posterior are easily misinterpreted to be biased low, and an alternative estimator and associated credible interval are proposed. Known systematic effects pertaining to weak lensing modelling and inference are shown to bias $S_8$ by no more than 0.1 standard deviations, with the caveat that no conclusive validation data exist for models of intrinsic galaxy alignments. Compared to the previous KiDS analyses, $S_8$ constraints are expected to improve by 20% for weak lensing alone and by 29% for the joint analysis. [abridged]

[2] oai:arXiv.org:2007.00356 [pdf] - 2126348

Galactic potential constraints from clustering in action space of combined stellar stream data

Reino, Stella; Rossi, Elena M.; Sanderson, Robyn E.; Sellentin, Elena; Helmi, Amina; Koppelman, Helmer H.; Sharma, Sanjib

Comments: 24 pages, 13 figures. Submitted to MNRAS

Submitted: 2020-07-01

Stream stars removed by tides from their progenitor satellite galaxy or globular cluster act as a group of test particles on neighboring orbits, probing the gravitational field of the Milky Way. While constraints from individual streams have been shown to be susceptible to biases, combining several streams from orbits with various distances reduces these biases. We fit a common gravitational potential to multiple stellar streams simultaneously by maximizing the clustering of the stream stars in action space. We apply this technique to members of the GD-1, Pal 5, Orphan and Helmi streams, exploiting both the individual and combined data sets. We describe the Galactic potential with a St\"ackel model, and vary up to five parameters simultaneously. We find that we can only constrain the enclosed mass, and that the strongest constraints come from the GD-1 and Pal 5 streams whose combined data set yields $M(< 20\ \mathrm{kpc}) = 3.47^{+0.95}_{-1.44} \times 10^{11} \ M_{\odot}$. When including the Orphan and Helmi stream in the data set, the mass uncertainty increases to $M(< 20\ \mathrm{kpc}) = 3.12^{+5.69}_{-1.07} \times 10^{11} \ M_{\odot}$, indicative of the hidden systematic errors in fits to individual streams. These systematics are likely due to insufficiencies in our St\"ackel model of the potential and to the limited phase space explored by the data set of 4 streams, so the larger uncertainty of the combined fit is the more robust measure of the actual uncertainty in the Milky Way potential.

[3] oai:arXiv.org:2006.06706 [pdf] - 2113397

Extreme data compression while searching for new physics

Heavens, Alan; Sellentin, Elena; Jaffe, Andrew

Comments: 11 pages, 12 figures

Submitted: 2020-06-11

Bringing a high-dimensional dataset into science-ready shape is a formidable challenge that often necessitates data compression. Compression has accordingly become a key consideration for contemporary cosmology, affecting public data releases, and reanalyses searching for new physics. However, data compression optimized for a particular model can suppress signs of new physics, or even remove them altogether. We therefore provide a solution for exploring new physics \emph{during} data compression. In particular, we store additional agnostic compressed data points, selected to enable precise constraints of non-standard physics at a later date. Our procedure is based on the maximal compression of the MOPED algorithm, which optimally filters the data with respect to a baseline model. We select additional filters, based on a generalised principal component analysis, which are carefully constructed to scout for new physics at high precision and speed. We refer to the augmented set of filters as MOPED-PC. They enable an analytic computation of Bayesian evidences that may indicate the presence of new physics, and fast analytic estimates of best-fitting parameters when adopting a specific non-standard theory, without further expensive MCMC analysis. As there may be large numbers of non-standard theories, the speed of the method becomes essential. Should no new physics be found, then our approach preserves the precision of the standard parameters. As a result, we achieve very rapid and maximally precise constraints of standard and non-standard physics, with a technique that scales well to large dimensional datasets.

[4] oai:arXiv.org:2005.14049 [pdf] - 2103532

Trimodal structure of Hercules stream explained by originating from bar resonances

Asano, Tetsuro; Fujii, Michiko S.; Baba, Junichi; Bédorf, Jeroen; Sellentin, Elena; Zwart, Simon Portegies

Comments: 15 pages, 13 figures, submitted to MNRAS

Submitted: 2020-05-28

Gaia Data Release 2 revealed detailed structures of nearby stars in phase space. These include the Hercules stream, whose origin is still debated. Earlier numerical studies conjectured that the observed structures originate from orbits in resonance with the bar, based on static potential models for the Milky Way. We, in contrast, approach the problem via a self-consistent, dynamic, and morphologically well-resolved model, namely a full $N$-body simulation of the Milky Way. Our simulation comprises about 5.1 billion particles in the galactic stellar bulge, bar, disk, and dark-matter halo and is evolved to 10 Gyr. Our model's disk component is composed of 200 million particles, and its simulation snapshots are stored every 10 Myr, enabling us to resolve and classify resonant orbits of representative samples of stars. After identifying the Sun's position in the simulation, we compare the distribution of stars in its neighborhood with Gaia's astrometric data, thereby establishing the role of identified resonantly trapped stars in the formation of Hercules-like structures. From our orbital spectral-analysis we identify multiple resonances and conclude that the Hercules stream is dominated by the 4:1 and 5:1 outer Lindblad and corotation resonances. In total, this yields a trimodal structure of the Hercules stream. From the relation between resonances and ridges in phase space, we conclude that the pattern speed of the Milky-Way bar is 40$-$45 km s$^{-1}$ kpc$^{-1}$.

[5] oai:arXiv.org:2005.07281 [pdf] - 2095676

Report from the Tri-Agency Cosmological Simulation Task Force

Battaglia, Nick; Benson, Andrew; Eifler, Tim; Hearin, Andrew; Heitmann, Katrin; Ho, Shirley; Kiessling, Alina; Lukic, Zarija; Schneider, Michael; Sellentin, Elena; Stadel, Joachim

Comments: 36 pages, 3 figures. Delivered to NASA, NSF, and DOE in Dec 2018

Submitted: 2020-05-14

The Tri-Agency Cosmological Simulations (TACS) Task Force was formed when Program Managers from the Department of Energy (DOE), the National Aeronautics and Space Administration (NASA), and the National Science Foundation (NSF) expressed an interest in receiving input into the cosmological simulations landscape related to the upcoming DOE/NSF Vera Rubin Observatory (Rubin), NASA/ESA's Euclid, and NASA's Wide Field Infrared Survey Telescope (WFIRST). The Co-Chairs of TACS, Katrin Heitmann and Alina Kiessling, invited community scientists from the USA and Europe who are each subject matter experts and are also members of one or more of the surveys to contribute. The following report represents the input from TACS that was delivered to the Agencies in December 2018.

[6] oai:arXiv.org:1401.6892 [pdf] - 2044522

Breaking the spell of Gaussianity: forecasting with higher order Fisher matrices

Sellentin, Elena; Quartin, Miguel; Amendola, Luca

Comments: v3: Corrected an error in Appendix A. Results and conclusions unchanged. v2: New figures and references added. Typos corrected. Matches published version. 10 pages, 6 figures

Submitted: 2014-01-27, last modified: 2020-02-06

We present the new method DALI (Derivative Approximation for LIkelihoods) for reconstructing and forecasting posteriors. DALI extends the Fisher Matrix formalism but allows for a much wider range of posterior shapes. While the Fisher Matrix formalism is limited to yield ellipsoidal confidence contours, our method can reproduce the often observed flexed, deformed or curved shapes of known posteriors. This gain in shape fidelity is obtained by expanding the posterior to higher order in derivatives with respect to parameters, such that non-Gaussianity in the parameter space is taken into account. The resulting expansion is positive definite and normalizable at every order. Here, we present the new technique, highlight its advantages and limitations, and show a representative application to a posterior of dark energy parameters from supernovae measurements.

[7] oai:arXiv.org:1907.05881 [pdf] - 2001403

Euclid-era cosmology for everyone: Neural net assisted MCMC sampling for the joint 3x2 likelihood

Manrique-Yus, Andrea; Sellentin, Elena

Comments: Original version of the authors, as accepted by the referee(s)

Submitted: 2019-07-12, last modified: 2019-11-20

We develop a fully non-invasive use of machine learning in order to enable open research on Euclid-sized data sets. Our algorithm leaves complete control over theory and data analysis, unlike many black-box like uses of machine learning. Focusing on a `3x2 analysis' which combines cosmic shear, galaxy clustering and tangential shear at a Euclid-like sky coverage, we arrange a total of 348000 data points into data matrices whose structure permits not only an easy prediction by neural nets, but it additionally permits the essential removal from the data of patterns which the neural nets could not `understand'. The latter provides an often lacking mechanism to control and debias the inference of physics. The theoretical backbone to our neural net training can be any conventional (deterministic) theory code, where we chose CLASS. After training, we infer the seven parameters of a $w$CDM cosmology by Monte Carlo Markov sampling posteriors at Euclid-like precision within a day. We publicly provide the neural nets which memorise and output all 3x2 power spectra at a Euclid-like sky coverage and redshift binning.

[8] oai:arXiv.org:1910.08533 [pdf] - 2030571

A blinding solution for inference from astronomical data

Sellentin, Elena

Comments:

Submitted: 2019-10-18

This paper presents a joint blinding and deblinding strategy for inference of physical laws from astronomical data. The strategy allows for up to three blinding stages, where the data may be blinded, the computations of theoretical physics may be blinded, and --assuming Gaussianly distributed data-- the covariance matrix may be blinded. We found covariance blinding to be particularly effective, as it enables the blinder to determine close to exactly where the blinded posterior will peak. Accordingly, we present an algorithm which induces posterior shifts in predetermined directions by hiding untraceable biases in a covariance matrix. The associated deblinding takes the form of a numerically lightweight post-processing step, where the blinded posterior is multiplied with deblinding weights. We illustrate the blinding strategy for cosmic shear from KiDS-450, and show that even though there is no direct evidence of the KiDS-450 covariance matrix being biased, the famous cosmic shear tension with Planck could easily be induced by a mischaracterization of correlations between $\xi_-$ at the highest redshift and all lower redshifts. The blinding algorithm illustrates the increasing importance of accurate uncertainty assessment in astronomical inferences, as otherwise involuntary blinding through biases occurs.

[9] oai:arXiv.org:1902.00709 [pdf] - 1953401

Debiasing inference with approximate covariance matrices and other unidentified biases

Sellentin, Elena; Starck, Jean-Luc

Comments:

Submitted: 2019-02-02

When a posterior peaks in unexpected regions of parameter space, new physics has either been discovered, or a bias has not been identified yet. To tell these two cases apart is of paramount importance. We therefore present a method to indicate and mitigate unrecognized biases: Our method runs any pipeline with possibly unknown biases on both simulations and real data. It computes the coverage probability of posteriors, which measures whether posterior volume is a faithful representation of probability or not. If found to be necessary, the posterior is then corrected. This is a non-parametric debiasing procedure which complies with objective Bayesian inference. We use the method to debias inference with approximate covariance matrices and redshift uncertainties. We demonstrate why approximate covariance matrices bias physical constraints, and how this bias can be mitigated. We show that for a Euclid-like survey, if a traditional likelihood exists, then 25 end-to-end simulations suffice to guarantee that the figure of merit deteriorates maximally by 22 percent, or by 10 percent for 225 simulations. Thus, even a pessimistic analysis of Euclid-like data will still constitute an 25-fold increase in precision on the dark energy parameters in comparison to the state of the art (2018) set by KiDS and DES. We provide a public code of our method.

[10] oai:arXiv.org:1801.02518 [pdf] - 1699778

General Relativistic corrections in density-shear correlations

Ghosh, Basundhara; Durrer, Ruth; Sellentin, Elena

Comments: version published in JCAP

Submitted: 2018-01-08, last modified: 2018-06-14

We investigate the corrections which relativistic light-cone computations induce on the correlation of the tangential shear with galaxy number counts, also known as galaxy-galaxy lensing. The standard-approach to galaxy-galaxy lensing treats the number density of sources in a foreground bin as observable, whereas it is in reality unobservable due to the presence of relativistic corrections. We find that already in the redshift range covered by the DES first year data, these currently neglected relativistic terms lead to a systematic correction of up to 50% in the density-shear correlation function for the highest redshift bins. This correction is dominated by the the fact that a redshift bin of number counts does not only lens sources in a background bin, but is itself again lensed by all masses between the observer and the counted source population. Relativistic corrections are currently ignored in the standard galaxy-galaxy analyses, and the additional lensing of a counted source populations is only included in the error budget (via the covariance matrix). At increasingly higher redshifts and larger scales, these relativistic and lensing corrections become however increasingly more important, and we here argue that it is then more efficient, and also cleaner, to account for these corrections in the density-shear correlations.

[11] oai:arXiv.org:1802.09450 [pdf] - 1674895

Objective Bayesian analysis of neutrino masses and hierarchy

Heavens, Alan F.; Sellentin, Elena

Comments: 15 pages. Minor changes to match accepted version in JCAP

Submitted: 2018-02-26, last modified: 2018-04-06

Given the precision of current neutrino data, priors still impact noticeably the constraints on neutrino masses and their hierarchy. To avoid our understanding of neutrinos being driven by prior assumptions, we construct a prior that is mathematically minimally informative. Using the constructed uninformative prior, we find that the normal hierarchy is favoured but with inconclusive posterior odds of 5.1:1. Better data is hence needed before the neutrino masses and their hierarchy can be well constrained. We find that the next decade of cosmological data should provide conclusive evidence if the normal hierarchy with negligible minimum mass is correct, and if the uncertainty in the sum of neutrino masses drops below 0.025 eV. On the other hand, if neutrinos obey the inverted hierarchy, achieving strong evidence will be difficult with the same uncertainties. Our uninformative prior was constructed from principles of the Objective Bayesian approach. The prior is called a reference prior and is minimally informative in the specific sense that the information gain after collection of data is maximised. The prior is computed for the combination of neutrino oscillation data and cosmological data and still applies if the data improve.

[12] oai:arXiv.org:1708.00492 [pdf] - 1659534

The full-sky relativistic correlation function and power spectrum of galaxy number counts: I. Theoretical aspects

Tansella, Vittorio; Bonvin, Camille; Durrer, Ruth; Ghosh, Basundhara; Sellentin, Elena

Comments: 49 pages, 19 figures. v2: minor corrections, references added, typos corrected in eqs. (2.3)-(2.6). Matches version published in JCAP

Submitted: 2017-08-01, last modified: 2018-04-03

We derive an exact expression for the correlation function in redshift shells including all the relativistic contributions. This expression, which does not rely on the distant-observer or flat-sky approximation, is valid at all scales and includes both local relativistic corrections and integrated contributions, like gravitational lensing. We present two methods to calculate this correlation function, one which makes use of the angular power spectrum C_ell(z1,z2) and a second method which evades the costly calculations of the angular power spectra. The correlation function is then used to define the power spectrum as its Fourier transform. In this work theoretical aspects of this procedure are presented, together with quantitative examples. In particular, we show that gravitational lensing modifies the multipoles of the correlation function and of the power spectrum by a few percent at redshift z=1 and by up to 30% and more at z=2. We also point out that large-scale relativistic effects and wide-angle corrections generate contributions of the same order of magnitude and have consequently to be treated in conjunction. These corrections are particularly important at small redshift, z=0.1, where they can reach 10%. This means in particular that a flat-sky treatment of relativistic effects, using for example the power spectrum, is not consistent.

[13] oai:arXiv.org:1712.04923 [pdf] - 1674800

The skewed weak lensing likelihood: why biases arise, despite data and theory being sound

Sellentin, Elena; Heymans, Catherine; Harnois-Déraps, Joachim

Comments: Submitted to MNRAS

Submitted: 2017-12-13

We derive the essentials of the skewed weak lensing likelihood via a simple Hierarchical Model. Our likelihood passes four objective and cosmology-independent tests which a standard Gaussian likelihood fails. We demonstrate that sound weak lensing analyses are naturally biased low, and this does not indicate any new physics such as deviations from $\Lambda$CDM. Mathematically, the biases arise because noisy two-point functions follow skewed distributions. This form of bias is already known from CMB analyses, where the low multipoles have asymmetric error bars. Weak lensing is more strongly affected by this asymmetry as galaxies form a discrete set of shear tracer particles, in contrast to a smooth shear field. We demonstrate that the biases can be up to 30 percent of the standard deviation per data point, dependent on the properties of the weak lensing survey. Our likelihood provides a versatile framework with which to address this bias in future weak lensing analyses.

[14] oai:arXiv.org:1707.04488 [pdf] - 1585942

On the insufficiency of arbitrarily precise covariance matrices: non-Gaussian weak lensing likelihoods

Sellentin, Elena; Heavens, Alan F.

Comments: Replacement to match accepted MNRAS version

Submitted: 2017-07-14, last modified: 2017-09-25

We investigate whether a Gaussian likelihood, as routinely assumed in the analysis of cosmological data, is supported by simulated survey data. We define test statistics, based on a novel method that first destroys Gaussian correlations in a dataset, and then measures the non-Gaussian correlations that remain. This procedure flags pairs of datapoints which depend on each other in a non-Gaussian fashion, and thereby identifies where the assumption of a Gaussian likelihood breaks down. Using this diagnostic, we find that non-Gaussian correlations in the CFHTLenS cosmic shear correlation functions are significant. With a simple exclusion of the most contaminated datapoints, the posterior for $s_8$ is shifted without broadening, but we find no significant reduction in the tension with $s_8$ derived from Planck Cosmic Microwave Background data. However, we also show that the one-point distributions of the correlation statistics are noticeably skewed, such that sound weak lensing data sets are intrinsically likely to lead to a systematically low lensing amplitude being inferred. The detected non-Gaussianities get larger with increasing angular scale such that for future wide-angle surveys such as Euclid or LSST, with their very small statistical errors, the large-scale modes are expected to be increasingly affected. The shifts in posteriors may then not be negligible and we recommend that these diagnostic tests be run as part of future analyses.

[15] oai:arXiv.org:1709.03452 [pdf] - 1588202

On the use of the Edgeworth expansion in cosmology I: how to foresee and evade its pitfalls

Sellentin, Elena; Jaffe, Andrew H.; Heavens, Alan F.

Comments: 25 pages, 7 figures

Submitted: 2017-09-11

Non-linear gravitational collapse introduces non-Gaussian statistics into the matter fields of the late Universe. As the large-scale structure is the target of current and future observational campaigns, one would ideally like to have the full probability density function of these non-Gaussian fields. The only viable way we see to achieve this analytically, at least approximately and in the near future, is via the Edgeworth expansion. We hence rederive this expansion for Fourier modes of non-Gaussian fields and then continue by putting it into a wider statistical context than previously done. We show that in its original form, the Edgeworth expansion only works if the non-Gaussian signal is averaged away. This is counterproductive, since we target the parameter-dependent non-Gaussianities as a signal of interest. We hence alter the analysis at the decisive step and now provide a roadmap towards a controlled and unadulterated analysis of non-Gaussianities in structure formation (with the Edgeworth expansion). Our central result is that, although the Edgeworth expansion has pathological properties, these can be predicted and avoided in a careful manner. We also show that, despite the non-Gaussianity coupling all modes, the Edgeworth series may be applied to any desired subset of modes, since this is equivalent (to the level of the approximation) to marginalising over the exlcuded modes. In this first paper of a series, we restrict ourselves to the sampling properties of the Edgeworth expansion, i.e.~how faithfully it reproduces the distribution of non-Gaussian data. A follow-up paper will detail its Bayesian use, when parameters are to be inferred.

[16] oai:arXiv.org:1707.06529 [pdf] - 1586176

Massive data compression for parameter-dependent covariance matrices

Heavens, Alan; Sellentin, Elena; de Mijolla, Damien; Vianello, Alvise

Comments: 8 pages. Accepted by MNRAS

Submitted: 2017-07-20, last modified: 2017-09-05

We show how the massive data compression algorithm MOPED can be used to reduce, by orders of magnitude, the number of simulated datasets that are required to estimate the covariance matrix required for the analysis of gaussian-distributed data. This is relevant when the covariance matrix cannot be calculated directly. The compression is especially valuable when the covariance matrix varies with the model parameters. In this case, it may be prohibitively expensive to run enough simulations to estimate the full covariance matrix throughout the parameter space. This compression may be particularly valuable for the next-generation of weak lensing surveys, such as proposed for Euclid and LSST, for which the number of summary data (such as band power or shear correlation estimates) is very large, $\sim 10^4$, due to the large number of tomographic redshift bins that the data will be divided into. In the pessimistic case where the covariance matrix is estimated separately for all points in an MCMC analysis, this may require an unfeasible $10^9$ simulations. We show here that MOPED can reduce this number by a factor of 1000, or a factor of $\sim 10^6$ if some regularity in the covariance matrix is assumed, reducing the number of simulations required to a manageable $10^3$, making an otherwise intractable analysis feasible.

[17] oai:arXiv.org:1704.03467 [pdf] - 1582495

No evidence for extensions to the standard cosmological model

Heavens, Alan; Fantaye, Yabebal; Sellentin, Elena; Eggers, Hans; Hosenie, Zafiirah; Kroon, Steve; Mootoovaloo, Arrykrishna

Comments: 5 pages. Accepted for publication in PRL. Effect of inclusion of recent H0 measurements is added

Submitted: 2017-04-11, last modified: 2017-08-09

We compute the Bayesian Evidence for models considered in the main analysis of Planck cosmic microwave background data. By utilising carefully-defined nearest-neighbour distances in parameter space, we reuse the Monte Carlo Markov Chains already produced for parameter inference to compute Bayes factors $B$ for many different model-dataset combinations. Standard 6-parameter flat $\Lambda$CDM model is favoured over all other models considered, with curvature being mildly favoured only when CMB lensing is not included. Many alternative models are strongly disfavoured by the data, including primordial correlated isocurvature models ($\ln B=-7.8$), non-zero scalar-to-tensor ratio ($\ln B=-4.3$), running of the spectral index ($\ln B = -4.7$), curvature ($\ln B=-3.6$), non-standard numbers of neutrinos ($\ln B=-3.1$), non-standard neutrino masses ($\ln B=-3.2$), non-standard lensing potential ($\ln B=-4.6$), evolving dark energy ($\ln B=-3.2$), sterile neutrinos ($\ln B=-6.9$), and extra sterile neutrinos with a non-zero scalar-to-tensor ratio ($\ln B=-10.8$). Other models are less strongly disfavoured with respect to flat $\Lambda$CDM. As with all analyses based on Bayesian Evidence, the final numbers depend on the widths of the parameter priors. We adopt the priors used in the Planck analysis, while performing a prior sensitivity analysis. Our quantitative conclusion is that extensions beyond the standard cosmological model are disfavoured by Planck data. Only when newer Hubble constant measurements are included does $\Lambda$CDM become disfavoured, and only mildly, compared with a dynamical dark energy model ($\ln B\sim +2$).

[18] oai:arXiv.org:1704.03472 [pdf] - 1562206

Marginal Likelihoods from Monte Carlo Markov Chains

Heavens, Alan; Fantaye, Yabebal; Mootoovaloo, Arrykrishna; Eggers, Hans; Hosenie, Zafiirah; Kroon, Steve; Sellentin, Elena

Comments:

Submitted: 2017-04-11

In this paper, we present a method for computing the marginal likelihood, also known as the model likelihood or Bayesian evidence, from Markov Chain Monte Carlo (MCMC), or other sampled posterior distributions. In order to do this, one needs to be able to estimate the density of points in parameter space, and this can be challenging in high numbers of dimensions. Here we present a Bayesian analysis, where we obtain the posterior for the marginal likelihood, using $k$th nearest-neighbour distances in parameter space, using the Mahalanobis distance metric, under the assumption that the points in the chain (thinned if required) are independent. We generalise the algorithm to apply to importance-sampled chains, where each point is assigned a weight. We illustrate this with an idealised posterior of known form with an analytic marginal likelihood, and show that for chains of length $\sim 10^5$ points, the technique is effective for parameter spaces with up to $\sim 20$ dimensions. We also argue that $k=1$ is the optimal choice, and discuss failure modes for the algorithm. In a companion paper (Heavens et al. 2017) we apply the technique to the main MCMC chains from the 2015 Planck analysis of cosmic background radiation data, to infer that quantitatively the simplest 6-parameter flat $\Lambda$CDM standard model of cosmology is preferred over all extensions considered.

[19] oai:arXiv.org:1609.00504 [pdf] - 1547687

Quantifying lost information due to covariance matrix estimation in parameter inference

Sellentin, Elena; Heavens, Alan F.

Comments: Figure 6 holds for all surveys; replacement to match published version

Submitted: 2016-09-02, last modified: 2017-03-15

Parameter inference with an estimated covariance matrix systematically loses information due to the remaining uncertainty of the covariance matrix. Here, we quantify this loss of precision and develop a framework to hypothetically restore it, which allows to judge how far away a given analysis is from the ideal case of a known covariance matrix. We point out that it is insufficient to estimate this loss by debiasing a Fisher matrix as previously done, due to a fundamental inequality that describes how biases arise in non-linear functions. We therefore develop direct estimators for parameter credibility contours and the figure of merit. We apply our results to DES Science Verification weak lensing data, detecting a 10% loss of information that increases their credibility contours. No significant loss of information is found for KiDS. For a Euclid-like survey, with about 10 nuisance parameters we find that 2900 simulations are sufficient to limit the systematically lost information to 1%, with an additional uncertainty of about 2%. Without any nuisance parameters 1900 simulations are sufficient to only lose 1% of information. We also derive an estimator for the Fisher matrix of the unknown true covariance matrix, two estimators of its inverse with different physical meanings, and an estimator for the optimally achievable figure of merit. The formalism here quantifies the gains to be made by running more simulated datasets, allowing decisions to be made about numbers of simulations in an informed way.

[20] oai:arXiv.org:1602.01746 [pdf] - 1359288

Optimizing parameter constraints: a new tool for Fisher matrix forecasts

Amendola, L.; Sellentin, E.

Comments: 6 pages, accepted for publication in MNRAS; v2: added an important earlier reference deriving the same formula for the case of a single parameter

Submitted: 2016-02-04, last modified: 2016-02-08

In a Bayesian context, theoretical parameters are correlated random variables. Then, the constraints on one parameter can be improved by either measuring this parameter more precisely - or by measuring the other parameters more precisely. Especially in the case of many parameters, a lengthy process of guesswork is then needed to determine the most efficient way to improve one parameter's constraints. In this short article, we highlight an extremely simple analytical expression that replaces the guesswork and that facilitates a deeper understanding of optimization with interdependent parameters.

[21] oai:arXiv.org:1511.05969 [pdf] - 1347646

Parameter inference with estimated covariance matrices

Sellentin, Elena; Heavens, Alan F.

Comments: Matches accepted MNRAS letter

Submitted: 2015-11-18, last modified: 2016-01-05

When inferring parameters from a Gaussian-distributed data set by computing a likelihood, a covariance matrix is needed that describes the data errors and their correlations. If the covariance matrix is not known a priori, it may be estimated and thereby becomes a random object with some intrinsic uncertainty itself. We show how to infer parameters in the presence of such an estimated covariance matrix, by marginalising over the true covariance matrix, conditioned on its estimated value. This leads to a likelihood function that is no longer Gaussian, but rather an adapted version of a multivariate t-distribution, which has the same numerical complexity as the multivariate Gaussian. As expected, marginalisation over the true covariance matrix improves inference when compared with Hartlap et al.'s method, which uses an unbiased estimate of the inverse covariance matrix but still assumes that the likelihood is Gaussian.

[22] oai:arXiv.org:1412.6427 [pdf] - 1280771

Detecting the cosmological neutrino background in the CMB

Sellentin, Elena; Durrer, Ruth

Comments: 10 pages 7 figures, version accepted for publication in PRD

Submitted: 2014-12-19, last modified: 2015-07-27

Three relativistic particles in addition to the photon are detected in the cosmic microwave background (CMB). In the standard model of cosmology, these are interpreted as the three neutrino species. However, at the time of CMB-decoupling, neutrinos are not only relativistic but they are also freestreaming. Here, we investigate, whether the CMB is sensitive to this defining feature of neutrinos, or whether the CMB-data allow to replace neutrinos with a relativistic fluid. We show that free streaming particles are preferred over a relativistic perfect fluid with $\Delta\chi^2\simeq 21$. We also study the possibility to replace the neutrinos by a viscous fluid and find that a relativistic viscous fluid with either the standard values $c_{\rm eff}^2=c_{\rm vis}^2=1/3$ or best fit values for $c_{\rm eff}^2$ and $c_{\rm vis}^2$ has $\Delta\chi^2=20$ and thus cannot provide a good fit to present CMB data either.

[23] oai:arXiv.org:1506.04866 [pdf] - 1273158

A fast, always positive definite and normalizable approximation of non-Gaussian likelihoods

Sellentin, Elena

Comments: accepted for publication in MNRAS

Submitted: 2015-06-16, last modified: 2015-07-22

In this paper we extent the previously published DALI-approximation for likelihoods to cases in which the parameter dependency is in the covariance matrix. The approximation recovers non-Gaussian likelihoods, and reduces to the Fisher matrix approach in the case of Gaussianity. It works with the minimal assumptions of having Gaussian errors on the data, and a covariance matrix that possesses a converging Taylor approximation. The resulting approximation works in cases of severe parameter degeneracies and in cases where the Fisher matrix is singular. It is at least $1000$ times faster than a typical Monte Carlo Markov Chain run over the same parameter space. Two example applications, to cases of extremely non-Gaussian likelihoods, are presented -- one demonstrates how the method succeeds in reconstructing completely a ring-shaped likelihood. A public code is released here: http://lnasellentin.github.io/DALI/

[24] oai:arXiv.org:1506.05356 [pdf] - 1347469

Non-Gaussian forecasts of weak lensing with and without priors

Sellentin, Elena; Schäfer, Björn Malte

Comments: 9 pages, 7 figures

Submitted: 2015-06-17

Assuming a Euclid-like weak lensing data set, we compare different methods of dealing with its inherent parameter degeneracies. Including priors into a data analysis can mask the information content of a given data set alone. However, since the information content of a data set is usually estimated with the Fisher matrix, priors are added in order to enforce an approximately Gaussian likelihood. Here, we compare priorless forecasts to more conventional forecasts that use priors. We find strongly non-Gaussian likelihoods for 2d-weak lensing if no priors are used, which we approximate with the DALI-expansion. Without priors, the Fisher matrix of the 2d-weak lensing likelihood includes unphysical values of $\Omega_m$ and $h$, since it does not capture the shape of the likelihood well. The Cramer-Rao inequality then does not need to apply. We find that DALI and Monte Carlo Markov Chains predict the presence of a dark energy with high significance, whereas a Fisher forecast of the same data set also allows decelerated expansion. We also find that a 2d-weak lensing analysis provides a sharp lower limit on the Hubble constant of $h > 0.4$, even if the equation of state of dark energy is jointly constrained by the data. This is not predicted by the Fisher matrix and usually masked in other works by a sharp prior on $h$. Additionally, we find that DALI estimates Figures of Merit in the presence of non-Gaussianities better than the Fisher matrix. We additionally demonstrate how DALI allows switching to a Hamiltonian Monte Carlo sampling of a highly curved likelihood with acceptance rates of $\approx 0.5$, an effective covering of the parameter space, and numerically effectively costless leapfrog steps. This shows how quick forecasts can be upgraded to accurate forecasts whenever needed. Results were gained with the public code from http://lnasellentin.github.io/DALI/

[25] oai:arXiv.org:1311.3498 [pdf] - 767138

A quantification of hydrodynamical effects on protoplanetary dust growth

Sellentin, E.; Ramsey, J. P.; Windmark, F.; Dullemond, C. P.

Comments: 9 pages, 6 figures; accepted for publication in A&A; v2 matches the manuscript sent to the publisher (very minor changes)

Submitted: 2013-11-14, last modified: 2013-11-18

Context. The growth process of dust particles in protoplanetary disks can be modeled via numerical dust coagulation codes. In this approach, physical effects that dominate the dust growth process often must be implemented in a parameterized form. Due to a lack of these parameterizations, existing studies of dust coagulation have ignored the effects a hydrodynamical gas flow can have on grain growth, even though it is often argued that the flow could significantly contribute either positively or negatively to the growth process. Aims. We intend to provide a quantification of hydrodynamical effects on the growth of dust particles, such that these effects can be parameterized and implemented in a dust coagulation code. Methods. We numerically integrate the trajectories of small dust particles in the flow of disk gas around a proto-planetesimal, sampling a large parameter space in proto-planetesimal radii, headwind velocities, and dust stopping times. Results. The gas flow deflects most particles away from the proto-planetesimal, such that its effective collisional cross section, and therefore the mass accretion rate, is reduced. The gas flow however also reduces the impact velocity of small dust particles onto a proto-planetesimal. This can be beneficial for its growth, since large impact velocities are known to lead to erosion. We also demonstrate why such a gas flow does not return collisional debris to the surface of a proto-planetesimal. Conclusions. We predict that a laminar hydrodynamical flow around a proto-planetesimal will have a significant effect on its growth. However, we cannot easily predict which result, the reduction of the impact velocity or the sweep-up cross section, will be more important. Therefore, we provide parameterizations ready for implementation into a dust coagulation code.