Normalized to: Sellentin, E.
[1]
oai:arXiv.org:2007.01844 [pdf] - 2127600
KiDS-1000 Methodology: Modelling and inference for joint weak
gravitational lensing and spectroscopic galaxy clustering analysis
Joachimi, B.;
Lin, C. -A.;
Asgari, M.;
Tröster, T.;
Heymans, C.;
Hildebrandt, H.;
Köhlinger, F.;
Sánchez, A. G.;
Wright, A. H.;
Bilicki, M.;
Blake, C.;
Busch, J. L. van den;
Crocce, M.;
Dvornik, A.;
Erben, T.;
Getman, F.;
Giblin, B.;
Hoekstra, H.;
Kannawadi, A.;
Kuijken, K.;
Napolitano, N. R.;
Schneider, P.;
Scoccimarro, R.;
Sellentin, E.;
Shan, H. Y.;
von Wietersheim-Kramsta, M.;
Zuntz, J.
Submitted: 2020-07-03
We present the methodology for a joint cosmological analysis of weak
gravitational lensing from the fourth data release of the ESO Kilo-Degree
Survey (KiDS-1000) and galaxy clustering from the partially overlapping BOSS
and 2dFLenS surveys. Cross-correlations between galaxy positions and
ellipticities are incorporated into the analysis, developing a hybrid model of
non-linear scales that blends perturbative and non-perturbative approaches, and
assessing signal contributions by astrophysical effects. All weak lensing
signals are measured consistently via Fourier-space statistics that are
insensitive to the survey mask and display low levels of mode mixing. The
calibration of photometric redshift distributions and multiplicative
gravitational shear bias is updated, and a more complete tally of residual
calibration uncertainties is propagated into the likelihood. A dedicated suite
of more than 20000 mocks is used to assess the performance of covariance models
and to quantify the impact of survey geometry and spatial variations of survey
depth on signals and their errors. The sampling distributions for the
likelihood and the $\chi^2$ goodness-of-fit statistic are validated, with
proposed changes to calculating the effective number of degrees of freedom.
Standard weak lensing point estimates on $S_8=\sigma_8\,(\Omega_{\rm
m}/0.3)^{1/2}$ derived from its marginal posterior are easily misinterpreted to
be biased low, and an alternative estimator and associated credible interval
are proposed. Known systematic effects pertaining to weak lensing modelling and
inference are shown to bias $S_8$ by no more than 0.1 standard deviations, with
the caveat that no conclusive validation data exist for models of intrinsic
galaxy alignments. Compared to the previous KiDS analyses, $S_8$ constraints
are expected to improve by 20% for weak lensing alone and by 29% for the joint
analysis. [abridged]
[2]
oai:arXiv.org:2007.00356 [pdf] - 2126348
Galactic potential constraints from clustering in action space of
combined stellar stream data
Submitted: 2020-07-01
Stream stars removed by tides from their progenitor satellite galaxy or
globular cluster act as a group of test particles on neighboring orbits,
probing the gravitational field of the Milky Way. While constraints from
individual streams have been shown to be susceptible to biases, combining
several streams from orbits with various distances reduces these biases. We fit
a common gravitational potential to multiple stellar streams simultaneously by
maximizing the clustering of the stream stars in action space. We apply this
technique to members of the GD-1, Pal 5, Orphan and Helmi streams, exploiting
both the individual and combined data sets. We describe the Galactic potential
with a St\"ackel model, and vary up to five parameters simultaneously. We find
that we can only constrain the enclosed mass, and that the strongest
constraints come from the GD-1 and Pal 5 streams whose combined data set yields
$M(< 20\ \mathrm{kpc}) = 3.47^{+0.95}_{-1.44} \times 10^{11} \ M_{\odot}$. When
including the Orphan and Helmi stream in the data set, the mass uncertainty
increases to $M(< 20\ \mathrm{kpc}) = 3.12^{+5.69}_{-1.07} \times 10^{11} \
M_{\odot}$, indicative of the hidden systematic errors in fits to individual
streams. These systematics are likely due to insufficiencies in our St\"ackel
model of the potential and to the limited phase space explored by the data set
of 4 streams, so the larger uncertainty of the combined fit is the more robust
measure of the actual uncertainty in the Milky Way potential.
[3]
oai:arXiv.org:2006.06706 [pdf] - 2113397
Extreme data compression while searching for new physics
Submitted: 2020-06-11
Bringing a high-dimensional dataset into science-ready shape is a formidable
challenge that often necessitates data compression. Compression has accordingly
become a key consideration for contemporary cosmology, affecting public data
releases, and reanalyses searching for new physics. However, data compression
optimized for a particular model can suppress signs of new physics, or even
remove them altogether. We therefore provide a solution for exploring new
physics \emph{during} data compression. In particular, we store additional
agnostic compressed data points, selected to enable precise constraints of
non-standard physics at a later date. Our procedure is based on the maximal
compression of the MOPED algorithm, which optimally filters the data with
respect to a baseline model. We select additional filters, based on a
generalised principal component analysis, which are carefully constructed to
scout for new physics at high precision and speed. We refer to the augmented
set of filters as MOPED-PC. They enable an analytic computation of Bayesian
evidences that may indicate the presence of new physics, and fast analytic
estimates of best-fitting parameters when adopting a specific non-standard
theory, without further expensive MCMC analysis. As there may be large numbers
of non-standard theories, the speed of the method becomes essential. Should no
new physics be found, then our approach preserves the precision of the standard
parameters. As a result, we achieve very rapid and maximally precise
constraints of standard and non-standard physics, with a technique that scales
well to large dimensional datasets.
[4]
oai:arXiv.org:2005.14049 [pdf] - 2103532
Trimodal structure of Hercules stream explained by originating from bar
resonances
Submitted: 2020-05-28
Gaia Data Release 2 revealed detailed structures of nearby stars in phase
space. These include the Hercules stream, whose origin is still debated.
Earlier numerical studies conjectured that the observed structures originate
from orbits in resonance with the bar, based on static potential models for the
Milky Way. We, in contrast, approach the problem via a self-consistent,
dynamic, and morphologically well-resolved model, namely a full $N$-body
simulation of the Milky Way. Our simulation comprises about 5.1 billion
particles in the galactic stellar bulge, bar, disk, and dark-matter halo and is
evolved to 10 Gyr. Our model's disk component is composed of 200 million
particles, and its simulation snapshots are stored every 10 Myr, enabling us to
resolve and classify resonant orbits of representative samples of stars. After
identifying the Sun's position in the simulation, we compare the distribution
of stars in its neighborhood with Gaia's astrometric data, thereby establishing
the role of identified resonantly trapped stars in the formation of
Hercules-like structures. From our orbital spectral-analysis we identify
multiple resonances and conclude that the Hercules stream is dominated by the
4:1 and 5:1 outer Lindblad and corotation resonances. In total, this yields a
trimodal structure of the Hercules stream. From the relation between resonances
and ridges in phase space, we conclude that the pattern speed of the Milky-Way
bar is 40$-$45 km s$^{-1}$ kpc$^{-1}$.
[5]
oai:arXiv.org:2005.07281 [pdf] - 2095676
Report from the Tri-Agency Cosmological Simulation Task Force
Battaglia, Nick;
Benson, Andrew;
Eifler, Tim;
Hearin, Andrew;
Heitmann, Katrin;
Ho, Shirley;
Kiessling, Alina;
Lukic, Zarija;
Schneider, Michael;
Sellentin, Elena;
Stadel, Joachim
Submitted: 2020-05-14
The Tri-Agency Cosmological Simulations (TACS) Task Force was formed when
Program Managers from the Department of Energy (DOE), the National Aeronautics
and Space Administration (NASA), and the National Science Foundation (NSF)
expressed an interest in receiving input into the cosmological simulations
landscape related to the upcoming DOE/NSF Vera Rubin Observatory (Rubin),
NASA/ESA's Euclid, and NASA's Wide Field Infrared Survey Telescope (WFIRST).
The Co-Chairs of TACS, Katrin Heitmann and Alina Kiessling, invited community
scientists from the USA and Europe who are each subject matter experts and are
also members of one or more of the surveys to contribute. The following report
represents the input from TACS that was delivered to the Agencies in December
2018.
[6]
oai:arXiv.org:1401.6892 [pdf] - 2044522
Breaking the spell of Gaussianity: forecasting with higher order Fisher
matrices
Submitted: 2014-01-27, last modified: 2020-02-06
We present the new method DALI (Derivative Approximation for LIkelihoods) for
reconstructing and forecasting posteriors. DALI extends the Fisher Matrix
formalism but allows for a much wider range of posterior shapes. While the
Fisher Matrix formalism is limited to yield ellipsoidal confidence contours,
our method can reproduce the often observed flexed, deformed or curved shapes
of known posteriors. This gain in shape fidelity is obtained by expanding the
posterior to higher order in derivatives with respect to parameters, such that
non-Gaussianity in the parameter space is taken into account. The resulting
expansion is positive definite and normalizable at every order. Here, we
present the new technique, highlight its advantages and limitations, and show a
representative application to a posterior of dark energy parameters from
supernovae measurements.
[7]
oai:arXiv.org:1907.05881 [pdf] - 2001403
Euclid-era cosmology for everyone: Neural net assisted MCMC sampling for
the joint 3x2 likelihood
Submitted: 2019-07-12, last modified: 2019-11-20
We develop a fully non-invasive use of machine learning in order to enable
open research on Euclid-sized data sets. Our algorithm leaves complete control
over theory and data analysis, unlike many black-box like uses of machine
learning. Focusing on a `3x2 analysis' which combines cosmic shear, galaxy
clustering and tangential shear at a Euclid-like sky coverage, we arrange a
total of 348000 data points into data matrices whose structure permits not only
an easy prediction by neural nets, but it additionally permits the essential
removal from the data of patterns which the neural nets could not `understand'.
The latter provides an often lacking mechanism to control and debias the
inference of physics. The theoretical backbone to our neural net training can
be any conventional (deterministic) theory code, where we chose CLASS. After
training, we infer the seven parameters of a $w$CDM cosmology by Monte Carlo
Markov sampling posteriors at Euclid-like precision within a day. We publicly
provide the neural nets which memorise and output all 3x2 power spectra at a
Euclid-like sky coverage and redshift binning.
[8]
oai:arXiv.org:1910.08533 [pdf] - 2030571
A blinding solution for inference from astronomical data
Submitted: 2019-10-18
This paper presents a joint blinding and deblinding strategy for inference of
physical laws from astronomical data. The strategy allows for up to three
blinding stages, where the data may be blinded, the computations of theoretical
physics may be blinded, and --assuming Gaussianly distributed data-- the
covariance matrix may be blinded. We found covariance blinding to be
particularly effective, as it enables the blinder to determine close to exactly
where the blinded posterior will peak. Accordingly, we present an algorithm
which induces posterior shifts in predetermined directions by hiding
untraceable biases in a covariance matrix. The associated deblinding takes the
form of a numerically lightweight post-processing step, where the blinded
posterior is multiplied with deblinding weights. We illustrate the blinding
strategy for cosmic shear from KiDS-450, and show that even though there is no
direct evidence of the KiDS-450 covariance matrix being biased, the famous
cosmic shear tension with Planck could easily be induced by a
mischaracterization of correlations between $\xi_-$ at the highest redshift and
all lower redshifts. The blinding algorithm illustrates the increasing
importance of accurate uncertainty assessment in astronomical inferences, as
otherwise involuntary blinding through biases occurs.
[9]
oai:arXiv.org:1902.00709 [pdf] - 1953401
Debiasing inference with approximate covariance matrices and other
unidentified biases
Submitted: 2019-02-02
When a posterior peaks in unexpected regions of parameter space, new physics
has either been discovered, or a bias has not been identified yet. To tell
these two cases apart is of paramount importance. We therefore present a method
to indicate and mitigate unrecognized biases: Our method runs any pipeline with
possibly unknown biases on both simulations and real data. It computes the
coverage probability of posteriors, which measures whether posterior volume is
a faithful representation of probability or not. If found to be necessary, the
posterior is then corrected. This is a non-parametric debiasing procedure which
complies with objective Bayesian inference. We use the method to debias
inference with approximate covariance matrices and redshift uncertainties. We
demonstrate why approximate covariance matrices bias physical constraints, and
how this bias can be mitigated. We show that for a Euclid-like survey, if a
traditional likelihood exists, then 25 end-to-end simulations suffice to
guarantee that the figure of merit deteriorates maximally by 22 percent, or by
10 percent for 225 simulations. Thus, even a pessimistic analysis of
Euclid-like data will still constitute an 25-fold increase in precision on the
dark energy parameters in comparison to the state of the art (2018) set by KiDS
and DES. We provide a public code of our method.
[10]
oai:arXiv.org:1801.02518 [pdf] - 1699778
General Relativistic corrections in density-shear correlations
Submitted: 2018-01-08, last modified: 2018-06-14
We investigate the corrections which relativistic light-cone computations
induce on the correlation of the tangential shear with galaxy number counts,
also known as galaxy-galaxy lensing. The standard-approach to galaxy-galaxy
lensing treats the number density of sources in a foreground bin as observable,
whereas it is in reality unobservable due to the presence of relativistic
corrections. We find that already in the redshift range covered by the DES
first year data, these currently neglected relativistic terms lead to a
systematic correction of up to 50% in the density-shear correlation function
for the highest redshift bins. This correction is dominated by the the fact
that a redshift bin of number counts does not only lens sources in a background
bin, but is itself again lensed by all masses between the observer and the
counted source population. Relativistic corrections are currently ignored in
the standard galaxy-galaxy analyses, and the additional lensing of a counted
source populations is only included in the error budget (via the covariance
matrix). At increasingly higher redshifts and larger scales, these relativistic
and lensing corrections become however increasingly more important, and we here
argue that it is then more efficient, and also cleaner, to account for these
corrections in the density-shear correlations.
[11]
oai:arXiv.org:1802.09450 [pdf] - 1674895
Objective Bayesian analysis of neutrino masses and hierarchy
Submitted: 2018-02-26, last modified: 2018-04-06
Given the precision of current neutrino data, priors still impact noticeably
the constraints on neutrino masses and their hierarchy. To avoid our
understanding of neutrinos being driven by prior assumptions, we construct a
prior that is mathematically minimally informative. Using the constructed
uninformative prior, we find that the normal hierarchy is favoured but with
inconclusive posterior odds of 5.1:1. Better data is hence needed before the
neutrino masses and their hierarchy can be well constrained. We find that the
next decade of cosmological data should provide conclusive evidence if the
normal hierarchy with negligible minimum mass is correct, and if the
uncertainty in the sum of neutrino masses drops below 0.025 eV. On the other
hand, if neutrinos obey the inverted hierarchy, achieving strong evidence will
be difficult with the same uncertainties. Our uninformative prior was
constructed from principles of the Objective Bayesian approach. The prior is
called a reference prior and is minimally informative in the specific sense
that the information gain after collection of data is maximised. The prior is
computed for the combination of neutrino oscillation data and cosmological data
and still applies if the data improve.
[12]
oai:arXiv.org:1708.00492 [pdf] - 1659534
The full-sky relativistic correlation function and power spectrum of
galaxy number counts: I. Theoretical aspects
Submitted: 2017-08-01, last modified: 2018-04-03
We derive an exact expression for the correlation function in redshift shells
including all the relativistic contributions. This expression, which does not
rely on the distant-observer or flat-sky approximation, is valid at all scales
and includes both local relativistic corrections and integrated contributions,
like gravitational lensing. We present two methods to calculate this
correlation function, one which makes use of the angular power spectrum
C_ell(z1,z2) and a second method which evades the costly calculations of the
angular power spectra. The correlation function is then used to define the
power spectrum as its Fourier transform. In this work theoretical aspects of
this procedure are presented, together with quantitative examples. In
particular, we show that gravitational lensing modifies the multipoles of the
correlation function and of the power spectrum by a few percent at redshift z=1
and by up to 30% and more at z=2. We also point out that large-scale
relativistic effects and wide-angle corrections generate contributions of the
same order of magnitude and have consequently to be treated in conjunction.
These corrections are particularly important at small redshift, z=0.1, where
they can reach 10%. This means in particular that a flat-sky treatment of
relativistic effects, using for example the power spectrum, is not consistent.
[13]
oai:arXiv.org:1712.04923 [pdf] - 1674800
The skewed weak lensing likelihood: why biases arise, despite data and
theory being sound
Submitted: 2017-12-13
We derive the essentials of the skewed weak lensing likelihood via a simple
Hierarchical Model. Our likelihood passes four objective and
cosmology-independent tests which a standard Gaussian likelihood fails. We
demonstrate that sound weak lensing analyses are naturally biased low, and this
does not indicate any new physics such as deviations from $\Lambda$CDM.
Mathematically, the biases arise because noisy two-point functions follow
skewed distributions. This form of bias is already known from CMB analyses,
where the low multipoles have asymmetric error bars. Weak lensing is more
strongly affected by this asymmetry as galaxies form a discrete set of shear
tracer particles, in contrast to a smooth shear field. We demonstrate that the
biases can be up to 30 percent of the standard deviation per data point,
dependent on the properties of the weak lensing survey. Our likelihood provides
a versatile framework with which to address this bias in future weak lensing
analyses.
[14]
oai:arXiv.org:1707.04488 [pdf] - 1585942
On the insufficiency of arbitrarily precise covariance matrices:
non-Gaussian weak lensing likelihoods
Submitted: 2017-07-14, last modified: 2017-09-25
We investigate whether a Gaussian likelihood, as routinely assumed in the
analysis of cosmological data, is supported by simulated survey data. We define
test statistics, based on a novel method that first destroys Gaussian
correlations in a dataset, and then measures the non-Gaussian correlations that
remain. This procedure flags pairs of datapoints which depend on each other in
a non-Gaussian fashion, and thereby identifies where the assumption of a
Gaussian likelihood breaks down. Using this diagnostic, we find that
non-Gaussian correlations in the CFHTLenS cosmic shear correlation functions
are significant. With a simple exclusion of the most contaminated datapoints,
the posterior for $s_8$ is shifted without broadening, but we find no
significant reduction in the tension with $s_8$ derived from Planck Cosmic
Microwave Background data. However, we also show that the one-point
distributions of the correlation statistics are noticeably skewed, such that
sound weak lensing data sets are intrinsically likely to lead to a
systematically low lensing amplitude being inferred. The detected
non-Gaussianities get larger with increasing angular scale such that for future
wide-angle surveys such as Euclid or LSST, with their very small statistical
errors, the large-scale modes are expected to be increasingly affected. The
shifts in posteriors may then not be negligible and we recommend that these
diagnostic tests be run as part of future analyses.
[15]
oai:arXiv.org:1709.03452 [pdf] - 1588202
On the use of the Edgeworth expansion in cosmology I: how to foresee and
evade its pitfalls
Submitted: 2017-09-11
Non-linear gravitational collapse introduces non-Gaussian statistics into the
matter fields of the late Universe. As the large-scale structure is the target
of current and future observational campaigns, one would ideally like to have
the full probability density function of these non-Gaussian fields. The only
viable way we see to achieve this analytically, at least approximately and in
the near future, is via the Edgeworth expansion. We hence rederive this
expansion for Fourier modes of non-Gaussian fields and then continue by putting
it into a wider statistical context than previously done. We show that in its
original form, the Edgeworth expansion only works if the non-Gaussian signal is
averaged away. This is counterproductive, since we target the
parameter-dependent non-Gaussianities as a signal of interest. We hence alter
the analysis at the decisive step and now provide a roadmap towards a
controlled and unadulterated analysis of non-Gaussianities in structure
formation (with the Edgeworth expansion). Our central result is that, although
the Edgeworth expansion has pathological properties, these can be predicted and
avoided in a careful manner. We also show that, despite the non-Gaussianity
coupling all modes, the Edgeworth series may be applied to any desired subset
of modes, since this is equivalent (to the level of the approximation) to
marginalising over the exlcuded modes. In this first paper of a series, we
restrict ourselves to the sampling properties of the Edgeworth expansion,
i.e.~how faithfully it reproduces the distribution of non-Gaussian data. A
follow-up paper will detail its Bayesian use, when parameters are to be
inferred.
[16]
oai:arXiv.org:1707.06529 [pdf] - 1586176
Massive data compression for parameter-dependent covariance matrices
Submitted: 2017-07-20, last modified: 2017-09-05
We show how the massive data compression algorithm MOPED can be used to
reduce, by orders of magnitude, the number of simulated datasets that are
required to estimate the covariance matrix required for the analysis of
gaussian-distributed data. This is relevant when the covariance matrix cannot
be calculated directly. The compression is especially valuable when the
covariance matrix varies with the model parameters. In this case, it may be
prohibitively expensive to run enough simulations to estimate the full
covariance matrix throughout the parameter space. This compression may be
particularly valuable for the next-generation of weak lensing surveys, such as
proposed for Euclid and LSST, for which the number of summary data (such as
band power or shear correlation estimates) is very large, $\sim 10^4$, due to
the large number of tomographic redshift bins that the data will be divided
into. In the pessimistic case where the covariance matrix is estimated
separately for all points in an MCMC analysis, this may require an unfeasible
$10^9$ simulations. We show here that MOPED can reduce this number by a factor
of 1000, or a factor of $\sim 10^6$ if some regularity in the covariance matrix
is assumed, reducing the number of simulations required to a manageable $10^3$,
making an otherwise intractable analysis feasible.
[17]
oai:arXiv.org:1704.03467 [pdf] - 1582495
No evidence for extensions to the standard cosmological model
Submitted: 2017-04-11, last modified: 2017-08-09
We compute the Bayesian Evidence for models considered in the main analysis
of Planck cosmic microwave background data. By utilising carefully-defined
nearest-neighbour distances in parameter space, we reuse the Monte Carlo Markov
Chains already produced for parameter inference to compute Bayes factors $B$
for many different model-dataset combinations. Standard 6-parameter flat
$\Lambda$CDM model is favoured over all other models considered, with curvature
being mildly favoured only when CMB lensing is not included. Many alternative
models are strongly disfavoured by the data, including primordial correlated
isocurvature models ($\ln B=-7.8$), non-zero scalar-to-tensor ratio ($\ln
B=-4.3$), running of the spectral index ($\ln B = -4.7$), curvature ($\ln
B=-3.6$), non-standard numbers of neutrinos ($\ln B=-3.1$), non-standard
neutrino masses ($\ln B=-3.2$), non-standard lensing potential ($\ln B=-4.6$),
evolving dark energy ($\ln B=-3.2$), sterile neutrinos ($\ln B=-6.9$), and
extra sterile neutrinos with a non-zero scalar-to-tensor ratio ($\ln B=-10.8$).
Other models are less strongly disfavoured with respect to flat $\Lambda$CDM.
As with all analyses based on Bayesian Evidence, the final numbers depend on
the widths of the parameter priors. We adopt the priors used in the Planck
analysis, while performing a prior sensitivity analysis. Our quantitative
conclusion is that extensions beyond the standard cosmological model are
disfavoured by Planck data. Only when newer Hubble constant measurements are
included does $\Lambda$CDM become disfavoured, and only mildly, compared with a
dynamical dark energy model ($\ln B\sim +2$).
[18]
oai:arXiv.org:1704.03472 [pdf] - 1562206
Marginal Likelihoods from Monte Carlo Markov Chains
Submitted: 2017-04-11
In this paper, we present a method for computing the marginal likelihood,
also known as the model likelihood or Bayesian evidence, from Markov Chain
Monte Carlo (MCMC), or other sampled posterior distributions. In order to do
this, one needs to be able to estimate the density of points in parameter
space, and this can be challenging in high numbers of dimensions. Here we
present a Bayesian analysis, where we obtain the posterior for the marginal
likelihood, using $k$th nearest-neighbour distances in parameter space, using
the Mahalanobis distance metric, under the assumption that the points in the
chain (thinned if required) are independent. We generalise the algorithm to
apply to importance-sampled chains, where each point is assigned a weight. We
illustrate this with an idealised posterior of known form with an analytic
marginal likelihood, and show that for chains of length $\sim 10^5$ points, the
technique is effective for parameter spaces with up to $\sim 20$ dimensions. We
also argue that $k=1$ is the optimal choice, and discuss failure modes for the
algorithm. In a companion paper (Heavens et al. 2017) we apply the technique to
the main MCMC chains from the 2015 Planck analysis of cosmic background
radiation data, to infer that quantitatively the simplest 6-parameter flat
$\Lambda$CDM standard model of cosmology is preferred over all extensions
considered.
[19]
oai:arXiv.org:1609.00504 [pdf] - 1547687
Quantifying lost information due to covariance matrix estimation in
parameter inference
Submitted: 2016-09-02, last modified: 2017-03-15
Parameter inference with an estimated covariance matrix systematically loses
information due to the remaining uncertainty of the covariance matrix. Here, we
quantify this loss of precision and develop a framework to hypothetically
restore it, which allows to judge how far away a given analysis is from the
ideal case of a known covariance matrix. We point out that it is insufficient
to estimate this loss by debiasing a Fisher matrix as previously done, due to a
fundamental inequality that describes how biases arise in non-linear functions.
We therefore develop direct estimators for parameter credibility contours and
the figure of merit. We apply our results to DES Science Verification weak
lensing data, detecting a 10% loss of information that increases their
credibility contours. No significant loss of information is found for KiDS. For
a Euclid-like survey, with about 10 nuisance parameters we find that 2900
simulations are sufficient to limit the systematically lost information to 1%,
with an additional uncertainty of about 2%. Without any nuisance parameters
1900 simulations are sufficient to only lose 1% of information. We also derive
an estimator for the Fisher matrix of the unknown true covariance matrix, two
estimators of its inverse with different physical meanings, and an estimator
for the optimally achievable figure of merit. The formalism here quantifies the
gains to be made by running more simulated datasets, allowing decisions to be
made about numbers of simulations in an informed way.
[20]
oai:arXiv.org:1602.01746 [pdf] - 1359288
Optimizing parameter constraints: a new tool for Fisher matrix forecasts
Submitted: 2016-02-04, last modified: 2016-02-08
In a Bayesian context, theoretical parameters are correlated random
variables. Then, the constraints on one parameter can be improved by either
measuring this parameter more precisely - or by measuring the other parameters
more precisely. Especially in the case of many parameters, a lengthy process of
guesswork is then needed to determine the most efficient way to improve one
parameter's constraints. In this short article, we highlight an extremely
simple analytical expression that replaces the guesswork and that facilitates a
deeper understanding of optimization with interdependent parameters.
[21]
oai:arXiv.org:1511.05969 [pdf] - 1347646
Parameter inference with estimated covariance matrices
Submitted: 2015-11-18, last modified: 2016-01-05
When inferring parameters from a Gaussian-distributed data set by computing a
likelihood, a covariance matrix is needed that describes the data errors and
their correlations. If the covariance matrix is not known a priori, it may be
estimated and thereby becomes a random object with some intrinsic uncertainty
itself. We show how to infer parameters in the presence of such an estimated
covariance matrix, by marginalising over the true covariance matrix,
conditioned on its estimated value. This leads to a likelihood function that is
no longer Gaussian, but rather an adapted version of a multivariate
t-distribution, which has the same numerical complexity as the multivariate
Gaussian. As expected, marginalisation over the true covariance matrix improves
inference when compared with Hartlap et al.'s method, which uses an unbiased
estimate of the inverse covariance matrix but still assumes that the likelihood
is Gaussian.
[22]
oai:arXiv.org:1412.6427 [pdf] - 1280771
Detecting the cosmological neutrino background in the CMB
Submitted: 2014-12-19, last modified: 2015-07-27
Three relativistic particles in addition to the photon are detected in the
cosmic microwave background (CMB). In the standard model of cosmology, these
are interpreted as the three neutrino species. However, at the time of
CMB-decoupling, neutrinos are not only relativistic but they are also
freestreaming. Here, we investigate, whether the CMB is sensitive to this
defining feature of neutrinos, or whether the CMB-data allow to replace
neutrinos with a relativistic fluid. We show that free streaming particles are
preferred over a relativistic perfect fluid with $\Delta\chi^2\simeq 21$. We
also study the possibility to replace the neutrinos by a viscous fluid and find
that a relativistic viscous fluid with either the standard values $c_{\rm
eff}^2=c_{\rm vis}^2=1/3$ or best fit values for $c_{\rm eff}^2$ and $c_{\rm
vis}^2$ has $\Delta\chi^2=20$ and thus cannot provide a good fit to present CMB
data either.
[23]
oai:arXiv.org:1506.04866 [pdf] - 1273158
A fast, always positive definite and normalizable approximation of
non-Gaussian likelihoods
Submitted: 2015-06-16, last modified: 2015-07-22
In this paper we extent the previously published DALI-approximation for
likelihoods to cases in which the parameter dependency is in the covariance
matrix. The approximation recovers non-Gaussian likelihoods, and reduces to the
Fisher matrix approach in the case of Gaussianity. It works with the minimal
assumptions of having Gaussian errors on the data, and a covariance matrix that
possesses a converging Taylor approximation. The resulting approximation works
in cases of severe parameter degeneracies and in cases where the Fisher matrix
is singular. It is at least $1000$ times faster than a typical Monte Carlo
Markov Chain run over the same parameter space. Two example applications, to
cases of extremely non-Gaussian likelihoods, are presented -- one demonstrates
how the method succeeds in reconstructing completely a ring-shaped likelihood.
A public code is released here: http://lnasellentin.github.io/DALI/
[24]
oai:arXiv.org:1506.05356 [pdf] - 1347469
Non-Gaussian forecasts of weak lensing with and without priors
Submitted: 2015-06-17
Assuming a Euclid-like weak lensing data set, we compare different methods of
dealing with its inherent parameter degeneracies. Including priors into a data
analysis can mask the information content of a given data set alone. However,
since the information content of a data set is usually estimated with the
Fisher matrix, priors are added in order to enforce an approximately Gaussian
likelihood. Here, we compare priorless forecasts to more conventional forecasts
that use priors. We find strongly non-Gaussian likelihoods for 2d-weak lensing
if no priors are used, which we approximate with the DALI-expansion. Without
priors, the Fisher matrix of the 2d-weak lensing likelihood includes unphysical
values of $\Omega_m$ and $h$, since it does not capture the shape of the
likelihood well. The Cramer-Rao inequality then does not need to apply. We find
that DALI and Monte Carlo Markov Chains predict the presence of a dark energy
with high significance, whereas a Fisher forecast of the same data set also
allows decelerated expansion. We also find that a 2d-weak lensing analysis
provides a sharp lower limit on the Hubble constant of $h > 0.4$, even if the
equation of state of dark energy is jointly constrained by the data. This is
not predicted by the Fisher matrix and usually masked in other works by a sharp
prior on $h$. Additionally, we find that DALI estimates Figures of Merit in the
presence of non-Gaussianities better than the Fisher matrix. We additionally
demonstrate how DALI allows switching to a Hamiltonian Monte Carlo sampling of
a highly curved likelihood with acceptance rates of $\approx 0.5$, an effective
covering of the parameter space, and numerically effectively costless leapfrog
steps. This shows how quick forecasts can be upgraded to accurate forecasts
whenever needed. Results were gained with the public code from
http://lnasellentin.github.io/DALI/
[25]
oai:arXiv.org:1311.3498 [pdf] - 767138
A quantification of hydrodynamical effects on protoplanetary dust growth
Submitted: 2013-11-14, last modified: 2013-11-18
Context. The growth process of dust particles in protoplanetary disks can be
modeled via numerical dust coagulation codes. In this approach, physical
effects that dominate the dust growth process often must be implemented in a
parameterized form. Due to a lack of these parameterizations, existing studies
of dust coagulation have ignored the effects a hydrodynamical gas flow can have
on grain growth, even though it is often argued that the flow could
significantly contribute either positively or negatively to the growth process.
Aims. We intend to provide a quantification of hydrodynamical effects on the
growth of dust particles, such that these effects can be parameterized and
implemented in a dust coagulation code.
Methods. We numerically integrate the trajectories of small dust particles in
the flow of disk gas around a proto-planetesimal, sampling a large parameter
space in proto-planetesimal radii, headwind velocities, and dust stopping
times.
Results. The gas flow deflects most particles away from the
proto-planetesimal, such that its effective collisional cross section, and
therefore the mass accretion rate, is reduced. The gas flow however also
reduces the impact velocity of small dust particles onto a proto-planetesimal.
This can be beneficial for its growth, since large impact velocities are known
to lead to erosion. We also demonstrate why such a gas flow does not return
collisional debris to the surface of a proto-planetesimal.
Conclusions. We predict that a laminar hydrodynamical flow around a
proto-planetesimal will have a significant effect on its growth. However, we
cannot easily predict which result, the reduction of the impact velocity or the
sweep-up cross section, will be more important. Therefore, we provide
parameterizations ready for implementation into a dust coagulation code.