Normalized to: Mootoovaloo, A.
[1]
oai:arXiv.org:2005.06551 [pdf] - 2095137
Parameter Inference for Weak Lensing using Gaussian Processes and MOPED
Submitted: 2020-05-13
In this paper, we propose a Gaussian Process (GP) emulator for the
calculation of a) tomographic weak lensing band-power spectra, and b)
coefficients of summary data massively compressed with the MOPED algorithm. In
the former case cosmological parameter inference is accelerated by a factor of
$\sim 10$-$30$ compared to explicit calls to the Boltzmann solver CLASS when
applied to KiDS-450 weak lensing data. Much larger gains will come with future
data, where with MOPED compression, the speed up can be up to a factor of $\sim
10^3$ when the common Limber approximation is used. Furthermore, the GP opens
up the possibility of dropping the Limber approximation, without which the
theoretical calculations may be unfeasibly slow. A potential advantage of GPs
is that an error on the emulated function can be computed and this uncertainty
incorporated into the likelihood. If speed is of the essence, then the mean of
the Gaussian Process can be used and the uncertainty ignored. We compute the
Kullback-Leibler divergence between the emulator likelihood and the CLASS
likelihood, and on the basis of this and from analysing the uncertainties on
the parameters, we find that in this case, the inclusion of the GP uncertainty
does not justify the extra computational expense in the test application. For
future weak lensing surveys such as Euclid and Legacy Survey of Space and
Telescope (LSST), the number of summary statistics will be large, up to $\sim
10^{4}$. The speed of MOPED is determined by the number of parameters, not the
number of summary data, so the gains are very large. In the non-Limber case,
the speed-up can be a factor of $\sim 10^5$, provided that a fast way to
compute the theoretical MOPED coefficients is available. The GP presented here
provides such a fast mechanism and enables MOPED to be employed.
[2]
oai:arXiv.org:2002.12386 [pdf] - 2065446
Imbalance Learning for Variable Star Classification
Submitted: 2020-02-27
The accurate automated classification of variable stars into their respective
sub-types is difficult. Machine learning based solutions often fall foul of the
imbalanced learning problem, which causes poor generalisation performance in
practice, especially on rare variable star sub-types. In previous work, we
attempted to overcome such deficiencies via the development of a hierarchical
machine learning classifier. This 'algorithm-level' approach to tackling
imbalance, yielded promising results on Catalina Real-Time Survey (CRTS) data,
outperforming the binary and multi-class classification schemes previously
applied in this area. In this work, we attempt to further improve hierarchical
classification performance by applying 'data-level' approaches to directly
augment the training data so that they better describe under-represented
classes. We apply and report results for three data augmentation methods in
particular: $\textit{R}$andomly $\textit{A}$ugmented $\textit{S}$ampled
$\textit{L}$ight curves from magnitude $\textit{E}$rror ($\texttt{RASLE}$),
augmenting light curves with Gaussian Process modelling ($\texttt{GpFit}$) and
the Synthetic Minority Over-sampling Technique ($\texttt{SMOTE}$). When
combining the 'algorithm-level' (i.e. the hierarchical scheme) together with
the 'data-level' approach, we further improve variable star classification
accuracy by 1-4$\%$. We found that a higher classification rate is obtained
when using $\texttt{GpFit}$ in the hierarchical model. Further improvement of
the metric scores requires a better standard set of correctly identified
variable stars and, perhaps enhanced features are needed.
[3]
oai:arXiv.org:1907.08189 [pdf] - 1925160
Comparing Multi-class, Binary and Hierarchical Machine Learning
Classification schemes for variable stars
Submitted: 2019-07-18
Upcoming synoptic surveys are set to generate an unprecedented amount of
data. This requires an automatic framework that can quickly and efficiently
provide classification labels for several new object classification challenges.
Using data describing 11 types of variable stars from the Catalina Real-Time
Transient Surveys (CRTS), we illustrate how to capture the most important
information from computed features and describe detailed methods of how to
robustly use Information Theory for feature selection and evaluation. We apply
three Machine Learning (ML) algorithms and demonstrate how to optimize these
classifiers via cross-validation techniques. For the CRTS dataset, we find that
the Random Forest (RF) classifier performs best in terms of balanced-accuracy
and geometric means. We demonstrate substantially improved classification
results by converting the multi-class problem into a binary classification
task, achieving a balanced-accuracy rate of $\sim$99 per cent for the
classification of ${\delta}$-Scuti and Anomalous Cepheids (ACEP). Additionally,
we describe how classification performance can be improved via converting a
'flat-multi-class' problem into a hierarchical taxonomy. We develop a new
hierarchical structure and propose a new set of classification features,
enabling the accurate identification of subtypes of cepheids, RR Lyrae and
eclipsing binary stars in CRTS data.
[4]
oai:arXiv.org:1704.03467 [pdf] - 1582495
No evidence for extensions to the standard cosmological model
Submitted: 2017-04-11, last modified: 2017-08-09
We compute the Bayesian Evidence for models considered in the main analysis
of Planck cosmic microwave background data. By utilising carefully-defined
nearest-neighbour distances in parameter space, we reuse the Monte Carlo Markov
Chains already produced for parameter inference to compute Bayes factors $B$
for many different model-dataset combinations. Standard 6-parameter flat
$\Lambda$CDM model is favoured over all other models considered, with curvature
being mildly favoured only when CMB lensing is not included. Many alternative
models are strongly disfavoured by the data, including primordial correlated
isocurvature models ($\ln B=-7.8$), non-zero scalar-to-tensor ratio ($\ln
B=-4.3$), running of the spectral index ($\ln B = -4.7$), curvature ($\ln
B=-3.6$), non-standard numbers of neutrinos ($\ln B=-3.1$), non-standard
neutrino masses ($\ln B=-3.2$), non-standard lensing potential ($\ln B=-4.6$),
evolving dark energy ($\ln B=-3.2$), sterile neutrinos ($\ln B=-6.9$), and
extra sterile neutrinos with a non-zero scalar-to-tensor ratio ($\ln B=-10.8$).
Other models are less strongly disfavoured with respect to flat $\Lambda$CDM.
As with all analyses based on Bayesian Evidence, the final numbers depend on
the widths of the parameter priors. We adopt the priors used in the Planck
analysis, while performing a prior sensitivity analysis. Our quantitative
conclusion is that extensions beyond the standard cosmological model are
disfavoured by Planck data. Only when newer Hubble constant measurements are
included does $\Lambda$CDM become disfavoured, and only mildly, compared with a
dynamical dark energy model ($\ln B\sim +2$).
[5]
oai:arXiv.org:1704.03472 [pdf] - 1562206
Marginal Likelihoods from Monte Carlo Markov Chains
Submitted: 2017-04-11
In this paper, we present a method for computing the marginal likelihood,
also known as the model likelihood or Bayesian evidence, from Markov Chain
Monte Carlo (MCMC), or other sampled posterior distributions. In order to do
this, one needs to be able to estimate the density of points in parameter
space, and this can be challenging in high numbers of dimensions. Here we
present a Bayesian analysis, where we obtain the posterior for the marginal
likelihood, using $k$th nearest-neighbour distances in parameter space, using
the Mahalanobis distance metric, under the assumption that the points in the
chain (thinned if required) are independent. We generalise the algorithm to
apply to importance-sampled chains, where each point is assigned a weight. We
illustrate this with an idealised posterior of known form with an analytic
marginal likelihood, and show that for chains of length $\sim 10^5$ points, the
technique is effective for parameter spaces with up to $\sim 20$ dimensions. We
also argue that $k=1$ is the optimal choice, and discuss failure modes for the
algorithm. In a companion paper (Heavens et al. 2017) we apply the technique to
the main MCMC chains from the 2015 Planck analysis of cosmic background
radiation data, to infer that quantitatively the simplest 6-parameter flat
$\Lambda$CDM standard model of cosmology is preferred over all extensions
considered.
[6]
oai:arXiv.org:1609.02186 [pdf] - 1477714
Bayes Factors via Savage-Dickey Supermodels
Submitted: 2016-09-07
We outline a new method to compute the Bayes Factor for model selection which
bypasses the Bayesian Evidence. Our method combines multiple models into a
single, nested, Supermodel using one or more hyperparameters. Since the models
are now nested the Bayes Factors between the models can be efficiently computed
using the Savage-Dickey Density Ratio (SDDR). In this way model selection
becomes a problem of parameter estimation. We consider two ways of constructing
the supermodel in detail: one based on combined models, and a second based on
combined likelihoods. We report on these two approaches for a Gaussian linear
model for which the Bayesian evidence can be calculated analytically and a toy
nonlinear problem. Unlike the combined model approach, where a standard Monte
Carlo Markov Chain (MCMC) struggles, the combined-likelihood approach fares
much better in providing a reliable estimate of the log-Bayes Factor. This
scheme potentially opens the way to computationally efficient ways to compute
Bayes Factors in high dimensions that exploit the good scaling properties of
MCMC, as compared to methods such as nested sampling that fail for high
dimensions.