Normalized to: Varughese, M.
[1]
oai:arXiv.org:1504.00015 [pdf] - 1298602
Nonparametric Transient Classification using Adaptive Wavelets
Submitted: 2015-03-31, last modified: 2015-10-23
Classifying transients based on multi band light curves is a challenging but
crucial problem in the era of GAIA and LSST since the sheer volume of
transients will make spectroscopic classification unfeasible. Here we present a
nonparametric classifier that uses the transient's light curve measurements to
predict its class given training data. It implements two novel components: the
first is the use of the BAGIDIS wavelet methodology - a characterization of
functional data using hierarchical wavelet coefficients. The second novelty is
the introduction of a ranked probability classifier on the wavelet coefficients
that handles both the heteroscedasticity of the data in addition to the
potential non-representativity of the training set. The ranked classifier is
simple and quick to implement while a major advantage of the BAGIDIS wavelets
is that they are translation invariant, hence they do not need the light curves
to be aligned to extract features. Further, BAGIDIS is nonparametric so it can
be used for blind searches for new objects. We demonstrate the effectiveness of
our ranked wavelet classifier against the well-tested Supernova Photometric
Classification Challenge dataset in which the challenge is to correctly
classify light curves as Type Ia or non-Ia supernovae. We train our ranked
probability classifier on the spectroscopically-confirmed subsample (which is
not representative) and show that it gives good results for all supernova with
observed light curve timespans greater than 100 days (roughly 55% of the
dataset). For such data, we obtain a Ia efficiency of 80.5% and a purity of
82.4% yielding a highly competitive score of 0.49 whilst implementing a truly
"model-blind" approach to supernova classification. Consequently this approach
may be particularly suitable for the classification of astronomical transients
in the era of large synoptic sky surveys.
[2]
oai:arXiv.org:1303.2061 [pdf] - 886825
Towards the Future of Supernova Cosmology
Submitted: 2013-03-08, last modified: 2014-10-23
For future surveys, spectroscopic follow-up for all supernovae will be
extremely difficult. However, one can use light curve fitters, to obtain the
probability that an object is a Type Ia. One may consider applying a
probability cut to the data, but we show that the resulting non-Ia
contamination can lead to biases in the estimation of cosmological parameters.
A different method, which allows the use of the full dataset and results in
unbiased cosmological parameter estimation, is Bayesian Estimation Applied to
Multiple Species (BEAMS). BEAMS is a Bayesian approach to the problem which
includes the uncertainty in the types in the evaluation of the posterior. Here
we outline the theory of BEAMS and demonstrate its effectiveness using both
simulated datasets and SDSS-II data. We also show that it is possible to use
BEAMS if the data are correlated, by introducing a numerical marginalisation
over the types of the objects. This is largely a pedagogical introduction to
BEAMS with references to the main BEAMS papers.
[3]
oai:arXiv.org:1205.3493 [pdf] - 968308
Extending BEAMS to incorporate correlated systematic uncertainties
Submitted: 2012-05-15, last modified: 2014-10-23
New supernova surveys such as the Dark Energy Survey, Pan-STARRS and the LSST
will produce an unprecedented number of photometric supernova candidates, most
with no spectroscopic data. Avoiding biases in cosmological parameters due to
the resulting inevitable contamination from non-Ia supernovae can be achieved
with the BEAMS formalism, allowing for fully photometric supernova cosmology
studies. Here we extend BEAMS to deal with the case in which the supernovae are
correlated by systematic uncertainties. The analytical form of the full BEAMS
posterior requires evaluating 2^N terms, where N is the number of supernova
candidates. This `exponential catastrophe' is computationally unfeasible even
for N of order 100. We circumvent the exponential catastrophe by marginalising
numerically instead of analytically over the possible supernova types: we
augment the cosmological parameters with nuisance parameters describing the
covariance matrix and the types of all the supernovae, \tau_i, that we include
in our MCMC analysis. We show that this method deals well even with large,
unknown systematic uncertainties without a major increase in computational
time, whereas ignoring the correlations can lead to significant biases and
incorrect credible contours. We then compare the numerical marginalisation
technique with a perturbative expansion of the posterior based on the insight
that future surveys will have exquisite light curves and hence the probability
that a given candidate is a Type Ia will be close to unity or zero, for most
objects. Although this perturbative approach changes computation of the
posterior from a 2^N problem into an N^2 or N^3 one, we show that it leads to
biases in general through a small number of misclassifications, implying that
numerical marginalisation is superior.
[4]
oai:arXiv.org:1210.7762 [pdf] - 1527987
BEAMS: separating the wheat from the chaff in supernova analysis
Submitted: 2012-10-29
We introduce Bayesian Estimation Applied to Multiple Species (BEAMS), an
algorithm designed to deal with parameter estimation when using contaminated
data. We present the algorithm and demonstrate how it works with the help of a
Gaussian simulation. We then apply it to supernova data from the Sloan Digital
Sky Survey (SDSS), showing how the resulting confidence contours of the
cosmological parameters shrink significantly.
[5]
oai:arXiv.org:1111.5328 [pdf] - 1091849
Photometric Supernova Cosmology with BEAMS and SDSS-II
Hlozek, Renée;
Kunz, Martin;
Bassett, Bruce;
Smith, Mat;
Newling, James;
Varughese, Melvin;
Kessler, Rick;
Bernstein, Joe;
Campbell, Heather;
Dilday, Ben;
Falck, Bridget;
Frieman, Joshua;
Kulhmann, Steve;
Lampeitl, Hubert;
Marriner, John;
Nichol, Robert C.;
Riess, Adam G.;
Sako, Masao;
Schneider, Donald P.
Submitted: 2011-11-22
Supernova cosmology without spectroscopic confirmation is an exciting new
frontier which we address here with the Bayesian Estimation Applied to Multiple
Species (BEAMS) algorithm and the full three years of data from the Sloan
Digital Sky Survey II Supernova Survey (SDSS-II SN). BEAMS is a Bayesian
framework for using data from multiple species in statistical inference when
one has the probability that each data point belongs to a given species,
corresponding in this context to different types of supernovae with their
probabilities derived from their multi-band lightcurves. We run the BEAMS
algorithm on both Gaussian and more realistic SNANA simulations with of order
10^4 supernovae, testing the algorithm against various pitfalls one might
expect in the new and somewhat uncharted territory of photometric supernova
cosmology. We compare the performance of BEAMS to that of both mock
spectroscopic surveys and photometric samples which have been cut using typical
selection criteria. The latter typically are either biased due to contamination
or have significantly larger contours in the cosmological parameters due to
small data-sets. We then apply BEAMS to the 792 SDSS-II photometric supernovae
with host spectroscopic redshifts. In this case, BEAMS reduces the area of the
(\Omega_m,\Omega_\Lambda) contours by a factor of three relative to the case
where only spectroscopically confirmed data are used (297 supernovae). In the
case of flatness, the constraints obtained on the matter density applying BEAMS
to the photometric SDSS-II data are \Omega_m(BEAMS)=0.194\pm0.07. This
illustrates the potential power of BEAMS for future large photometric supernova
surveys such as LSST.
[6]
oai:arXiv.org:1110.6178 [pdf] - 1366169
Parameter Estimation with BEAMS in the presence of biases and
correlations
Submitted: 2011-10-27
The original formulation of BEAMS - Bayesian Estimation Applied to Multiple
Species - showed how to use a dataset contaminated by points of multiple
underlying types to perform unbiased parameter estimation. An example is
cosmological parameter estimation from a photometric supernova sample
contaminated by unknown Type Ibc and II supernovae. Where other methods require
data cuts to increase purity, BEAMS uses all of the data points in conjunction
with their probabilities of being each type. Here we extend the BEAMS formalism
to allow for correlations between the data and the type probabilities of the
objects as can occur in realistic cases. We show with simple simulations that
this extension can be crucial, providing a 50% reduction in parameter
estimation variance when such correlations do exist. We then go on to perform
tests to quantify the importance of the type probabilities, one of which
illustrates the effect of biasing the probabilities in various ways. Finally, a
general presentation of the selection bias problem is given, and discussed in
the context of future photometric supernova surveys and BEAMS, which lead to
specific recommendations for future supernova surveys.
[7]
oai:arXiv.org:1008.1024 [pdf] - 368054
Results from the Supernova Photometric Classification Challenge
Kessler, Richard;
Bassett, Bruce;
Belov, Pavel;
Bhatnagar, Vasudha;
Campbell, Heather;
Conley, Alex;
Frieman, Joshua A.;
Glazov, Alexandre;
Gonzalez-Gaitan, Santiago;
Hlozek, Renee;
Jha, Saurabh;
Kuhlmann, Stephen;
Kunz, Martin;
Lampeitl, Hubert;
Mahabal, Ashish;
Newling, James;
Nichol, Robert C.;
Parkinson, David;
Philip, Ninan Sajeeth;
Poznanski, Dovi;
Richards, Joseph W.;
Rodney, Steven A.;
Sako, Masao;
Schneider, Donald P.;
Smith, Mathew;
Stritzinger, Maximilian;
Varughese, Melvin
Submitted: 2010-08-05, last modified: 2010-11-03
We report results from the Supernova Photometric Classification Challenge
(SNPCC), a publicly released mix of simulated supernovae (SNe), with types (Ia,
Ibc, and II) selected in proportion to their expected rate. The simulation was
realized in the griz filters of the Dark Energy Survey (DES) with realistic
observing conditions (sky noise, point-spread function and atmospheric
transparency) based on years of recorded conditions at the DES site.
Simulations of non-Ia type SNe are based on spectroscopically confirmed light
curves that include unpublished non-Ia samples donated from the Carnegie
Supernova Project (CSP), the Supernova Legacy Survey (SNLS), and the Sloan
Digital Sky Survey-II (SDSS-II). A spectroscopically confirmed subset was
provided for training. We challenged scientists to run their classification
algorithms and report a type and photo-z for each SN. Participants from 10
groups contributed 13 entries for the sample that included a host-galaxy
photo-z for each SN, and 9 entries for the sample that had no redshift
information. Several different classification strategies resulted in similar
performance, and for all entries the performance was significantly better for
the training subset than for the unconfirmed sample. For the spectroscopically
unconfirmed subset, the entry with the highest average figure of merit for
classifying SNe~Ia has an efficiency of 0.96 and an SN~Ia purity of 0.79. As a
public resource for the future development of photometric SN classification and
photo-z estimators, we have released updated simulations with improvements
based on our experience from the SNPCC, added samples corresponding to the
Large Synoptic Survey Telescope (LSST) and the SDSS, and provided the answer
keys so that developers can evaluate their own analysis.
[8]
oai:arXiv.org:1010.1005 [pdf] - 955471
Statistical Classification Techniques for Photometric Supernova Typing
Newling, James;
Varughese, Melvin;
Bassett, Bruce A.;
Campbell, Heather;
Hlozek, Renée;
Kunz, Martin;
Lampeitl, Hubert;
Martin, Bryony;
Nichol, Robert;
Parkinson, David;
Smith, Mathew
Submitted: 2010-10-05, last modified: 2010-10-08
Future photometric supernova surveys will produce vastly more candidates than
can be followed up spectroscopically, highlighting the need for effective
classification methods based on lightcurves alone. Here we introduce boosting
and kernel density estimation techniques which have minimal astrophysical
input, and compare their performance on 20,000 simulated Dark Energy Survey
lightcurves. We demonstrate that these methods are comparable to the best
template fitting methods currently used, and in particular do not require the
redshift of the host galaxy or candidate. However both methods require a
training sample that is representative of the full population, so typical
spectroscopic supernova subsamples will lead to poor performance. To enable the
full potential of such blind methods, we recommend that representative training
samples should be used and so specific attention should be given to their
creation in the design phase of future photometric surveys.