Normalized to: Contardo, G.
[1]
oai:arXiv.org:2007.04459 [pdf] - 2131626
Meta-Learning One-Class Classification with DeepSets: Application in the
Milky Way
Submitted: 2020-07-08
We explore in this paper the use of neural networks designed for point-clouds
and sets on a new meta-learning task. We present experiments on the
astronomical challenge of characterizing the stellar population of stellar
streams. Stellar streams are elongated structures of stars in the outskirts of
the Milky Way that form when a (small) galaxy breaks up under the Milky Way's
gravitational force. We consider that we obtain, for each stream, a small
'support set' of stars that belongs to this stream. We aim to predict if the
other stars in that region of the sky are from that stream or not, similar to
one-class classification. Each "stream task" could also be transformed into a
binary classification problem in a highly imbalanced regime (or supervised
anomaly detection) by using the much bigger set of "other" stars and
considering them as noisy negative examples. We propose to study the problem in
the meta-learning regime: we expect that we can learn general information on
characterizing a stream's stellar population by meta-learning across several
streams in a fully supervised regime, and transfer it to new streams using only
positive supervision. We present a novel use of Deep Sets, a model developed
for point-cloud and sets, trained in a meta-learning fully supervised regime,
and evaluated in a one-class classification setting. We compare it against
Random Forests (with and without self-labeling) in the classic setting of
binary classification, retrained for each task. We show that our method
outperforms the Random-Forests even though the Deep Sets is not retrained on
the new tasks, and accesses only a small part of the data compared to the
Random Forest. We also show that the model performs well on a real-life stream
when including additional fine-tuning.
[2]
oai:arXiv.org:2002.09491 [pdf] - 2128202
Gravitational wave population inference with deep flow-based generative
network
Submitted: 2020-02-21, last modified: 2020-07-04
We combine hierarchical Bayesian modeling with a flow-based deep generative
network, in order to demonstrate that one can efficiently constraint numerical
gravitational wave (GW) population models at a previously intractable
complexity. Existing techniques for comparing data to simulation,such as
discrete model selection and Gaussian process regression, can only be applied
efficiently to moderate-dimension data. This limits the number of observable
(e.g. chirp mass, spins.) and hyper-parameters (e.g. common envelope
efficiency) one can use in a population inference. In this study, we train a
network to emulate a phenomenological model with 6 observables and 4
hyper-parameters, use it to infer the properties of a simulated catalogue and
compare the results to the phenomenological model. We find that a 10-layer
network can emulate the phenomenological model accurately and efficiently. Our
machine enables simulation-based GW population inferences to take on data at a
new complexity level.
[3]
oai:arXiv.org:2007.01868 [pdf] - 2128458
Dalek -- a deep-learning emulator for TARDIS
Submitted: 2020-07-03
Supernova spectral time series contain a wealth of information about the
progenitor and explosion process of these energetic events. The modeling of
these data requires the exploration of very high dimensional posterior
probabilities with expensive radiative transfer codes. Even modest
parametrizations of supernovae contain more than ten parameters and a detailed
exploration demands at least several million function evaluations. Physically
realistic models require at least tens of CPU minutes per evaluation putting a
detailed reconstruction of the explosion out of reach of traditional
methodology. The advent of widely available libraries for the training of
neural networks combined with their ability to approximate almost arbitrary
functions with high precision allows for a new approach to this problem.
Instead of evaluating the radiative transfer model itself, one can build a
neural network proxy trained on the simulations but evaluating orders of
magnitude faster. Such a framework is called an emulator or surrogate model. In
this work, we present an emulator for the TARDIS supernova radiative transfer
code applied to Type Ia supernova spectra. We show that we can train an
emulator for this problem given a modest training set of a hundred thousand
spectra (easily calculable on modern supercomputers). The results show an
accuracy on the percent level (that are dominated by the Monte Carlo nature of
TARDIS and not the emulator) with a speedup of several orders of magnitude.
This method has a much broader set of applications and is not limited to the
presented problem.
[4]
oai:arXiv.org:1910.07813 [pdf] - 1981849
From Dark Matter to Galaxies with Convolutional Neural Networks
Yip, Jacky H. T.;
Zhang, Xinyue;
Wang, Yanfang;
Zhang, Wei;
Sun, Yueqiu;
Contardo, Gabriella;
Villaescusa-Navarro, Francisco;
He, Siyu;
Genel, Shy;
Ho, Shirley
Submitted: 2019-10-17
Cosmological simulations play an important role in the interpretation of
astronomical data, in particular in comparing observed data to our theoretical
expectations. However, to compare data with these simulations, the simulations
in principle need to include gravity, magneto-hydrodyanmics, radiative
transfer, etc. These ideal large-volume simulations
(gravo-magneto-hydrodynamical) are incredibly computationally expensive which
can cost tens of millions of CPU hours to run. In this paper, we propose a deep
learning approach to map from the dark-matter-only simulation (computationally
cheaper) to the galaxy distribution (from the much costlier cosmological
simulation). The main challenge of this task is the high sparsity in the target
galaxy distribution: space is mainly empty. We propose a cascade architecture
composed of a classification filter followed by a regression procedure. We show
that our result outperforms a state-of-the-art model used in the astronomical
community, and provides a good trade-off between computational cost and
prediction accuracy.
[5]
oai:arXiv.org:1909.05273 [pdf] - 1960408
The Quijote simulations
Villaescusa-Navarro, Francisco;
Hahn, ChangHoon;
Massara, Elena;
Banerjee, Arka;
Delgado, Ana Maria;
Ramanah, Doogesh Kodi;
Charnock, Tom;
Giusarma, Elena;
Li, Yin;
Allys, Erwan;
Brochard, Antoine;
Chiang, Chi-Ting;
He, Siyu;
Pisani, Alice;
Obuljen, Andrej;
Feng, Yu;
Castorina, Emanuele;
Contardo, Gabriella;
Kreisch, Christina D.;
Nicola, Andrina;
Scoccimarro, Roman;
Verde, Licia;
Viel, Matteo;
Ho, Shirley;
Mallat, Stephane;
Wandelt, Benjamin;
Spergel, David N.
Submitted: 2019-09-11
The Quijote simulations are a set of 43100 full N-body simulations spanning
more than 7000 cosmological models in the $\{\Omega_{\rm m}, \Omega_{\rm b}, h,
n_s, \sigma_8, M_\nu, w \}$ hyperplane. At a single redshift the simulations
contain more than 8.5 trillions of particles over a combined volume of 43100
$(h^{-1}{\rm Gpc})^3$. Billions of dark matter halos and cosmic voids have been
identified in the simulations, whose runs required more than 35 million core
hours. The Quijote simulations have been designed for two main purposes: 1) to
quantify the information content on cosmological observables, and 2) to provide
enough data to train machine learning algorithms. In this paper we describe the
simulations and show a few of their applications. We also release the Petabyte
of data generated, comprising hundreds of thousands of simulation snapshots at
multiple redshifts, halo and void catalogs, together with millions of summary
statistics such as power spectra, bispectra, correlation functions, marked
power spectra, and estimated probability density functions.
[6]
oai:arXiv.org:1902.05965 [pdf] - 1859078
From Dark Matter to Galaxies with Convolutional Networks
Submitted: 2019-02-15, last modified: 2019-03-31
Cosmological surveys aim at answering fundamental questions about our
Universe, including the nature of dark matter or the reason of unexpected
accelerated expansion of the Universe. In order to answer these questions, two
important ingredients are needed: 1) data from observations and 2) a
theoretical model that allows fast comparison between observation and theory.
Most of the cosmological surveys observe galaxies, which are very difficult to
model theoretically due to the complicated physics involved in their formation
and evolution; modeling realistic galaxies over cosmological volumes requires
running computationally expensive hydrodynamic simulations that can cost
millions of CPU hours. In this paper, we propose to use deep learning to
establish a mapping between the 3D galaxy distribution in hydrodynamic
simulations and its underlying dark matter distribution. One of the major
challenges in this pursuit is the very high sparsity in the predicted galaxy
distribution. To this end, we develop a two-phase convolutional neural network
architecture to generate fast galaxy catalogues, and compare our results
against a standard cosmological technique. We find that our proposed approach
either outperforms or is competitive with traditional cosmological techniques.
Compared to the common methods used in cosmology, our approach also provides a
nice trade-off between time-consumption (comparable to fastest benchmark in the
literature) and the quality and accuracy of the predicted simulation. In
combination with current and upcoming data from cosmological observations, our
method has the potential to answer fundamental questions about our Universe
with the highest accuracy.
[7]
oai:arXiv.org:astro-ph/0506415 [pdf] - 73815
Constraints on the Progenitor Systems of Type Ia Supernovae
Submitted: 2005-06-17, last modified: 2005-12-05
UVOIR bolometric light curves provide valuable insight into the nature of
type Ia supernovae. We present an analysis of sixteen well-observed SNe Ia.
Constraints are placed on several global parameters concerning the progenitor
system, explosion mechanism and subsequent radiation transport. By fitting a
radioactive decay energy deposition function to the quasi-exponential phase (50
to 100 days after maximum light), it is found that the ejected mass varies by
at least a factor of two. This result suggests that a sub-Chandrasekhar mass
model could be responsible for the progenitor system of some type Ia
supernovae. We find that the range in the amount of synthesized (56)Ni
indicates a significant variation in the burning mechanism. In order to explain
a factor of ten range in the observed bolometric luminosity more detailed
modeling of the explosion mechanism is required.
[8]
oai:arXiv.org:astro-ph/0005507 [pdf] - 1468120
Epochs of Maximum Light and Bolometric Light Curves of Type Ia
Supernovae
Submitted: 2000-05-25
We present empirical fits to the UBVRI light curves of type Ia supernovae.
These fits are used to objectively evaluate light curve parameters. We find
that the relative times of maximum light in the filter passbands are very
similar for most objects. Surprisingly the maximum at longer wavelengths is
reached earlier than in the B and V light curves. This clearly demonstrates the
complicated nature of the supernova emission. Bolometric light curves for a
small sample of well-observed SNe Ia are constructed by integration over the
optical filters. In most objects a plateau or inflection is observed in the
light curve about 20-40 days after bolometric maximum. The strength of this
plateau varies considerably among the individual objects in the sample.
Furthermore the rise times show a range of several days for the few objects
which have observations early enough for such an analysis. On the other hand,
the decline rate between 50 and 80 days past maximum is remarkably similar for
all objects, with the notable exception of SN 1991bg. The similar late decline
rates for the supernovae indicate that the energy release at late times is very
uniform; the differences at early times is likely due to the radiation
diffusing out of the ejecta. With the exception of SN 1991bg, the range of
absolute bolometric luminosities of SNe Ia is found to be at least a factor of
2.5. The nickel masses derived from this estimate range from 0.4 to 1.1 Msun.
It seems impossible to explain such a mass range by a single explosion
mechanism, especially since the rate of gamma-ray escape at late phases seems
to be very uniform.
[9]
oai:arXiv.org:astro-ph/9812042 [pdf] - 104149
The High-Redshift Supernova Search -- Evidence for a Positive
Cosmological Constant
Submitted: 1998-12-02
A new component of the Universe which leads to an accelerated cosmic
expansion is found from the measurements of distances to high-redshift type Ia
supernovae. We describe the method and the results obtained from the
observations of distant supernovae. The dependence on the understanding of the
local type Ia supernovae is stressed. The lack of a good understanding of the
stellar evolution leading to the explosion of the white dwarf, the exact
explosion physics and the current difficulties in calculating the emission from
the ejecta limit the theoretical support. Despite the current ignorance of some
of the basic physics of the explosions, the cosmological result is robust. The
empirical relations seem to hold for the distant supernovae the same way as for
the local ones and the spectral appearance is identical. The distances to the
high-redshift supernovae are larger than expected in a freely coasting, i.e.
empty, Universe. A positive cosmological constant is inferred from these
measurements.
[10]
oai:arXiv.org:astro-ph/9801278 [pdf] - 100127
Photometric Evolution of Galaxies in Cosmological Scenarios
Submitted: 1998-01-28
The photometric evolution of galaxies in a hierarchically clustering universe
is investigated. The study is based on high resolution numerical simulations
which include the effects of gas dynamics, shock heating, radiative cooling and
a heuristic star formation scheme. The outcome of the simulations is convolved
with photometric models which enables us to predict the appearance of galaxies
in the broad band colors U, B, V, R, I and K. We demonstrate the effect of the
mutual interplay of the hierarchical build-up of galaxies, photometric
evolution, k-correction, and intervening absorption on the appearance of
forming disk galaxies at redshift one to three. We also discuss to what extend
the numerical resolution of current computer simulations is sufficient to make
quantitative predictions on surface density profiles and color gradients.