Normalized to: O'Beirne, M.
[1]
oai:arXiv.org:2003.02430 [pdf] - 2061799
Accurate Machine Learning Atmospheric Retrieval via a Neural Network
Surrogate Model for Radiative Transfer
Himes, Michael D.;
Harrington, Joseph;
Cobb, Adam D.;
Baydin, Atilim Gunes;
Soboczenski, Frank;
O'Beirne, Molly D.;
Zorzan, Simone;
Wright, David C.;
Scheffer, Zacchaeus;
Domagal-Goldman, Shawn D.;
Arney, Giada N.
Submitted: 2020-03-04, last modified: 2020-03-09
Atmospheric retrieval determines the properties of an atmosphere based on its
measured spectrum. The low signal-to-noise ratio of exoplanet observations
require a Bayesian approach to determine posterior probability distributions of
each model parameter, given observed spectra. This inference is computationally
expensive, as it requires many executions of a costly radiative transfer (RT)
simulation for each set of sampled model parameters. Machine learning (ML) has
recently been shown to provide a significant reduction in runtime for
retrievals, mainly by training inverse ML models that predict parameter
distributions, given observed spectra, albeit with reduced posterior accuracy.
Here we present a novel approach to retrieval by training a forward ML
surrogate model that predicts spectra given model parameters, providing a fast
approximate RT simulation that can be used in a conventional Bayesian retrieval
framework without significant loss of accuracy. We demonstrate our method on
the emission spectrum of HD 189733 b and find Bhattacharyya coefficients of
97.74 -- 99.74% between our 1D marginalized posterior distributions and those
of the Bayesian Atmospheric Radiative Transfer (BART) code. Our retrieval
method is ~20x faster than BART when run on an Intel i7-4770 central processing
unit (CPU). Neural-network computation using an NVIDIA Titan Xp graphics
processing unit is ~600x faster than BART on that CPU.
[2]
oai:arXiv.org:1905.10659 [pdf] - 1912877
An Ensemble of Bayesian Neural Networks for Exoplanetary Atmospheric
Retrieval
Cobb, Adam D.;
Himes, Michael D.;
Soboczenski, Frank;
Zorzan, Simone;
O'Beirne, Molly D.;
Baydin, Atılım Güneş;
Gal, Yarin;
Domagal-Goldman, Shawn D.;
Arney, Giada N.;
Angerhausen, Daniel
Submitted: 2019-05-25
Machine learning is now used in many areas of astrophysics, from detecting
exoplanets in Kepler transit signals to removing telescope systematics. Recent
work demonstrated the potential of using machine learning algorithms for
atmospheric retrieval by implementing a random forest to perform retrievals in
seconds that are consistent with the traditional, computationally-expensive
nested-sampling retrieval method. We expand upon their approach by presenting a
new machine learning model, \texttt{plan-net}, based on an ensemble of Bayesian
neural networks that yields more accurate inferences than the random forest for
the same data set of synthetic transmission spectra. We demonstrate that an
ensemble provides greater accuracy and more robust uncertainties than a single
model. In addition to being the first to use Bayesian neural networks for
atmospheric retrieval, we also introduce a new loss function for Bayesian
neural networks that learns correlations between the model outputs.
Importantly, we show that designing machine learning models to explicitly
incorporate domain-specific knowledge both improves performance and provides
additional insight by inferring the covariance of the retrieved atmospheric
parameters. We apply \texttt{plan-net} to the Hubble Space Telescope Wide Field
Camera 3 transmission spectrum for WASP-12b and retrieve an isothermal
temperature and water abundance consistent with the literature. We highlight
that our method is flexible and can be expanded to higher-resolution spectra
and a larger number of atmospheric parameters.
[3]
oai:arXiv.org:1811.03390 [pdf] - 1791343
Bayesian Deep Learning for Exoplanet Atmospheric Retrieval
Soboczenski, Frank;
Himes, Michael D.;
O'Beirne, Molly D.;
Zorzan, Simone;
Baydin, Atilim Gunes;
Cobb, Adam D.;
Gal, Yarin;
Angerhausen, Daniel;
Mascaro, Massimo;
Arney, Giada N.;
Domagal-Goldman, Shawn D.
Submitted: 2018-11-08, last modified: 2018-12-02
Over the past decade, the study of extrasolar planets has evolved rapidly
from plain detection and identification to comprehensive categorization and
characterization of exoplanet systems and their atmospheres. Atmospheric
retrieval, the inverse modeling technique used to determine an exoplanetary
atmosphere's temperature structure and composition from an observed spectrum,
is both time-consuming and compute-intensive, requiring complex algorithms that
compare thousands to millions of atmospheric models to the observational data
to find the most probable values and associated uncertainties for each model
parameter. For rocky, terrestrial planets, the retrieved atmospheric
composition can give insight into the surface fluxes of gaseous species
necessary to maintain the stability of that atmosphere, which may in turn
provide insight into the geological and/or biological processes active on the
planet. These atmospheres contain many molecules, some of them biosignatures,
spectral fingerprints indicative of biological activity, which will become
observable with the next generation of telescopes. Runtimes of traditional
retrieval models scale with the number of model parameters, so as more
molecular species are considered, runtimes can become prohibitively long.
Recent advances in machine learning (ML) and computer vision offer new ways to
reduce the time to perform a retrieval by orders of magnitude, given a
sufficient data set to train with. Here we present an ML-based retrieval
framework called Intelligent exoplaNet Atmospheric RetrievAl (INARA) that
consists of a Bayesian deep learning model for retrieval and a data set of
3,000,000 synthetic rocky exoplanetary spectra generated using the NASA
Planetary Spectrum Generator. Our work represents the first ML retrieval model
for rocky, terrestrial exoplanets and the first synthetic data set of
terrestrial spectra generated at this scale.