Full-text search for arXiv

O'Beirne, Molly D.

Normalized to: O'Beirne, M.

3 article(s) in total. 13 co-authors, from 1 to 3 common article(s). Median position in authors list is 5,0.

[1] oai:arXiv.org:2003.02430 [pdf] - 2061799

Accurate Machine Learning Atmospheric Retrieval via a Neural Network Surrogate Model for Radiative Transfer

Himes, Michael D.; Harrington, Joseph; Cobb, Adam D.; Baydin, Atilim Gunes; Soboczenski, Frank; O'Beirne, Molly D.; Zorzan, Simone; Wright, David C.; Scheffer, Zacchaeus; Domagal-Goldman, Shawn D.; Arney, Giada N.

Comments: 8 pages, 3 figures, submitted to PSJ 3/4/2020. Only added comment, no changes to text

Submitted: 2020-03-04, last modified: 2020-03-09

Atmospheric retrieval determines the properties of an atmosphere based on its measured spectrum. The low signal-to-noise ratio of exoplanet observations require a Bayesian approach to determine posterior probability distributions of each model parameter, given observed spectra. This inference is computationally expensive, as it requires many executions of a costly radiative transfer (RT) simulation for each set of sampled model parameters. Machine learning (ML) has recently been shown to provide a significant reduction in runtime for retrievals, mainly by training inverse ML models that predict parameter distributions, given observed spectra, albeit with reduced posterior accuracy. Here we present a novel approach to retrieval by training a forward ML surrogate model that predicts spectra given model parameters, providing a fast approximate RT simulation that can be used in a conventional Bayesian retrieval framework without significant loss of accuracy. We demonstrate our method on the emission spectrum of HD 189733 b and find Bhattacharyya coefficients of 97.74 -- 99.74% between our 1D marginalized posterior distributions and those of the Bayesian Atmospheric Radiative Transfer (BART) code. Our retrieval method is ~20x faster than BART when run on an Intel i7-4770 central processing unit (CPU). Neural-network computation using an NVIDIA Titan Xp graphics processing unit is ~600x faster than BART on that CPU.

[2] oai:arXiv.org:1905.10659 [pdf] - 1912877

An Ensemble of Bayesian Neural Networks for Exoplanetary Atmospheric Retrieval

Cobb, Adam D.; Himes, Michael D.; Soboczenski, Frank; Zorzan, Simone; O'Beirne, Molly D.; Baydin, Atılım Güneş; Gal, Yarin; Domagal-Goldman, Shawn D.; Arney, Giada N.; Angerhausen, Daniel

Comments:

Submitted: 2019-05-25

Machine learning is now used in many areas of astrophysics, from detecting exoplanets in Kepler transit signals to removing telescope systematics. Recent work demonstrated the potential of using machine learning algorithms for atmospheric retrieval by implementing a random forest to perform retrievals in seconds that are consistent with the traditional, computationally-expensive nested-sampling retrieval method. We expand upon their approach by presenting a new machine learning model, \texttt{plan-net}, based on an ensemble of Bayesian neural networks that yields more accurate inferences than the random forest for the same data set of synthetic transmission spectra. We demonstrate that an ensemble provides greater accuracy and more robust uncertainties than a single model. In addition to being the first to use Bayesian neural networks for atmospheric retrieval, we also introduce a new loss function for Bayesian neural networks that learns correlations between the model outputs. Importantly, we show that designing machine learning models to explicitly incorporate domain-specific knowledge both improves performance and provides additional insight by inferring the covariance of the retrieved atmospheric parameters. We apply \texttt{plan-net} to the Hubble Space Telescope Wide Field Camera 3 transmission spectrum for WASP-12b and retrieve an isothermal temperature and water abundance consistent with the literature. We highlight that our method is flexible and can be expanded to higher-resolution spectra and a larger number of atmospheric parameters.

[3] oai:arXiv.org:1811.03390 [pdf] - 1791343

Bayesian Deep Learning for Exoplanet Atmospheric Retrieval

Soboczenski, Frank; Himes, Michael D.; O'Beirne, Molly D.; Zorzan, Simone; Baydin, Atilim Gunes; Cobb, Adam D.; Gal, Yarin; Angerhausen, Daniel; Mascaro, Massimo; Arney, Giada N.; Domagal-Goldman, Shawn D.

Comments: Third workshop on Bayesian Deep Learning (NeurIPS 2018), Montreal, Canada

Submitted: 2018-11-08, last modified: 2018-12-02

Over the past decade, the study of extrasolar planets has evolved rapidly from plain detection and identification to comprehensive categorization and characterization of exoplanet systems and their atmospheres. Atmospheric retrieval, the inverse modeling technique used to determine an exoplanetary atmosphere's temperature structure and composition from an observed spectrum, is both time-consuming and compute-intensive, requiring complex algorithms that compare thousands to millions of atmospheric models to the observational data to find the most probable values and associated uncertainties for each model parameter. For rocky, terrestrial planets, the retrieved atmospheric composition can give insight into the surface fluxes of gaseous species necessary to maintain the stability of that atmosphere, which may in turn provide insight into the geological and/or biological processes active on the planet. These atmospheres contain many molecules, some of them biosignatures, spectral fingerprints indicative of biological activity, which will become observable with the next generation of telescopes. Runtimes of traditional retrieval models scale with the number of model parameters, so as more molecular species are considered, runtimes can become prohibitively long. Recent advances in machine learning (ML) and computer vision offer new ways to reduce the time to perform a retrieval by orders of magnitude, given a sufficient data set to train with. Here we present an ML-based retrieval framework called Intelligent exoplaNet Atmospheric RetrievAl (INARA) that consists of a Bayesian deep learning model for retrieval and a data set of 3,000,000 synthetic rocky exoplanetary spectra generated using the NASA Planetary Spectrum Generator. Our work represents the first ML retrieval model for rocky, terrestrial exoplanets and the first synthetic data set of terrestrial spectra generated at this scale.