Normalized to: McAuliffe, J.
[1]
oai:arXiv.org:1803.00113 [pdf] - 1865287
Approximate Inference for Constructing Astronomical Catalogs from Images
Submitted: 2018-02-28, last modified: 2019-04-09
We present a new, fully generative model for constructing astronomical
catalogs from optical telescope image sets. Each pixel intensity is treated as
a random variable with parameters that depend on the latent properties of stars
and galaxies. These latent properties are themselves modeled as random. We
compare two procedures for posterior inference. One procedure is based on
Markov chain Monte Carlo (MCMC) while the other is based on variational
inference (VI). The MCMC procedure excels at quantifying uncertainty, while the
VI procedure is 1000 times faster. On a supercomputer, the VI procedure
efficiently uses 665,000 CPU cores to construct an astronomical catalog from 50
terabytes of images in 14.6 minutes, demonstrating the scaling characteristics
necessary to construct catalogs for upcoming astronomical surveys.
[2]
oai:arXiv.org:1801.10277 [pdf] - 1627837
Cataloging the Visible Universe through Bayesian Inference at Petascale
Regier, Jeffrey;
Pamnany, Kiran;
Fischer, Keno;
Noack, Andreas;
Lam, Maximilian;
Revels, Jarrett;
Howard, Steve;
Giordano, Ryan;
Schlegel, David;
McAuliffe, Jon;
Thomas, Rollin;
Prabhat
Submitted: 2018-01-30
Astronomical catalogs derived from wide-field imaging surveys are an
important tool for understanding the Universe. We construct an astronomical
catalog from 55 TB of imaging data using Celeste, a Bayesian variational
inference code written entirely in the high-productivity programming language
Julia. Using over 1.3 million threads on 650,000 Intel Xeon Phi cores of the
Cori Phase II supercomputer, Celeste achieves a peak rate of 1.54 DP PFLOP/s.
Celeste is able to jointly optimize parameters for 188M stars and galaxies,
loading and processing 178 TB across 8192 nodes in 14.6 minutes. To achieve
this, Celeste exploits parallelism at multiple levels (cluster, node, and
thread) and accelerates I/O through Cori's Burst Buffer. Julia's native
performance enables Celeste to employ high-level constructs without resorting
to hand-written or generated low-level code (C/C++/Fortran), and yet achieve
petascale performance.
[3]
oai:arXiv.org:1611.03404 [pdf] - 1511937
Learning an Astronomical Catalog of the Visible Universe through
Scalable Bayesian Inference
Submitted: 2016-11-10
Celeste is a procedure for inferring astronomical catalogs that attains
state-of-the-art scientific results. To date, Celeste has been scaled to at
most hundreds of megabytes of astronomical images: Bayesian posterior inference
is notoriously demanding computationally. In this paper, we report on a
scalable, parallel version of Celeste, suitable for learning catalogs from
modern large-scale astronomical datasets. Our algorithmic innovations include a
fast numerical optimization routine for Bayesian posterior inference and a
statistically efficient scheme for decomposing astronomical optimization
problems into subproblems.
Our scalable implementation is written entirely in Julia, a new high-level
dynamic programming language designed for scientific and numerical computing.
We use Julia's high-level constructs for shared and distributed memory
parallelism, and demonstrate effective load balancing and efficient scaling on
up to 8192 Xeon cores on the NERSC Cori supercomputer.
[4]
oai:arXiv.org:1506.01351 [pdf] - 1120672
Celeste: Variational inference for a generative model of astronomical
images
Submitted: 2015-06-03
We present a new, fully generative model of optical telescope image sets,
along with a variational procedure for inference. Each pixel intensity is
treated as a Poisson random variable, with a rate parameter dependent on latent
properties of stars and galaxies. Key latent properties are themselves random,
with scientific prior distributions constructed from large ancillary data sets.
We check our approach on synthetic images. We also run it on images from a
major sky survey, where it exceeds the performance of the current
state-of-the-art method for locating celestial bodies and measuring their
colors.