Normalized to: Ambikasaran, S.
[1]
oai:arXiv.org:1703.09710 [pdf] - 1582187
Fast and scalable Gaussian process modeling with applications to
astronomical time series
Submitted: 2017-03-28, last modified: 2017-07-19
The growing field of large-scale time domain astronomy requires methods for
probabilistic data analysis that are computationally tractable, even with large
datasets. Gaussian Processes are a popular class of models used for this
purpose but, since the computational cost scales, in general, as the cube of
the number of data points, their application has been limited to small
datasets. In this paper, we present a novel method for Gaussian Process
modeling in one-dimension where the computational requirements scale linearly
with the size of the dataset. We demonstrate the method by applying it to
simulated and real astronomical time series datasets. These demonstrations are
examples of probabilistic inference of stellar rotation periods, asteroseismic
oscillation spectra, and transiting planet parameters. The method exploits
structure in the problem when the covariance function is expressed as a mixture
of complex exponentials, without requiring evenly spaced observations or
uniform noise. This form of covariance arises naturally when the process is a
mixture of stochastically-driven damped harmonic oscillators -- providing a
physical motivation for and interpretation of this choice -- but we also
demonstrate that it can be a useful effective model in some other cases. We
present a mathematical description of the method and compare it to existing
scalable Gaussian Process methods. The method is fast and interpretable, with a
range of potential applications within astronomical data analysis and beyond.
We provide well-tested and documented open-source implementations of this
method in C++, Python, and Julia.
[2]
oai:arXiv.org:1403.6015 [pdf] - 975176
Fast Direct Methods for Gaussian Processes
Submitted: 2014-03-24, last modified: 2015-04-04
A number of problems in probability and statistics can be addressed using the
multivariate normal (Gaussian) distribution. In the one-dimensional case,
computing the probability for a given mean and variance simply requires the
evaluation of the corresponding Gaussian density. In the $n$-dimensional
setting, however, it requires the inversion of an $n \times n$ covariance
matrix, $C$, as well as the evaluation of its determinant, $\det(C)$. In many
cases, such as regression using Gaussian processes, the covariance matrix is of
the form $C = \sigma^2 I + K$, where $K$ is computed using a specified
covariance kernel which depends on the data and additional parameters
(hyperparameters). The matrix $C$ is typically dense, causing standard direct
methods for inversion and determinant evaluation to require $\mathcal O(n^3)$
work. This cost is prohibitive for large-scale modeling. Here, we show that for
the most commonly used covariance functions, the matrix $C$ can be
hierarchically factored into a product of block low-rank updates of the
identity matrix, yielding an $\mathcal O (n\log^2 n) $ algorithm for inversion.
More importantly, we show that this factorization enables the evaluation of the
determinant $\det(C)$, permitting the direct calculation of probabilities in
high dimensions under fairly broad assumptions on the kernel defining $K$. Our
fast algorithm brings many problems in marginalization and the adaptation of
hyperparameters within practical reach using a single CPU core. The combination
of nearly optimal scaling in terms of problem size with high-performance
computing resources will permit the modeling of previously intractable
problems. We illustrate the performance of the scheme on standard covariance
kernels.