Normalized to: Stenning, D.
[1]
oai:arXiv.org:1910.08857 [pdf] - 2097234
LRP2020: Astrostatistics in Canada
Eadie, Gwendolyn;
Bahramian, Arash;
Barmby, Pauline;
Craiu, Radu;
Bingham, Derek;
Hložek, Renée;
Kavelaars, JJ;
Stenning, David;
Benincasa, Samantha;
Thomas, Guillaume;
Thanjavur, Karun;
Bovy, Jo;
Cami, Jan;
Carlberg, Ray;
Lawler, Sam;
Liu, Adrian;
Ngo, Henry;
Rahman, Mubdi;
Rupen, Michael
Submitted: 2019-10-19
(Abridged from Executive Summary) This white paper focuses on the
interdisciplinary fields of astrostatistics and astroinformatics, in which
modern statistical and computational methods are applied to and developed for
astronomical data. Astrostatistics and astroinformatics have grown dramatically
in the past ten years, with international organizations, societies,
conferences, workshops, and summer schools becoming the norm. Canada's formal
role in astrostatistics and astroinformatics has been relatively limited, but
there is a great opportunity and necessity for growth in this area. We
conducted a survey of astronomers in Canada to gain information on the training
mechanisms through which we learn statistical methods and to identify areas for
improvement. In general, the results of our survey indicate that while
astronomers see statistical methods as critically important for their research,
they lack focused training in this area and wish they had received more formal
training during all stages of education and professional development. These
findings inform our recommendations for the LRP2020 on how to increase
interdisciplinary connections between astronomy and statistics at the
institutional, national, and international levels over the next ten years. We
recommend specific, actionable ways to increase these connections, and discuss
how interdisciplinary work can benefit not only research but also astronomy's
role in training Highly Qualified Personnel (HQP) in Canada.
[2]
oai:arXiv.org:1903.06796 [pdf] - 1850859
Astro2020 Science White Paper: The Next Decade of Astroinformatics and
Astrostatistics
Siemiginowska, A.;
Eadie, G.;
Czekala, I.;
Feigelson, E.;
Ford, E. B.;
Kashyap, V.;
Kuhn, M.;
Loredo, T.;
Ntampaka, M.;
Stevens, A.;
Avelino, A.;
Borne, K.;
Budavari, T.;
Burkhart, B.;
Cisewski-Kehe, J.;
Civano, F.;
Chilingarian, I.;
van Dyk, D. A.;
Fabbiano, G.;
Finkbeiner, D. P.;
Foreman-Mackey, D.;
Freeman, P.;
Fruscione, A.;
Goodman, A. A.;
Graham, M.;
Guenther, H. M.;
Hakkila, J.;
Hernquist, L.;
Huppenkothen, D.;
James, D. J.;
Law, C.;
Lazio, J.;
Lee, T.;
López-Morales, M.;
Mahabal, A. A.;
Mandel, K.;
Meng, X. L.;
Moustakas, J.;
Muna, D.;
Peek, J. E. G.;
Richards, G.;
Portillo, S. K. N.;
Scargle, J.;
de Souza, R. S.;
Speagle, J. S.;
Stassun, K. G.;
Stenning, D. C.;
Taylor, S. R.;
Tremblay, G. R.;
Trimble, V.;
Yanamandra-Fisher, P. A.;
Young, C. A.
Submitted: 2019-03-15
Over the past century, major advances in astronomy and astrophysics have been
largely driven by improvements in instrumentation and data collection. With the
amassing of high quality data from new telescopes, and especially with the
advent of deep and large astronomical surveys, it is becoming clear that future
advances will also rely heavily on how those data are analyzed and interpreted.
New methodologies derived from advances in statistics, computer science, and
machine learning are beginning to be employed in sophisticated investigations
that are not only bringing forth new discoveries, but are placing them on a
solid footing. Progress in wide-field sky surveys, interferometric imaging,
precision cosmology, exoplanet detection and characterization, and many
subfields of stellar, Galactic and extragalactic astronomy, has resulted in
complex data analysis challenges that must be solved to perform scientific
inference. Research in astrostatistics and astroinformatics will be necessary
to develop the state-of-the-art methodology needed in astronomy. Overcoming
these challenges requires dedicated, interdisciplinary research. We recommend:
(1) increasing funding for interdisciplinary projects in astrostatistics and
astroinformatics; (2) dedicating space and time at conferences for
interdisciplinary research and promotion; (3) developing sustainable funding
for long-term astrostatisics appointments; and (4) funding infrastructure
development for data archives and archive support, state-of-the-art algorithms,
and efficient computing.
[3]
oai:arXiv.org:1809.06173 [pdf] - 1775689
Incorporating Uncertainties in Atomic Data Into the Analysis of Solar
and Stellar Observations: A Case Study in Fe XIII
Submitted: 2018-09-17
Information about the physical properties of astrophysical objects cannot be
measured directly but is inferred by interpreting spectroscopic observations in
the context of atomic physics calculations. Ratios of emission lines, for
example, can be used to infer the electron density of the emitting plasma.
Similarly, the relative intensities of emission lines formed over a wide range
of temperatures yield information on the temperature structure. A critical
component of this analysis is understanding how uncertainties in the underlying
atomic physics propagates to the uncertainties in the inferred plasma
parameters. At present, however, atomic physics databases do not include
uncertainties on the atomic parameters and there is no established methodology
for using them even if they did. In this paper we develop simple models for the
uncertainties in the collision strengths and decay rates for Fe XIII and apply
them to the interpretation of density sensitive lines observed with the EUV
Imagining spectrometer (EIS) on Hinode. We incorporate these uncertainties in a
Bayesian framework. We consider both a pragmatic Bayesian method where the
atomic physics information is unaffected by the observed data, and a fully
Bayesian method where the data can be used to probe the physics. The former
generally increases the uncertainty in the inferred density by about a factor
of 5 compared with models that incorporate only statistical uncertainties. The
latter reduces the uncertainties on the inferred densities, but identifies
areas of possible systematic problems with either the atomic physics or the
observed intensities.
[4]
oai:arXiv.org:1806.06733 [pdf] - 1717192
Bayesian Hierarchical Modelling of Initial-Final Mass Relations Across
Star Clusters
Submitted: 2018-06-18, last modified: 2018-07-17
The initial-final mass relation (IFMR) of white dwarfs (WDs) plays an
important role in stellar evolution. To derive precise estimates of IFMRs and
explore how they may vary among star clusters, we propose a Bayesian
hierarchical model that pools photo- metric data from multiple star clusters.
After performing a simulation study to show the benefits of the Bayesian
hierarchical model, we apply this model to five star clus- ters: the Hyades,
M67, NGC 188, NGC 2168, and NGC 2477, leading to reasonable and consistent
estimates of IFMRs for these clusters. We illustrate how a cluster-specific
analysis of NGC 188 using its own photometric data can produce an unreasonable
IFMR since its WDs have a narrow range of zero-age main sequence (ZAMS) masses.
However, the Bayesian hierarchical model corrects the cluster-specific analysis
by bor- rowing strength from other clusters, thus generating more reliable
estimates of IFMR parameters. The data analysis presents the benefits of
Bayesian hierarchical modelling over conventional cluster-specific methods,
which motivates us to elaborate the pow- erful statistical techniques in this
article.
[5]
oai:arXiv.org:1703.09164 [pdf] - 1700864
A Hierarchical Model for the Ages of Galactic Halo White Dwarfs
Submitted: 2017-03-27, last modified: 2018-06-18
In astrophysics, we often aim to estimate one or more parameters for each
member object in a population and study the distribution of the fitted
parameters across the population. In this paper, we develop novel methods that
allow us to take advantage of existing software designed for such case-by-case
analyses to simultaneously fit parameters of both the individual objects and
the parameters that quantify their distribution across the population. Our
methods are based on Bayesian hierarchical modelling which is known to produce
parameter estimators for the individual objects that are on average closer to
their true values than estimators based on case-by-case analyses. We verify
this in the context of estimating ages of Galactic halo white dwarfs (WDs) via
a series of simulation studies. Finally, we deploy our new techniques on
optical and near-infrared photometry of ten candidate halo WDs to obtain
estimates of their ages along with an estimate of the mean age of Galactic halo
WDs of [11.25, 12.96] Gyr. Although this sample is small, our technique lays
the ground work for large-scale studies using data from the Gaia mission.
[6]
oai:arXiv.org:1711.01318 [pdf] - 1597249
Improving Exoplanet Detection Power: Multivariate Gaussian Process
Models for Stellar Activity
Submitted: 2017-11-03, last modified: 2017-12-01
The radial velocity method is one of the most successful techniques for
detecting exoplanets. It works by detecting the velocity of a host star induced
by the gravitational effect of an orbiting planet, specifically the velocity
along our line of sight, which is called the radial velocity of the star. As
astronomical instrumentation has improved, radial velocity surveys have become
sensitive to low-mass planets that cause their host star to move with radial
velocities of 1 m/s or less. While analysis of a time series of stellar spectra
can in theory reveal such small radial velocities, in practice intrinsic
stellar variability (e.g., star spots, convective motion, pulsations) affects
the spectra and often mimics a radial velocity signal. This signal
contamination makes it difficult to reliably detect low mass planets and
planets orbiting magnetically active stars. A principled approach to recovering
planet radial velocity signals in the presence of stellar activity was proposed
by Rajpaul et al. (2015) and involves the use of a multivariate Gaussian
process model to jointly capture time series of the apparent radial velocity
and multiple indicators of stellar activity. We build on this work in two ways:
(i) we propose using dimension reduction techniques to construct more
informative stellar activity indicators that make use of a larger portion of
the stellar spectrum; (ii) we extend the Rajpaul et al. (2015) model to a
larger class of models and use a model comparison procedure to select the best
model for the particular stellar activity indicators at hand. By combining our
high-information stellar activity indicators, Gaussian process models, and
model selection procedure, we achieve substantially improved planet detection
power compared to previous state-of-the-art approaches.
[7]
oai:arXiv.org:1702.08856 [pdf] - 1540614
The ACS Survey of Galactic Globular Clusters XIV: Bayesian
Single-Population Analysis of 69 Globular Clusters
Wagner-Kaiser, R.;
Sarajedini, A.;
von Hippel, T.;
Stenning, D. C.;
van Dyk, D. A.;
Jeffery, E.;
Robinson, E.;
Stein, N.;
Anderson, J.;
Jefferys, W. H.
Submitted: 2017-02-28
We use Hubble Space Telescope (HST) imaging from the ACS Treasury Survey to
determine fits for single population isochrones of 69 Galactic globular
clusters. Using robust Bayesian analysis techniques, we simultaneously
determine ages, distances, absorptions, and helium values for each cluster
under the scenario of a "single" stellar population on model grids with solar
ratio heavy element abundances. The set of cluster parameters is determined in
a consistent and reproducible manner for all clusters using the Bayesian
analysis suite BASE-9. Our results are used to re-visit the age-metallicity
relation. We find correlations with helium and several other parameters such as
metallicity, binary fraction, and proxies for cluster mass. The helium
abundances of the clusters are also considered in the context of CNO abundances
and the multiple population scenario.
[8]
oai:arXiv.org:1611.00835 [pdf] - 1510413
A Bayesian Analysis of the Ages of Four Open Clusters
Submitted: 2016-11-02
In this paper we apply a Bayesian technique to determine the best fit of
stellar evolution models to find the main sequence turn off age and other
cluster parameters of four intermediate-age open clusters: NGC 2360, NGC 2477,
NGC 2660, and NGC 3960. Our algorithm utilizes a Markov chain Monte Carlo
technique to fit these various parameters, objectively finding the best-fit
isochrone for each cluster. The result is a high-precision isochrone fit. We
compare these results with the those of traditional "by-eye" isochrone fitting
methods. By applying this Bayesian technique to NGC 2360, NGC 2477, NGC 2660,
and NGC 3960, we determine the ages of these clusters to be 1.35 +/- 0.05, 1.02
+/- 0.02, 1.64 +/- 0.04, and 0.860 +/- 0.04 Gyr, respectively. The results of
this paper continue our effort to determine cluster ages to higher precision
than that offered by these traditional methods of isochrone fitting.
[9]
oai:arXiv.org:1609.01527 [pdf] - 1483531
Bayesian Analysis of Two Stellar Populations in Galactic Globular
Clusters III: Analysis of 30 Clusters
Submitted: 2016-09-06
We use Cycle 21 Hubble Space Telescope (HST) observations and HST archival
ACS Treasury observations of 30 Galactic Globular Clusters to characterize two
distinct stellar populations. A sophisticated Bayesian technique is employed to
simultaneously sample the joint posterior distribution of age, distance, and
extinction for each cluster, as well as unique helium values for two
populations within each cluster and the relative proportion of those
populations. We find the helium differences among the two populations in the
clusters fall in the range of ~0.04 to 0.11. Because adequate models varying in
CNO are not presently available, we view these spreads as upper limits and
present them with statistical rather than observational uncertainties. Evidence
supports previous studies suggesting an increase in helium content concurrent
with increasing mass of the cluster and also find that the proportion of the
first population of stars increases with mass as well. Our results are examined
in the context of proposed globular cluster formation scenarios. Additionally,
we leverage our Bayesian technique to shed light on inconsistencies between the
theoretical models and the observed data.
[10]
oai:arXiv.org:1605.02810 [pdf] - 1403836
The Power of Principled Bayesian Methods in the Study of Stellar
Evolution
Submitted: 2016-05-09
It takes years of effort employing the best telescopes and instruments to
obtain high-quality stellar photometry, astrometry, and spectroscopy. Stellar
evolution models contain the experience of lifetimes of theoretical
calculations and testing. Yet most astronomers fit these valuable models to
these precious datasets by eye. We show that a principled Bayesian approach to
fitting models to stellar data yields substantially more information over a
range of stellar astrophysics. We highlight advances in determining the ages of
star clusters, mass ratios of binary stars, limitations in the accuracy of
stellar models, post-main-sequence mass loss, and the ages of individual white
dwarfs. We also outline a number of unsolved problems that would benefit from
principled Bayesian analyses.
[11]
oai:arXiv.org:1604.06073 [pdf] - 1443898
Bayesian Analysis of Two Stellar Populations in Galactic Globular
Clusters I: Statistical and Computational Methods
Submitted: 2016-04-20, last modified: 2016-04-21
We develop a Bayesian model for globular clusters composed of multiple
stellar populations, extending earlier statistical models for open clusters
composed of simple (single) stellar populations (vanDyk et al. 2009, Stein et
al. 2013). Specifically, we model globular clusters with two populations that
differ in helium abundance. Our model assumes a hierarchical structuring of the
parameters in which physical properties---age, metallicity, helium abundance,
distance, absorption, and initial mass---are common to (i) the cluster as a
whole or to (ii) individual populations within a cluster, or are unique to
(iii) individual stars. An adaptive Markov chain Monte Carlo (MCMC) algorithm
is devised for model fitting that greatly improves convergence relative to its
precursor non-adaptive MCMC algorithm. Our model and computational tools are
incorporated into an open-source software suite known as BASE-9. We use
numerical studies to demonstrate that our method can recover parameters of
two-population clusters, and also show model misspecification can potentially
be identified. As a proof of concept, we analyze the two stellar populations of
globular cluster NGC 5272 using our model and methods. (BASE-9 is available
from GitHub: https://github.com/argiopetech/base/releases).
[12]
oai:arXiv.org:1604.06074 [pdf] - 1443899
Bayesian Analysis of Two Stellar Populations in Galactic Globular
Clusters II: NGC 5024, NGC 5272, and NGC 6352
Submitted: 2016-04-20
We use Cycle 21 Hubble Space Telescope (HST) observations and HST archival
ACS Treasury observations of Galactic Globular Clusters to find and
characterize two stellar populations in NGC 5024 (M53), NGC 5272 (M3), and NGC
6352. For these three clusters, both single and double-population analyses are
used to determine a best fit isochrone(s). We employ a sophisticated Bayesian
analysis technique to simultaneously fit the cluster parameters (age, distance,
absorption, and metallicity) that characterize each cluster. For the
two-population analysis, unique population level helium values are also fit to
each distinct population of the cluster and the relative proportions of the
populations are determined. We find differences in helium ranging from
$\sim$0.05 to 0.11 for these three clusters. Model grids with solar
$\alpha$-element abundances ([$\alpha$/Fe] =0.0) and enhanced $\alpha$-elements
([$\alpha$/Fe]=0.4) are adopted.
[13]
oai:arXiv.org:1411.3786 [pdf] - 898736
Bayesian Analysis for Stellar Evolution with Nine Parameters (BASE-9):
User's Manual
Submitted: 2014-11-13
BASE-9 is a Bayesian software suite that recovers star cluster and stellar
parameters from photometry. BASE-9 is useful for analyzing single-age,
single-metallicity star clusters, binaries, or single stars, and for simulating
such systems. BASE-9 uses Markov chain Monte Carlo and brute-force numerical
integration techniques to estimate the posterior probability distributions for
the age, metallicity, helium abundance, distance modulus, and line-of-sight
absorption for a cluster, and the mass, binary mass ratio, and cluster
membership probability for every stellar object. BASE-9 is provided as open
source code on a version-controlled web server. The executables are also
available as Amazon Elastic Compute Cloud images. This manual provides
potential users with an overview of BASE-9, including instructions for
installation and use.