Normalized to: Hsu, D.
[1]
oai:arXiv.org:2007.06529 [pdf] - 2132089
Interpreting deep learning models for weak lensing
Submitted: 2020-07-13
Deep Neural Networks (DNNs) are powerful algorithms that have been proven
capable of extracting non-Gaussian information from weak lensing (WL) data
sets. Understanding which features in the data determine the output of these
nested, non-linear algorithms is an important but challenging task. We analyze
a DNN that has been found in previous work to accurately recover cosmological
parameters in simulated maps of the WL convergence ($\kappa$). We derive
constraints on the cosmological parameter pair $(\Omega_m,\sigma_8)$ from a
combination of three commonly used WL statistics (power spectrum, lensing
peaks, and Minkowski functionals), using ray-traced simulated $\kappa$ maps. We
show that the network can improve the inferred parameter constraints relative
to this combination by $20\%$ even in the presence of realistic levels of shape
noise. We apply a series of well established saliency methods to interpret the
DNN and find that the most relevant pixels are those with extreme $\kappa$
values. For noiseless maps, regions with negative $\kappa$ account for
$86-69\%$ of the attribution of the DNN output, defined as the square of the
saliency in input space. In the presence of shape nose, the attribution
concentrates in high convergence regions, with $36-68\%$ of the attribution in
regions with $\kappa > 3 \sigma_{\kappa}$.
[2]
oai:arXiv.org:2002.02573 [pdf] - 2104690
Occurrence Rates of Planets Orbiting M Stars: Applying ABC to Kepler
DR25, Gaia DR2, and 2MASS Data
Submitted: 2020-02-06, last modified: 2020-05-29
We present robust planet occurrence rates for Kepler planet candidates around
M stars for planet radii $R_p = 0.5-4~\textrm{R}_\oplus$ and orbital periods $P
= 0.5-256$ days using the approximate Bayesian computation (ABC) technique.
This work incorporates the final Kepler DR25 planet candidate catalog and data
products and augments them with updated stellar properties using Gaia DR2 and
2MASS PSC. We apply a set of selection criteria to select a sample of 1,746
Kepler M dwarf targets that host 89 associated planet candidates. These early M
dwarfs and late K dwarfs were selected from cross-referenced targets using
several photometric quality flags from Gaia DR2 and color-magnitude cuts using
2MASS magnitudes. We estimate a habitable zone occurrence rate of
$f_{\textrm{M,HZ}} = 0.33^{+0.10}_{-0.12}$ for planets with $0.75-1.5$
R$_\oplus$ size. We caution that occurrence rate estimates for Kepler M stars
are sensitive to the choice of prior due to the small sample of target stars
and planet candidates. For example, we find an occurrence rate of
$4.2^{+0.6}_{-0.6}$ or $8.4^{+1.2}_{-1.1}$ planets per M dwarf (integrating
over $R_p = 0.5-4~\textrm{R}_\oplus$ and $P = 0.5-256$ days) for our two
choices of prior. These occurrence rates are greater than those for FGK dwarf
target when compared at the same range of orbital periods, but similar to
occurrence rates when computed as a function of equivalent stellar insolation.
Combining our result with recent studies of exoplanet architectures indicates
that most, and potentially all, early-M dwarfs harbor planetary systems.
[3]
oai:arXiv.org:1902.01417 [pdf] - 2025390
Occurrence Rates of Planets orbiting FGK Stars: Combining Kepler DR25,
Gaia DR2 and Bayesian Inference
Submitted: 2019-02-04, last modified: 2020-01-07
We characterize the occurrence rate of planets, ranging in size from 0.5-16
R$_\oplus$, orbiting FGK stars with orbital periods from 0.5-500 days. Our
analysis is based on results from the "DR25" catalog of planet candidates
produced by NASA's Kepler mission and stellar radii from Gaia "DR2". We
incorporate additional Kepler data products to accurately characterize the
efficiency of planets being recognized as a "threshold crossing events" (TCE)
by Kepler's Transiting Planet Search pipeline and labeled as a planet candidate
by the robovetter. Using a hierarchical Bayesian model, we derive planet
occurrence rates for a wide range of planet sizes and orbital periods. For
planets with sizes $0.75-1.5$ R$_\oplus$ and orbital periods of 237-500 days,
we find a rate of planets per FGK star of $<0.27$ ($84.13$th percentile). While
the true rate of such planets could be lower by a factor of $\sim~2$ (primarily
due to potential contamination of planet candidates by false alarms), the upper
limits on the occurrence rate of such planets are robust to $\sim~10\%$. We
recommend that mission concepts aiming to characterize potentially rocky
planets in or near the habitable zone of sun-like stars prepare compelling
science programs that would be robust for a true rate in the range $f_{R,P} = $
$0.03-0.40$ for $0.75-1.5$ R$_\oplus$ planets with orbital periods in 237-500
days, or a differential rate of $\Gamma_\oplus \equiv (d^2 f)/[d(\ln P)~d(\ln
R_{p})] = $ $0.06-0.76$.
[4]
oai:arXiv.org:1908.00203 [pdf] - 2115086
Sensitivity Analyses of Exoplanet Occurrence Rates from Kepler and Gaia
Shabram, Megan I.;
Batalha, Natalie;
Thompson, Susan E.;
Hsu, Danley C.;
Ford, Eric B.;
Christiansen, Jessie L.;
Huber, Daniel;
Berger, Travis;
Catanzarite, Joseph;
Nelson, Benjamin E.;
Bryson, Steve;
Belikov, Ruslan;
Burke, Chris;
Caldwell, Doug
Submitted: 2019-08-01
We infer the number of planets-per-star as a function of orbital period and
planet size using $Kepler$ archival data products with updated stellar
properties from the $Gaia$ Data Release 2. Using hierarchical Bayesian modeling
and Hamiltonian Monte Carlo, we incorporate planet radius uncertainties into an
inhomogeneous Poisson point process model. We demonstrate that this model
captures the general features of the outcome of the planet formation process
around GK stars, and provides an infrastructure to use the $Kepler$ results to
constrain analytic planet distribution models. We report an increased mean and
variance in the marginal posterior distributions for the number of planets per
$GK$ star when including planet radius measurement uncertainties. We estimate
the number of planets-per-$GK$ star between 0.75 and 2.5 $R_{\oplus}$ and 50 to
300 day orbital periods to have a $68\%$ credible interval of $0.49$ to $0.77$
and a posterior mean of $0.63$. This posterior has a smaller mean and a larger
variance than the occurrence rate calculated in this work and in Burke et al.
(2015) for the same parameter space using the $Q1-Q16$ (previous $Kepler$
planet candidate and stellar catalog), and a larger mean and variance than when
using the $DR25$ (latest $Kepler$ planet candidate and stellar catalog). We
find that the accuracy and precision of our hierarchical Bayesian model
posterior distributions are less sensitive to the total number of planets in
the sample, and more so on the characteristics of the catalog completeness and
reliability and the span of the planet parameter space.
[5]
oai:arXiv.org:1902.03663 [pdf] - 1993978
Weak lensing cosmology with convolutional neural networks on noisy data
Submitted: 2019-02-10
Weak gravitational lensing is one of the most promising cosmological probes
of the late universe. Several large ongoing (DES, KiDS, HSC) and planned (LSST,
EUCLID, WFIRST) astronomical surveys attempt to collect even deeper and larger
scale data on weak lensing. Due to gravitational collapse, the distribution of
dark matter is non-Gaussian on small scales. However, observations are
typically evaluated through the two-point correlation function of galaxy shear,
which does not capture non-Gaussian features of the lensing maps. Previous
studies attempted to extract non-Gaussian information from weak lensing
observations through several higher-order statistics such as the three-point
correlation function, peak counts or Minkowski-functionals. Deep convolutional
neural networks (CNN) emerged in the field of computer vision with tremendous
success, and they offer a new and very promising framework to extract
information from 2 or 3-dimensional astronomical data sets, confirmed by recent
studies on weak lensing. We show that a CNN is able to yield significantly
stricter constraints of ($\sigma_8, \Omega_m$) cosmological parameters than the
power spectrum using convergence maps generated by full N-body simulations and
ray-tracing, at angular scales and shape noise levels relevant for future
observations. In a scenario mimicking LSST or Euclid, the CNN yields 2.4-2.8
times smaller credible contours than the power spectrum, and 3.5-4.2 times
smaller at noise levels corresponding to a deep space survey such as WFIRST. We
also show that at shape noise levels achievable in future space surveys the CNN
yields 1.4-2.1 times smaller contours than peak counts, a higher-order
statistic capable of extracting non-Gaussian information from weak lensing
maps.
[6]
oai:arXiv.org:1802.01212 [pdf] - 1686734
Non-Gaussian information from weak lensing data via deep learning
Submitted: 2018-02-04, last modified: 2018-05-01
Weak lensing maps contain information beyond two-point statistics on small
scales. Much recent work has tried to extract this information through a range
of different observables or via nonlinear transformations of the lensing field.
Here we train and apply a 2D convolutional neural network to simulated
noiseless lensing maps covering 96 different cosmological models over a range
of {$\Omega_m,\sigma_8$}. Using the area of the confidence contour in the
{$\Omega_m,\sigma_8$} plane as a figure-of-merit, derived from simulated
convergence maps smoothed on a scale of 1.0 arcmin, we show that the neural
network yields $\approx 5 \times$ tighter constraints than the power spectrum,
and $\approx 4 \times$ tighter than the lensing peaks. Such gains illustrate
the extent to which weak lensing data encode cosmological information not
accessible to the power spectrum or even other, non-Gaussian statistics such as
lensing peaks.
[7]
oai:arXiv.org:1803.10787 [pdf] - 1674992
Improving the Accuracy of Planet Occurrence Rates from Kepler using
Approximate Bayesian Computation
Submitted: 2018-03-28
We present a new framework to characterize the occurrence rates of planet
candidates identified by Kepler based on hierarchical Bayesian modeling,
Approximate Bayesian Computing (ABC), and sequential importance sampling. For
this study we adopt a simple 2-D grid in planet radius and orbital period as
our model and apply our algorithm to estimate occurrence rates for Q1-Q16
planet candidates orbiting around solar-type stars. We arrive at significantly
increased planet occurrence rates for small planet candidates ($R_p<1.25
R_{\oplus}$) at larger orbital periods ($P>80$d) compared to the rates
estimated by the more common inverse detection efficiency method. Our improved
methodology estimates that the occurrence rate density of small planet
candidates in the habitable zone of solar-type stars is $1.6^{+1.2}_{-0.5}$ per
factor of 2 in planet radius and orbital period. Additionally, we observe a
local minimum in the occurrence rate for strong planet candidates marginalized
over orbital period between 1.5 and 2$R_{\oplus}$ that is consistent with
previous studies. For future improvements, the forward modeling approach of ABC
is ideally suited to incorporating multiple populations, such as planets,
astrophysical false positives and pipeline false alarms, to provide accurate
planet occurrence rates and uncertainties. Furthermore, ABC provides a
practical statistical framework for answering complex questions (e.g.,
frequency of different planetary architectures) and providing sound
uncertainties, even in the face of complex selection effects, observational
biases, and follow-up strategies. In summary, ABC offers a powerful tool for
accurately characterizing a wide variety of astrophysical populations.
[8]
oai:arXiv.org:1609.03973 [pdf] - 1507629
Do dark matter halos explain lensing peaks?
Submitted: 2016-09-13
We have investigated a recently proposed halo-based model, Camelus, for
predicting weak-lensing peak counts, and compared its results over a collection
of 162 cosmologies with those from N-body simulations. While counts from both
models agree for peaks with $\mathcal{S/N}>1$ (where $\mathcal{S/N}$ is the
ratio of the peak height to the r.m.s. shape noise), we find $\approx 50\%$
fewer counts for peaks near $\mathcal{S/N}=0$ and significantly higher counts
in the negative $\mathcal{S/N}$ tail. Adding shape noise reduces the
differences to within $20\%$ for all cosmologies. We also found larger
covariances that are more sensitive to cosmological parameters. As a result,
credibility regions in the $\{\Omega_m, \sigma_8\}$ are $\approx 30\%$ larger.
Even though the credible contours are commensurate, each model draws its
predictive power from different types of peaks. Low peaks, especially those
with $2<\mathcal{S/N}<3$, convey important cosmological information in N-body
data, as shown in \cite{DietrichHartlap, Kratochvil2010}, but \textsc{Camelus}
constrains cosmology almost exclusively from high significance peaks
$(\mathcal{S/N}>3)$. Our results confirm the importance of using a
cosmology-dependent covariance with at least a 14\% improvement in parameter
constraints. We identified the covariance estimation as the main driver behind
differences in inference, and suggest possible ways to make Camelus even more
useful as a highly accurate peak count emulator.
[9]
oai:arXiv.org:1003.2136 [pdf] - 1025634
The Third US Naval Observatory CCD Astrograph Catalog (UCAC3)
Zacharias, N.;
Finch, C.;
Girard, T.;
Hambly, N.;
Wycoff, G.;
Zacharias, M.;
Castillo, D.;
Corbin, T.;
DiVittorio, M.;
Dutta, S.;
Gaume, R.;
Gauss, S.;
Germain, M.;
Hall, D.;
Hartkopf, W.;
Hsu, D.;
Holdenried, E.;
Makarov, V.;
Martines, M.;
Mason, B.;
Monet, D.;
Rafferty, T.;
Rhodes, A.;
Siemers, T.;
Smith, D.;
Tilleman, T.;
Urban, S.;
Wieder, G.;
Winter, L.;
Young, A.
Submitted: 2010-03-09
The third US Naval Observatory (USNO) CCD Astrograph Catalog, UCAC3 was
released at the IAU General Assembly on 2009 August 10. It is the first all-sky
release in this series and contains just over 100 million objects, about 95
million of them with proper motions, covering about R = 8 to 16 magnitudes.
Current epoch positions are obtained from the observations with the 20 cm
aperture USNO Astrograph's "red lens", equipped with a 4k by 4k CCD. Proper
motions are derived by combining these observations with over 140 ground- and
space-based catalogs, including Hipparcos/Tycho and the AC2000.2, as well as
unpublished measures of over 5000 plates from other astrographs. For most of
the faint stars in the Southern Hemisphere the Yale/San Juan first epoch plates
from the SPM program (YSJ1) form the basis for proper motions. These data are
supplemented by all-sky Schmidt plate survey astrometry and photometry obtained
from the SuperCOSMOS project, as well as 2MASS near-IR photometry. Major
differences of UCAC3 data as compared to UCAC2 include a completely new raw
data reduction with improved control over systematic errors in positions,
significantly improved photometry, slightly deeper limiting magnitude, coverage
of the north pole region, greater completeness by inclusion of double stars and
weak detections. This of course leads to a catalog which is not as "clean" as
UCAC2 and problem areas are outlined for the user in this paper. The positional
accuracy of stars in UCAC3 is about 15 to 100 mas per coordinate, depending on
magnitude, while the errors in proper motions range from 1 to 10 mas/yr
depending on magnitude and observing history, with a significant improvement
over UCAC2 achieved due to the re-reduced SPM data and inclusion of more
astrograph plate data unavailable at the time of UCAC2.