Full-text search for arXiv

Hsu, D.

Normalized to: Hsu, D.

9 article(s) in total. 54 co-authors, from 1 to 4 common article(s). Median position in authors list is 3,0.

[1] oai:arXiv.org:2007.06529 [pdf] - 2132089

Interpreting deep learning models for weak lensing

Matilla, José Manuel Zorrilla; Sharma, Manasi; Hsu, Daniel; Haiman, Zoltán

Comments: 12 pages, 6 figures, submitted to PRD, comments welcome

Submitted: 2020-07-13

Deep Neural Networks (DNNs) are powerful algorithms that have been proven capable of extracting non-Gaussian information from weak lensing (WL) data sets. Understanding which features in the data determine the output of these nested, non-linear algorithms is an important but challenging task. We analyze a DNN that has been found in previous work to accurately recover cosmological parameters in simulated maps of the WL convergence ($\kappa$). We derive constraints on the cosmological parameter pair $(\Omega_m,\sigma_8)$ from a combination of three commonly used WL statistics (power spectrum, lensing peaks, and Minkowski functionals), using ray-traced simulated $\kappa$ maps. We show that the network can improve the inferred parameter constraints relative to this combination by $20\%$ even in the presence of realistic levels of shape noise. We apply a series of well established saliency methods to interpret the DNN and find that the most relevant pixels are those with extreme $\kappa$ values. For noiseless maps, regions with negative $\kappa$ account for $86-69\%$ of the attribution of the DNN output, defined as the square of the saliency in input space. In the presence of shape nose, the attribution concentrates in high convergence regions, with $36-68\%$ of the attribution in regions with $\kappa > 3 \sigma_{\kappa}$.

[2] oai:arXiv.org:2002.02573 [pdf] - 2104690

Occurrence Rates of Planets Orbiting M Stars: Applying ABC to Kepler DR25, Gaia DR2, and 2MASS Data

Hsu, Danley C.; Ford, Eric B.; Terrien, Ryan

Comments: 15 pages, 3 figures; submitted to MNRAS

Submitted: 2020-02-06, last modified: 2020-05-29

We present robust planet occurrence rates for Kepler planet candidates around M stars for planet radii $R_p = 0.5-4~\textrm{R}_\oplus$ and orbital periods $P = 0.5-256$ days using the approximate Bayesian computation (ABC) technique. This work incorporates the final Kepler DR25 planet candidate catalog and data products and augments them with updated stellar properties using Gaia DR2 and 2MASS PSC. We apply a set of selection criteria to select a sample of 1,746 Kepler M dwarf targets that host 89 associated planet candidates. These early M dwarfs and late K dwarfs were selected from cross-referenced targets using several photometric quality flags from Gaia DR2 and color-magnitude cuts using 2MASS magnitudes. We estimate a habitable zone occurrence rate of $f_{\textrm{M,HZ}} = 0.33^{+0.10}_{-0.12}$ for planets with $0.75-1.5$ R$_\oplus$ size. We caution that occurrence rate estimates for Kepler M stars are sensitive to the choice of prior due to the small sample of target stars and planet candidates. For example, we find an occurrence rate of $4.2^{+0.6}_{-0.6}$ or $8.4^{+1.2}_{-1.1}$ planets per M dwarf (integrating over $R_p = 0.5-4~\textrm{R}_\oplus$ and $P = 0.5-256$ days) for our two choices of prior. These occurrence rates are greater than those for FGK dwarf target when compared at the same range of orbital periods, but similar to occurrence rates when computed as a function of equivalent stellar insolation. Combining our result with recent studies of exoplanet architectures indicates that most, and potentially all, early-M dwarfs harbor planetary systems.

[3] oai:arXiv.org:1902.01417 [pdf] - 2025390

Occurrence Rates of Planets orbiting FGK Stars: Combining Kepler DR25, Gaia DR2 and Bayesian Inference

Hsu, Danley C.; Ford, Eric B.; Ragozzine, Darin; Ashby, Keir

Comments: Published in AJ; 28 pages, 6 figures, 4 tables

Submitted: 2019-02-04, last modified: 2020-01-07

We characterize the occurrence rate of planets, ranging in size from 0.5-16 R$_\oplus$, orbiting FGK stars with orbital periods from 0.5-500 days. Our analysis is based on results from the "DR25" catalog of planet candidates produced by NASA's Kepler mission and stellar radii from Gaia "DR2". We incorporate additional Kepler data products to accurately characterize the efficiency of planets being recognized as a "threshold crossing events" (TCE) by Kepler's Transiting Planet Search pipeline and labeled as a planet candidate by the robovetter. Using a hierarchical Bayesian model, we derive planet occurrence rates for a wide range of planet sizes and orbital periods. For planets with sizes $0.75-1.5$ R$_\oplus$ and orbital periods of 237-500 days, we find a rate of planets per FGK star of $<0.27$ ($84.13$th percentile). While the true rate of such planets could be lower by a factor of $\sim~2$ (primarily due to potential contamination of planet candidates by false alarms), the upper limits on the occurrence rate of such planets are robust to $\sim~10\%$. We recommend that mission concepts aiming to characterize potentially rocky planets in or near the habitable zone of sun-like stars prepare compelling science programs that would be robust for a true rate in the range $f_{R,P} = $ $0.03-0.40$ for $0.75-1.5$ R$_\oplus$ planets with orbital periods in 237-500 days, or a differential rate of $\Gamma_\oplus \equiv (d^2 f)/[d(\ln P)~d(\ln R_{p})] = $ $0.06-0.76$.

[4] oai:arXiv.org:1908.00203 [pdf] - 2115086

Sensitivity Analyses of Exoplanet Occurrence Rates from Kepler and Gaia

Shabram, Megan I.; Batalha, Natalie; Thompson, Susan E.; Hsu, Danley C.; Ford, Eric B.; Christiansen, Jessie L.; Huber, Daniel; Berger, Travis; Catanzarite, Joseph; Nelson, Benjamin E.; Bryson, Steve; Belikov, Ruslan; Burke, Chris; Caldwell, Doug

Comments: 17 pages, 3 figures, revised for ApJ

Submitted: 2019-08-01

We infer the number of planets-per-star as a function of orbital period and planet size using $Kepler$ archival data products with updated stellar properties from the $Gaia$ Data Release 2. Using hierarchical Bayesian modeling and Hamiltonian Monte Carlo, we incorporate planet radius uncertainties into an inhomogeneous Poisson point process model. We demonstrate that this model captures the general features of the outcome of the planet formation process around GK stars, and provides an infrastructure to use the $Kepler$ results to constrain analytic planet distribution models. We report an increased mean and variance in the marginal posterior distributions for the number of planets per $GK$ star when including planet radius measurement uncertainties. We estimate the number of planets-per-$GK$ star between 0.75 and 2.5 $R_{\oplus}$ and 50 to 300 day orbital periods to have a $68\%$ credible interval of $0.49$ to $0.77$ and a posterior mean of $0.63$. This posterior has a smaller mean and a larger variance than the occurrence rate calculated in this work and in Burke et al. (2015) for the same parameter space using the $Q1-Q16$ (previous $Kepler$ planet candidate and stellar catalog), and a larger mean and variance than when using the $DR25$ (latest $Kepler$ planet candidate and stellar catalog). We find that the accuracy and precision of our hierarchical Bayesian model posterior distributions are less sensitive to the total number of planets in the sample, and more so on the characteristics of the catalog completeness and reliability and the span of the planet parameter space.

[5] oai:arXiv.org:1902.03663 [pdf] - 1993978

Weak lensing cosmology with convolutional neural networks on noisy data

Ribli, Dezső; Pataki, Bálint Ármin; Matilla, José Manuel Zorrilla; Hsu, Daniel; Haiman, Zoltán; Csabai, István

Comments:

Submitted: 2019-02-10

Weak gravitational lensing is one of the most promising cosmological probes of the late universe. Several large ongoing (DES, KiDS, HSC) and planned (LSST, EUCLID, WFIRST) astronomical surveys attempt to collect even deeper and larger scale data on weak lensing. Due to gravitational collapse, the distribution of dark matter is non-Gaussian on small scales. However, observations are typically evaluated through the two-point correlation function of galaxy shear, which does not capture non-Gaussian features of the lensing maps. Previous studies attempted to extract non-Gaussian information from weak lensing observations through several higher-order statistics such as the three-point correlation function, peak counts or Minkowski-functionals. Deep convolutional neural networks (CNN) emerged in the field of computer vision with tremendous success, and they offer a new and very promising framework to extract information from 2 or 3-dimensional astronomical data sets, confirmed by recent studies on weak lensing. We show that a CNN is able to yield significantly stricter constraints of ($\sigma_8, \Omega_m$) cosmological parameters than the power spectrum using convergence maps generated by full N-body simulations and ray-tracing, at angular scales and shape noise levels relevant for future observations. In a scenario mimicking LSST or Euclid, the CNN yields 2.4-2.8 times smaller credible contours than the power spectrum, and 3.5-4.2 times smaller at noise levels corresponding to a deep space survey such as WFIRST. We also show that at shape noise levels achievable in future space surveys the CNN yields 1.4-2.1 times smaller contours than peak counts, a higher-order statistic capable of extracting non-Gaussian information from weak lensing maps.

[6] oai:arXiv.org:1802.01212 [pdf] - 1686734

Non-Gaussian information from weak lensing data via deep learning

Gupta, Arushi; Matilla, José Manuel Zorrilla; Hsu, Daniel; Haiman, Zoltán

Comments: 15 pages, 13 figures, accepted to PRD

Submitted: 2018-02-04, last modified: 2018-05-01

Weak lensing maps contain information beyond two-point statistics on small scales. Much recent work has tried to extract this information through a range of different observables or via nonlinear transformations of the lensing field. Here we train and apply a 2D convolutional neural network to simulated noiseless lensing maps covering 96 different cosmological models over a range of {$\Omega_m,\sigma_8$}. Using the area of the confidence contour in the {$\Omega_m,\sigma_8$} plane as a figure-of-merit, derived from simulated convergence maps smoothed on a scale of 1.0 arcmin, we show that the neural network yields $\approx 5 \times$ tighter constraints than the power spectrum, and $\approx 4 \times$ tighter than the lensing peaks. Such gains illustrate the extent to which weak lensing data encode cosmological information not accessible to the power spectrum or even other, non-Gaussian statistics such as lensing peaks.

[7] oai:arXiv.org:1803.10787 [pdf] - 1674992

Improving the Accuracy of Planet Occurrence Rates from Kepler using Approximate Bayesian Computation

Hsu, Danley C.; Ford, Eric B.; Ragozzine, Darin; Morehead, Robert C.

Comments: Accepted by AJ; 27 pages, 8 figures

Submitted: 2018-03-28

We present a new framework to characterize the occurrence rates of planet candidates identified by Kepler based on hierarchical Bayesian modeling, Approximate Bayesian Computing (ABC), and sequential importance sampling. For this study we adopt a simple 2-D grid in planet radius and orbital period as our model and apply our algorithm to estimate occurrence rates for Q1-Q16 planet candidates orbiting around solar-type stars. We arrive at significantly increased planet occurrence rates for small planet candidates ($R_p<1.25 R_{\oplus}$) at larger orbital periods ($P>80$d) compared to the rates estimated by the more common inverse detection efficiency method. Our improved methodology estimates that the occurrence rate density of small planet candidates in the habitable zone of solar-type stars is $1.6^{+1.2}_{-0.5}$ per factor of 2 in planet radius and orbital period. Additionally, we observe a local minimum in the occurrence rate for strong planet candidates marginalized over orbital period between 1.5 and 2$R_{\oplus}$ that is consistent with previous studies. For future improvements, the forward modeling approach of ABC is ideally suited to incorporating multiple populations, such as planets, astrophysical false positives and pipeline false alarms, to provide accurate planet occurrence rates and uncertainties. Furthermore, ABC provides a practical statistical framework for answering complex questions (e.g., frequency of different planetary architectures) and providing sound uncertainties, even in the face of complex selection effects, observational biases, and follow-up strategies. In summary, ABC offers a powerful tool for accurately characterizing a wide variety of astrophysical populations.

[8] oai:arXiv.org:1609.03973 [pdf] - 1507629

Do dark matter halos explain lensing peaks?

Matilla, José Manuel Zorrilla; Haiman, Zoltán; Hsu, Daniel; Gupta, Arushi; Petri, Andrea

Comments:

Submitted: 2016-09-13

We have investigated a recently proposed halo-based model, Camelus, for predicting weak-lensing peak counts, and compared its results over a collection of 162 cosmologies with those from N-body simulations. While counts from both models agree for peaks with $\mathcal{S/N}>1$ (where $\mathcal{S/N}$ is the ratio of the peak height to the r.m.s. shape noise), we find $\approx 50\%$ fewer counts for peaks near $\mathcal{S/N}=0$ and significantly higher counts in the negative $\mathcal{S/N}$ tail. Adding shape noise reduces the differences to within $20\%$ for all cosmologies. We also found larger covariances that are more sensitive to cosmological parameters. As a result, credibility regions in the $\{\Omega_m, \sigma_8\}$ are $\approx 30\%$ larger. Even though the credible contours are commensurate, each model draws its predictive power from different types of peaks. Low peaks, especially those with $2<\mathcal{S/N}<3$, convey important cosmological information in N-body data, as shown in \cite{DietrichHartlap, Kratochvil2010}, but \textsc{Camelus} constrains cosmology almost exclusively from high significance peaks $(\mathcal{S/N}>3)$. Our results confirm the importance of using a cosmology-dependent covariance with at least a 14\% improvement in parameter constraints. We identified the covariance estimation as the main driver behind differences in inference, and suggest possible ways to make Camelus even more useful as a highly accurate peak count emulator.

[9] oai:arXiv.org:1003.2136 [pdf] - 1025634

The Third US Naval Observatory CCD Astrograph Catalog (UCAC3)

Comments: accepted by AJ, 24 pages, 34 figures, 3 tables

Submitted: 2010-03-09

The third US Naval Observatory (USNO) CCD Astrograph Catalog, UCAC3 was released at the IAU General Assembly on 2009 August 10. It is the first all-sky release in this series and contains just over 100 million objects, about 95 million of them with proper motions, covering about R = 8 to 16 magnitudes. Current epoch positions are obtained from the observations with the 20 cm aperture USNO Astrograph's "red lens", equipped with a 4k by 4k CCD. Proper motions are derived by combining these observations with over 140 ground- and space-based catalogs, including Hipparcos/Tycho and the AC2000.2, as well as unpublished measures of over 5000 plates from other astrographs. For most of the faint stars in the Southern Hemisphere the Yale/San Juan first epoch plates from the SPM program (YSJ1) form the basis for proper motions. These data are supplemented by all-sky Schmidt plate survey astrometry and photometry obtained from the SuperCOSMOS project, as well as 2MASS near-IR photometry. Major differences of UCAC3 data as compared to UCAC2 include a completely new raw data reduction with improved control over systematic errors in positions, significantly improved photometry, slightly deeper limiting magnitude, coverage of the north pole region, greater completeness by inclusion of double stars and weak detections. This of course leads to a catalog which is not as "clean" as UCAC2 and problem areas are outlined for the user in this paper. The positional accuracy of stars in UCAC3 is about 15 to 100 mas per coordinate, depending on magnitude, while the errors in proper motions range from 1 to 10 mas/yr depending on magnitude and observing history, with a significant improvement over UCAC2 achieved due to the re-reduced SPM data and inclusion of more astrograph plate data unavailable at the time of UCAC2.