Normalized to: Blocker, A.
[1]
oai:arXiv.org:1401.2134 [pdf] - 1202657
10 Simple Rules for the Care and Feeding of Scientific Data
Goodman, Alyssa;
Pepe, Alberto;
Blocker, Alexander W.;
Borgman, Christine L.;
Cranmer, Kyle;
Crosas, Mercè;
Di Stefano, Rosanne;
Gil, Yolanda;
Groth, Paul;
Hedstrom, Margaret;
Hogg, David W.;
Kashyap, Vinay;
Mahabal, Ashish;
Siemiginowska, Aneta;
Slavkovic, Aleksandra
Submitted: 2014-01-09
This article offers a short guide to the steps scientists can take to ensure
that their data and associated analyses continue to be of value and to be
recognized. In just the past few years, hundreds of scholarly papers and
reports have been written on questions of data sharing, data provenance,
research reproducibility, licensing, attribution, privacy, and more, but our
goal here is not to review that literature. Instead, we present a short guide
intended for researchers who want to know why it is important to "care for and
feed" data, with some practical advice on how to do that.
[2]
oai:arXiv.org:1301.3027 [pdf] - 616798
Semi-parametric Robust Event Detection for Massive Time-Domain Databases
Submitted: 2013-01-14, last modified: 2013-01-19
The detection and analysis of events within massive collections of
time-series has become an extremely important task for time-domain astronomy.
In particular, many scientific investigations (e.g. the analysis of
microlensing and other transients) begin with the detection of isolated events
in irregularly-sampled series with both non-linear trends and non-Gaussian
noise. We outline a semi-parametric, robust, parallel method for identifying
variability and isolated events at multiple scales in the presence of the above
complications. This approach harnesses the power of Bayesian modeling while
maintaining much of the speed and scalability of more ad-hoc machine learning
approaches. We also contrast this work with event detection methods from other
fields, highlighting the unique challenges posed by astronomical surveys.
Finally, we present results from the application of this method to 87.2 million
EROS-2 sources, where we have obtained a greater than 100-fold reduction in
candidates for certain types of phenomena while creating high-quality features
for subsequent analyses.
[3]
oai:arXiv.org:0904.0645 [pdf] - 315790
A Bayesian approach to the analysis of time symmetry in light curves:
Reconsidering Scorpius X-1 occultations
Submitted: 2009-04-04
We present a new approach to the analysis of time symmetry in light curves,
such as those in the x-ray at the center of the Scorpius X-1 occultation
debate. Our method uses a new parameterization for such events (the bilogistic
event profile) and provides a clear, physically relevant characterization of
each event's key features. We also demonstrate a Markov Chain Monte Carlo
algorithm to carry out this analysis, including a novel independence chain
configuration for the estimation of each event's location in the light curve.
These tools are applied to the Scorpius X-1 light curves presented in Chang et
al. (2007), providing additional evidence based on the time series that the
events detected thus far are most likely not occultations by TNOs.