Normalized to: Kubica, J.
[1]
oai:arXiv.org:1911.02479 [pdf] - 1994455
Algorithms and Statistical Models for Scientific Discovery in the
Petabyte Era
Nord, Brian;
Connolly, Andrew J.;
Kinney, Jamie;
Kubica, Jeremy;
Narayan, Gautaum;
Peek, Joshua E. G.;
Schafer, Chad;
Tollerud, Erik J.;
Avestruz, Camille;
Babu, G. Jogesh;
Birrer, Simon;
Burke, Douglas;
Caldeira, João;
Caldwell, Douglas A.;
Carlberg, Joleen K.;
Chen, Yen-Chi;
Dong, Chuanfei;
Feigelson, Eric D.;
Golkhou, V. Zach;
Kashyap, Vinay;
Li, T. S.;
Loredo, Thomas;
Lucie-Smith, Luisa;
Mandel, Kaisey S.;
Martínez-Galarza, J. R.;
Miller, Adam A.;
Natarajan, Priyamvada;
Ntampaka, Michelle;
Ptak, Andy;
Rapetti, David;
Shamir, Lior;
Siemiginowska, Aneta;
Sipőcz, Brigitta M.;
Smith, Arfon M.;
Tran, Nhan;
Vilalta, Ricardo;
Walkowicz, Lucianne M.;
ZuHone, John
Submitted: 2019-11-04
The field of astronomy has arrived at a turning point in terms of size and
complexity of both datasets and scientific collaboration. Commensurately,
algorithms and statistical models have begun to adapt --- e.g., via the onset
of artificial intelligence --- which itself presents new challenges and
opportunities for growth. This white paper aims to offer guidance and ideas for
how we can evolve our technical and collaborative frameworks to promote
efficient algorithmic development and take advantage of opportunities for
scientific discovery in the petabyte era. We discuss challenges for discovery
in large and complex data sets; challenges and requirements for the next stage
of development of statistical methodologies and algorithmic tool sets; how we
might change our paradigms of collaboration and education; and the ethical
implications of scientists' contributions to widely applicable algorithms and
computational modeling. We start with six distinct recommendations that are
supported by the commentary following them. This white paper is related to a
larger corpus of effort that has taken place within and around the Petabytes to
Science Workshops (https://petabytestoscience.github.io/).
[2]
oai:arXiv.org:1302.7281 [pdf] - 1164931
The Pan-STARRS Moving Object Processing System
Denneau, Larry;
Jedicke, Robert;
Grav, Tommy;
Granvik, Mikael;
Kubica, Jeremy;
Milani, Andrea;
Veres, Peter;
Wainscoat, Richard;
Chang, Daniel;
Pierfederici, Francesco;
Kaiser, N.;
Chambers, K. C.;
Heasley, J. N.;
Magnier, Eugene. A.;
Price, P. A.;
Myers, Jonathan;
Kleyna, Jan;
Hsieh, Henry;
Farnocchia, Davide;
Waters, Chris;
Sweeney, W. H.;
Green, Denver;
Bolin, Bryce;
Burgett, W. S.;
Morgan, J. S.;
Tonry, John L.;
Hodapp, K. W.;
Chastel, Serge;
Chesley, Steve;
Fitzsimmons, Alan;
Holman, Matthew;
Spahr, Tim;
Tholen, David;
Williams, Gareth V.;
Abe, Shinsuke;
Armstrong, J. D.;
Bressi, Terry H.;
Holmes, Robert;
Lister, Tim;
McMillan, Robert S.;
Micheli, Marco;
Ryan, Eileen V.;
Ryan, William H.;
Scotti, James V.
Submitted: 2013-02-28
We describe the Pan-STARRS Moving Object Processing System (MOPS), a modern
software package that produces automatic asteroid discoveries and
identifications from catalogs of transient detections from next-generation
astronomical survey telescopes. MOPS achieves > 99.5% efficiency in producing
orbits from a synthetic but realistic population of asteroids whose
measurements were simulated for a Pan-STARRS4-class telescope. Additionally,
using a non-physical grid population, we demonstrate that MOPS can detect
populations of currently unknown objects such as interstellar asteroids.
MOPS has been adapted successfully to the prototype Pan-STARRS1 telescope
despite differences in expected false detection rates, fill-factor loss and
relatively sparse observing cadence compared to a hypothetical Pan-STARRS4
telescope and survey. MOPS remains >99.5% efficient at detecting objects on a
single night but drops to 80% efficiency at producing orbits for objects
detected on multiple nights. This loss is primarily due to configurable MOPS
processing limits that are not yet tuned for the Pan-STARRS1 mission.
The core MOPS software package is the product of more than 15 person-years of
software development and incorporates countless additional years of effort in
third-party software to perform lower-level functions such as spatial searching
or orbit determination. We describe the high-level design of MOPS and essential
subcomponents, the suitability of MOPS for other survey programs, and suggest a
road map for future MOPS development.
[3]
oai:arXiv.org:astro-ph/0703475 [pdf] - 90284
Efficient intra- and inter-night linking of asteroid detections using
kd-trees
Kubica, Jeremy;
Denneau, Larry;
Grav, Tommy;
Heasley, James;
Jedicke, Robert;
Masiero, Joseph;
Milani, Andrea;
Moore, Andrew;
Tholen, David;
Wainscoat, Richard J.
Submitted: 2007-03-19
The Panoramic Survey Telescope And Rapid Response System (Pan-STARRS) under
development at the University of Hawaii's Institute for Astronomy is creating
the first fully automated end-to-end Moving Object Processing System (MOPS) in
the world. It will be capable of identifying detections of moving objects in
our solar system and linking those detections within and between nights,
attributing those detections to known objects, calculating initial and
differentially-corrected orbits for linked detections, precovering detections
when they exist, and orbit identification. Here we describe new kd-tree and
variable-tree algorithms that allow fast, efficient, scalable linking of intra
and inter-night detections. Using a pseudo-realistic simulation of the
Pan-STARRS survey strategy incorporating weather, astrometric accuracy and
false detections we have achieved nearly 100% efficiency and accuracy for
intra-night linking and nearly 100% efficiency for inter-night linking within a
lunation. At realistic sky-plane densities for both real and false detections
the intra-night linking of detections into `tracks' currently has an accuracy
of 0.3%. Successful tests of the MOPS on real source detections from the
Spacewatch asteroid survey indicate that the MOPS is capable of identifying
asteroids in real data.
[4]
oai:arXiv.org:astro-ph/0701506 [pdf] - 88635
LSST: Comprehensive NEO Detection, Characterization, and Orbits
Submitted: 2007-01-17
(Abridged) The Large Synoptic Survey Telescope (LSST) is currently by far the
most ambitious proposed ground-based optical survey. Solar System mapping is
one of the four key scientific design drivers, with emphasis on efficient
Near-Earth Object (NEO) and Potentially Hazardous Asteroid (PHA) detection,
orbit determination, and characterization. In a continuous observing campaign
of pairs of 15 second exposures of its 3,200 megapixel camera, LSST will cover
the entire available sky every three nights in two photometric bands to a depth
of V=25 per visit (two exposures), with exquisitely accurate astrometry and
photometry. Over the proposed survey lifetime of 10 years, each sky location
would be visited about 1000 times. The baseline design satisfies strong
constraints on the cadence of observations mandated by PHAs such as closely
spaced pairs of observations to link different detections and short exposures
to avoid trailing losses. Equally important, due to frequent repeat visits LSST
will effectively provide its own follow-up to derive orbits for detected moving
objects. Detailed modeling of LSST operations, incorporating real historical
weather and seeing data from LSST site at Cerro Pachon, shows that LSST using
its baseline design cadence could find 90% of the PHAs with diameters larger
than 250 m, and 75% of those greater than 140 m within ten years. However, by
optimizing sky coverage, the ongoing simulations suggest that the LSST system,
with its first light in 2013, can reach the Congressional mandate of cataloging
90% of PHAs larger than 140m by 2020.