Normalized to: Ruiz, I.
[1]
oai:arXiv.org:1504.00782 [pdf] - 1232162
A comparative study of four significance measures for periodicity
detection in astronomical surveys
Süveges, Maria;
Guy, Leanne P.;
Eyer, Laurent;
Cuypers, Jan;
Holl, Berry;
Lecoeur-Taïbi, Isabelle;
Mowlavi, Nami;
Nienartowicz, Krzysztof;
Blanco, Diego Ordóñez;
Rimoldini, Lorenzo;
Ruiz, Idoia
Submitted: 2015-04-03
We study the problem of periodicity detection in massive data sets of
photometric or radial velocity time series, as presented by ESA's Gaia mission.
Periodicity detection hinges on the estimation of the false alarm probability
(FAP) of the extremum of the periodogram of the time series. We consider the
problem of its estimation with two main issues in mind. First, for a given
number of observations and signal-to-noise ratio, the rate of correct
periodicity detections should be constant for all realized cadences of
observations regardless of the observational time patterns, in order to avoid
sky biases that are difficult to assess. Second, the computational loads should
be kept feasible even for millions of time series. Using the Gaia case, we
compare the $F^M$ method (Paltani 2004, Schwarzenberg-Czerny 2012), the Baluev
method (Baluev 2008) and the GEV method (S\"uveges 2014), as well as a method
for the direct estimation of a threshold. Three methods involve some unknown
parameters, which are obtained by fitting a regression-type predictive model
using easily obtainable covariates derived from observational time series. We
conclude that the GEV and the Baluev methods both provide good solutions to the
issues posed by a large-scale processing. The first of these yields the best
scientific quality at the price of some moderately costly pre-processing. When
this pre-processing is impossible for some reason (e.g. the computational costs
are prohibitive or good regression models cannot be constructed), the Baluev
method provides a computationally inexpensive alternative with slight biases in
regions where time samplings exhibit strong aliases.
[2]
oai:arXiv.org:1502.01165 [pdf] - 931035
Automated eclipsing binary detection: applying the Gaia CU7 pipeline to
Hipparcos
Holl, Berry;
Mowlavi, Nami;
Lecoeur-Taïbi, Isabelle;
Barblan, Fabio;
Rimoldini, Lorenzo;
Eyer, Laurent;
Süveges, Maria;
Guy, Leanne;
Ordoñez-Blanco, Diego;
Ruiz, Idoia;
Nienartowicz, Krzysztof
Submitted: 2015-02-04
We demonstrate the eclipsing binary detection performance of the Gaia
variability analysis and processing pipeline using Hipparcos data. The
automated pipeline classifies 1,067 (0.9%) of the 118,204 Hipparcos sources as
eclipsing binary candidates. The detection rate amounts to 89% (732 sources) in
a subset of 819 visually confirmed eclipsing binaries, with the period
correctly identified for 80% of them, and double or half periods obtained in 6%
of the cases.
[3]
oai:arXiv.org:1411.5943 [pdf] - 904189
Time series data mining for the Gaia variability analysis
Nienartowicz, Krzysztof;
Blanco, Diego Ordóñez;
Guy, Leanne;
Holl, Berry;
Lecoeur-Taïbi, Isabelle;
Mowlavi, Nami;
Rimoldini, Lorenzo;
Ruiz, Idoia;
Süveges, Maria;
Eyer, Laurent
Submitted: 2014-11-21
Gaia is an ESA cornerstone mission, which was successfully launched December
2013 and commenced operations in July 2014. Within the Gaia Data Processing and
Analysis consortium, Coordination Unit 7 (CU7) is responsible for the
variability analysis of over a billion celestial sources and nearly 4 billion
associated time series (photometric, spectrophotometric, and spectroscopic),
encoding information in over 800 billion observations during the 5 years of the
mission, resulting in a petabyte scale analytical problem. In this article, we
briefly describe the solutions we developed to address the challenges of time
series variability analysis: from the structure for a distributed data-oriented
scientific collaboration to architectural choices and specific components used.
Our approach is based on Open Source components with a distributed, partitioned
database as the core to handle incrementally: ingestion, distributed
processing, analysis, results and export in a constrained time window.