sort results by

Use logical operators AND, OR, NOT and round brackets to construct complex queries. Whitespace-separated words are treated as ANDed.

Show articles per page in mode

Ruiz, Idoia

Normalized to: Ruiz, I.

3 article(s) in total. 12 co-authors, from 1 to 3 common article(s). Median position in authors list is 10,0.

[1]  oai:arXiv.org:1504.00782  [pdf] - 1232162
A comparative study of four significance measures for periodicity detection in astronomical surveys
Comments: 16 pages, 14 figures, 1 table
Submitted: 2015-04-03
We study the problem of periodicity detection in massive data sets of photometric or radial velocity time series, as presented by ESA's Gaia mission. Periodicity detection hinges on the estimation of the false alarm probability (FAP) of the extremum of the periodogram of the time series. We consider the problem of its estimation with two main issues in mind. First, for a given number of observations and signal-to-noise ratio, the rate of correct periodicity detections should be constant for all realized cadences of observations regardless of the observational time patterns, in order to avoid sky biases that are difficult to assess. Second, the computational loads should be kept feasible even for millions of time series. Using the Gaia case, we compare the $F^M$ method (Paltani 2004, Schwarzenberg-Czerny 2012), the Baluev method (Baluev 2008) and the GEV method (S\"uveges 2014), as well as a method for the direct estimation of a threshold. Three methods involve some unknown parameters, which are obtained by fitting a regression-type predictive model using easily obtainable covariates derived from observational time series. We conclude that the GEV and the Baluev methods both provide good solutions to the issues posed by a large-scale processing. The first of these yields the best scientific quality at the price of some moderately costly pre-processing. When this pre-processing is impossible for some reason (e.g. the computational costs are prohibitive or good regression models cannot be constructed), the Baluev method provides a computationally inexpensive alternative with slight biases in regions where time samplings exhibit strong aliases.
[2]  oai:arXiv.org:1502.01165  [pdf] - 931035
Automated eclipsing binary detection: applying the Gaia CU7 pipeline to Hipparcos
Comments: 4 pages, 1 figure, to be published in conference proceedings: "The Milky Way Unravelled by Gaia" in "EAS Publications Series"
Submitted: 2015-02-04
We demonstrate the eclipsing binary detection performance of the Gaia variability analysis and processing pipeline using Hipparcos data. The automated pipeline classifies 1,067 (0.9%) of the 118,204 Hipparcos sources as eclipsing binary candidates. The detection rate amounts to 89% (732 sources) in a subset of 819 visually confirmed eclipsing binaries, with the period correctly identified for 80% of them, and double or half periods obtained in 6% of the cases.
[3]  oai:arXiv.org:1411.5943  [pdf] - 904189
Time series data mining for the Gaia variability analysis
Comments: 4 pages, 3 figures. appears in the Proc. of the 2014 conference on Big Data from Space (BiDS14), European Commission, Joint Research Centre, P. Soille, P. G. Marchetti (eds)
Submitted: 2014-11-21
Gaia is an ESA cornerstone mission, which was successfully launched December 2013 and commenced operations in July 2014. Within the Gaia Data Processing and Analysis consortium, Coordination Unit 7 (CU7) is responsible for the variability analysis of over a billion celestial sources and nearly 4 billion associated time series (photometric, spectrophotometric, and spectroscopic), encoding information in over 800 billion observations during the 5 years of the mission, resulting in a petabyte scale analytical problem. In this article, we briefly describe the solutions we developed to address the challenges of time series variability analysis: from the structure for a distributed data-oriented scientific collaboration to architectural choices and specific components used. Our approach is based on Open Source components with a distributed, partitioned database as the core to handle incrementally: ingestion, distributed processing, analysis, results and export in a constrained time window.