Full-text search for arXiv

Cabrera-Vives, Guillermo

Normalized to: Cabrera-Vives, G.

11 article(s) in total. 70 co-authors, from 1 to 7 common article(s). Median position in authors list is 4,0.

[1] oai:arXiv.org:2003.05499 [pdf] - 2063820

Asteroids' Size Distribution and Colors from HiTS

Peña, J.; Fuentes, C.; Förster, F.; Martínez-Palomera, J.; Cabrera-Vives, G.; Maureira, J. C.; Huijse, P.; Estévez, P. A.; Galbany, L.; González-Gaitán, S.; de Jaeger, Th.

Comments: 17 pages, 18 figures

Submitted: 2020-03-11, last modified: 2020-03-13

We report the observations of solar system objects during the 2015 campaign of the High cadence Transient Survey (HiTS). We found 5740 bodies (mostly Main Belt asteroids), 1203 of which were detected in different nights and in $g'$ and $r'$. Objects were linked in the barycenter system and their orbital parameters were computed assuming Keplerian motion. We identified 6 near Earth objects, 1738 Main Belt asteroids and 4 Trans-Neptunian objects. We did not find a $g'-r'$ color-size correlation for $14<H_{g'}<18$ ($1<D<10$ km) asteroids. We show asteroids' colors are disturbed by HiTS' 1.6 hour cadence and estimate that observations should be separated by at most 14 minutes to avoid confusion in future wide-field surveys like LSST. The size distribution for the Main Belt objects can be characterized as a simple power law with slope $\sim0.9$, steeper than in any other survey, while data from HiTS 2014's campaign is consistent with previous ones (slopes $\sim0.68$ at the bright end and $\sim0.34$ at the faint end). This difference is likely due to the ecliptic distribution of the Main Belt since 2015's campaign surveyed farther from the ecliptic than did 2014's and most previous surveys.

[2] oai:arXiv.org:1811.03577 [pdf] - 1791346

Labeling Bias in Galaxy Morphologies

Cabrera-Vives, Guillermo; Miller, Christopher J.; Schneider, Jeff

Comments:

Submitted: 2018-11-08

We present a metric to quantify systematic labeling bias in galaxy morphology data sets stemming from the quality of the labeled data. This labeling bias is independent from labeling errors and requires knowledge about the intrinsic properties of the data with respect to the observed properties. We conduct a relative comparison of label bias for different low redshift galaxy morphology data sets. We show our metric is able to recover previous de-biasing procedures based on redshift as biasing parameter. By using the image resolution instead, we find biases that have not been addressed. We find that the morphologies based on supervised machine-learning trained over features such as colors, shape, and concentration show significantly less bias than morphologies based on expert or citizen-science classifiers. This result holds even when there is underlying bias present in the training sets used in the supervised machine learning process. We use catalog simulations to validate our bias metric, and show how to bin the multidimensional intrinsic and observed galaxy properties used in the bias quantification. Our approach is designed to work on any other labeled multidimensional data sets and the code is publicly available.

[3] oai:arXiv.org:1807.03869 [pdf] - 1957097

Deep Learning for Image Sequence Classification of Astronomical Events

Carrasco-Davis, Rodrigo; Cabrera-Vives, Guillermo; Förster, Francisco; Estévez, Pablo A.; Huijse, Pablo; Protopapas, Pavlos; Reyes, Ignacio; Martínez-Palomera, Jorge; Donoso, Cristóbal

Comments: 20 pages, 20 figures (corrected compilation errors). This is an Accepted Manuscript version of an article accepted for publication in Publications of the Astronomical Society of the Pacific. Nether the Astronomical Society of the Pacific nor IOP Publishing Ltd is responsible for any errors or omissions in this version of the manuscript or any version derived from it

Submitted: 2018-07-10, last modified: 2018-11-07

We propose a new sequential classification model for astronomical objects based on a recurrent convolutional neural network (RCNN) which uses sequences of images as inputs. This approach avoids the computation of light curves or difference images. This is the first time that sequences of images are used directly for the classification of variable objects in astronomy. The second contribution of this work is the image simulation process. We generate synthetic image sequences that take into account the instrumental and observing conditions, obtaining a realistic, set of movies for each astronomical object. The simulated dataset is used to train our RCNN classifier. This approach allows us to generate datasets to train and test our RCNN model for different astronomical surveys and telescopes. We aim at building a simulated dataset whose distribution is close enough to the real dataset, so that a fine tuning could match the distributions between real and simulated dataset. To test the RCNN classifier trained with the synthetic dataset, we used real-world data from the High cadence Transient Survey (HiTS) obtaining an average recall of 85%, improved to 94% after performing fine tuning with 10 real samples per class. We compare the results of our model with those of a light curve random forest classifier. The proposed RCNN with fine tuning has a similar performance on the HiTS dataset compared to the light curve classifier, trained on an augmented training set with 10 real samples per class. The RCNN approach presents several advantages in an alert stream classification scenario, such as a reduction of the data pre-processing, faster online evaluation and easier performance improvement using a few real data samples. These results encourage us to use this method for alert brokers systems that will process alert streams generated by new telescopes such as the Large Synoptic Survey Telescope.

[4] oai:arXiv.org:1810.07857 [pdf] - 1953345

Multiband galaxy morphologies for CLASH: a convolutional neural network transferred from CANDELS

Pérez-Carrasco, Manuel; Cabrera-Vives, Guillermo; Martinez-Marín, Monserrat; Cerulo, Pierluigi; Demarco, Ricardo; Protopapas, Pavlos; Godoy, Julio; Huertas-Company, Marc

Comments: 11 pages, 11 figures, submitted to Publications of the Astronomical Society of the Pacific

Submitted: 2018-10-17

We present visual-like morphologies over 16 photometric bands, from ultra-violet to near infrared, for 8,412 galaxies in the Cluster Lensing And Supernova survey with Hubble (CLASH) obtained by a convolutional neural network (CNN) model. Our model follows the CANDELS main morphological classification scheme, obtaining the probability for each galaxy at each CLASH band of being spheroid, disk, irregular, point source, or unclassifiable. Our catalog contains morphologies for each galaxy with Hmag < 24.5 in every filter where the galaxy is observed. We trained an initial CNN model using approximately 7,500 expert eyeball labels from The Cosmic Assembly Near-IR Deep Extragalactic Legacy Survey (CANDELS). We created eyeball labels for 100 randomly selected galaxies per each of the 16-filters set of CLASH (1,600 galaxy images in total), where each image was classified by at least five of us. We use these labels to fine-tune the network in order to accurately predict labels for the CLASH data and to evaluate the performance of our model. We achieve a root-mean-square error of 0.0991 on the test set. We show that our proposed fine-tuning technique reduces the number of labeled images needed for training, as compared to directly training over the CLASH data, and achieves a better performance. This approach is very useful to minimize eyeball labeling efforts when classifying unlabeled data from new surveys. This will become particularly useful for massive datasets such as the ones coming from near future surveys such as EUCLID or the LSST. Our catalog consists of prediction of probabilities for each galaxy by morphology in their different bands and is made publicly available at http://www.inf.udec.cl/~guille/data/Deep-CLASH.csv.

[5] oai:arXiv.org:1809.06379 [pdf] - 1752035

The delay of shock breakout due to circumstellar material seen in most Type II Supernovae

Comments: Published in Nature Astronomy (https://www.nature.com/articles/s41550-018-0563-4). 41 pages including methods. 5 figures in main text) + 8 figures in methods

Submitted: 2018-09-17

Type II supernovae (SNe) originate from the explosion of hydrogen-rich supergiant massive stars. Their first electromagnetic signature is the shock breakout, a short-lived phenomenon which can last from hours to days depending on the density at shock emergence. We present 26 rising optical light curves of SN II candidates discovered shortly after explosion by the High cadence Transient Survey (HiTS) and derive physical parameters based on hydrodynamical models using a Bayesian approach. We observe a steep rise of a few days in 24 out of 26 SN II candidates, indicating the systematic detection of shock breakouts in a dense circumstellar matter consistent with a mass loss rate $\dot{M} > 10^{-4} M_\odot yr^{-1}$ or a dense atmosphere. This implies that the characteristic hour timescale signature of stellar envelope SBOs may be rare in nature and could be delayed into longer-lived circumstellar material shock breakouts in most Type II SNe.

[6] oai:arXiv.org:1809.00763 [pdf] - 1767631

The High Cadence Transient Survey (HITS): Compilation and characterization of light-curve catalogs

Martínez-Palomera, Jorge; Förster, Francisco; Protopapas, Pavlos; Maureira, Juan Carlos; Lira, Paulina; Cabrera-Vives, Guillermo; Huijse, Pablo; Galbany, Lluis; de Jaeger, Thomas; González-Gaitán, Santiago; Medina, Gustavo; Pignata, Giuliano; Martín, Jaime San; Hamuy, Mario; Muñoz, Ricardo R.

Comments: 22 pages including 10 figures and 9 tables. Accepted for publication in AJ. For associated files, see http://astro.cmm.uchile.cl/HiTS/

Submitted: 2018-09-03, last modified: 2018-09-07

The High Cadence Transient Survey (HiTS) aims to discover and study transient objects with characteristic timescales between hours and days, such as pulsating, eclipsing and exploding stars. This survey represents a unique laboratory to explore large etendue observations from cadences of about 0.1 days and to test new computational tools for the analysis of large data. This work follows a fully \textit{Data Science} approach: from the raw data to the analysis and classification of variable sources. We compile a catalog of ${\sim}15$ million object detections and a catalog of ${\sim}2.5$ million light-curves classified by variability. The typical depth of the survey is $24.2$, $24.3$, $24.1$ and $23.8$ in $u$, $g$, $r$ and $i$ bands, respectively. We classified all point-like non-moving sources by first extracting features from their light-curves and then applying a Random Forest classifier. For the classification, we used a training set constructed using a combination of cross-matched catalogs, visual inspection, transfer/active learning and data augmentation. The classification model consists of several Random Forest classifiers organized in a hierarchical scheme. The classifier accuracy estimated on a test set is approximately $97\%$. In the unlabeled data, $3\,485$ sources were classified as variables, of which $1\,321$ were classified as periodic. Among the periodic classes we discovered with high confidence, 1 $\delta$-scutti, 39 eclipsing binaries, 48 rotational variables and 90 RR-Lyrae and for the non-periodic classes we discovered 1 cataclysmic variables, 630 QSO, and 1 supernova candidates. The first data release can be accessed in the project archive of HiTS.

[7] oai:arXiv.org:1808.03626 [pdf] - 1820132

Enhanced Rotational Invariant Convolutional Neural Network for Supernovae Detection

Reyes, Esteban; Estévez, Pablo A.; Reyes, Ignacio; Cabrera-Vives, Guillermo; Huijse, Pablo; Carrasco-Davis, Rodrigo; Förster, Francisco

Comments: 8 pages, 5 figures. Accepted for publication in proceedings of the IEEE World Congress on Computational Intelligence (IEEE WCCI), Rio de Janeiro, Brazil, 8-13 July, 2018

Submitted: 2018-08-10

In this paper, we propose an enhanced CNN model for detecting supernovae (SNe). This is done by applying a new method for obtaining rotational invariance that exploits cyclic symmetry. In addition, we use a visualization approach, the layer-wise relevance propagation (LRP) method, which allows finding the relevant pixels in each image that contribute to discriminate between SN candidates and artifacts. We introduce a measure to assess quantitatively the effect of the rotational invariant methods on the LRP relevance heatmaps. This allows comparing the proposed method, CAP, with the original Deep-HiTS model. The results show that the enhanced method presents an augmented capacity for achieving rotational invariance with respect to the original model. An ensemble of CAP models obtained the best results so far on the HiTS dataset, reaching an average accuracy of 99.53%. The improvement over Deep-HiTS is significant both statistically and in practice.

[8] oai:arXiv.org:1806.03352 [pdf] - 1697179

Asteroids in the High cadence Transient Survey

Peña, J.; Fuentes, C.; Förster, F.; Maureira, J. C.; Martín, J. San; Littín, J.; Huijse, P.; Cabrera-Vives, G.; Estévez, P. A.; Galbany, L.; González-Gaitán, S.; Martínez, J.; de Jaeger, Th.; Hamuy, M.

Comments: 9 pages, 7 figures

Submitted: 2018-06-08

We report on the serendipitous observations of Solar System objects imaged during the High cadence Transient Survey (HiTS) 2014 observation campaign. Data from this high cadence, wide field survey was originally analyzed for finding variable static sources using Machine Learning to select the most-likely candidates. In this work we search for moving transients consistent with Solar System objects and derive their orbital parameters. We use a simple, custom detection algorithm to link trajectories and assume Keplerian motion to derive the asteroid's orbital parameters. We use known asteroids from the Minor Planet Center (MPC) database to assess the detection efficiency of the survey and our search algorithm. Trajectories have an average of nine detections spread over 2 days, and our fit yields typical errors of $\sigma_a\sim 0.07 ~{\rm AU}$, $\sigma_{\rm e} \sim 0.07 $ and $\sigma_i\sim 0.^{\circ}5~ {\rm deg}$ in semi-major axis, eccentricity, and inclination respectively for known asteroids in our sample. We extract 7,700 orbits from our trajectories, identifying 19 near Earth objects, 6,687 asteroids, 14 Centaurs, and 15 trans-Neptunian objects. This highlights the complementarity of supernova wide field surveys for Solar System research and the significance of machine learning to clean data of false detections. It is a good example of the data--driven science that LSST will deliver.

[9] oai:arXiv.org:1701.00458 [pdf] - 1534038

Deep-HiTS: Rotation Invariant Convolutional Neural Network for Transient Detection

Cabrera-Vives, Guillermo; Reyes, Ignacio; Förster, Francisco; Estévez, Pablo A.; Maureira, Juan-Carlos

Comments:

Submitted: 2017-01-02

We introduce Deep-HiTS, a rotation invariant convolutional neural network (CNN) model for classifying images of transients candidates into artifacts or real sources for the High cadence Transient Survey (HiTS). CNNs have the advantage of learning the features automatically from the data while achieving high performance. We compare our CNN model against a feature engineering approach using random forests (RF). We show that our CNN significantly outperforms the RF model reducing the error by almost half. Furthermore, for a fixed number of approximately 2,000 allowed false transient candidates per night we are able to reduce the miss-classified real transients by approximately 1/5. To the best of our knowledge, this is the first time CNNs have been used to detect astronomical transient events. Our approach will be very useful when processing images from next generation instruments such as the Large Synoptic Survey Telescope (LSST). We have made all our code and data available to the community for the sake of allowing further developments and comparisons at https://github.com/guille-c/Deep-HiTS.

[10] oai:arXiv.org:1509.05429 [pdf] - 1304133

A catalog of visual-like morphologies in the 5 CANDELS fields using deep-learning

Huertas-Company, M.; Gravet, R.; Cabrera-Vives, G.; Pérez-González, P. G.; Kartaltepe, J. S.; Barro, G.; Bernardi, M.; Mei, S.; Shankar, F.; Dimauro, P.; Bell, E. F.; Kocevski, D.; Koo, D. C.; Faber, S. M.; Mcintosh, D. H.

Comments: Accepted for publication in ApjS. Figure 10 summarizes the excellent agreement between our classification and a pure visual one. Table 3 shows the content of the catalogs. The catalogs are available from the Rainbow database (http://rainbowx.fis.ucm.es/Rainbow_navigator_public) based on the selections from the CANDELS team and cross-matched with 3D-HST v4.1 catalogs

Submitted: 2015-09-17

We present a catalog of visual like H-band morphologies of $\sim50.000$ galaxies ($H_{f160w}<24.5$) in the 5 CANDELS fields (GOODS-N, GOODS-S, UDS, EGS and COSMOS). Morphologies are estimated with Convolutional Neural Networks (ConvNets). The median redshift of the sample is $<z>\sim1.25$. The algorithm is trained on GOODS-S for which visual classifications are publicly available and then applied to the other 4 fields. Following the CANDELS main morphology classification scheme, our model retrieves the probabilities for each galaxy of having a spheroid, a disk, presenting an irregularity, being compact or point source and being unclassifiable. ConvNets are able to predict the fractions of votes given a galaxy image with zero bias and $\sim10\%$ scatter. The fraction of miss-classifications is less than $1\%$. Our classification scheme represents a major improvement with respect to CAS (Concentration-Asymmetry-Smoothness)-based methods, which hit a $20-30\%$ contamination limit at high z. The catalog is released with the present paper via the $\href{http://rainbowx.fis.ucm.es/Rainbow_navigator_public}{Rainbow\,database}$

[11] oai:arXiv.org:1506.03084 [pdf] - 1264041

The morphologies of massive galaxies from z~3 - Witnessing the 2 channels of bulge growth

Huertas-Company, Marc; Pérez-González, Pablo G.; Mei, Simona; Shankar, Francesco; Bernardi, Mariangela; Daddi, Emanuele; Barro, Guillermo; Cabrera-Vives, Guillermo; Cattaneo, Andrea; Dimauro, Paola; Gravet, Romaric

Comments: accepted for publication in ApJ - comments welcome

Submitted: 2015-06-09

[abridged] We quantify the morphological evolution of z~0 massive galaxies ($M*/M_\odot\sim10^{11}$) from z~3 in the 5 CANDELS fields. The progenitors are selected using abundance matching techniques to account for the mass growth. The morphologies strongly evolve from z~3. At z<1, the population matches the massive end of the Hubble sequence, with 30% of spheroids, 50% of galaxies with equally dominant disk and bulge components and 20% of disks. At z~2-3 there is a majority of irregular systems (~60-70%) with still 30% of spheroids. We then analyze the SFRs, gas fractions and structural properties for the different morphologies independently. Our results suggest two distinct channels for the growth of bulges in massive galaxies. Around 30-40% were already bulges at z~2.5, with low average SFRs and gas-fractions (10-15%), high Sersic indices (n>3-4) and small effective radii ($R_e$~1 kpc) pointing towards an early formation through gas-rich mergers or VDI. Between z~ 2.5 and z~0, they rapidly increase their size by a factor of ~4-5, become all passive but their global morphology remains unaltered. The structural evolution is independent of the gas fractions, suggesting that it is driven by ex-situ events. The remaining 60% experience a gradual morphological transformation, from clumpy disks to more regular bulge+disks systems, essentially happening at z>1. It results in the growth of a significant bulge component (n~3) for 2/3 of the systems possibly through the migration of clumps while the remaining 1/3 keeps a rather small bulge (n~1.5-2). The transition phase between disturbed and relaxed systems and the emergence of the bulge is correlated with a decrease of the star formation activity and the gas fractions. The growth of the effective radii scales roughly with $H(z)^{-1}$ and it is therefore consistent with the expected growth of disks in galaxy haloes.