Normalized to: Cabrera-Vives, G.
[1]
oai:arXiv.org:2003.05499 [pdf] - 2063820
Asteroids' Size Distribution and Colors from HiTS
Peña, J.;
Fuentes, C.;
Förster, F.;
Martínez-Palomera, J.;
Cabrera-Vives, G.;
Maureira, J. C.;
Huijse, P.;
Estévez, P. A.;
Galbany, L.;
González-Gaitán, S.;
de Jaeger, Th.
Submitted: 2020-03-11, last modified: 2020-03-13
We report the observations of solar system objects during the 2015 campaign
of the High cadence Transient Survey (HiTS). We found 5740 bodies (mostly Main
Belt asteroids), 1203 of which were detected in different nights and in $g'$
and $r'$. Objects were linked in the barycenter system and their orbital
parameters were computed assuming Keplerian motion. We identified 6 near Earth
objects, 1738 Main Belt asteroids and 4 Trans-Neptunian objects. We did not
find a $g'-r'$ color-size correlation for $14<H_{g'}<18$ ($1<D<10$ km)
asteroids. We show asteroids' colors are disturbed by HiTS' 1.6 hour cadence
and estimate that observations should be separated by at most 14 minutes to
avoid confusion in future wide-field surveys like LSST. The size distribution
for the Main Belt objects can be characterized as a simple power law with slope
$\sim0.9$, steeper than in any other survey, while data from HiTS 2014's
campaign is consistent with previous ones (slopes $\sim0.68$ at the bright end
and $\sim0.34$ at the faint end). This difference is likely due to the ecliptic
distribution of the Main Belt since 2015's campaign surveyed farther from the
ecliptic than did 2014's and most previous surveys.
[2]
oai:arXiv.org:1811.03577 [pdf] - 1791346
Labeling Bias in Galaxy Morphologies
Submitted: 2018-11-08
We present a metric to quantify systematic labeling bias in galaxy morphology
data sets stemming from the quality of the labeled data. This labeling bias is
independent from labeling errors and requires knowledge about the intrinsic
properties of the data with respect to the observed properties. We conduct a
relative comparison of label bias for different low redshift galaxy morphology
data sets. We show our metric is able to recover previous de-biasing procedures
based on redshift as biasing parameter. By using the image resolution instead,
we find biases that have not been addressed. We find that the morphologies
based on supervised machine-learning trained over features such as colors,
shape, and concentration show significantly less bias than morphologies based
on expert or citizen-science classifiers. This result holds even when there is
underlying bias present in the training sets used in the supervised machine
learning process. We use catalog simulations to validate our bias metric, and
show how to bin the multidimensional intrinsic and observed galaxy properties
used in the bias quantification. Our approach is designed to work on any other
labeled multidimensional data sets and the code is publicly available.
[3]
oai:arXiv.org:1807.03869 [pdf] - 1957097
Deep Learning for Image Sequence Classification of Astronomical Events
Submitted: 2018-07-10, last modified: 2018-11-07
We propose a new sequential classification model for astronomical objects
based on a recurrent convolutional neural network (RCNN) which uses sequences
of images as inputs. This approach avoids the computation of light curves or
difference images. This is the first time that sequences of images are used
directly for the classification of variable objects in astronomy. The second
contribution of this work is the image simulation process. We generate
synthetic image sequences that take into account the instrumental and observing
conditions, obtaining a realistic, set of movies for each astronomical object.
The simulated dataset is used to train our RCNN classifier. This approach
allows us to generate datasets to train and test our RCNN model for different
astronomical surveys and telescopes. We aim at building a simulated dataset
whose distribution is close enough to the real dataset, so that a fine tuning
could match the distributions between real and simulated dataset. To test the
RCNN classifier trained with the synthetic dataset, we used real-world data
from the High cadence Transient Survey (HiTS) obtaining an average recall of
85%, improved to 94% after performing fine tuning with 10 real samples per
class. We compare the results of our model with those of a light curve random
forest classifier. The proposed RCNN with fine tuning has a similar performance
on the HiTS dataset compared to the light curve classifier, trained on an
augmented training set with 10 real samples per class. The RCNN approach
presents several advantages in an alert stream classification scenario, such as
a reduction of the data pre-processing, faster online evaluation and easier
performance improvement using a few real data samples. These results encourage
us to use this method for alert brokers systems that will process alert streams
generated by new telescopes such as the Large Synoptic Survey Telescope.
[4]
oai:arXiv.org:1810.07857 [pdf] - 1953345
Multiband galaxy morphologies for CLASH: a convolutional neural network
transferred from CANDELS
Submitted: 2018-10-17
We present visual-like morphologies over 16 photometric bands, from
ultra-violet to near infrared, for 8,412 galaxies in the Cluster Lensing And
Supernova survey with Hubble (CLASH) obtained by a convolutional neural network
(CNN) model. Our model follows the CANDELS main morphological classification
scheme, obtaining the probability for each galaxy at each CLASH band of being
spheroid, disk, irregular, point source, or unclassifiable. Our catalog
contains morphologies for each galaxy with Hmag < 24.5 in every filter where
the galaxy is observed. We trained an initial CNN model using approximately
7,500 expert eyeball labels from The Cosmic Assembly Near-IR Deep Extragalactic
Legacy Survey (CANDELS). We created eyeball labels for 100 randomly selected
galaxies per each of the 16-filters set of CLASH (1,600 galaxy images in
total), where each image was classified by at least five of us. We use these
labels to fine-tune the network in order to accurately predict labels for the
CLASH data and to evaluate the performance of our model. We achieve a
root-mean-square error of 0.0991 on the test set. We show that our proposed
fine-tuning technique reduces the number of labeled images needed for training,
as compared to directly training over the CLASH data, and achieves a better
performance. This approach is very useful to minimize eyeball labeling efforts
when classifying unlabeled data from new surveys. This will become particularly
useful for massive datasets such as the ones coming from near future surveys
such as EUCLID or the LSST. Our catalog consists of prediction of probabilities
for each galaxy by morphology in their different bands and is made publicly
available at http://www.inf.udec.cl/~guille/data/Deep-CLASH.csv.
[5]
oai:arXiv.org:1809.06379 [pdf] - 1752035
The delay of shock breakout due to circumstellar material seen in most
Type II Supernovae
Förster, F.;
Moriya, T. J.;
Maureira, J. C.;
Anderson, J. P.;
Blinnikov, S.;
Bufano, F.;
Cabrera-Vives, G.;
Clocchiatti, A.;
de Jaeger, Th.;
Estévez, P. A.;
Galbany, L.;
González-Gaitán, S.;
Gräfener, G.;
Hamuy, M.;
Hsiao, E.;
Huentelemu, P.;
Huijse, P.;
Kuncarayakti, H.;
Martínez-Palomera, J.;
Medina, G.;
E., F. Olivares;
Pignata, G.;
Razza, A.;
Reyes, I.;
Martín, J. San;
Smith, R. C.;
Vera, E.;
Vivas, A. K.;
Postigo, A. de Ugarte;
Yoon, S. -C.;
Ashall, C.;
Fraser, M.;
Gal-Yam, A.;
Kankare, E.;
Guillou, L. Le;
Mazzali, P. A.;
Walton, N. A.;
Young, D. R.
Submitted: 2018-09-17
Type II supernovae (SNe) originate from the explosion of hydrogen-rich
supergiant massive stars. Their first electromagnetic signature is the shock
breakout, a short-lived phenomenon which can last from hours to days depending
on the density at shock emergence. We present 26 rising optical light curves of
SN II candidates discovered shortly after explosion by the High cadence
Transient Survey (HiTS) and derive physical parameters based on hydrodynamical
models using a Bayesian approach. We observe a steep rise of a few days in 24
out of 26 SN II candidates, indicating the systematic detection of shock
breakouts in a dense circumstellar matter consistent with a mass loss rate
$\dot{M} > 10^{-4} M_\odot yr^{-1}$ or a dense atmosphere. This implies that
the characteristic hour timescale signature of stellar envelope SBOs may be
rare in nature and could be delayed into longer-lived circumstellar material
shock breakouts in most Type II SNe.
[6]
oai:arXiv.org:1809.00763 [pdf] - 1767631
The High Cadence Transient Survey (HITS): Compilation and
characterization of light-curve catalogs
Martínez-Palomera, Jorge;
Förster, Francisco;
Protopapas, Pavlos;
Maureira, Juan Carlos;
Lira, Paulina;
Cabrera-Vives, Guillermo;
Huijse, Pablo;
Galbany, Lluis;
de Jaeger, Thomas;
González-Gaitán, Santiago;
Medina, Gustavo;
Pignata, Giuliano;
Martín, Jaime San;
Hamuy, Mario;
Muñoz, Ricardo R.
Submitted: 2018-09-03, last modified: 2018-09-07
The High Cadence Transient Survey (HiTS) aims to discover and study transient
objects with characteristic timescales between hours and days, such as
pulsating, eclipsing and exploding stars. This survey represents a unique
laboratory to explore large etendue observations from cadences of about 0.1
days and to test new computational tools for the analysis of large data. This
work follows a fully \textit{Data Science} approach: from the raw data to the
analysis and classification of variable sources. We compile a catalog of
${\sim}15$ million object detections and a catalog of ${\sim}2.5$ million
light-curves classified by variability. The typical depth of the survey is
$24.2$, $24.3$, $24.1$ and $23.8$ in $u$, $g$, $r$ and $i$ bands, respectively.
We classified all point-like non-moving sources by first extracting features
from their light-curves and then applying a Random Forest classifier. For the
classification, we used a training set constructed using a combination of
cross-matched catalogs, visual inspection, transfer/active learning and data
augmentation. The classification model consists of several Random Forest
classifiers organized in a hierarchical scheme. The classifier accuracy
estimated on a test set is approximately $97\%$. In the unlabeled data,
$3\,485$ sources were classified as variables, of which $1\,321$ were
classified as periodic. Among the periodic classes we discovered with high
confidence, 1 $\delta$-scutti, 39 eclipsing binaries, 48 rotational variables
and 90 RR-Lyrae and for the non-periodic classes we discovered 1 cataclysmic
variables, 630 QSO, and 1 supernova candidates. The first data release can be
accessed in the project archive of HiTS.
[7]
oai:arXiv.org:1808.03626 [pdf] - 1820132
Enhanced Rotational Invariant Convolutional Neural Network for
Supernovae Detection
Submitted: 2018-08-10
In this paper, we propose an enhanced CNN model for detecting supernovae
(SNe). This is done by applying a new method for obtaining rotational
invariance that exploits cyclic symmetry. In addition, we use a visualization
approach, the layer-wise relevance propagation (LRP) method, which allows
finding the relevant pixels in each image that contribute to discriminate
between SN candidates and artifacts. We introduce a measure to assess
quantitatively the effect of the rotational invariant methods on the LRP
relevance heatmaps. This allows comparing the proposed method, CAP, with the
original Deep-HiTS model. The results show that the enhanced method presents an
augmented capacity for achieving rotational invariance with respect to the
original model. An ensemble of CAP models obtained the best results so far on
the HiTS dataset, reaching an average accuracy of 99.53%. The improvement over
Deep-HiTS is significant both statistically and in practice.
[8]
oai:arXiv.org:1806.03352 [pdf] - 1697179
Asteroids in the High cadence Transient Survey
Peña, J.;
Fuentes, C.;
Förster, F.;
Maureira, J. C.;
Martín, J. San;
Littín, J.;
Huijse, P.;
Cabrera-Vives, G.;
Estévez, P. A.;
Galbany, L.;
González-Gaitán, S.;
Martínez, J.;
de Jaeger, Th.;
Hamuy, M.
Submitted: 2018-06-08
We report on the serendipitous observations of Solar System objects imaged
during the High cadence Transient Survey (HiTS) 2014 observation campaign. Data
from this high cadence, wide field survey was originally analyzed for finding
variable static sources using Machine Learning to select the most-likely
candidates. In this work we search for moving transients consistent with Solar
System objects and derive their orbital parameters.
We use a simple, custom detection algorithm to link trajectories and assume
Keplerian motion to derive the asteroid's orbital parameters. We use known
asteroids from the Minor Planet Center (MPC) database to assess the detection
efficiency of the survey and our search algorithm. Trajectories have an average
of nine detections spread over 2 days, and our fit yields typical errors of
$\sigma_a\sim 0.07 ~{\rm AU}$, $\sigma_{\rm e} \sim 0.07 $ and $\sigma_i\sim
0.^{\circ}5~ {\rm deg}$ in semi-major axis, eccentricity, and inclination
respectively for known asteroids in our sample. We extract 7,700 orbits from
our trajectories, identifying 19 near Earth objects, 6,687 asteroids, 14
Centaurs, and 15 trans-Neptunian objects. This highlights the complementarity
of supernova wide field surveys for Solar System research and the significance
of machine learning to clean data of false detections. It is a good example of
the data--driven science that LSST will deliver.
[9]
oai:arXiv.org:1701.00458 [pdf] - 1534038
Deep-HiTS: Rotation Invariant Convolutional Neural Network for Transient
Detection
Submitted: 2017-01-02
We introduce Deep-HiTS, a rotation invariant convolutional neural network
(CNN) model for classifying images of transients candidates into artifacts or
real sources for the High cadence Transient Survey (HiTS). CNNs have the
advantage of learning the features automatically from the data while achieving
high performance. We compare our CNN model against a feature engineering
approach using random forests (RF). We show that our CNN significantly
outperforms the RF model reducing the error by almost half. Furthermore, for a
fixed number of approximately 2,000 allowed false transient candidates per
night we are able to reduce the miss-classified real transients by
approximately 1/5. To the best of our knowledge, this is the first time CNNs
have been used to detect astronomical transient events. Our approach will be
very useful when processing images from next generation instruments such as the
Large Synoptic Survey Telescope (LSST). We have made all our code and data
available to the community for the sake of allowing further developments and
comparisons at https://github.com/guille-c/Deep-HiTS.
[10]
oai:arXiv.org:1509.05429 [pdf] - 1304133
A catalog of visual-like morphologies in the 5 CANDELS fields using
deep-learning
Huertas-Company, M.;
Gravet, R.;
Cabrera-Vives, G.;
Pérez-González, P. G.;
Kartaltepe, J. S.;
Barro, G.;
Bernardi, M.;
Mei, S.;
Shankar, F.;
Dimauro, P.;
Bell, E. F.;
Kocevski, D.;
Koo, D. C.;
Faber, S. M.;
Mcintosh, D. H.
Submitted: 2015-09-17
We present a catalog of visual like H-band morphologies of $\sim50.000$
galaxies ($H_{f160w}<24.5$) in the 5 CANDELS fields (GOODS-N, GOODS-S, UDS, EGS
and COSMOS). Morphologies are estimated with Convolutional Neural Networks
(ConvNets). The median redshift of the sample is $<z>\sim1.25$. The algorithm
is trained on GOODS-S for which visual classifications are publicly available
and then applied to the other 4 fields. Following the CANDELS main morphology
classification scheme, our model retrieves the probabilities for each galaxy of
having a spheroid, a disk, presenting an irregularity, being compact or point
source and being unclassifiable. ConvNets are able to predict the fractions of
votes given a galaxy image with zero bias and $\sim10\%$ scatter. The fraction
of miss-classifications is less than $1\%$. Our classification scheme
represents a major improvement with respect to CAS
(Concentration-Asymmetry-Smoothness)-based methods, which hit a $20-30\%$
contamination limit at high z. The catalog is released with the present paper
via the
$\href{http://rainbowx.fis.ucm.es/Rainbow_navigator_public}{Rainbow\,database}$
[11]
oai:arXiv.org:1506.03084 [pdf] - 1264041
The morphologies of massive galaxies from z~3 - Witnessing the 2
channels of bulge growth
Huertas-Company, Marc;
Pérez-González, Pablo G.;
Mei, Simona;
Shankar, Francesco;
Bernardi, Mariangela;
Daddi, Emanuele;
Barro, Guillermo;
Cabrera-Vives, Guillermo;
Cattaneo, Andrea;
Dimauro, Paola;
Gravet, Romaric
Submitted: 2015-06-09
[abridged] We quantify the morphological evolution of z~0 massive galaxies
($M*/M_\odot\sim10^{11}$) from z~3 in the 5 CANDELS fields. The progenitors are
selected using abundance matching techniques to account for the mass growth.
The morphologies strongly evolve from z~3. At z<1, the population matches the
massive end of the Hubble sequence, with 30% of spheroids, 50% of galaxies with
equally dominant disk and bulge components and 20% of disks. At z~2-3 there is
a majority of irregular systems (~60-70%) with still 30% of spheroids.
We then analyze the SFRs, gas fractions and structural properties for the
different morphologies independently. Our results suggest two distinct channels
for the growth of bulges in massive galaxies.
Around 30-40% were already bulges at z~2.5, with low average SFRs and
gas-fractions (10-15%), high Sersic indices (n>3-4) and small effective radii
($R_e$~1 kpc) pointing towards an early formation through gas-rich mergers or
VDI. Between z~ 2.5 and z~0, they rapidly increase their size by a factor of
~4-5, become all passive but their global morphology remains unaltered. The
structural evolution is independent of the gas fractions, suggesting that it is
driven by ex-situ events.
The remaining 60% experience a gradual morphological transformation, from
clumpy disks to more regular bulge+disks systems, essentially happening at z>1.
It results in the growth of a significant bulge component (n~3) for 2/3 of the
systems possibly through the migration of clumps while the remaining 1/3 keeps
a rather small bulge (n~1.5-2). The transition phase between disturbed and
relaxed systems and the emergence of the bulge is correlated with a decrease of
the star formation activity and the gas fractions. The growth of the effective
radii scales roughly with $H(z)^{-1}$ and it is therefore consistent with the
expected growth of disks in galaxy haloes.