Normalized to: Bialek, S.
[1]
oai:arXiv.org:2007.03109 [pdf] - 2129514
Cycle-StarNet: Bridging the gap between theory and data by leveraging
large datasets
Submitted: 2020-07-06
Spectroscopy provides an immense amount of information on stellar objects,
and this field continues to grow with recent developments in multi-object data
acquisition and rapid data analysis techniques. Current automated methods for
analyzing spectra are either (a) data-driven models, which require large
amounts of data with prior knowledge of stellar parameters and elemental
abundances, or (b) based on theoretical synthetic models that are susceptible
to the gap between theory and practice. In this study, we present a hybrid
generative domain adaptation method to turn simulated stellar spectra into
realistic spectra, learning from the large spectroscopic surveys. We use a
neural network to emulate computationally expensive stellar spectra
simulations, and then train a separate unsupervised domain-adaptation network
that learns to relate the generated synthetic spectra to observational spectra.
Consequently, the network essentially produces data-driven models without the
need for a labeled training set. As a proof of concept, two case studies are
presented. The first of which is the auto-calibration of synthetic models
without using any standard stars. To accomplish this, synthetic models are
morphed into spectra that resemble observations, thereby reducing the gap
between theory and observations. The second case study is the identification of
the elemental source of missing spectral lines in the synthetic modelling.
These sources are predicted by interpreting the differences between the
domain-adapted and original spectral models. To test our ability to identify
missing lines, we use a mock dataset and show that, even with noisy
observations, absorption lines can be recovered when they are absent in one of
the domains. While we focus on spectral analyses in this study, this method can
be applied to other fields, which use large data sets and are currently limited
by modelling accuracy.
[2]
oai:arXiv.org:2007.03112 [pdf] - 2129515
Interpreting Stellar Spectra with Unsupervised Domain Adaptation
Submitted: 2020-07-06
We discuss how to achieve mapping from large sets of imperfect simulations
and observational data with unsupervised domain adaptation. Under the
hypothesis that simulated and observed data distributions share a common
underlying representation, we show how it is possible to transfer between
simulated and observed domains. Driven by an application to interpret stellar
spectroscopic sky surveys, we construct the domain transfer pipeline from two
adversarial autoencoders on each domains with a disentangling latent space, and
a cycle-consistency constraint. We then construct a differentiable pipeline
from physical stellar parameters to realistic observed spectra, aided by a
supplementary generative surrogate physics emulator network. We further
exemplify the potential of the method on the reconstructed spectra quality and
to discover new spectral features associated to elemental abundances.
[3]
oai:arXiv.org:1911.08491 [pdf] - 2050263
The Pristine survey X: a large population of low-metallicity stars
permeates the Galactic disk
Sestito, Federico;
Martin, Nicolas F.;
Starkenburg, Else;
Arentsen, Anke;
Ibata, Rodrigo A.;
Longeard, Nicolas;
Kielty, Collin;
Youakim, Kristopher;
Venn, Kim A.;
Aguado, David S.;
Carlberg, Raymond G.;
Hernandez, Jonay I. Gonzalez;
Hill, Vanessa;
Jablonka, Pascale;
Kordopatis, Georges;
Malhan, Khyati;
Navarro, Julio F.;
Sanchez-Janssen, Ruben;
Thomas, Guillame;
Tolstoy, Eline;
Wilson, Thomas G.;
Palicio, Pedro Alonso;
Bialek, Spencer;
Garcia-Dias, Rafael;
Lucchesi, Romain;
North, Pierre;
Osorio, Yeisson;
Patrick, Lee R.;
de Arriba, Luis Peralta
Submitted: 2019-11-19
The orbits of the least chemically enriched stars open a window on the
formation of our Galaxy when it was still in its infancy. The common picture is
that these low-metallicity stars are distributed as an isotropic,
pressure-supported component since these stars were either accreted from the
early building blocks of the assembling Milky Way, or were later brought by the
accretion of faint dwarf galaxies. Combining the metallicities and radial
velocities from the Pristine and LAMOST surveys and Gaia DR2 parallaxes and
proper motions for an unprecedented large and unbiased sample of very
metal-poor stars at $[Fe/H]\leq-2.5$ we show that this picture is incomplete.
This sample shows strong statistical evidence (at the $5.0\sigma$ level) of
asymmetry in their kinematics, favouring prograde motion. Moreover, we find
that $31\%$ of the stars that currently reside in the disk do not venture
outside of the disk plane throughout their orbit. The discovery of this
population implies that a significant fraction of stars with iron abundances
$[Fe/H]\leq-2.5$ formed within or concurrently with the Milky Way disk and that
the history of the disk was quiet enough to allow them to retain their
disk-like orbital properties.
[4]
oai:arXiv.org:1911.02602 [pdf] - 1995525
Deep learning analyses of synthetic spectral libraries with an
application to the Gaia-ESO database
Submitted: 2019-11-06
In the era of stellar spectroscopic surveys, synthetic spectral libraries
will form the basis for the derivation of the stellar parameters and chemical
abundances. In this paper, four popular synthetic grids (INTRIGOSS, FERRE,
AMBRE, and PHOENIX) are used in our deep learning prediction framework
(StarNet), and compared in an application to optical spectra from the Gaia-ESO
survey. The stellar parameters for temperature, surface gravity, metallicity,
radial velocity, rotational velocity, and [{\alpha}/Fe] are determined
simultaneously for FGK type dwarfs and giants. StarNet was modified to mitigate
the differences in the sampling between the synthetic grids and the observed
spectra, by augmenting the grids with realistic observational signatures, in an
attempt to incorporate both modelling and statistical uncertainties as part of
the training. When applied to spectra from the Gaia-ESO spectroscopic survey
and the Gaia-ESO benchmark stars, the INTRIGOSS-trained StarNet showed the best
results with the least scatter. Training with the FERRE synthetic grid produces
similarly accurate predictions (followed closely by the AMBRE grid), but over a
wider range in stellar parameters and spectroscopic wavelengths . In the
future, improvements in the underlying physics that generates these synthetic
grids will be necessary for consistent high precision stellar parameters and
chemical abundances from machine learning and other sophisticated data analysis
tools.
[5]
oai:arXiv.org:1709.09182 [pdf] - 1637529
An Application of Deep Neural Networks in the Analysis of Stellar
Spectra
Submitted: 2017-09-26, last modified: 2018-01-24
Spectroscopic surveys require fast and efficient analysis methods to maximize
their scientific impact. Here we apply a deep neural network architecture to
analyze both SDSS-III APOGEE DR13 and synthetic stellar spectra. When our
convolutional neural network model (StarNet) is trained on APOGEE spectra, we
show that the stellar parameters (temperature, gravity, and metallicity) are
determined with similar precision and accuracy as the APOGEE pipeline. StarNet
can also predict stellar parameters when trained on synthetic data, with
excellent precision and accuracy for both APOGEE data and synthetic data, over
a wide range of signal-to-noise ratios. In addition, the statistical
uncertainties in the stellar parameter determinations are comparable to the
differences between the APOGEE pipeline results and those determined
independently from optical spectra. We compare StarNet to other data-driven
methods; for example, StarNet and the Cannon 2 show similar behaviour when
trained with the same datasets, however StarNet performs poorly on small
training sets like those used by the original Cannon. The influence of the
spectral features on the stellar parameters is examined via partial derivatives
of the StarNet model results with respect to the input spectra. While StarNet
was developed using the APOGEE observed spectra and corresponding ASSET
synthetic data, we suggest that this technique is applicable to other
wavelength ranges and other spectral surveys.