Normalized to: Kügler, S.
[1]
oai:arXiv.org:1607.06059 [pdf] - 1441162
Modelling multimodal photometric redshift regression with noisy
observations
Submitted: 2016-07-20
In this work, we are trying to extent the existing photometric redshift
regression models from modeling pure photometric data back to the spectra
themselves. To that end, we developed a PCA that is capable of describing the
input uncertainty (including missing values) in a dimensionality reduction
framework. With this "spectrum generator" at hand, we are capable of treating
the redshift regression problem in a fully Bayesian framework, returning a
posterior distribution over the redshift. This approach allows therefore to
approach the multimodal regression problem in an adequate fashion. In addition,
input uncertainty on the magnitudes can be included quite naturally and lastly,
the proposed algorithm allows in principle to make predictions outside the
training values which makes it a fascinating opportunity for the detection of
high-redshifted quasars.
[2]
oai:arXiv.org:1601.05654 [pdf] - 1344835
Model-Coupled Autoencoder for Time Series Visualisation
Submitted: 2016-01-21
We present an approach for the visualisation of a set of time series that
combines an echo state network with an autoencoder. For each time series in the
dataset we train an echo state network, using a common and fixed reservoir of
hidden neurons, and use the optimised readout weights as the new
representation. Dimensionality reduction is then performed via an autoencoder
on the readout weight representations. The crux of the work is to equip the
autoencoder with a loss function that correctly interprets the reconstructed
readout weights by associating them with a reconstruction error measured in the
data space of sequences. This essentially amounts to measuring the predictive
performance that the reconstructed readout weights exhibit on their
corresponding sequences when plugged back into the echo state network with the
same fixed reservoir. We demonstrate that the proposed visualisation framework
can deal both with real valued sequences as well as binary sequences. We derive
magnification factors in order to analyse distance preservations and
distortions in the visualisation space. The versatility and advantages of the
proposed method are demonstrated on datasets of time series that originate from
diverse domains.
[3]
oai:arXiv.org:1508.03482 [pdf] - 1327459
An Explorative Approach for Inspecting Kepler Data
Submitted: 2015-08-14, last modified: 2015-11-04
The Kepler survey has provided a wealth of astrophysical knowledge by
continuously monitoring over 150,000 stars. The resulting database contains
thousands of examples of known variability types and at least as many that
cannot be classified yet. In order to reveal the knowledge hidden in the
database, we introduce a new visualisation method that allows us to inspect
time series exploratively. To that end, we propose dimensionality reduction on
the parameters of a model capable of representing time series as fixed-length
vector representation. We show that a more refined objective function can be
chosen by minimising the prediction error of the data reconstruction instead of
the reconstruction of the model parameters. The proposed visualisation exhibits
a strong correlation between the variability behaviour of the light curves and
their physical properties. As a consequence, temperature and surface gravity
can, for some stars, be directly inferred from non- (or quasi-) periodic light
curves.
[4]
oai:arXiv.org:1504.04455 [pdf] - 1044555
Featureless Classification of Light Curves
Submitted: 2015-04-17, last modified: 2015-05-20
In the era of rapidly increasing amounts of time series data, classification
of variable objects has become the main objective of time-domain astronomy.
Classification of irregularly sampled time series is particularly difficult
because the data cannot be represented naturally as a vector which can be
directly fed into a classifier. In the literature, various statistical features
serve as vector representations. In this work, we represent time series by a
density model. The density model captures all the information available,
including measurement errors. Hence, we view this model as a generalisation to
the static features which directly can be derived, e.g., as moments from the
density. Similarity between each pair of time series is quantified by the
distance between their respective models. Classification is performed on the
obtained distance matrix. In the numerical experiments, we use data from the
OGLE and ASAS surveys and demonstrate that the proposed representation performs
up to par with the best cur- rently used feature-based approaches. The density
representation preserves all static information present in the observational
data, in contrast to a less complete description by features. The density
representation is an upper boundary in terms of information made available to
the classifier. Consequently, the predictive power of the proposed
classification depends on the choice of similarity measure and classifier,
only. Due to its principled nature, we advocate that this new approach of
representing time series has potential in tasks beyond classification, e.g.,
unsupervised learning.
[5]
oai:arXiv.org:1409.8417 [pdf] - 1450526
Estimating Spectroscopic Redshifts by Using k Nearest Neighbors
Regression I. Description of Method and Analysis
Submitted: 2014-09-30, last modified: 2015-03-06
Context: In astronomy, new approaches to process and analyze the
exponentially increasing amount of data are inevitable. While classical
approaches (e.g. template fitting) are fine for objects of well-known classes,
alternative techniques have to be developed to determine those that do not fit.
Therefore a classification scheme should be based on individual properties
instead of fitting to a global model and therefore loose valuable information.
An important issue when dealing with large data sets is the outlier detection
which at the moment is often treated problem-orientated. Aims: In this paper we
present a method to statistically estimate the redshift z based on a similarity
approach. This allows us to determine redshifts in spectra in emission as well
as in absorption without using any predefined model. Additionally we show how
an estimate of the redshift based on single features is possible. As a
consequence we are e.g. able to filter objects which show multiple redshift
components. We propose to apply this general method to all similar problems in
order to identify objects where traditional approaches fail. Methods: The
redshift estimation is performed by comparing predefined regions in the spectra
and applying a k nearest neighbor regression model for every predefined
emission and absorption region, individually. Results: We estimated a redshift
for more than 50% of the analyzed 16,000 spectra of our reference and test
sample. The redshift estimate yields a precision for every individually tested
feature that is comparable with the overall precision of the redshifts of SDSS.
In 14 spectra we find a significant shift between emission and absorption or
emission and emission lines. The results show already the immense power of this
simple machine learning approach for investigating huge databases such as the
SDSS.
[6]
oai:arXiv.org:1409.8121 [pdf] - 873388
Properties of optically selected BL Lac candidates from the SDSS
Submitted: 2014-09-29
\textbf{Context.} Deep optical surveys open the avenue for find large numbers
of BL Lac objects that are hard to identify because they lack the unique
properties classifying them as such. While radio or X-ray surveys typically
reveal dozens of sources, recent compilations based on optical criteria alone
have increased the number of BL Lac candidates considerably. However, these
compilations are subject to biases and may contain a substantial number of
contaminating sources. \textbf{Aims.} In this paper we extend our analysis of
182 optically selected BL Lac object candidates from the SDSS with respect to
an earlier study. The main goal is to determine the number of bona fide BL Lac
objects in this sample. \textbf{Methods.} We examine their variability
characteristics, determine their broad-band radio-UV SEDs, and search for the
presence of a host galaxy. In addition we present new optical spectra for 27
targets with improved S/N with respect to the SDSS spectra. \textbf{Results.}
At least 59% of our targets have shown variability between SDSS DR2 and our
observations by more than 0.1-0.27 mag de- pending on the telescope used. A
host galaxy was detected in 36% of our targets. The host galaxy type and
luminosities are consistent with earlier studies of BL Lac host galaxies.
Simple fits to broad-band SEDS for 104 targets of our sample derived
synchrotron peak frequencies between $13.5 \leq
\mathrm{log}_{10}(\nu_{\mathrm{peak}}) \leq 16$ with a peak at
$\mathrm{log}_{10} \sim 14.5$. Our new optical spectra do not reveal any new
redshift for any of our objects. Thus the sample contains a large number of
bona fide BL Lac objects and seems to contain a substantial fraction of
intermediate-frequency peaked BL Lacs.