Normalized to: Dobrycheva, D.
[1]
oai:arXiv.org:1910.07317 [pdf] - 2068941
Machine-learning computation of distance modulus for local galaxies
Submitted: 2019-10-16, last modified: 2020-03-02
Quickly growing computing facilities and an increasing number of
extragalactic observations encourage the application of data-driven approaches
to uncover hidden relations from astronomical data. In this work we raise the
problem of distance reconstruction for a large number of galaxies from
available extensive observations. We propose a new data-driven approach for
computing distance moduli for local galaxies based on the machine-learning
regression as an alternative to physically oriented methods. We use key
observable parameters for a large number of galaxies as input explanatory
variables for training: magnitudes in U, B, I, and K bands, corresponding
colour indices, surface brightness, angular size, radial velocity, and
coordinates. We performed detailed tests of the five machine-learning
regression techniques for inference of $m-M$: linear, polynomial, k-nearest
neighbours, gradient boosting, and artificial neural network regression. As a
test set we selected 91 760 galaxies at $z<0.2$ from the NASA/IPAC
extragalactic database with distance moduli measured by different independent
redshift methods. We find that the most effective and precise is the neural
network regression model with two hidden layers. The obtained root-mean-square
error of 0.35 mag, which corresponds to a relative error of 16\%, does not
depend on the distance to galaxy and is comparable with methods based on the
Tully-Fisher and Fundamental Plane relations. The proposed model shows a 0.44
mag (20\%) error in the case of spectroscopic redshift absence and is
complementary to existing photometric redshift methodologies. Our approach has
great potential for obtaining distance moduli for around 250 000 galaxies at
$z<0.2$ for which the above-mentioned parameters are already observed.
[2]
oai:arXiv.org:1712.08955 [pdf] - 1608784
Machine learning technique for morphological classification of galaxies
at z<0.1 from the SDSS
Submitted: 2017-12-24
A galaxy morphological type is correlated with the color indices, luminosity,
de Vaucouleurs radius, inverse concentration index etc. To study these
relations we have to operate with big samples of galaxies, so the visual
morphological inspection is not always possible. We evaluated a new approach.
Namely, we applied the "color--concentration index" diagram and machine
learning methods for the morphological classification of galaxies from the SDSS
at z<0.1. With this aim, we visually identified morphological T-types of about
1500 galaxies, which formed our training samples. Method 1. We plotted the
diagrams of color indices g-i and one of such parameters as the inverse
concentration index, absolute magnitude, de Vaucouleurs radius. We discovered
that these parameters may be used for galaxy classification into three classes:
E -- elliptical and lenticular, S -- types Sa-Scd, and L -- types Sd-Sdm and
irregular s. The accuracy is 98% for E, 88% for S, and 57% for L types. The
combinations of "color indices g-i and inverse concentration index R50/R90' and
"color indices g-i and absolute magnitude M_r" give the best result. We applied
this method to classify 317018 galaxies from SDSS DR5 (143263 E, 112 578 S,
61177 L types). Method 2. We used a training sample classified visually into
two classes: early E (E, S0, S0a) and late L (Sa to Irr) types. We checked
Naive Bayes, Random Forest, and Support Vector Classifier. We used absolute
magnitudes, all the color indices and inverse concentration indexes as the
attributes of galaxy. To define an accuracy of classifiers we applied the
5-folds validation and found that Random Forest provides the highest accuracy
(91% of galaxies were correctly classified (96% for E and 80% for L types)). We
tested it to classify 60561 galaxies from SDSS DR9 with a good accuracy onto
two classes (47% E and 53% L types of galaxies).