Full-text search for arXiv

2 article(s) in total. 4 co-authors, from 1 to 2 common article(s). Median position in authors list is 2,5.

[1] oai:arXiv.org:1910.07317 [pdf] - 2068941

Machine-learning computation of distance modulus for local galaxies

Elyiv, A.; Melnyk, O.; Vavilova, I.; Dobrycheva, D.; Karachentseva, V.

Comments: 8 pages, 5 figures, Accepted for publication in A&A

Submitted: 2019-10-16, last modified: 2020-03-02

Quickly growing computing facilities and an increasing number of extragalactic observations encourage the application of data-driven approaches to uncover hidden relations from astronomical data. In this work we raise the problem of distance reconstruction for a large number of galaxies from available extensive observations. We propose a new data-driven approach for computing distance moduli for local galaxies based on the machine-learning regression as an alternative to physically oriented methods. We use key observable parameters for a large number of galaxies as input explanatory variables for training: magnitudes in U, B, I, and K bands, corresponding colour indices, surface brightness, angular size, radial velocity, and coordinates. We performed detailed tests of the five machine-learning regression techniques for inference of $m-M$: linear, polynomial, k-nearest neighbours, gradient boosting, and artificial neural network regression. As a test set we selected 91 760 galaxies at $z<0.2$ from the NASA/IPAC extragalactic database with distance moduli measured by different independent redshift methods. We find that the most effective and precise is the neural network regression model with two hidden layers. The obtained root-mean-square error of 0.35 mag, which corresponds to a relative error of 16\%, does not depend on the distance to galaxy and is comparable with methods based on the Tully-Fisher and Fundamental Plane relations. The proposed model shows a 0.44 mag (20\%) error in the case of spectroscopic redshift absence and is complementary to existing photometric redshift methodologies. Our approach has great potential for obtaining distance moduli for around 250 000 galaxies at $z<0.2$ for which the above-mentioned parameters are already observed.

[2] oai:arXiv.org:1712.08955 [pdf] - 1608784

Machine learning technique for morphological classification of galaxies at z<0.1 from the SDSS

Dobrycheva, D. V.; Vavilova, I. B.; Melnyk, O. V.; Elyiv, A. A.

Comments: 4 pages, 5 figures. The presentation of these results was given during the EWASS-2017, Symposium "Astroinformatics: From Big Data to Understanding the Universe at Large". It is vailable through \url{http://space.asu.cas.cz/~ewass17-soc/Presentations/S14/Dobrycheva_987.pdf}

Submitted: 2017-12-24

A galaxy morphological type is correlated with the color indices, luminosity, de Vaucouleurs radius, inverse concentration index etc. To study these relations we have to operate with big samples of galaxies, so the visual morphological inspection is not always possible. We evaluated a new approach. Namely, we applied the "color--concentration index" diagram and machine learning methods for the morphological classification of galaxies from the SDSS at z<0.1. With this aim, we visually identified morphological T-types of about 1500 galaxies, which formed our training samples. Method 1. We plotted the diagrams of color indices g-i and one of such parameters as the inverse concentration index, absolute magnitude, de Vaucouleurs radius. We discovered that these parameters may be used for galaxy classification into three classes: E -- elliptical and lenticular, S -- types Sa-Scd, and L -- types Sd-Sdm and irregular s. The accuracy is 98% for E, 88% for S, and 57% for L types. The combinations of "color indices g-i and inverse concentration index R50/R90' and "color indices g-i and absolute magnitude M_r" give the best result. We applied this method to classify 317018 galaxies from SDSS DR5 (143263 E, 112 578 S, 61177 L types). Method 2. We used a training sample classified visually into two classes: early E (E, S0, S0a) and late L (Sa to Irr) types. We checked Naive Bayes, Random Forest, and Support Vector Classifier. We used absolute magnitudes, all the color indices and inverse concentration indexes as the attributes of galaxy. To define an accuracy of classifiers we applied the 5-folds validation and found that Random Forest provides the highest accuracy (91% of galaxies were correctly classified (96% for E and 80% for L types)). We tested it to classify 60561 galaxies from SDSS DR9 with a good accuracy onto two classes (47% E and 53% L types of galaxies).