Full-text search for arXiv

Amaro, Valeria

Normalized to: Amaro, V.

12 article(s) in total. 69 co-authors, from 1 to 12 common article(s). Median position in authors list is 3,5.

[1] oai:arXiv.org:2007.01840 [pdf] - 2127599

Rejection criteria based on outliers in the KiDS photometric redshifts and PDF distributions derived by machine learning

Amaro, Valeria; Cavuoti, Stefano; Brescia, Massimo; Riccio, Giuseppe; Tortora, Crescenzo; D'Addona, Maurizio; Veneri, Michele Delli; Napolitano, Nicola R.; Radovich, Mario; Longo, Giuseppe

Comments: Preprint version of the manuscript to appear in the Volume "Intelligent Astrophysics" of the series "Emergence, Complexity and Computation", Book eds. I. Zelinka, D. Baron, M. Brescia, Springer Nature Switzerland, ISSN: 2194-7287

Submitted: 2020-07-03

The Probability Density Function (PDF) provides an estimate of the photometric redshift (zphot) prediction error. It is crucial for current and future sky surveys, characterized by strict requirements on the zphot precision, reliability and completeness. The present work stands on the assumption that properly defined rejection criteria, capable of identifying and rejecting potential outliers, can increase the precision of zphot estimates and of their cumulative PDF, without sacrificing much in terms of completeness of the sample. We provide a way to assess rejection through proper cuts on the shape descriptors of a PDF, such as the width and the height of the maximum PDF's peak. In this work we tested these rejection criteria to galaxies with photometry extracted from the Kilo Degree Survey (KiDS) ESO Data Release 4, proving that such approach could lead to significant improvements to the zphot quality: e.g., for the clipped sample showing the best trade-off between precision and completeness, we achieve a reduction in outliers fraction of $\simeq 75\%$ and an improvement of $\simeq 6\%$ for NMAD, with respect to the original data set, preserving the $\simeq 93\%$ of its content.

[2] oai:arXiv.org:1810.09777 [pdf] - 1774830

Statistical analysis of probability density functions for photometric redshifts through the KiDS-ESO-DR3 galaxies

Amaro, Valeria; Cavuoti, Stefano; Brescia, Massimo; Vellucci, Civita; Longo, Giuseppe; Bilicki, Maciej; de Jong, Jelte T. A.; Tortora, Crescenzo; Radovich, Mario; Napolitano, Nicola R.; Buddelmeijer, Hugo

Comments: Accepted for publication by MNRAS, 20 pages, 14 figures

Submitted: 2018-10-23

Despite the high accuracy of photometric redshifts (zphot) derived using Machine Learning (ML) methods, the quantification of errors through reliable and accurate Probability Density Functions (PDFs) is still an open problem. First, because it is difficult to accurately assess the contribution from different sources of errors, namely internal to the method itself and from the photometric features defining the available parameter space. Second, because the problem of defining a robust statistical method, always able to quantify and qualify the PDF estimation validity, is still an open issue. We present a comparison among PDFs obtained using three different methods on the same data set: two ML techniques, METAPHOR (Machine-learning Estimation Tool for Accurate PHOtometric Redshifts) and ANNz2, plus the spectral energy distribution template fitting method, BPZ. The photometric data were extracted from the KiDS (Kilo Degree Survey) ESO Data Release 3, while the spectroscopy was obtained from the GAMA (Galaxy and Mass Assembly) Data Release 2. The statistical evaluation of both individual and stacked PDFs was done through quantitative and qualitative estimators, including a dummy PDF, useful to verify whether different statistical estimators can correctly assess PDF quality. We conclude that, in order to quantify the reliability and accuracy of any zphot PDF method, a combined set of statistical estimators is required.

[3] oai:arXiv.org:1807.06085 [pdf] - 1719591

Evolution of galaxy size--stellar mass relation from the Kilo Degree Survey

Roy, N.; Napolitano, N. R.; La Barbera, F.; Tortora, C.; Getman, F.; Radovich, M.; Capaccioli, M.; Brescia, M.; Cavuoti, S.; Longo, G.; Raj, M. A.; Puddu, E.; Covone, G.; Amaro, V.; Vellucci, C.; Grado, A.; Kuijken, K.; Kleijn, G. Verdoes; Valentijn, E.

Comments: accepted by MNRAS

Submitted: 2018-07-16

We have obtained structural parameters of about 340,000 galaxies from the Kilo Degree Survey (KiDS) in 153 square degrees of data release 1, 2 and 3. We have performed a seeing convolved 2D single S\'ersic fit to the galaxy images in the 4 photometric bands (u, g, r, i) observed by KiDS, by selecting high signal-to-noise ratio (S/N > 50) systems in every bands. We have classified galaxies as spheroids and disc-dominated by combining their spectral energy distribution properties and their S\'ersic index. Using photometric redshifts derived from a machine learning technique, we have determined the evolution of the effective radius, \Re\ and stellar mass, \mst, versus redshift, for both mass complete samples of spheroids and disc-dominated galaxies up to z ~ 0.6. Our results show a significant evolution of the structural quantities at intermediate redshift for the massive spheroids ($\mbox{Log}\ M_*/M_\odot>11$, Chabrier IMF), while almost no evolution has found for less massive ones ($\mbox{Log}\ M_*/M_\odot < 11$). On the other hand, disc dominated systems show a milder evolution in the less massive systems ($\mbox{Log}\ M_*/M_\odot < 11$) and possibly no evolution of the more massive systems. These trends are generally consistent with predictions from hydrodynamical simulations and independent datasets out to redshift z ~ 0.6, although in some cases the scatter of the data is large to drive final conclusions. These results, based on 1/10 of the expected KiDS area, reinforce precedent finding based on smaller statistical samples and show the route toward more accurate results, expected with the the next survey releases.

[4] oai:arXiv.org:1802.07683 [pdf] - 1715967

Data Deluge in Astrophysics: Photometric Redshifts as a Template Use Case

Brescia, Massimo; Cavuoti, Stefano; Amaro, Valeria; Riccio, Giuseppe; Angora, Giuseppe; Vellucci, Civita; Longo, Giuseppe

Comments: 13 pages, 3 figures, Springer's Communications in Computer and Information Science (CCIS), Vol. 822

Submitted: 2018-02-21, last modified: 2018-07-16

Astronomy has entered the big data era and Machine Learning based methods have found widespread use in a large variety of astronomical applications. This is demonstrated by the recent huge increase in the number of publications making use of this new approach. The usage of machine learning methods, however is still far from trivial and many problems still need to be solved. Using the evaluation of photometric redshifts as a case study, we outline the main problems and some ongoing efforts to solve them.

[5] oai:arXiv.org:1802.10282 [pdf] - 1699787

Weak Lensing Study in VOICE Survey I: Shear Measurement

Fu, Liping; Liu, Dezi; Radovich, Mario; Liu, Xiangkun; Pan, Chuzhong; Fan, Zuhui; Covone, Giovanni; Vaccari, Mattia; Amaro, Valeria; Brescia, Massimo; Capaccioli, Massimo; De Cicco, Demetra; Grado, Aniello; Limatola, Luca; Miller, Lance; Napolitano, Nicola R.; Paolillo, Maurizio; Pignata, Giuliano

Comments: 15 pages, 16 figures, 4 tables. MNRAS Accepted

Submitted: 2018-02-28, last modified: 2018-06-13

The VST Optical Imaging of the CDFS and ES1 Fields (VOICE) Survey is a Guaranteed Time program carried out with the ESO/VST telescope to provide deep optical imaging over two 4 deg$^2$ patches of the sky centred on the CDFS and ES1 pointings. We present the cosmic shear measurement over the 4 deg$^2$ covering the CDFS region in the $r$-band using LensFit. Each of the four tiles of 1 deg$^2$ has more than one hundred exposures, of which more than 50 exposures passed a series of image quality selection criteria for weak lensing study. The $5\sigma$ limiting magnitude in $r$- band is 26.1 for point sources, which is $\sim$1 mag deeper than other weak lensing survey in the literature (e.g. the Kilo Degree Survey, KiDS, at VST). The photometric redshifts are estimated using the VOICE $u,g,r,i$ together with near-infrared VIDEO data $Y,J,H,K_s$. The mean redshift of the shear catalogue is 0.87, considering the shear weight. The effective galaxy number density is 16.35 gal/arcmin$^2$, which is nearly twice the one of KiDS. The performance of LensFit on such a deep dataset was calibrated using VOICE-like mock image simulations. Furthermore, we have analyzed the reliability of the shear catalogue by calculating the star-galaxy cross-correlations, the tomographic shear correlations of two redshift bins and the contaminations of the blended galaxies. As a further sanity check, we have constrained cosmological parameters by exploring the parameter space with Population Monte Carlo sampling. For a flat $\Lambda$CDM model we have obtained $\Sigma_8$ = $\sigma_8(\Omega_m/0.3)^{0.5}$ = $0.68^{+0.11}_{-0.15}$.

[6] oai:arXiv.org:1709.04205 [pdf] - 1736187

Photometric redshifts for the Kilo-Degree Survey. Machine-learning analysis with artificial neural networks

Comments: A&A, in press. Data available from the KiDS website http://kids.strw.leidenuniv.nl/DR3/ml-photoz.php#annz2

Submitted: 2017-09-13, last modified: 2018-05-11

We present a machine-learning photometric redshift analysis of the Kilo-Degree Survey Data Release 3, using two neural-network based techniques: ANNz2 and MLPQNA. Despite limited coverage of spectroscopic training sets, these ML codes provide photo-zs of quality comparable to, if not better than, those from the BPZ code, at least up to zphot<0.9 and r<23.5. At the bright end of r<20, where very complete spectroscopic data overlapping with KiDS are available, the performance of the ML photo-zs clearly surpasses that of BPZ, currently the primary photo-z method for KiDS. Using the Galaxy And Mass Assembly (GAMA) spectroscopic survey as calibration, we furthermore study how photo-zs improve for bright sources when photometric parameters additional to magnitudes are included in the photo-z derivation, as well as when VIKING and WISE infrared bands are added. While the fiducial four-band ugri setup gives a photo-z bias $\delta z=-2e-4$ and scatter $\sigma_z<0.022$ at mean z = 0.23, combining magnitudes, colours, and galaxy sizes reduces the scatter by ~7% and the bias by an order of magnitude. Once the ugri and IR magnitudes are joined into 12-band photometry spanning up to 12 $\mu$, the scatter decreases by more than 10% over the fiducial case. Finally, using the 12 bands together with optical colours and linear sizes gives $\delta z<4e-5$ and $\sigma_z<0.019$. This paper also serves as a reference for two public photo-z catalogues accompanying KiDS DR3, both obtained using the ANNz2 code. The first one, of general purpose, includes all the 39 million KiDS sources with four-band ugri measurements in DR3. The second dataset, optimized for low-redshift studies such as galaxy-galaxy lensing, is limited to r<20, and provides photo-zs of much better quality than in the full-depth case thanks to incorporating optical magnitudes, colours, and sizes in the GAMA-calibrated photo-z derivation.

[7] oai:arXiv.org:1706.03501 [pdf] - 1584534

Probability density estimation of photometric redshifts based on machine learning

Cavuoti, Stefano; Brescia, Massimo; Amaro, Valeria; Vellucci, Civita; Longo, Giuseppe; Tortora, Crescenzo

Comments: 2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016 7849953

Submitted: 2017-06-12

Photometric redshifts (photo-z's) provide an alternative way to estimate the distances of large samples of galaxies and are therefore crucial to a large variety of cosmological problems. Among the various methods proposed over the years, supervised machine learning (ML) methods capable to interpolate the knowledge gained by means of spectroscopical data have proven to be very effective. METAPHOR (Machine-learning Estimation Tool for Accurate PHOtometric Redshifts) is a novel method designed to provide a reliable PDF (Probability density Function) of the error distribution of photometric redshifts predicted by ML methods. The method is implemented as a modular workflow, whose internal engine for photo-z estimation makes use of the MLPQNA neural network (Multi Layer Perceptron with Quasi Newton learning rule), with the possibility to easily replace the specific machine learning model chosen to predict photo-z's. After a short description of the software, we present a summary of results on public galaxy data (Sloan Digital Sky Survey - Data Release 9) and a comparison with a completely different method based on Spectral Energy Distribution (SED) template fitting.

[8] oai:arXiv.org:1703.02991 [pdf] - 1581832

The third data release of the Kilo-Degree Survey and associated data products

Comments: small modifications; 27 pages, 12 figures, accepted for publication in Astronomy & Astrophysics

Submitted: 2017-03-08, last modified: 2017-05-21

The Kilo-Degree Survey (KiDS) is an ongoing optical wide-field imaging survey with the OmegaCAM camera at the VLT Survey Telescope. It aims to image 1500 square degrees in four filters (ugri). The core science driver is mapping the large-scale matter distribution in the Universe, using weak lensing shear and photometric redshift measurements. Further science cases include galaxy evolution, Milky Way structure, detection of high-redshift clusters, and finding rare sources such as strong lenses and quasars. Here we present the third public data release (DR3) and several associated data products, adding further area, homogenized photometric calibration, photometric redshifts and weak lensing shear measurements to the first two releases. A dedicated pipeline embedded in the Astro-WISE information system is used for the production of the main release. Modifications with respect to earlier releases are described in detail. Photometric redshifts have been derived using both Bayesian template fitting, and machine-learning techniques. For the weak lensing measurements, optimized procedures based on the THELI data reduction and lensfit shear measurement packages are used. In DR3 stacked ugri images, weight maps, masks, and source lists for 292 new survey tiles (~300 sq.deg) are made available. The multi-band catalogue, including homogenized photometry and photometric redshifts, covers the combined DR1, DR2 and DR3 footprint of 440 survey tiles (447 sq.deg). Limiting magnitudes are typically 24.3, 25.1, 24.9, 23.8 (5 sigma in a 2 arcsec aperture) in ugri, respectively, and the typical r-band PSF size is less than 0.7 arcsec. The photometric homogenization scheme ensures accurate colors and an absolute calibration stable to ~2% for gri and ~3% in u. Separately released are a weak lensing shear catalogue and photometric redshifts based on two different machine-learning techniques.

[9] oai:arXiv.org:1703.02292 [pdf] - 1581787

METAPHOR: Probability density estimation for machine learning based photometric redshifts

Amaro, Valeria; Cavuoti, Stefano; Brescia, Massimo; Vellucci, Civita; Tortora, Crescenzo; Longo, Giuseppe

Comments: proceedings of the International Astronomical Union, IAU-325 symposium, Cambridge University press

Submitted: 2017-03-07

We present METAPHOR (Machine-learning Estimation Tool for Accurate PHOtometric Redshifts), a method able to provide a reliable PDF for photometric galaxy redshifts estimated through empirical techniques. METAPHOR is a modular workflow, mainly based on the MLPQNA neural network as internal engine to derive photometric galaxy redshifts, but giving the possibility to easily replace MLPQNA with any other method to predict photo-z's and their PDF. We present here the results about a validation test of the workflow on the galaxies from SDSS-DR9, showing also the universality of the method by replacing MLPQNA with KNN and Random Forest models. The validation test include also a comparison with the PDF's derived from a traditional SED template fitting method (Le Phare).

[10] oai:arXiv.org:1701.08120 [pdf] - 1581300

Cooperative photometric redshift estimation

Cavuoti, Stefano; Tortora, Crescenzo; Brescia, Massimo; Longo, Giuseppe; Radovich, Mario; Napolitano, Nicola R.; Amaro, Valeria; Vellucci, Civita

Comments: 6 pages, 1 figure, proceedings of the International Astronomical Union, IAU-325 symposium, Cambridge University press

Submitted: 2017-01-27

In the modern galaxy surveys photometric redshifts play a central role in a broad range of studies, from gravitational lensing and dark matter distribution to galaxy evolution. Using a dataset of about 25,000 galaxies from the second data release of the Kilo Degree Survey (KiDS) we obtain photometric redshifts with five different methods: (i) Random forest, (ii) Multi Layer Perceptron with Quasi Newton Algorithm, (iii) Multi Layer Perceptron with an optimization network based on the Levenberg-Marquardt learning rule, (iv) the Bayesian Photometric Redshift model (or BPZ) and (v) a classical SED template fitting procedure (Le Phare). We show how SED fitting techniques could provide useful information on the galaxy spectral type which can be used to improve the capability of machine learning methods constraining systematic errors and reduce the occurrence of catastrophic outliers. We use such classification to train specialized regression estimators, by demonstrating that such hybrid approach, involving SED fitting and machine learning in a single collaborative framework, is capable to improve the overall prediction accuracy of photometric redshifts.

[11] oai:arXiv.org:1612.02173 [pdf] - 1533068

A cooperative approach among methods for photometric redshifts estimation: an application to KiDS data

Cavuoti, Stefano; Tortora, Crescenzo; Brescia, Massimo; Longo, Giuseppe; Radovich, Mario; Napolitano, Nicola R.; Amaro, Valeria; Vellucci, Civita; La Barbera, Francesco; Getman, Fedor; Grado, Aniello

Comments: Accepted by MNRAS, 17 pages, 11 figures

Submitted: 2016-12-07

Photometric redshifts (photo-z's) are fundamental in galaxy surveys to address different topics, from gravitational lensing and dark matter distribution to galaxy evolution. The Kilo Degree Survey (KiDS), i.e. the ESO public survey on the VLT Survey Telescope (VST), provides the unprecedented opportunity to exploit a large galaxy dataset with an exceptional image quality and depth in the optical wavebands. Using a KiDS subset of about 25,000 galaxies with measured spectroscopic redshifts, we have derived photo-z's using i) three different empirical methods based on supervised machine learning, ii) the Bayesian Photometric Redshift model (or BPZ), and iii) a classical SED template fitting procedure (Le Phare). We confirm that, in the regions of the photometric parameter space properly sampled by the spectroscopic templates, machine learning methods provide better redshift estimates, with a lower scatter and a smaller fraction of outliers. SED fitting techniques, however, provide useful information on the galaxy spectral type which can be effectively used to constrain systematic errors and to better characterize potential catastrophic outliers. Such classification is then used to specialize the training of regression machine learning models, by demonstrating that a hybrid approach, involving SED fitting and machine learning in a single collaborative framework, can be effectively used to improve the accuracy of photo-z estimates.

[12] oai:arXiv.org:1611.02162 [pdf] - 1532474

METAPHOR: A machine learning based method for the probability density estimation of photometric redshifts

Cavuoti, Stefano; Amaro, Valeria; Brescia, Massimo; Vellucci, Civita; Tortora, Crescenzo; Longo, Giuseppe

Comments: Accepted from MNRAS, 17 pages, 16 figures

Submitted: 2016-11-07

A variety of fundamental astrophysical science topics require the determination of very accurate photometric redshifts (photo-z's). A wide plethora of methods have been developed, based either on template models fitting or on empirical explorations of the photometric parameter space. Machine learning based techniques are not explicitly dependent on the physical priors and able to produce accurate photo-z estimations within the photometric ranges derived from the spectroscopic training set. These estimates, however, are not easy to characterize in terms of a photo-z Probability Density Function (PDF), due to the fact that the analytical relation mapping the photometric parameters onto the redshift space is virtually unknown. We present METAPHOR (Machine-learning Estimation Tool for Accurate PHOtometric Redshifts), a method designed to provide a reliable PDF of the error distribution for empirical techniques. The method is implemented as a modular workflow, whose internal engine for photo-z estimation makes use of the MLPQNA neural network (Multi Layer Perceptron with Quasi Newton learning rule), with the possibility to easily replace the specific machine learning model chosen to predict photo-z's. We present a summary of results on SDSS-DR9 galaxy data, used also to perform a direct comparison with PDF's obtained by the Le Phare SED template fitting. We show that METAPHOR is capable to estimate the precision and reliability of photometric redshifts obtained with three different self-adaptive techniques, i.e. MLPQNA, Random Forest and the standard K-Nearest Neighbors models.