Normalized to: Amaro, V.
[1]
oai:arXiv.org:2007.01840 [pdf] - 2127599
Rejection criteria based on outliers in the KiDS photometric redshifts
and PDF distributions derived by machine learning
Amaro, Valeria;
Cavuoti, Stefano;
Brescia, Massimo;
Riccio, Giuseppe;
Tortora, Crescenzo;
D'Addona, Maurizio;
Veneri, Michele Delli;
Napolitano, Nicola R.;
Radovich, Mario;
Longo, Giuseppe
Submitted: 2020-07-03
The Probability Density Function (PDF) provides an estimate of the
photometric redshift (zphot) prediction error. It is crucial for current and
future sky surveys, characterized by strict requirements on the zphot
precision, reliability and completeness. The present work stands on the
assumption that properly defined rejection criteria, capable of identifying and
rejecting potential outliers, can increase the precision of zphot estimates and
of their cumulative PDF, without sacrificing much in terms of completeness of
the sample. We provide a way to assess rejection through proper cuts on the
shape descriptors of a PDF, such as the width and the height of the maximum
PDF's peak. In this work we tested these rejection criteria to galaxies with
photometry extracted from the Kilo Degree Survey (KiDS) ESO Data Release 4,
proving that such approach could lead to significant improvements to the zphot
quality: e.g., for the clipped sample showing the best trade-off between
precision and completeness, we achieve a reduction in outliers fraction of
$\simeq 75\%$ and an improvement of $\simeq 6\%$ for NMAD, with respect to the
original data set, preserving the $\simeq 93\%$ of its content.
[2]
oai:arXiv.org:1810.09777 [pdf] - 1774830
Statistical analysis of probability density functions for photometric
redshifts through the KiDS-ESO-DR3 galaxies
Amaro, Valeria;
Cavuoti, Stefano;
Brescia, Massimo;
Vellucci, Civita;
Longo, Giuseppe;
Bilicki, Maciej;
de Jong, Jelte T. A.;
Tortora, Crescenzo;
Radovich, Mario;
Napolitano, Nicola R.;
Buddelmeijer, Hugo
Submitted: 2018-10-23
Despite the high accuracy of photometric redshifts (zphot) derived using
Machine Learning (ML) methods, the quantification of errors through reliable
and accurate Probability Density Functions (PDFs) is still an open problem.
First, because it is difficult to accurately assess the contribution from
different sources of errors, namely internal to the method itself and from the
photometric features defining the available parameter space. Second, because
the problem of defining a robust statistical method, always able to quantify
and qualify the PDF estimation validity, is still an open issue. We present a
comparison among PDFs obtained using three different methods on the same data
set: two ML techniques, METAPHOR (Machine-learning Estimation Tool for Accurate
PHOtometric Redshifts) and ANNz2, plus the spectral energy distribution
template fitting method, BPZ. The photometric data were extracted from the KiDS
(Kilo Degree Survey) ESO Data Release 3, while the spectroscopy was obtained
from the GAMA (Galaxy and Mass Assembly) Data Release 2. The statistical
evaluation of both individual and stacked PDFs was done through quantitative
and qualitative estimators, including a dummy PDF, useful to verify whether
different statistical estimators can correctly assess PDF quality. We conclude
that, in order to quantify the reliability and accuracy of any zphot PDF
method, a combined set of statistical estimators is required.
[3]
oai:arXiv.org:1807.06085 [pdf] - 1719591
Evolution of galaxy size--stellar mass relation from the Kilo Degree
Survey
Roy, N.;
Napolitano, N. R.;
La Barbera, F.;
Tortora, C.;
Getman, F.;
Radovich, M.;
Capaccioli, M.;
Brescia, M.;
Cavuoti, S.;
Longo, G.;
Raj, M. A.;
Puddu, E.;
Covone, G.;
Amaro, V.;
Vellucci, C.;
Grado, A.;
Kuijken, K.;
Kleijn, G. Verdoes;
Valentijn, E.
Submitted: 2018-07-16
We have obtained structural parameters of about 340,000 galaxies from the
Kilo Degree Survey (KiDS) in 153 square degrees of data release 1, 2 and 3. We
have performed a seeing convolved 2D single S\'ersic fit to the galaxy images
in the 4 photometric bands (u, g, r, i) observed by KiDS, by selecting high
signal-to-noise ratio (S/N > 50) systems in every bands.
We have classified galaxies as spheroids and disc-dominated by combining
their spectral energy distribution properties and their S\'ersic index. Using
photometric redshifts derived from a machine learning technique, we have
determined the evolution of the effective radius, \Re\ and stellar mass, \mst,
versus redshift, for both mass complete samples of spheroids and disc-dominated
galaxies up to z ~ 0.6.
Our results show a significant evolution of the structural quantities at
intermediate redshift for the massive spheroids ($\mbox{Log}\ M_*/M_\odot>11$,
Chabrier IMF), while almost no evolution has found for less massive ones
($\mbox{Log}\ M_*/M_\odot < 11$). On the other hand, disc dominated systems
show a milder evolution in the less massive systems ($\mbox{Log}\ M_*/M_\odot <
11$) and possibly no evolution of the more massive systems. These trends are
generally consistent with predictions from hydrodynamical simulations and
independent datasets out to redshift z ~ 0.6, although in some cases the
scatter of the data is large to drive final conclusions.
These results, based on 1/10 of the expected KiDS area, reinforce precedent
finding based on smaller statistical samples and show the route toward more
accurate results, expected with the the next survey releases.
[4]
oai:arXiv.org:1802.07683 [pdf] - 1715967
Data Deluge in Astrophysics: Photometric Redshifts as a Template Use
Case
Submitted: 2018-02-21, last modified: 2018-07-16
Astronomy has entered the big data era and Machine Learning based methods
have found widespread use in a large variety of astronomical applications. This
is demonstrated by the recent huge increase in the number of publications
making use of this new approach. The usage of machine learning methods, however
is still far from trivial and many problems still need to be solved. Using the
evaluation of photometric redshifts as a case study, we outline the main
problems and some ongoing efforts to solve them.
[5]
oai:arXiv.org:1802.10282 [pdf] - 1699787
Weak Lensing Study in VOICE Survey I: Shear Measurement
Fu, Liping;
Liu, Dezi;
Radovich, Mario;
Liu, Xiangkun;
Pan, Chuzhong;
Fan, Zuhui;
Covone, Giovanni;
Vaccari, Mattia;
Amaro, Valeria;
Brescia, Massimo;
Capaccioli, Massimo;
De Cicco, Demetra;
Grado, Aniello;
Limatola, Luca;
Miller, Lance;
Napolitano, Nicola R.;
Paolillo, Maurizio;
Pignata, Giuliano
Submitted: 2018-02-28, last modified: 2018-06-13
The VST Optical Imaging of the CDFS and ES1 Fields (VOICE) Survey is a
Guaranteed Time program carried out with the ESO/VST telescope to provide deep
optical imaging over two 4 deg$^2$ patches of the sky centred on the CDFS and
ES1 pointings. We present the cosmic shear measurement over the 4 deg$^2$
covering the CDFS region in the $r$-band using LensFit. Each of the four tiles
of 1 deg$^2$ has more than one hundred exposures, of which more than 50
exposures passed a series of image quality selection criteria for weak lensing
study. The $5\sigma$ limiting magnitude in $r$- band is 26.1 for point sources,
which is $\sim$1 mag deeper than other weak lensing survey in the literature
(e.g. the Kilo Degree Survey, KiDS, at VST). The photometric redshifts are
estimated using the VOICE $u,g,r,i$ together with near-infrared VIDEO data
$Y,J,H,K_s$. The mean redshift of the shear catalogue is 0.87, considering the
shear weight. The effective galaxy number density is 16.35 gal/arcmin$^2$,
which is nearly twice the one of KiDS. The performance of LensFit on such a
deep dataset was calibrated using VOICE-like mock image simulations.
Furthermore, we have analyzed the reliability of the shear catalogue by
calculating the star-galaxy cross-correlations, the tomographic shear
correlations of two redshift bins and the contaminations of the blended
galaxies. As a further sanity check, we have constrained cosmological
parameters by exploring the parameter space with Population Monte Carlo
sampling. For a flat $\Lambda$CDM model we have obtained $\Sigma_8$ =
$\sigma_8(\Omega_m/0.3)^{0.5}$ = $0.68^{+0.11}_{-0.15}$.
[6]
oai:arXiv.org:1709.04205 [pdf] - 1736187
Photometric redshifts for the Kilo-Degree Survey. Machine-learning
analysis with artificial neural networks
Bilicki, M.;
Hoekstra, H.;
Brown, M. J. I.;
Amaro, V.;
Blake, C.;
Cavuoti, S.;
de Jong, J. T. A.;
Georgiou, C.;
Hildebrandt, H.;
Wolf, C.;
Amon, A.;
Brescia, M.;
Brough, S.;
Costa-Duarte, M. V.;
Erben, T.;
Glazebrook, K.;
Grado, A.;
Heymans, C.;
Jarrett, T.;
Joudaki, S.;
Kuijken, K.;
Longo, G.;
Napolitano, N.;
Parkinson, D.;
Vellucci, C.;
Kleijn, G. A. Verdoes;
Wang, L.
Submitted: 2017-09-13, last modified: 2018-05-11
We present a machine-learning photometric redshift analysis of the
Kilo-Degree Survey Data Release 3, using two neural-network based techniques:
ANNz2 and MLPQNA. Despite limited coverage of spectroscopic training sets,
these ML codes provide photo-zs of quality comparable to, if not better than,
those from the BPZ code, at least up to zphot<0.9 and r<23.5. At the bright end
of r<20, where very complete spectroscopic data overlapping with KiDS are
available, the performance of the ML photo-zs clearly surpasses that of BPZ,
currently the primary photo-z method for KiDS.
Using the Galaxy And Mass Assembly (GAMA) spectroscopic survey as
calibration, we furthermore study how photo-zs improve for bright sources when
photometric parameters additional to magnitudes are included in the photo-z
derivation, as well as when VIKING and WISE infrared bands are added. While the
fiducial four-band ugri setup gives a photo-z bias $\delta z=-2e-4$ and scatter
$\sigma_z<0.022$ at mean z = 0.23, combining magnitudes, colours, and galaxy
sizes reduces the scatter by ~7% and the bias by an order of magnitude. Once
the ugri and IR magnitudes are joined into 12-band photometry spanning up to 12
$\mu$, the scatter decreases by more than 10% over the fiducial case. Finally,
using the 12 bands together with optical colours and linear sizes gives $\delta
z<4e-5$ and $\sigma_z<0.019$.
This paper also serves as a reference for two public photo-z catalogues
accompanying KiDS DR3, both obtained using the ANNz2 code. The first one, of
general purpose, includes all the 39 million KiDS sources with four-band ugri
measurements in DR3. The second dataset, optimized for low-redshift studies
such as galaxy-galaxy lensing, is limited to r<20, and provides photo-zs of
much better quality than in the full-depth case thanks to incorporating optical
magnitudes, colours, and sizes in the GAMA-calibrated photo-z derivation.
[7]
oai:arXiv.org:1706.03501 [pdf] - 1584534
Probability density estimation of photometric redshifts based on machine
learning
Submitted: 2017-06-12
Photometric redshifts (photo-z's) provide an alternative way to estimate the
distances of large samples of galaxies and are therefore crucial to a large
variety of cosmological problems. Among the various methods proposed over the
years, supervised machine learning (ML) methods capable to interpolate the
knowledge gained by means of spectroscopical data have proven to be very
effective. METAPHOR (Machine-learning Estimation Tool for Accurate PHOtometric
Redshifts) is a novel method designed to provide a reliable PDF (Probability
density Function) of the error distribution of photometric redshifts predicted
by ML methods. The method is implemented as a modular workflow, whose internal
engine for photo-z estimation makes use of the MLPQNA neural network (Multi
Layer Perceptron with Quasi Newton learning rule), with the possibility to
easily replace the specific machine learning model chosen to predict photo-z's.
After a short description of the software, we present a summary of results on
public galaxy data (Sloan Digital Sky Survey - Data Release 9) and a comparison
with a completely different method based on Spectral Energy Distribution (SED)
template fitting.
[8]
oai:arXiv.org:1703.02991 [pdf] - 1581832
The third data release of the Kilo-Degree Survey and associated data
products
de Jong, J. T. A.;
Kleijn, G. A. Verdoes;
Erben, T.;
Hildebrandt, H.;
Kuijken, K.;
Sikkema, G.;
Brescia, M.;
Bilicki, M.;
Napolitano, N. R.;
Amaro, V.;
Begeman, K. G.;
Boxhoorn, D. R.;
Buddelmeijer, H.;
Cavuoti, S.;
Getman, F.;
Grado, A.;
Helmich, E.;
Huang, Z.;
Irisarri, N.;
La Barbera, F.;
Longo, G.;
McFarland, J. P.;
Nakajima, R.;
Paolillo, M.;
Puddu, E.;
Radovich, M.;
Rifatto, A.;
Tortora, C.;
Valentijn, E. A.;
Vellucci, C.;
Vriend, W-J.;
Amon, A.;
Blake, C.;
Choi, A.;
Conti, I. Fenech;
Herbonnet, R.;
Heymans, C.;
Hoekstra, H.;
Klaes, D.;
Merten, J.;
Miller, L.;
Schneider, P.;
Viola, M.
Submitted: 2017-03-08, last modified: 2017-05-21
The Kilo-Degree Survey (KiDS) is an ongoing optical wide-field imaging survey
with the OmegaCAM camera at the VLT Survey Telescope. It aims to image 1500
square degrees in four filters (ugri). The core science driver is mapping the
large-scale matter distribution in the Universe, using weak lensing shear and
photometric redshift measurements. Further science cases include galaxy
evolution, Milky Way structure, detection of high-redshift clusters, and
finding rare sources such as strong lenses and quasars. Here we present the
third public data release (DR3) and several associated data products, adding
further area, homogenized photometric calibration, photometric redshifts and
weak lensing shear measurements to the first two releases. A dedicated pipeline
embedded in the Astro-WISE information system is used for the production of the
main release. Modifications with respect to earlier releases are described in
detail. Photometric redshifts have been derived using both Bayesian template
fitting, and machine-learning techniques. For the weak lensing measurements,
optimized procedures based on the THELI data reduction and lensfit shear
measurement packages are used. In DR3 stacked ugri images, weight maps, masks,
and source lists for 292 new survey tiles (~300 sq.deg) are made available. The
multi-band catalogue, including homogenized photometry and photometric
redshifts, covers the combined DR1, DR2 and DR3 footprint of 440 survey tiles
(447 sq.deg). Limiting magnitudes are typically 24.3, 25.1, 24.9, 23.8 (5 sigma
in a 2 arcsec aperture) in ugri, respectively, and the typical r-band PSF size
is less than 0.7 arcsec. The photometric homogenization scheme ensures accurate
colors and an absolute calibration stable to ~2% for gri and ~3% in u.
Separately released are a weak lensing shear catalogue and photometric
redshifts based on two different machine-learning techniques.
[9]
oai:arXiv.org:1703.02292 [pdf] - 1581787
METAPHOR: Probability density estimation for machine learning based
photometric redshifts
Submitted: 2017-03-07
We present METAPHOR (Machine-learning Estimation Tool for Accurate
PHOtometric Redshifts), a method able to provide a reliable PDF for photometric
galaxy redshifts estimated through empirical techniques. METAPHOR is a modular
workflow, mainly based on the MLPQNA neural network as internal engine to
derive photometric galaxy redshifts, but giving the possibility to easily
replace MLPQNA with any other method to predict photo-z's and their PDF. We
present here the results about a validation test of the workflow on the
galaxies from SDSS-DR9, showing also the universality of the method by
replacing MLPQNA with KNN and Random Forest models. The validation test include
also a comparison with the PDF's derived from a traditional SED template
fitting method (Le Phare).
[10]
oai:arXiv.org:1701.08120 [pdf] - 1581300
Cooperative photometric redshift estimation
Submitted: 2017-01-27
In the modern galaxy surveys photometric redshifts play a central role in a
broad range of studies, from gravitational lensing and dark matter distribution
to galaxy evolution. Using a dataset of about 25,000 galaxies from the second
data release of the Kilo Degree Survey (KiDS) we obtain photometric redshifts
with five different methods: (i) Random forest, (ii) Multi Layer Perceptron
with Quasi Newton Algorithm, (iii) Multi Layer Perceptron with an optimization
network based on the Levenberg-Marquardt learning rule, (iv) the Bayesian
Photometric Redshift model (or BPZ) and (v) a classical SED template fitting
procedure (Le Phare). We show how SED fitting techniques could provide useful
information on the galaxy spectral type which can be used to improve the
capability of machine learning methods constraining systematic errors and
reduce the occurrence of catastrophic outliers. We use such classification to
train specialized regression estimators, by demonstrating that such hybrid
approach, involving SED fitting and machine learning in a single collaborative
framework, is capable to improve the overall prediction accuracy of photometric
redshifts.
[11]
oai:arXiv.org:1612.02173 [pdf] - 1533068
A cooperative approach among methods for photometric redshifts
estimation: an application to KiDS data
Cavuoti, Stefano;
Tortora, Crescenzo;
Brescia, Massimo;
Longo, Giuseppe;
Radovich, Mario;
Napolitano, Nicola R.;
Amaro, Valeria;
Vellucci, Civita;
La Barbera, Francesco;
Getman, Fedor;
Grado, Aniello
Submitted: 2016-12-07
Photometric redshifts (photo-z's) are fundamental in galaxy surveys to
address different topics, from gravitational lensing and dark matter
distribution to galaxy evolution. The Kilo Degree Survey (KiDS), i.e. the ESO
public survey on the VLT Survey Telescope (VST), provides the unprecedented
opportunity to exploit a large galaxy dataset with an exceptional image quality
and depth in the optical wavebands. Using a KiDS subset of about 25,000
galaxies with measured spectroscopic redshifts, we have derived photo-z's using
i) three different empirical methods based on supervised machine learning, ii)
the Bayesian Photometric Redshift model (or BPZ), and iii) a classical SED
template fitting procedure (Le Phare). We confirm that, in the regions of the
photometric parameter space properly sampled by the spectroscopic templates,
machine learning methods provide better redshift estimates, with a lower
scatter and a smaller fraction of outliers. SED fitting techniques, however,
provide useful information on the galaxy spectral type which can be effectively
used to constrain systematic errors and to better characterize potential
catastrophic outliers. Such classification is then used to specialize the
training of regression machine learning models, by demonstrating that a hybrid
approach, involving SED fitting and machine learning in a single collaborative
framework, can be effectively used to improve the accuracy of photo-z
estimates.
[12]
oai:arXiv.org:1611.02162 [pdf] - 1532474
METAPHOR: A machine learning based method for the probability density
estimation of photometric redshifts
Submitted: 2016-11-07
A variety of fundamental astrophysical science topics require the
determination of very accurate photometric redshifts (photo-z's). A wide
plethora of methods have been developed, based either on template models
fitting or on empirical explorations of the photometric parameter space.
Machine learning based techniques are not explicitly dependent on the physical
priors and able to produce accurate photo-z estimations within the photometric
ranges derived from the spectroscopic training set. These estimates, however,
are not easy to characterize in terms of a photo-z Probability Density Function
(PDF), due to the fact that the analytical relation mapping the photometric
parameters onto the redshift space is virtually unknown. We present METAPHOR
(Machine-learning Estimation Tool for Accurate PHOtometric Redshifts), a method
designed to provide a reliable PDF of the error distribution for empirical
techniques. The method is implemented as a modular workflow, whose internal
engine for photo-z estimation makes use of the MLPQNA neural network (Multi
Layer Perceptron with Quasi Newton learning rule), with the possibility to
easily replace the specific machine learning model chosen to predict photo-z's.
We present a summary of results on SDSS-DR9 galaxy data, used also to perform a
direct comparison with PDF's obtained by the Le Phare SED template fitting. We
show that METAPHOR is capable to estimate the precision and reliability of
photometric redshifts obtained with three different self-adaptive techniques,
i.e. MLPQNA, Random Forest and the standard K-Nearest Neighbors models.