Normalized to: Yanxia, Z.
[1]
oai:arXiv.org:1305.5023 [pdf] - 1550237
Estimating Photometric Redshifts of Quasars via K-nearest Neighbor
Approach Based on Large Survey Databases
Submitted: 2013-05-22
We apply one of lazy learning methods named k-nearest neighbor algorithm
(kNN) to estimate the photometric redshifts of quasars, based on various
datasets from the Sloan Digital Sky Survey (SDSS), UKIRT Infrared Deep Sky
Survey (UKIDSS) and Wide-field Infrared Survey Explorer (WISE) (the SDSS
sample, the SDSS-UKIDSS sample, the SDSS-WISE sample and the SDSS-UKIDSS-WISE
sample). The influence of the k value and different input patterns on the
performance of kNN is discussed. kNN arrives at the best performance when k is
different with a special input pattern for a special dataset. The best result
belongs to the SDSS-UKIDSS-WISE sample. The experimental results show that
generally the more information from more bands, the better performance of
photometric redshift estimation with kNN. The results also demonstrate that kNN
using multiband data can effectively solve the catastrophic failure of
photometric redshift estimation, which is met by many machine learning methods.
By comparing the performance of various methods for photometric redshift
estimation of quasars, kNN based on KD-Tree shows its superiority with the best
accuracy for our case.
[2]
oai:arXiv.org:0802.0537 [pdf] - 9762
Support Vector Machines and Kd-tree for Separating Quasars from Large
Survey Databases
Submitted: 2008-02-04
We compare the performance of two automated classification algorithms:
k-dimensional tree (kd-tree) and support vector machines (SVMs), to separate
quasars from stars in the databases of the Sloan Digital Sky Survey (SDSS) and
the Two Micron All Sky Survey (2MASS) catalogs. The two algorithms are trained
on subsets of SDSS and 2MASS objects whose nature is known via spectroscopy. We
choose different attribute combination as input patterns to train the
classifier using photometric data only and present the classification results
obtained by these two methods. Performance metrics such as precision and
recall, true positive rate and true negative rate, F-measure, G-mean and
Weighted Accuracy are computed to evaluate the performance of the two
algorithms. The study shows that both kd-tree and SVMs are effective automated
algorithms to classify point sources. SVMs show slightly higher accuracy, but
kd-tree requires less computation time. Given different input patterns based on
various parameters(e.g. magnitudes, color information), we conclude that both
kd-tree and SVMs show better performance with fewer features. What is more, our
results also indicate that the accuracy using the four colors (u-g, g-r, r-i,
i-z) and r magnitude based on SDSS model magnitudes adds up to the highest
value. The classifiers trained by kd-tree and SVMs can be used to solve the
automated classification problems faced by the virtual observatory (VO);
moreover, they all can be applied for the photometric preselection of quasar
candidates for large survey projects in order to optimize the efficiency of
telescopes.