Normalized to: Murtagh, F.
[1]
oai:arXiv.org:1612.03931 [pdf] - 1662265
Hierarchical Matching and Regression with Application to Photometric
Redshift Estimation
Submitted: 2016-12-12
This work emphasizes that heterogeneity, diversity, discontinuity, and
discreteness in data is to be exploited in classification and regression
problems. A global a priori model may not be desirable. For data analytics in
cosmology, this is motivated by the variety of cosmological objects such as
elliptical, spiral, active, and merging galaxies at a wide range of redshifts.
Our aim is matching and similarity-based analytics that takes account of
discrete relationships in the data. The information structure of the data is
represented by a hierarchy or tree where the branch structure, rather than just
the proximity, is important. The representation is related to p-adic number
theory. The clustering or binning of the data values, related to the precision
of the measurements, has a central role in this methodology. If used for
regression, our approach is a method of cluster-wise regression, generalizing
nearest neighbour regression. Both to exemplify this analytics approach, and to
demonstrate computational benefits, we address the well-known photometric
redshift or `photo-z' problem, seeking to match Sloan Digital Sky Survey (SDSS)
spectroscopic and photometric redshifts.
[2]
oai:arXiv.org:1104.4063 [pdf] - 1579365
Fast redshift clustering with the Baire (ultra) metric
Submitted: 2011-04-20
The Baire metric induces an ultrametric on a dataset and is of linear
computational complexity, contrasted with the standard quadratic time
agglomerative hierarchical clustering algorithm. We apply the Baire distance to
spectrometric and photometric redshifts from the Sloan Digital Sky Survey
using, in this work, about half a million astronomical objects. We want to know
how well the (more cos\ tly to determine) spectrometric redshifts can predict
the (more easily obtained) photometric redshifts, i.e. we seek to regress the
spectrometric on the photometric redshifts, and we develop a clusterwise
nearest neighbor regression procedure for this.
[3]
oai:arXiv.org:astro-ph/0504181 [pdf] - 72278
The application of a Trous wave filtering and Monte Carlo analysis on
SECIS 2001 solar eclipse observations
Submitted: 2005-04-07
8000 images of the Solar corona were captured during the June 2001 total
Solar eclipse. New software for the alignment of the images and an automated
technique for detecting intensity oscillations using multi scale wavelet
analysis were developed. Large areas of the images covered by the Moon and the
upper corona were scanned for oscillations and the statistical properties of
the atmospheric effects were determined. The a Trous wavelet transform was used
for noise reduction and Monte Carlo analysis as a significance test of the
detections. The effectiveness of those techniques is discussed in detail.
[4]
oai:arXiv.org:astro-ph/0411722 [pdf] - 69321
Initial results from SECIS observations of the 2001 eclipse
Submitted: 2004-11-26
SECIS observations of the June 2001 total solar eclipse were taken using an
Fe {\sc xiv} 5303 {\AA} filter. Automated tools based on wavelet analysis was
used to detect intensity oscillations on various areas of the images.
Statistical analysis of the detections found in the areas covered by the moon
and the upper corona allowed us to estimate the atmospheric and instrumental
effects on the detection of intensity oscillations. An area of the lower
corona, close to Active Region 9513, was found with a statistically significant
amount of intensity oscillations with periodicity of $\sim7.5s$. The shape of
the wavelet transformation of those detections matches theoretical predictions
of sausage-mode perturbations and for the first time in the SECIS project,
second order oscillations were also detected.
[5]
oai:arXiv.org:astro-ph/0305225 [pdf] - 56698
Eclipse observations of high-frequency oscillations in active region
coronal loops
Submitted: 2003-05-13
One of the mechanisms proposed for heating the corona above solar active
regions is the damping of magnetohydrodynamic (MHD) waves. Continuing on
previous work, we provide observational evidence for the existence of
high-frequency MHD waves in coronal loops observed during the August 1999 total
solar eclipse. A wavelet analysis is used to identify twenty 4x4 arcsec2 areas
showing intensity oscillations. All detections lie in the frequency Hz (5-3 s),
last for at least 3 periods at a confidence level of more than 99% and arise
just outside known coronal loops. This leads us to suggest that they occur in
low emission-measure or different temperature loops associated with the active
region.
[6]
oai:arXiv.org:astro-ph/0002113 [pdf] - 34456
Search and Discovery Tools for Astronomical On-line Resources and
Services
Submitted: 2000-02-04
A growing number of astronomical resources and data or information services
are made available through the Internet. However valuable information is
frequently hidden in a deluge of non-pertinent or non up-to-date documents. At
a first level, compilations of astronomical resources provide help for
selecting relevant sites. Combining yellow-page services and meta-databases of
active pointers may be an efficient solution to the data retrieval problem.
Responses generated by submission of queries to a set of heterogeneous
resources are difficult to merge or cross-match, because different data
providers generally use different data formats: new endeavors are under way to
tackle this problem. We review the technical challenges involved in trying to
provide general search and discovery tools, and to integrate them through upper
level interfaces.
[7]
oai:arXiv.org:astro-ph/9802085 [pdf] - 100256
Three types of gamma-ray bursts
Submitted: 1998-02-07
A multivariate analysis of gamma-ray burst (GRB) bulk properties is presented
to discriminate between distinct classes of GRBs. Several variables
representing burst duration, fluence and spectral hardness are considered. Two
multivariate clustering procedures are used on a sample of 797 bursts from the
Third BATSE Catalog: a nonparametric average linkage hierarchical agglomerative
clustering procedure validated with Wilks' $\Lambda^*$ and other MANOVA tests;
and a parametric maximum likelihood model-based clustering procedure assuming
multinormal populations calculated with the EM Algorithm and validated with the
Bayesian Information Criterion.
The two methods yield very similar results. The BATSE GRB population consists
of three classes with the following Duration/Fluence/Spectrum bulk properties:
Class I with long/bright/intermediate bursts, Class II with short/hard/faint
bursts, and Class III with intermediate/intermediate/soft bursts. One outlier
with poor data is also present. Classes I and II correspond to those reported
by Kouveliotou et al. (1993), but Class III is clearly defined here for the
first time.
[8]
oai:arXiv.org:astro-ph/9411028 [pdf] - 92010
Network Resources for Astronomers
Submitted: 1994-11-07
The amount of data produced by large observational facilities and space
missions has led to the archiving and on-line accessibility of much of this
data, available to the entire astronomical community. This allows a much wider
multi-frequency approach to astronomical research than previously possible.
Here we provide an overview of these services, and give a basic description of
their contents and possibilities for accessing them. Apart from services
providing observational data, many of those providing general information, e.g.
on addresses, bibliographies, software etc. are also described. The field is
rapidly growing with improved network technology, and our attempt to keep the
report as complete and up-to-date as possible will inevitably be outdated
shortly. We will endeavor to maintain an updated version of this document
on-line.