Normalized to: Sekora, M.
[1]
oai:arXiv.org:0908.2664 [pdf] - 27428
Detecting Variability in Massive Astronomical Time-Series Data I:
application of an infinite Gaussian mixture model
Submitted: 2009-08-18
We present a new framework to detect various types of variable objects within
massive astronomical time-series data. Assuming that the dominant population of
objects is non-variable, we find outliers from this population by using a
non-parametric Bayesian clustering algorithm based on an infinite
GaussianMixtureModel (GMM) and the Dirichlet Process. The algorithm extracts
information from a given dataset, which is described by six variability
indices. The GMM uses those variability indices to recover clusters that are
described by six-dimensional multivariate Gaussian distributions, allowing our
approach to consider the sampling pattern of time-series data, systematic
biases, the number of data points for each light curve, and photometric
quality. Using the Northern Sky Variability Survey data, we test our approach
and prove that the infinite GMM is useful at detecting variable objects, while
providing statistical inference estimation that suppresses false detection. The
proposed approach will be effective in the exploration of future surveys such
as GAIA, Pan-Starrs, and LSST, which will produce massive time-series data.
[2]
oai:arXiv.org:0807.3762 [pdf] - 14798
The Size Distributions of Asteroid Families in the SDSS Moving Object
Catalog 4
Submitted: 2008-07-23
Asteroid families, traditionally defined as clusters of objects in orbital
parameter space, often have distinctive optical colors. We show that the
separation of family members from background interlopers can be improved with
the aid of SDSS colors as a qualifier for family membership. Based on an
~88,000 object subset of the Sloan Digital Sky Survey Moving Object Catalog 4
with available proper orbital elements, we define 37 statistically robust
asteroid families with at least 100 members using a simple Gaussian
distribution model in both orbital and color space. The interloper rejection
rate based on colors is typically ~10% for a given orbital family definition,
with four families that can be reliably isolated only with the aid of colors.
About 50% of all objects in this data set belong to families, and this fraction
varies from about 35% for objects brighter than an H magnitude of 13 and rises
to 60% for objects fainter than this. The fraction of C-type objects in
families decreases with increasing H magnitude for H > 13, while the fraction
of S-type objects above this limit remains effectively constant. This suggests
that S-type objects require a shorter timescale for equilibrating the
background and family size distributions via collisional processing. The size
distributions for 15 families display a well-defined change of slope and can be
modeled as a "broken" double power-law. Such "broken" size distributions are
twice as likely for S-type familes than for C-type families, and are dominated
by dynamically old families. The remaining families with size distributions
that can be modeled as a single power law are dominated by young families. When
size distribution requires a double power-law model, the two slopes are
correlated and are steeper for S-type families.