Normalized to: Pang, D.
[1]
oai:arXiv.org:1810.03190 [pdf] - 1762297
Scalable Solutions for Automated Single Pulse Identification and
Classification in Radio Astronomy
Submitted: 2018-10-07
Data collection for scientific applications is increasing exponentially and
is forecasted to soon reach peta- and exabyte scales. Applications which
process and analyze scientific data must be scalable and focus on execution
performance to keep pace. In the field of radio astronomy, in addition to
increasingly large datasets, tasks such as the identification of transient
radio signals from extrasolar sources are computationally expensive. We present
a scalable approach to radio pulsar detection written in Scala that
parallelizes candidate identification to take advantage of in-memory task
processing using Apache Spark on a YARN distributed system. Furthermore, we
introduce a novel automated multiclass supervised machine learning technique
that we combine with feature selection to reduce the time required for
candidate classification. Experimental testing on a Beowulf cluster with 15
data nodes shows that the parallel implementation of the identification
algorithm offers a speedup of up to 5X that of a similar multithreaded
implementation. Further, we show that the combination of automated multiclass
classification and feature selection speeds up the execution performance of the
RandomForest machine learning algorithm by an average of 54% with less than a
2% average reduction in the algorithm's ability to correctly classify pulsars.
The generalizability of these results is demonstrated by using two real-world
radio astronomy data sets.
[2]
oai:arXiv.org:1807.07164 [pdf] - 1727989
A novel single-pulse search approach to detection of dispersed radio
pulses using clustering and supervised machine learning
Submitted: 2018-07-18, last modified: 2018-08-05
We present a novel two-stage approach which combines unsupervised and
supervised machine learning to automatically identify and classify single
pulses in radio pulsar search data. In the first stage, we identify
astrophysical pulse candidates in the data, which were derived from the Pulsar
Arecibo L-Band Feed Array (PALFA) survey and contain 47,042 independent beams,
as trial single-pulse event groups (SPEGs) by clustering single-pulse events
and merging clusters that fall within the expected DM and time span of
astrophysical pulses. We also present a new peak scoring algorithm, to identify
astrophysical peaks in S/N versus DM curves. Furthermore, we group SPEGs
detected at a consistent DM for they were likely emitted by the same source. In
the second stage, we create a fully labelled benchmark data set by selecting a
subset of data with SPEGs identified (using stage 1 procedures), their features
extracted and individual SPEGs manually labelled, and then train classifiers
using supervised machine learning. Next, using the best trained classifier, we
automatically classify unlabelled SPEGs identified in the full data set. To aid
the examination of dim SPEGs, we develop an algorithm that searches for an
underlying periodicity among grouped SPEGs. The results showed that
RandomForest with SMOTE treatment was the best learner, with a recall of 95.6%
and a false positive rate of 2.0%. In total, besides all 60 known pulsars from
the benchmark data set, the model found 32 additional (i.e., not included in
the benchmark data set) known pulsars, and several potential discoveries.
[3]
oai:arXiv.org:1208.0714 [pdf] - 555415
New determination of the 13C(a, n)16O reaction rate and its influence on
the s-process nucleosynthesis in AGB stars
Guo, B.;
Li, Z. H.;
Lugaro, M.;
Buntain, J.;
Pang, D. Y.;
Li, Y. J.;
Su, J.;
Yan, S. Q.;
Bai, X. X.;
Chen, Y. S.;
Fan, Q. W.;
Jin, S. J.;
Karakas, A. I.;
Li, E. T.;
Li, Z. C.;
Lian, G.;
Liu, J. C.;
Liu, X.;
Shi, J. R.;
Shu, N. C.;
Wang, B. X.;
Wang, Y. B.;
Zeng, S.;
Liu, W. P.
Submitted: 2012-08-03
We present a new measurement of the $\alpha$-spectroscopic factor
($S_\alpha$) and the asymptotic normalization coefficient (ANC) for the 6.356
MeV 1/2$^+$ subthreshold state of $^{17}$O through the $^{13}$C($^{11}$B,
$^{7}$Li)$^{17}$O transfer reaction and we determine the $\alpha$-width of this
state. This is believed to have a strong effect on the rate of the
$^{13}$C($\alpha$, $n$)$^{16}$O reaction, the main neutron source for {\it
slow} neutron captures (the $s$-process) in asymptotic giant branch (AGB)
stars. Based on the new width we derive the astrophysical S-factor and the
stellar rate of the $^{13}$C($\alpha$, $n$)$^{16}$O reaction. At a temperature
of 100 MK our rate is roughly two times larger than that by \citet{cau88} and
two times smaller than that recommended by the NACRE compilation. We use the
new rate and different rates available in the literature as input in
simulations of AGB stars to study their influence on the abundances of selected
$s$-process elements and isotopic ratios. There are no changes in the final
results using the different rates for the $^{13}$C($\alpha$, $n$)$^{16}$O
reaction when the $^{13}$C burns completely in radiative conditions. When the
$^{13}$C burns in convective conditions, as in stars of initial mass lower than
$\sim$2 $M_\sun$ and in post-AGB stars, some changes are to be expected, e.g.,
of up to 25% for Pb in our models. These variations will have to be carefully
analyzed when more accurate stellar mixing models and more precise
observational constraints are available.