Normalized to: Narasimhamurthy, A.
[1]
oai:arXiv.org:1604.01722 [pdf] - 1386792
CD-HPF: New Habitability Score Via Data Analytic Modeling
Submitted: 2016-04-06
The search for life on the planets outside the Solar System can be broadly
classified into the following: looking for Earth-like conditions or the planets
similar to the Earth (Earth similarity), and looking for the possibility of
life in a form known or unknown to us (habitability). The two frequently used
indices, ESI and PHI, describe heuristic methods to score
similarity/habitability in the efforts to categorize different exoplanets or
exomoons. ESI, in particular, considers Earth as the reference frame for
habitability and is a quick screening tool to categorize and measure physical
similarity of any planetary body with the Earth. The PHI assesses the
probability that life in some form may exist on any given world, and is based
on the essential requirements of known life: a stable and protected substrate,
energy, appropriate chemistry and a liquid medium. We propose here a different
metric, a Cobb-Douglas Habitability Score (CDHS), based on Cobb-Douglas
habitability production function (CD-HPF), which computes the habitability
score by using measured and calculated planetary input parameters. The proposed
metric, with exponents accounting for metric elasticity, is endowed with
verifiable analytical properties that ensure global optima, and is scalable to
accommodate finitely many input parameters. The model is elastic, does not
suffer from curvature violations and, as we discovered, the standard PHI is a
special case of CDHS. Computed CDHS scores are fed to K-NN (K-Nearest
Neighbour) classification algorithm with probabilistic herding that facilitates
the assignment of exoplanets to appropriate classes via supervised feature
learning methods, producing granular clusters of habitability. The proposed
work describes a decision-theoretical model using the power of convex
optimization and algorithmic machine learning.
[2]
oai:arXiv.org:1504.07865 [pdf] - 986708
ASTROMLSKIT: A New Statistical Machine Learning Toolkit: A Platform for
Data Analytics in Astronomy
Submitted: 2015-04-29
Astroinformatics is a new impact area in the world of astronomy, occasionally
called the final frontier, where several astrophysicists, statisticians and
computer scientists work together to tackle various data intensive astronomical
problems. Exponential growth in the data volume and increased complexity of the
data augments difficult questions to the existing challenges. Classical
problems in Astronomy are compounded by accumulation of astronomical volume of
complex data, rendering the task of classification and interpretation
incredibly laborious. The presence of noise in the data makes analysis and
interpretation even more arduous. Machine learning algorithms and data analytic
techniques provide the right platform for the challenges posed by these
problems. A diverse range of open problem like star-galaxy separation,
detection and classification of exoplanets, classification of supernovae is
discussed. The focus of the paper is the applicability and efficacy of various
machine learning algorithms like K Nearest Neighbor (KNN), random forest (RF),
decision tree (DT), Support Vector Machine (SVM), Na\"ive Bayes and Linear
Discriminant Analysis (LDA) in analysis and inference of the decision theoretic
problems in Astronomy. The machine learning algorithms, integrated into
ASTROMLSKIT, a toolkit developed in the course of the work, have been used to
analyze HabCat data and supernovae data. Accuracy has been found to be
appreciably good.