Normalized to: Hero, A.
[1]
oai:arXiv.org:1912.06120 [pdf] - 2055372
Solar Flare Intensity Prediction with Machine Learning Models
Submitted: 2019-12-12, last modified: 2020-02-26
We develop a mixed Long Short Term Memory (LSTM) regression model to predict
the maximum solar flare intensity within a 24-hour time window 0$\sim$24,
6$\sim$30, 12$\sim$36 and 24$\sim$48 hours ahead of time using 6, 12, 24 and 48
hours of data (predictors) for each Helioseismic and Magnetic Imager (HMI)
Active Region Patch (HARP). The model makes use of (1) the Space-weather HMI
Active Region Patch (SHARP) parameters as predictors and (2) the exact flare
intensities instead of class labels recorded in the Geostationary Operational
Environmental Satellites (GOES) data set, which serves as the source of the
response variables. Compared to solar flare classification, the model offers us
more detailed information about the exact maximum flux level, i.e. intensity,
for each occurrence of a flare. We also consider classification models built on
top of the regression model and obtain better results in solar flare
classifications. Our results suggest that the most efficient time period for
predicting the solar activity is within 24 hours before the prediction time
using the SHARP parameters and the LSTM model.
[2]
oai:arXiv.org:1912.00502 [pdf] - 2101325
Predicting solar flares with machine learning: investigating solar cycle
dependence
Wang, Xiantong;
Chen, Yang;
Toth, Gabor;
Manchester, Ward B.;
Gombosi, Tamas I.;
Hero, Alfred O.;
Jiao, Zhenbang;
Sun, Hu;
Jin, Meng;
Liu, Yang
Submitted: 2019-12-01, last modified: 2020-01-22
A deep learning network, Long-Short Term Memory (LSTM) network, is used in
this work to predict whether the maximum flare class an active region (AR) will
produce in the next 24 hours is class $\Gamma$. We considered $\Gamma$ are $\ge
M$, $\ge C$ and any flare class. The essence of using LSTM, which is a
recurrent neural network, is its capability to capture temporal information of
the data samples. The input features are time sequences of 20 magnetic
parameters from SHARPs - Space-weather HMI Active Region Patches. We analyzed
active regions from June 2010 to Dec 2018, using the Geostationary Operational
Environmental Satellite (GOES) X-ray flare catalogs and label the data samples
with identified ARs in the GOES X-ray flare catalogs. Our results (i) shows
consistent skill scores with recently published results using LSTMs and better
than the previous work using single time input (eg. DeFN) (ii) The skill scores
from the model show essential differences when different years of data was
chosen for training and testing.
[3]
oai:arXiv.org:1904.00125 [pdf] - 2025437
Identifying Solar Flare Precursors Using Time Series of SDO/HMI Images
and SHARP Parameters
Chen, Yang;
Manchester, Ward B.;
Hero, Alfred O.;
Toth, Gabor;
DuFumier, Benoit;
Zhou, Tian;
Wang, Xiantong;
Zhu, Haonan;
Sun, Zeyu;
Gombosi, Tamas I.
Submitted: 2019-03-29, last modified: 2019-08-03
We present several methods towards construction of precursors, which show
great promise towards early predictions, of solar flare events in this paper. A
data pre-processing pipeline is built to extract useful data from multiple
sources, Geostationary Operational Environmental Satellites (GOES) and Solar
Dynamics Observatory (SDO)/Helioseismic and Magnetic Imager (HMI), to prepare
inputs for machine learning algorithms. Two classification models are
presented: classification of flares from quiet times for active regions and
classification of strong versus weak flare events. We adopt deep learning
algorithms to capture both the spatial and temporal information from HMI
magnetogram data. Effective feature extraction and feature selection with raw
magnetogram data using deep learning and statistical algorithms enable us to
train classification models to achieve almost as good performance as using
active region parameters provided in HMI/Space-Weather HMI-Active Region Patch
(SHARP) data files. Case studies show a significant increase in the prediction
score around 20 hours before strong solar flare events.
[4]
oai:arXiv.org:1503.04127 [pdf] - 1371971
Image patch analysis of sunspots and active regions. I. Intrinsic
dimension and correlation analysis
Submitted: 2015-03-13, last modified: 2015-12-14
The flare-productivity of an active region is observed to be related to its
spatial complexity. Mount Wilson or McIntosh sunspot classifications measure
such complexity but in a categorical way, and may therefore not use all the
information present in the observations. Moreover, such categorical schemes
hinder a systematic study of an active region's evolution for example. We
propose fine-scale quantitative descriptors for an active region's complexity
and relate them to the Mount Wilson classification. We analyze the local
correlation structure within continuum and magnetogram data, as well as the
cross-correlation between continuum and magnetogram data. We compute the
intrinsic dimension, partial correlation, and canonical correlation analysis
(CCA) of image patches of continuum and magnetogram active region images taken
from the SOHO-MDI instrument. We use masks of sunspots derived from continuum
as well as larger masks of magnetic active regions derived from the magnetogram
to analyze separately the core part of an active region from its surrounding
part. We find the relationship between complexity of an active region as
measured by Mount Wilson and the intrinsic dimension of its image patches.
Partial correlation patterns exhibit approximately a third-order Markov
structure. CCA reveals different patterns of correlation between continuum and
magnetogram within the sunspots and in the region surrounding the sunspots.
These results also pave the way for patch-based dictionary learning with a view
towards automatic clustering of active regions.
[5]
oai:arXiv.org:1504.02762 [pdf] - 1371975
Image patch analysis of sunspots and active regions. II. Clustering via
matrix factorization
Submitted: 2015-04-10, last modified: 2015-12-10
Separating active regions that are quiet from potentially eruptive ones is a
key issue in Space Weather applications. Traditional classification schemes
such as Mount Wilson and McIntosh have been effective in relating an active
region large scale magnetic configuration to its ability to produce eruptive
events. However, their qualitative nature prevents systematic studies of an
active region's evolution for example. We introduce a new clustering of active
regions that is based on the local geometry observed in Line of Sight
magnetogram and continuum images. We use a reduced-dimension representation of
an active region that is obtained by factoring the corresponding data matrix
comprised of local image patches. Two factorizations can be compared via the
definition of appropriate metrics on the resulting factors. The distances
obtained from these metrics are then used to cluster the active regions. We
find that these metrics result in natural clusterings of active regions. The
clusterings are related to large scale descriptors of an active region such as
its size, its local magnetic field distribution, and its complexity as measured
by the Mount Wilson classification scheme. We also find that including data
focused on the neutral line of an active region can result in an increased
correspondence between our clustering results and other active region
descriptors such as the Mount Wilson classifications and the $R$ value. We
provide some recommendations for which metrics, matrix factorization
techniques, and regions of interest to use to study active regions.
[6]
oai:arXiv.org:1504.07116 [pdf] - 1371977
Meta learning of bounds on the Bayes classifier error
Submitted: 2015-04-27, last modified: 2015-07-03
Meta learning uses information from base learners (e.g. classifiers or
estimators) as well as information about the learning problem to improve upon
the performance of a single base learner. For example, the Bayes error rate of
a given feature space, if known, can be used to aid in choosing a classifier,
as well as in feature selection and model selection for the base classifiers
and the meta classifier. Recent work in the field of f-divergence functional
estimation has led to the development of simple and rapidly converging
estimators that can be used to estimate various bounds on the Bayes error. We
estimate multiple bounds on the Bayes error using an estimator that applies
meta learning to slowly converging plug-in estimators to obtain the parametric
convergence rate. We compare the estimated bounds empirically on simulated data
and then estimate the tighter bounds on features extracted from an image patch
analysis of sunspot continuum and magnetogram images.
[7]
oai:arXiv.org:1406.6390 [pdf] - 953591
Image patch analysis and clustering of sunspots: a dimensionality
reduction approach
Submitted: 2014-06-24
Sunspots, as seen in white light or continuum images, are associated with
regions of high magnetic activity on the Sun, visible on magnetogram images.
Their complexity is correlated with explosive solar activity and so classifying
these active regions is useful for predicting future solar activity. Current
classification of sunspot groups is visually based and suffers from bias.
Supervised learning methods can reduce human bias but fail to optimally
capitalize on the information present in sunspot images. This paper uses two
image modalities (continuum and magnetogram) to characterize the spatial and
modal interactions of sunspot and magnetic active region images and presents a
new approach to cluster the images. Specifically, in the framework of image
patch analysis, we estimate the number of intrinsic parameters required to
describe the spatial and modal dependencies, the correlation between the two
modalities and the corresponding spatial patterns, and examine the phenomena at
different scales within the images. To do this, we use linear and nonlinear
intrinsic dimension estimators, canonical correlation analysis, and
multiresolution analysis of intrinsic dimension.
[8]
oai:arXiv.org:1110.3052 [pdf] - 622363
The First Stray Light Corrected EUV Images of Solar Coronal Holes
Submitted: 2011-10-13, last modified: 2012-03-07
Coronal holes are the source regions of the fast solar wind, which fills most
of the solar system volume near the cycle minimum. Removing stray light from
extreme ultraviolet (EUV) images of the Sun's corona is of high astrophysical
importance, as it is required to make meaningful determinations of temperatures
and densities of coronal holes. EUV images tend to be dominated by the
component of the stray light due to the long-range scatter caused by
microroughness of telescope mirror surfaces, and this component has proven very
difficult to measure in pre-flight characterization. In-flight characterization
heretofore has proven elusive due to the fact that the detected image is
simultaneously nonlinear in two unknown functions: the stray light pattern and
the true image which would be seen by an ideal telescope. Using a constrained
blind deconvolution technique that takes advantage of known zeros in the true
image provided by a fortuitous lunar transit, we have removed the stray light
from solar images seen by the EUVI instrument on STEREO-B in all four filter
bands (171, 195, 284, and 304 \AA). Uncertainty measures of the stray light
corrected images, which include the systematic error due to misestimation of
the scatter, are provided. It is shown that in EUVI, stray light contributes up
to 70% of the emission in coronal holes seen on the solar disk, which has
dramatic consequences for diagnostics of temperature and density and therefore
estimates of key plasma parameters such as the plasma $\beta$\ and ion-electron
collision rates.