Normalized to: Jiao, Z.
[1]
oai:arXiv.org:1912.12360 [pdf] - 2061638
Interpreting LSTM Prediction on Solar Flare Eruption with Time-series
Clustering
Submitted: 2019-12-27, last modified: 2020-03-09
We conduct a post hoc analysis of solar flare predictions made by a Long
Short Term Memory (LSTM) model employing data in the form of Space-weather HMI
Active Region Patches (SHARP) parameters calculated from data in proximity to
the magnetic polarity inversion line where the flares originate. We train the
the LSTM model for binary classification to provide a prediction score for the
probability of M/X class flares to occur in next hour. We then develop a
dimension-reduction technique to reduce the dimensions of SHARP parameter (LSTM
inputs) and demonstrate the different patterns of SHARP parameters
corresponding to the transition from low to high prediction score. Our work
shows that a subset of SHARP parameters contain the key signals that strong
solar flare eruptions are imminent. The dynamics of these parameters have a
highly uniform trajectory for many events whose LSTM prediction scores for M/X
class flares transition from very low to very high. The results demonstrate the
existence of a few threshold values of SHARP parameters that when surpassed
indicate a high probability of the eruption of a strong flare. Our method has
distilled the knowledge of solar flare eruption learnt by deep learning model
and provides a more interpretable approximation, which provides physical
insight to processes driving solar flares.
[2]
oai:arXiv.org:1912.06120 [pdf] - 2055372
Solar Flare Intensity Prediction with Machine Learning Models
Submitted: 2019-12-12, last modified: 2020-02-26
We develop a mixed Long Short Term Memory (LSTM) regression model to predict
the maximum solar flare intensity within a 24-hour time window 0$\sim$24,
6$\sim$30, 12$\sim$36 and 24$\sim$48 hours ahead of time using 6, 12, 24 and 48
hours of data (predictors) for each Helioseismic and Magnetic Imager (HMI)
Active Region Patch (HARP). The model makes use of (1) the Space-weather HMI
Active Region Patch (SHARP) parameters as predictors and (2) the exact flare
intensities instead of class labels recorded in the Geostationary Operational
Environmental Satellites (GOES) data set, which serves as the source of the
response variables. Compared to solar flare classification, the model offers us
more detailed information about the exact maximum flux level, i.e. intensity,
for each occurrence of a flare. We also consider classification models built on
top of the regression model and obtain better results in solar flare
classifications. Our results suggest that the most efficient time period for
predicting the solar activity is within 24 hours before the prediction time
using the SHARP parameters and the LSTM model.
[3]
oai:arXiv.org:1912.00502 [pdf] - 2101325
Predicting solar flares with machine learning: investigating solar cycle
dependence
Wang, Xiantong;
Chen, Yang;
Toth, Gabor;
Manchester, Ward B.;
Gombosi, Tamas I.;
Hero, Alfred O.;
Jiao, Zhenbang;
Sun, Hu;
Jin, Meng;
Liu, Yang
Submitted: 2019-12-01, last modified: 2020-01-22
A deep learning network, Long-Short Term Memory (LSTM) network, is used in
this work to predict whether the maximum flare class an active region (AR) will
produce in the next 24 hours is class $\Gamma$. We considered $\Gamma$ are $\ge
M$, $\ge C$ and any flare class. The essence of using LSTM, which is a
recurrent neural network, is its capability to capture temporal information of
the data samples. The input features are time sequences of 20 magnetic
parameters from SHARPs - Space-weather HMI Active Region Patches. We analyzed
active regions from June 2010 to Dec 2018, using the Geostationary Operational
Environmental Satellite (GOES) X-ray flare catalogs and label the data samples
with identified ARs in the GOES X-ray flare catalogs. Our results (i) shows
consistent skill scores with recently published results using LSTMs and better
than the previous work using single time input (eg. DeFN) (ii) The skill scores
from the model show essential differences when different years of data was
chosen for training and testing.