Normalized to: Delouille, V.
[1]
oai:arXiv.org:2001.02808 [pdf] - 2054142
A Comparison of Flare Forecasting Methods. IV. Evaluating
Consecutive-Day Forecasting Patterns
Park, Sung-Hong;
Leka, K. D.;
Kusano, Kanya;
Andries, Jesse;
Barnes, Graham;
Bingham, Suzy;
Bloomfield, D. Shaun;
McCloskey, Aoife E.;
Delouille, Veronique;
Falconer, David;
Gallagher, Peter T.;
Georgoulis, Manolis K.;
Kubo, Yuki;
Lee, Kangjin;
Lee, Sangwoo;
Lobzin, Vasily;
Mun, JunChul;
Murray, Sophie A.;
Nageem, Tarek A. M. Hamad;
Qahwaji, Rami;
Sharpe, Michael;
Steenburgh, Rob A.;
Steward, Graham;
Terkildsen, Michael
Submitted: 2020-01-08, last modified: 2020-01-21
A crucial challenge to successful flare prediction is forecasting periods
that transition between "flare-quiet" and "flare-active". Building on earlier
studies in this series (Barnes et al. 2016; Leka et al. 2019a,b) in which we
describe methodology, details, and results of flare forecasting comparison
efforts, we focus here on patterns of forecast outcomes (success and failure)
over multi-day periods. A novel analysis is developed to evaluate forecasting
success in the context of catching the first event of flare-active periods, and
conversely, of correctly predicting declining flare activity. We demonstrate
these evaluation methods graphically and quantitatively as they provide both
quick comparative evaluations and options for detailed analysis. For the
testing interval 2016-2017, we determine the relative frequency distribution of
two-day dichotomous forecast outcomes for three different event histories
(i.e., event/event, no-event/event and event/no-event), and use it to highlight
performance differences between forecasting methods. A trend is identified
across all forecasting methods that a high/low forecast probability on day-1
remains high/low on day-2 even though flaring activity is transitioning. For
M-class and larger flares, we find that explicitly including persistence or
prior flare history in computing forecasts helps to improve overall forecast
performance. It is also found that using magnetic/modern data leads to
improvement in catching the first-event/first-no-event transitions. Finally,
15% of major (i.e., M-class or above) flare days over the testing interval were
effectively missed due to a lack of observations from instruments away from the
Earth-Sun line.
[2]
oai:arXiv.org:1907.02909 [pdf] - 1953630
A Comparison of Flare Forecasting Methods. III. Systematic Behaviors of
Operational Solar Flare Forecasting Systems
Leka, K. D.;
Park, Sung-Hong;
Kusano, Kanya;
Andries, Jesse;
Barnes, Graham;
Bingham, Suzy;
Bloomfield, D. Shaun;
McCloskey, Aoife E.;
Delouille, Veronique;
Falconer, David;
Gallagher, Peter T.;
Georgoulis, Manolis K.;
Kubo, Yuki;
Lee, Kangjin;
Lee, Sangwoo;
Lobzin, Vasily;
Mun, JunChul;
Murray, Sophie A.;
Nageem, Tarek A. M. Hamad;
Qahwaji, Rami;
Sharpe, Michael;
Steenburgh, Rob;
Steward, Graham;
Terkildsen, Michael
Submitted: 2019-07-05
A workshop was recently held at Nagoya University (31 October - 02 November
2017), sponsored by the Center for International Collaborative Research, at the
Institute for Space-Earth Environmental Research, Nagoya University, Japan, to
quantitatively compare the performance of today's operational solar flare
forecasting facilities. Building upon Paper I of this series (Barnes et al.
2016), in Paper II (Leka et al. 2019) we described the participating methods
for this latest comparison effort, the evaluation methodology, and presented
quantitative comparisons. In this paper we focus on the behavior and
performance of the methods when evaluated in the context of broad
implementation differences. Acknowledging the short testing interval available
and the small number of methods available, we do find that forecast
performance: 1) appears to improve by including persistence or prior flare
activity, region evolution, and a human "forecaster in the loop"; 2) is hurt by
restricting data to disk-center observations; 3) may benefit from long-term
statistics, but mostly when then combined with modern data sources and
statistical approaches. These trends are arguably weak and must be viewed with
numerous caveats, as discussed both here and in Paper II. Following this
present work, we present in Paper IV a novel analysis method to evaluate
temporal patterns of forecasting errors of both types (i.e., misses and false
alarms; Park et al. 2019). Hence, most importantly, with this series of papers
we demonstrate the techniques for facilitating comparisons in the interest of
establishing performance-positive methodologies.
[3]
oai:arXiv.org:1907.02905 [pdf] - 1953629
A Comparison of Flare Forecasting Methods. II. Benchmarks, Metrics and
Performance Results for Operational Solar Flare Forecasting Systems
Leka, K. D.;
Park, Sung-Hong;
Kusano, Kanya;
Andries, Jesse;
Barnes, Graham;
Bingham, Suzy;
Bloomfield, D. Shaun;
McCloskey, Aoife E.;
Delouille, Veronique;
Falconer, David;
Gallagher, Peter T.;
Georgoulis, Manolis K.;
Kubo, Yuki;
Lee, Kangjin;
Lee, Sangwoo;
Lobzin, Vasily;
Mun, JunChul;
Murray, Sophie A.;
Nageem, Tarek A. M. Hamad;
Qahwaji, Rami;
Sharpe, Michael;
Steenburgh, Rob;
Steward, Graham;
Terkildsen, Michael
Submitted: 2019-07-05
Solar flares are extremely energetic phenomena in our Solar System. Their
impulsive, often drastic radiative increases, in particular at short
wavelengths, bring immediate impacts that motivate solar physics and space
weather research to understand solar flares to the point of being able to
forecast them. As data and algorithms improve dramatically, questions must be
asked concerning how well the forecasting performs; crucially, we must ask how
to rigorously measure performance in order to critically gauge any
improvements. Building upon earlier-developed methodology (Barnes et al, 2016,
Paper I), international representatives of regional warning centers and
research facilities assembled in 2017 at the Institute for Space-Earth
Environmental Research, Nagoya University, Japan to - for the first time -
directly compare the performance of operational solar flare forecasting
methods. Multiple quantitative evaluation metrics are employed, with focus and
discussion on evaluation methodologies given the restrictions of operational
forecasting. Numerous methods performed consistently above the "no skill"
level, although which method scored top marks is decisively a function of flare
event definition and the metric used; there was no single winner. Following in
this paper series we ask why the performances differ by examining
implementation details (Leka et al. 2019, Paper III), and then we present a
novel analysis method to evaluate temporal patterns of forecasting errors in
(Park et al. 2019, Paper IV). With these works, this team presents a
well-defined and robust methodology for evaluating solar flare forecasting
methods in both research and operational frameworks, and today's performance
benchmarks against which improvements and new methods may be compared.
[4]
oai:arXiv.org:1503.04127 [pdf] - 1371971
Image patch analysis of sunspots and active regions. I. Intrinsic
dimension and correlation analysis
Submitted: 2015-03-13, last modified: 2015-12-14
The flare-productivity of an active region is observed to be related to its
spatial complexity. Mount Wilson or McIntosh sunspot classifications measure
such complexity but in a categorical way, and may therefore not use all the
information present in the observations. Moreover, such categorical schemes
hinder a systematic study of an active region's evolution for example. We
propose fine-scale quantitative descriptors for an active region's complexity
and relate them to the Mount Wilson classification. We analyze the local
correlation structure within continuum and magnetogram data, as well as the
cross-correlation between continuum and magnetogram data. We compute the
intrinsic dimension, partial correlation, and canonical correlation analysis
(CCA) of image patches of continuum and magnetogram active region images taken
from the SOHO-MDI instrument. We use masks of sunspots derived from continuum
as well as larger masks of magnetic active regions derived from the magnetogram
to analyze separately the core part of an active region from its surrounding
part. We find the relationship between complexity of an active region as
measured by Mount Wilson and the intrinsic dimension of its image patches.
Partial correlation patterns exhibit approximately a third-order Markov
structure. CCA reveals different patterns of correlation between continuum and
magnetogram within the sunspots and in the region surrounding the sunspots.
These results also pave the way for patch-based dictionary learning with a view
towards automatic clustering of active regions.
[5]
oai:arXiv.org:1504.02762 [pdf] - 1371975
Image patch analysis of sunspots and active regions. II. Clustering via
matrix factorization
Submitted: 2015-04-10, last modified: 2015-12-10
Separating active regions that are quiet from potentially eruptive ones is a
key issue in Space Weather applications. Traditional classification schemes
such as Mount Wilson and McIntosh have been effective in relating an active
region large scale magnetic configuration to its ability to produce eruptive
events. However, their qualitative nature prevents systematic studies of an
active region's evolution for example. We introduce a new clustering of active
regions that is based on the local geometry observed in Line of Sight
magnetogram and continuum images. We use a reduced-dimension representation of
an active region that is obtained by factoring the corresponding data matrix
comprised of local image patches. Two factorizations can be compared via the
definition of appropriate metrics on the resulting factors. The distances
obtained from these metrics are then used to cluster the active regions. We
find that these metrics result in natural clusterings of active regions. The
clusterings are related to large scale descriptors of an active region such as
its size, its local magnetic field distribution, and its complexity as measured
by the Mount Wilson classification scheme. We also find that including data
focused on the neutral line of an active region can result in an increased
correspondence between our clustering results and other active region
descriptors such as the Mount Wilson classifications and the $R$ value. We
provide some recommendations for which metrics, matrix factorization
techniques, and regions of interest to use to study active regions.
[6]
oai:arXiv.org:1412.6279 [pdf] - 1337942
Non-parametric PSF estimation from celestial transit solar images using
blind deconvolution
Submitted: 2014-12-19, last modified: 2015-09-29
Context: Characterization of instrumental effects in astronomical imaging is
important in order to extract accurate physical information from the
observations. The measured image in a real optical instrument is usually
represented by the convolution of an ideal image with a Point Spread Function
(PSF). Additionally, the image acquisition process is also contaminated by
other sources of noise (read-out, photon-counting). The problem of estimating
both the PSF and a denoised image is called blind deconvolution and is
ill-posed.
Aims: We propose a blind deconvolution scheme that relies on image
regularization. Contrarily to most methods presented in the literature, our
method does not assume a parametric model of the PSF and can thus be applied to
any telescope.
Methods: Our scheme uses a wavelet analysis prior model on the image and weak
assumptions on the PSF. We use observations from a celestial transit, where the
occulting body can be assumed to be a black disk. These constraints allow us to
retain meaningful solutions for the filter and the image, eliminating trivial,
translated and interchanged solutions. Under an additive Gaussian noise
assumption, they also enforce noise canceling and avoid reconstruction
artifacts by promoting the whiteness of the residual between the blurred
observations and the cleaned data.
Results: Our method is applied to synthetic and experimental data. The PSF is
estimated for the SECCHI/EUVI instrument using the 2007 Lunar transit, and for
SDO/AIA using the 2012 Venus transit. Results show that the proposed
non-parametric blind deconvolution method is able to estimate the core of the
PSF with a similar quality to parametric methods proposed in the literature. We
also show that, if these parametric estimations are incorporated in the
acquisition model, the resulting PSF outperforms both the parametric and
non-parametric methods.
[7]
oai:arXiv.org:1504.07116 [pdf] - 1371977
Meta learning of bounds on the Bayes classifier error
Submitted: 2015-04-27, last modified: 2015-07-03
Meta learning uses information from base learners (e.g. classifiers or
estimators) as well as information about the learning problem to improve upon
the performance of a single base learner. For example, the Bayes error rate of
a given feature space, if known, can be used to aid in choosing a classifier,
as well as in feature selection and model selection for the base classifiers
and the meta classifier. Recent work in the field of f-divergence functional
estimation has led to the development of simple and rapidly converging
estimators that can be used to estimate various bounds on the Bayes error. We
estimate multiple bounds on the Bayes error using an estimator that applies
meta learning to slowly converging plug-in estimators to obtain the parametric
convergence rate. We compare the estimated bounds empirically on simulated data
and then estimate the tighter bounds on features extracted from an image patch
analysis of sunspot continuum and magnetogram images.
[8]
oai:arXiv.org:1506.06623 [pdf] - 1225013
Improvements on coronal hole detection in SDO/AIA images using
supervised classification
Submitted: 2015-06-22
We demonstrate the use of machine learning algorithms in combination with
segmentation techniques in order to distinguish coronal holes and filaments in
SDO/AIA EUV images of the Sun. Based on two coronal hole detection techniques
(intensity-based thresholding, SPoCA), we prepared data sets of manually
labeled coronal hole and filament channel regions present on the Sun during the
time range 2011 - 2013. By mapping the extracted regions from EUV observations
onto HMI line-of-sight magnetograms we also include their magnetic
characteristics. We computed shape measures from the segmented binary maps as
well as first order and second order texture statistics from the segmented
regions in the EUV images and magnetograms. These attributes were used for data
mining investigations to identify the most performant rule to differentiate
between coronal holes and filament channels. We applied several classifiers,
namely Support Vector Machine, Linear Support Vector Machine, Decision Tree,
and Random Forest and found that all classification rules achieve good results
in general, with linear SVM providing the best performances (with a true skill
statistic of ~0.90). Additional information from magnetic field data
systematically improves the performance across all four classifiers for the
SPoCA detection. Since the calculation is inexpensive in computing time, this
approach is well suited for applications on real-time data. This study
demonstrates how a machine learning approach may help improve upon an
unsupervised feature extraction method.
[9]
oai:arXiv.org:1406.6390 [pdf] - 953591
Image patch analysis and clustering of sunspots: a dimensionality
reduction approach
Submitted: 2014-06-24
Sunspots, as seen in white light or continuum images, are associated with
regions of high magnetic activity on the Sun, visible on magnetogram images.
Their complexity is correlated with explosive solar activity and so classifying
these active regions is useful for predicting future solar activity. Current
classification of sunspot groups is visually based and suffers from bias.
Supervised learning methods can reduce human bias but fail to optimally
capitalize on the information present in sunspot images. This paper uses two
image modalities (continuum and magnetogram) to characterize the spatial and
modal interactions of sunspot and magnetic active region images and presents a
new approach to cluster the images. Specifically, in the framework of image
patch analysis, we estimate the number of intrinsic parameters required to
describe the spatial and modal dependencies, the correlation between the two
modalities and the corresponding spatial patterns, and examine the phenomena at
different scales within the images. To do this, we use linear and nonlinear
intrinsic dimension estimators, canonical correlation analysis, and
multiresolution analysis of intrinsic dimension.
[10]
oai:arXiv.org:1208.1483 [pdf] - 546502
The SPOCA-suite: a software for extraction and tracking of Active
Regions and Coronal Holes on EUV images
Submitted: 2012-08-07
Precise localisation and characterization of active regions and coronal holes
as observed by EUV imagers are crucial for a wide range of solar and
helio-physics studies. We describe a segmentation procedure, the SPOCA-suite,
that produces catalogs of Active Regions (AR) and Coronal Holes (CH) on SDO-AIA
images. The method builds upon our previous work on 'Spatial Possibilistic
Clustering Algorithm' (SPOCA) and substantially improve it in several ways. The
SPOCA-suite is applied in near real time on AIA archive and produces entries
into the AR and CH catalogs of the Heliophysics Event Knowledgebase (HEK) every
four hours. We give an illustration of the use of SPOCA for determination of
the CH filling factors. This reports is intended as a reference guide for the
users of SPoCA output.
[11]
oai:arXiv.org:1109.0473 [pdf] - 405935
A Multi-Wavelength Analysis of Active Regions and Sunspots by Comparison
of Automated Detection Algorithms
Submitted: 2011-09-02
Since the Solar Dynamics Observatory (SDO) began recording ~ 1 TB of data per
day, there has been an increased need to automatically extract features and
events for further analysis. Here we compare the overall detection performance,
correlations between extracted properties, and usability for feature tracking
of four solar feature-detection algorithms: the Solar Monitor Active Region
Tracker (SMART) detects active regions in line-of-sight magnetograms; the
Automated Solar Activity Prediction code (ASAP) detects sunspots and pores in
white-light continuum images; the Sunspot Tracking And Recognition Algorithm
(STARA) detects sunspots in white-light continuum images; the Spatial
Possibilistic Clustering Algorithm (SPoCA) automatically segments solar EUV
images into active regions (AR), coronal holes (CH) and quiet Sun (QS). One
month of data from the SOHO/MDI and SOHO/EIT instruments during 12 May - 23
June 2003 is analysed. The overall detection performance of each algorithm is
benchmarked against National Oceanic and Atmospheric Administration (NOAA) and
Solar Influences Data Analysis Centre (SIDC) catalogues using various feature
properties such as total sunspot area, which shows good agreement, and the
number of features detected, which shows poor agreement. Principal Component
Analysis indicates a clear distinction between photospheric properties, which
are highly correlated to the first component and account for 52.86% of
variability in the data set, and coronal properties, which are moderately
correlated to both the first and second principal components. Finally, case
studies of NOAA 10377 and 10365 are conducted to determine algorithm stability
for tracking the evolution of individual features. We find that magnetic flux
and total sunspot area are the best indicators of active-region emergence.
[12]
oai:arXiv.org:0808.3068 [pdf] - 1000879
Quantifying and containing the curse of high resolution coronal imaging
Submitted: 2008-08-22
Future missions such as Solar Orbiter (SO), InterHelioprobe, or Solar Probe
aim at approaching the Sun closer than ever before, with on board some high
resolution imagers (HRI) having a subsecond cadence and a pixel area of about
$(80km)^2$ at the Sun during perihelion. In order to guarantee their scientific
success, it is necessary to evaluate if the photon counts available at these
resolution and cadence will provide a sufficient signal-to-noise ratio (SNR).
We perform a first step in this direction by analyzing and characterizing the
spatial intermittency of Quiet Sun images thanks to a multifractal analysis.
We identify the parameters that specify the scale-invariance behavior. This
identification allows next to select a family of multifractal processes, namely
the Compound Poisson Cascades, that can synthesize artificial images having
some of the scale-invariance properties observed on the recorded images.
The prevalence of self-similarity in Quiet Sun coronal images makes it
relevant to study the ratio between the SNR present at SoHO/EIT images and in
coarsened images. SoHO/EIT images thus play the role of 'high resolution'
images, whereas the 'low-resolution' coarsened images are rebinned so as to
simulate a smaller angular resolution and/or a larger distance to the Sun. For
a fixed difference in angular resolution and in Spacecraft-Sun distance, we
determine the proportion of pixels having a SNR preserved at high resolution
given a particular increase in effective area. If scale-invariance continues to
prevail at smaller scales, the conclusion reached with SoHO/EIT images can be
transposed to the situation where the resolution is increased from SoHO/EIT to
SO/HRI resolution at perihelion.