Normalized to: Bingham, S.
[1]
oai:arXiv.org:2001.02808 [pdf] - 2054142
A Comparison of Flare Forecasting Methods. IV. Evaluating
Consecutive-Day Forecasting Patterns
Park, Sung-Hong;
Leka, K. D.;
Kusano, Kanya;
Andries, Jesse;
Barnes, Graham;
Bingham, Suzy;
Bloomfield, D. Shaun;
McCloskey, Aoife E.;
Delouille, Veronique;
Falconer, David;
Gallagher, Peter T.;
Georgoulis, Manolis K.;
Kubo, Yuki;
Lee, Kangjin;
Lee, Sangwoo;
Lobzin, Vasily;
Mun, JunChul;
Murray, Sophie A.;
Nageem, Tarek A. M. Hamad;
Qahwaji, Rami;
Sharpe, Michael;
Steenburgh, Rob A.;
Steward, Graham;
Terkildsen, Michael
Submitted: 2020-01-08, last modified: 2020-01-21
A crucial challenge to successful flare prediction is forecasting periods
that transition between "flare-quiet" and "flare-active". Building on earlier
studies in this series (Barnes et al. 2016; Leka et al. 2019a,b) in which we
describe methodology, details, and results of flare forecasting comparison
efforts, we focus here on patterns of forecast outcomes (success and failure)
over multi-day periods. A novel analysis is developed to evaluate forecasting
success in the context of catching the first event of flare-active periods, and
conversely, of correctly predicting declining flare activity. We demonstrate
these evaluation methods graphically and quantitatively as they provide both
quick comparative evaluations and options for detailed analysis. For the
testing interval 2016-2017, we determine the relative frequency distribution of
two-day dichotomous forecast outcomes for three different event histories
(i.e., event/event, no-event/event and event/no-event), and use it to highlight
performance differences between forecasting methods. A trend is identified
across all forecasting methods that a high/low forecast probability on day-1
remains high/low on day-2 even though flaring activity is transitioning. For
M-class and larger flares, we find that explicitly including persistence or
prior flare history in computing forecasts helps to improve overall forecast
performance. It is also found that using magnetic/modern data leads to
improvement in catching the first-event/first-no-event transitions. Finally,
15% of major (i.e., M-class or above) flare days over the testing interval were
effectively missed due to a lack of observations from instruments away from the
Earth-Sun line.
[2]
oai:arXiv.org:1908.05938 [pdf] - 1944327
Summary of the plenary sessions at European Space Weather Week 15: space
weather users and service providers working together now and in the future
Submitted: 2019-08-16
During European Space Weather Week 15 two plenary sessions were held to
review the status of operational space weather forecasting. The first session
addressed the topic of working with space weather service providers now and in
the future, the user perspective. The second session provided the service
perspective, addressing experiences in forecasting development and operations.
Presentations in both sessions provided an overview of international efforts on
these topics, and panel discussion topics arising in the first session were
used as a basis for panel discussion in the second session. Discussion topics
included experiences during the September 2017 space weather events, cross
domain impacts, timeliness of notifications, and provision of effective user
education. Users highlighted that a 'severe' space weather event did not
necessarily lead to severe impacts for each individual user across the
different sectors. Service providers were generally confident that timely and
reliable information could be provided during severe and extreme events,
although stressed that more research and funding were required in this
relatively new field of operational space weather forecasting, to ensure
continuation of capabilities and further development of services, in particular
improved forecasting targeting user needs. Here a summary of the sessions is
provided followed by a commentary on the current state-of-the-art and potential
next steps towards improvement of services.
[3]
oai:arXiv.org:1907.08663 [pdf] - 1920179
Application Usability Levels: A Framework for Tracking Project Product
Progress
Halford, Alexa J.;
Kellerman, Adam C.;
Garcia-Sage, Katherine;
Klenzing, Jeffrey;
Carter, Brett A.;
McGranaghan, Ryan M.;
Guild, Timothy;
Cid, Consuelo;
Henney, Carl J.;
Ganushkina, Natalia Y.;
Burrell, Angeline G.;
Terkildsen, Mike;
Welling, Daniel T.;
Murray, Sophie A.;
Leka, K. D.;
McCollough, James P.;
Thompson, Barbara J.;
Pulkkinen, Antti;
Fung, Shing F.;
Bingham, Suzy;
Bisi, Mario M.;
Liemohn, Michael W.;
Walsh, Brian M.;
Morley, Steven K.
Submitted: 2019-07-19
The space physics community continues to grow and become both more
interdisciplinary and more intertwined with commercial and government
operations. This has created a need for a framework to easily identify what
projects can be used for specific applications and how close the tool is to
routine autonomous or on-demand implementation and operation. We propose the
Application Usability Level (AUL) framework and publicizing AULs to help the
community quantify the progress of successful applications, metrics, and
validation efforts. This framework will also aid the scientific community by
supplying the type of information needed to build off of previously published
work and publicizing the applications and requirements needed by the user
communities. In this paper, we define the AUL framework, outline the milestones
required for progression to higher AULs, and provide example projects utilizing
the AUL framework. This work has been completed as part of the activities of
the Assessment of Understanding and Quantifying Progress working group which is
part of the International Forum for Space Weather Capabilities Assessment.
[4]
oai:arXiv.org:1907.02909 [pdf] - 1953630
A Comparison of Flare Forecasting Methods. III. Systematic Behaviors of
Operational Solar Flare Forecasting Systems
Leka, K. D.;
Park, Sung-Hong;
Kusano, Kanya;
Andries, Jesse;
Barnes, Graham;
Bingham, Suzy;
Bloomfield, D. Shaun;
McCloskey, Aoife E.;
Delouille, Veronique;
Falconer, David;
Gallagher, Peter T.;
Georgoulis, Manolis K.;
Kubo, Yuki;
Lee, Kangjin;
Lee, Sangwoo;
Lobzin, Vasily;
Mun, JunChul;
Murray, Sophie A.;
Nageem, Tarek A. M. Hamad;
Qahwaji, Rami;
Sharpe, Michael;
Steenburgh, Rob;
Steward, Graham;
Terkildsen, Michael
Submitted: 2019-07-05
A workshop was recently held at Nagoya University (31 October - 02 November
2017), sponsored by the Center for International Collaborative Research, at the
Institute for Space-Earth Environmental Research, Nagoya University, Japan, to
quantitatively compare the performance of today's operational solar flare
forecasting facilities. Building upon Paper I of this series (Barnes et al.
2016), in Paper II (Leka et al. 2019) we described the participating methods
for this latest comparison effort, the evaluation methodology, and presented
quantitative comparisons. In this paper we focus on the behavior and
performance of the methods when evaluated in the context of broad
implementation differences. Acknowledging the short testing interval available
and the small number of methods available, we do find that forecast
performance: 1) appears to improve by including persistence or prior flare
activity, region evolution, and a human "forecaster in the loop"; 2) is hurt by
restricting data to disk-center observations; 3) may benefit from long-term
statistics, but mostly when then combined with modern data sources and
statistical approaches. These trends are arguably weak and must be viewed with
numerous caveats, as discussed both here and in Paper II. Following this
present work, we present in Paper IV a novel analysis method to evaluate
temporal patterns of forecasting errors of both types (i.e., misses and false
alarms; Park et al. 2019). Hence, most importantly, with this series of papers
we demonstrate the techniques for facilitating comparisons in the interest of
establishing performance-positive methodologies.
[5]
oai:arXiv.org:1907.02905 [pdf] - 1953629
A Comparison of Flare Forecasting Methods. II. Benchmarks, Metrics and
Performance Results for Operational Solar Flare Forecasting Systems
Leka, K. D.;
Park, Sung-Hong;
Kusano, Kanya;
Andries, Jesse;
Barnes, Graham;
Bingham, Suzy;
Bloomfield, D. Shaun;
McCloskey, Aoife E.;
Delouille, Veronique;
Falconer, David;
Gallagher, Peter T.;
Georgoulis, Manolis K.;
Kubo, Yuki;
Lee, Kangjin;
Lee, Sangwoo;
Lobzin, Vasily;
Mun, JunChul;
Murray, Sophie A.;
Nageem, Tarek A. M. Hamad;
Qahwaji, Rami;
Sharpe, Michael;
Steenburgh, Rob;
Steward, Graham;
Terkildsen, Michael
Submitted: 2019-07-05
Solar flares are extremely energetic phenomena in our Solar System. Their
impulsive, often drastic radiative increases, in particular at short
wavelengths, bring immediate impacts that motivate solar physics and space
weather research to understand solar flares to the point of being able to
forecast them. As data and algorithms improve dramatically, questions must be
asked concerning how well the forecasting performs; crucially, we must ask how
to rigorously measure performance in order to critically gauge any
improvements. Building upon earlier-developed methodology (Barnes et al, 2016,
Paper I), international representatives of regional warning centers and
research facilities assembled in 2017 at the Institute for Space-Earth
Environmental Research, Nagoya University, Japan to - for the first time -
directly compare the performance of operational solar flare forecasting
methods. Multiple quantitative evaluation metrics are employed, with focus and
discussion on evaluation methodologies given the restrictions of operational
forecasting. Numerous methods performed consistently above the "no skill"
level, although which method scored top marks is decisively a function of flare
event definition and the metric used; there was no single winner. Following in
this paper series we ask why the performances differ by examining
implementation details (Leka et al. 2019, Paper III), and then we present a
novel analysis method to evaluate temporal patterns of forecasting errors in
(Park et al. 2019, Paper IV). With these works, this team presents a
well-defined and robust methodology for evaluating solar flare forecasting
methods in both research and operational frameworks, and today's performance
benchmarks against which improvements and new methods may be compared.
[6]
oai:arXiv.org:1811.10695 [pdf] - 1855665
Benchmarking CME Arrival Time and Impact: Progress on Metadata, Metrics,
and Events
Verbeke, C.;
Mays, M. L.;
Temmer, M.;
Bingham, S.;
Steenburgh, R.;
Dumbović, M.;
Núñez, M.;
Jian, L. K.;
Hess, P.;
Wiegand, C.;
Taktakishvili, A.;
Andries, J.
Submitted: 2018-11-26
Accurate forecasting of the arrival time and subsequent geomagnetic impacts
of Coronal Mass Ejections (CMEs) at Earth is an important objective for space
weather forecasting agencies. Recently, the CME Arrival and Impact working team
has made significant progress towards defining community-agreed metrics and
validation methods to assess the current state of CME modeling capabilities.
This will allow the community to quantify our current capabilities and track
progress in models over time. Firstly, it is crucial that the community focuses
on the collection of the necessary metadata for transparency and
reproducibility of results. Concerning CME arrival and impact we have
identified 6 different metadata types: 3D CME measurement, model description,
model input, CME (non-)arrival observation, model output data and metrics and
validation methods. Secondly, the working team has also identified a validation
time period, where all events within the following two periods will be
considered: 1 January 2011-31 December 2012 and January 2015-31 December 2015.
Those two periods amount to a total of about 100 hit events at Earth and a
large amount of misses. Considering a time period will remove any bias in
selecting events and the event set will represent a sample set that will not be
biased by user selection. Lastly, we have defined the basic metrics and skill
scores that the CME Arrival and Impact working team will focus on.
[7]
oai:arXiv.org:1703.06754 [pdf] - 1789380
Flare forecasting at the Met Office Space Weather Operations Centre
Submitted: 2017-03-20, last modified: 2017-09-25
The Met Office Space Weather Operations Centre produces 24/7/365 space
weather guidance, alerts, and forecasts to a wide range of government and
commercial end users across the United Kingdom. Solar flare forecasts are one
of its products, which are issued multiple times a day in two forms; forecasts
for each active region on the solar disk over the next 24 hours, and full-disk
forecasts for the next four days. Here the forecasting process is described in
detail, as well as first verification of archived forecasts using methods
commonly used in operational weather prediction. Real-time verification
available for operational flare forecasting use is also described. The
influence of human forecasters is highlighted, with human-edited forecasts
outperforming original model results, and forecasting skill decreasing over
longer forecast lead times.