Normalized to: Ganushkina, N.
[1]
oai:arXiv.org:2005.03542 [pdf] - 2091276
The STONE curve: A ROC-derived model performance assessment tool
Submitted: 2020-04-22
A new model validation and performance assessment tool is introduced, the
sliding threshold of observation for numeric evaluation (STONE) curve. It is
based on the relative operating characteristic (ROC) curve technique, but
instead of sorting all observations in a categorical classification, the STONE
tool uses the continuous nature of the observations. Rather than defining
events in the observations and then sliding the threshold only in the
classifier (model) data set, the threshold is changed simultaneously for both
the observational and model values, with the same threshold value for both data
and model. This is only possible if the observations are continuous and the
model output is in the same units and scale as the observations, that is, the
model is trying to exactly reproduce the data. The STONE curve has several
similarities with the ROC curve, plotting probability of detection against
probability of false detection, ranging from the (1,1) corner for low
thresholds to the (0,0) corner for high thresholds, and values above the
zero-intercept unity-slope line indicating better than random predictive
ability. The main difference is that the STONE curve can be nonmonotonic,
doubling back in both the x and y directions. These ripples reveal asymmetries
in the data-model value pairs. This new technique is applied to modeling output
of a common geomagnetic activity index as well as energetic electron fluxes in
the Earth's inner magnetosphere. It is not limited to space physics
applications but can be used for any scientific or engineering field where
numerical models are used to reproduce observations.
[2]
oai:arXiv.org:1907.08663 [pdf] - 1920179
Application Usability Levels: A Framework for Tracking Project Product
Progress
Halford, Alexa J.;
Kellerman, Adam C.;
Garcia-Sage, Katherine;
Klenzing, Jeffrey;
Carter, Brett A.;
McGranaghan, Ryan M.;
Guild, Timothy;
Cid, Consuelo;
Henney, Carl J.;
Ganushkina, Natalia Y.;
Burrell, Angeline G.;
Terkildsen, Mike;
Welling, Daniel T.;
Murray, Sophie A.;
Leka, K. D.;
McCollough, James P.;
Thompson, Barbara J.;
Pulkkinen, Antti;
Fung, Shing F.;
Bingham, Suzy;
Bisi, Mario M.;
Liemohn, Michael W.;
Walsh, Brian M.;
Morley, Steven K.
Submitted: 2019-07-19
The space physics community continues to grow and become both more
interdisciplinary and more intertwined with commercial and government
operations. This has created a need for a framework to easily identify what
projects can be used for specific applications and how close the tool is to
routine autonomous or on-demand implementation and operation. We propose the
Application Usability Level (AUL) framework and publicizing AULs to help the
community quantify the progress of successful applications, metrics, and
validation efforts. This framework will also aid the scientific community by
supplying the type of information needed to build off of previously published
work and publicizing the applications and requirements needed by the user
communities. In this paper, we define the AUL framework, outline the milestones
required for progression to higher AULs, and provide example projects utilizing
the AUL framework. This work has been completed as part of the activities of
the Assessment of Understanding and Quantifying Progress working group which is
part of the International Forum for Space Weather Capabilities Assessment.