Full-text search for arXiv

2 article(s) in total. 13 co-authors. Median position in authors list is 4,5.

[1] oai:arXiv.org:2005.03542 [pdf] - 2091276

The STONE curve: A ROC-derived model performance assessment tool

Liemohn, Michael W.; Azari, Abigail R.; Ganushkina, Natalia; Rastaetter, Lutz

Comments: 19 pages, including 4 figures. Currently in second-round review with "Earth and Space Science": https://agupubs.onlinelibrary.wiley.com/journal/23335084

Submitted: 2020-04-22

A new model validation and performance assessment tool is introduced, the sliding threshold of observation for numeric evaluation (STONE) curve. It is based on the relative operating characteristic (ROC) curve technique, but instead of sorting all observations in a categorical classification, the STONE tool uses the continuous nature of the observations. Rather than defining events in the observations and then sliding the threshold only in the classifier (model) data set, the threshold is changed simultaneously for both the observational and model values, with the same threshold value for both data and model. This is only possible if the observations are continuous and the model output is in the same units and scale as the observations, that is, the model is trying to exactly reproduce the data. The STONE curve has several similarities with the ROC curve, plotting probability of detection against probability of false detection, ranging from the (1,1) corner for low thresholds to the (0,0) corner for high thresholds, and values above the zero-intercept unity-slope line indicating better than random predictive ability. The main difference is that the STONE curve can be nonmonotonic, doubling back in both the x and y directions. These ripples reveal asymmetries in the data-model value pairs. This new technique is applied to modeling output of a common geomagnetic activity index as well as energetic electron fluxes in the Earth's inner magnetosphere. It is not limited to space physics applications but can be used for any scientific or engineering field where numerical models are used to reproduce observations.

[2] oai:arXiv.org:1504.04402 [pdf] - 1005697

Ensemble modeling of CMEs using the WSA-ENLIL+Cone model

Mays, M. L.; Taktakishvili, A.; Pulkkinen, A. A.; MacNeice, P. J.; Rastaetter, L.; Odstrcil, D.; Jian, L. K.; Richardson, I. G.; LaSota, J. A.; Zheng, Y.; Kuznetsova, M. M.

Comments: 37 pages, 22 figures

Submitted: 2015-04-16, last modified: 2015-05-11

Ensemble modeling of CMEs provides a probabilistic forecast of CME arrival time which includes an estimation of arrival time uncertainty from the spread and distribution of predictions and forecast confidence in the likelihood of CME arrival. The real-time ensemble modeling of CME propagation uses the WSA-ENLIL+Cone model installed at the CCMC and executed in real-time. The current implementation evaluates the sensitivity of WSA-ENLIL+Cone model simulations of CME propagation to initial CME parameters. We discuss the results of real-time ensemble simulations for a total of 35 CME events between January 2013 - July 2014. For the 17 events where the CME was predicted to arrive at Earth, the mean absolute arrival time prediction error was 12.3 hours, which is comparable to the errors reported in other studies. For predictions of CME arrival at Earth the correct rejection rate is 62% and the false-alarm rate is 38%. The arrival time was within the range of the ensemble arrival predictions for 8 out of 17 events. The Brier Score for CME arrival predictions is 0.15 (where 1 is a perfect forecast), indicating that on average, the predicted likelihood of CME arrival is fairly accurate. The reliability of ensemble CME arrival predictions is heavily dependent on the initial distribution of CME input parameters, particularly the median and spread. Preliminary analysis of the probabilistic forecasts suggests undervariability, indicating that these ensembles do not sample a wide enough spread in CME input parameters. Prediction errors can also arise from ambient model parameters, the accuracy of the solar wind background derived from coronal maps, or other model limitations. Finally, predictions of the Kp geomagnetic index differ from observed values by less than one for 11 out of 17 of the ensembles and Kp prediction errors computed from the mean predicted Kp show a mean absolute error of 1.3.