Full-text search for arXiv

Hero, Alfred O.

Normalized to: Hero, A.

8 article(s) in total. 21 co-authors, from 1 to 4 common article(s). Median position in authors list is 5,5.

[1] oai:arXiv.org:1912.06120 [pdf] - 2055372

Solar Flare Intensity Prediction with Machine Learning Models

Jiao, Zhenbang; Sun, Hu; Wang, Xiantong; Manchester, Ward; Gombosi, Tamas; Hero, Alfred; Chen, Yang

Comments: 28 pages, 15 figures

Submitted: 2019-12-12, last modified: 2020-02-26

We develop a mixed Long Short Term Memory (LSTM) regression model to predict the maximum solar flare intensity within a 24-hour time window 0$\sim$24, 6$\sim$30, 12$\sim$36 and 24$\sim$48 hours ahead of time using 6, 12, 24 and 48 hours of data (predictors) for each Helioseismic and Magnetic Imager (HMI) Active Region Patch (HARP). The model makes use of (1) the Space-weather HMI Active Region Patch (SHARP) parameters as predictors and (2) the exact flare intensities instead of class labels recorded in the Geostationary Operational Environmental Satellites (GOES) data set, which serves as the source of the response variables. Compared to solar flare classification, the model offers us more detailed information about the exact maximum flux level, i.e. intensity, for each occurrence of a flare. We also consider classification models built on top of the regression model and obtain better results in solar flare classifications. Our results suggest that the most efficient time period for predicting the solar activity is within 24 hours before the prediction time using the SHARP parameters and the LSTM model.

[2] oai:arXiv.org:1912.00502 [pdf] - 2101325

Predicting solar flares with machine learning: investigating solar cycle dependence

Wang, Xiantong; Chen, Yang; Toth, Gabor; Manchester, Ward B.; Gombosi, Tamas I.; Hero, Alfred O.; Jiao, Zhenbang; Sun, Hu; Jin, Meng; Liu, Yang

Comments:

Submitted: 2019-12-01, last modified: 2020-01-22

A deep learning network, Long-Short Term Memory (LSTM) network, is used in this work to predict whether the maximum flare class an active region (AR) will produce in the next 24 hours is class $\Gamma$. We considered $\Gamma$ are $\ge M$, $\ge C$ and any flare class. The essence of using LSTM, which is a recurrent neural network, is its capability to capture temporal information of the data samples. The input features are time sequences of 20 magnetic parameters from SHARPs - Space-weather HMI Active Region Patches. We analyzed active regions from June 2010 to Dec 2018, using the Geostationary Operational Environmental Satellite (GOES) X-ray flare catalogs and label the data samples with identified ARs in the GOES X-ray flare catalogs. Our results (i) shows consistent skill scores with recently published results using LSTMs and better than the previous work using single time input (eg. DeFN) (ii) The skill scores from the model show essential differences when different years of data was chosen for training and testing.

[3] oai:arXiv.org:1904.00125 [pdf] - 2025437

Identifying Solar Flare Precursors Using Time Series of SDO/HMI Images and SHARP Parameters

Chen, Yang; Manchester, Ward B.; Hero, Alfred O.; Toth, Gabor; DuFumier, Benoit; Zhou, Tian; Wang, Xiantong; Zhu, Haonan; Sun, Zeyu; Gombosi, Tamas I.

Comments:

Submitted: 2019-03-29, last modified: 2019-08-03

We present several methods towards construction of precursors, which show great promise towards early predictions, of solar flare events in this paper. A data pre-processing pipeline is built to extract useful data from multiple sources, Geostationary Operational Environmental Satellites (GOES) and Solar Dynamics Observatory (SDO)/Helioseismic and Magnetic Imager (HMI), to prepare inputs for machine learning algorithms. Two classification models are presented: classification of flares from quiet times for active regions and classification of strong versus weak flare events. We adopt deep learning algorithms to capture both the spatial and temporal information from HMI magnetogram data. Effective feature extraction and feature selection with raw magnetogram data using deep learning and statistical algorithms enable us to train classification models to achieve almost as good performance as using active region parameters provided in HMI/Space-Weather HMI-Active Region Patch (SHARP) data files. Case studies show a significant increase in the prediction score around 20 hours before strong solar flare events.

[4] oai:arXiv.org:1503.04127 [pdf] - 1371971

Image patch analysis of sunspots and active regions. I. Intrinsic dimension and correlation analysis

Moon, Kevin R.; Li, Jimmy J.; Delouille, Veronique; De Visscher, Ruben; Watson, Fraser; Hero, Alfred O.

Comments: Accepted for publication in the Journal of Space Weather and Space Climate (SWSC). 23 pages, 11 figures

Submitted: 2015-03-13, last modified: 2015-12-14

The flare-productivity of an active region is observed to be related to its spatial complexity. Mount Wilson or McIntosh sunspot classifications measure such complexity but in a categorical way, and may therefore not use all the information present in the observations. Moreover, such categorical schemes hinder a systematic study of an active region's evolution for example. We propose fine-scale quantitative descriptors for an active region's complexity and relate them to the Mount Wilson classification. We analyze the local correlation structure within continuum and magnetogram data, as well as the cross-correlation between continuum and magnetogram data. We compute the intrinsic dimension, partial correlation, and canonical correlation analysis (CCA) of image patches of continuum and magnetogram active region images taken from the SOHO-MDI instrument. We use masks of sunspots derived from continuum as well as larger masks of magnetic active regions derived from the magnetogram to analyze separately the core part of an active region from its surrounding part. We find the relationship between complexity of an active region as measured by Mount Wilson and the intrinsic dimension of its image patches. Partial correlation patterns exhibit approximately a third-order Markov structure. CCA reveals different patterns of correlation between continuum and magnetogram within the sunspots and in the region surrounding the sunspots. These results also pave the way for patch-based dictionary learning with a view towards automatic clustering of active regions.

[5] oai:arXiv.org:1504.02762 [pdf] - 1371975

Image patch analysis of sunspots and active regions. II. Clustering via matrix factorization

Moon, Kevin R.; Delouille, Veronique; Li, Jimmy J.; De Visscher, Ruben; Watson, Fraser; Hero, Alfred O.

Comments: Accepted for publication in the Journal of Space Weather and Space Climate (SWSC). 33 pages, 12 figures

Submitted: 2015-04-10, last modified: 2015-12-10

Separating active regions that are quiet from potentially eruptive ones is a key issue in Space Weather applications. Traditional classification schemes such as Mount Wilson and McIntosh have been effective in relating an active region large scale magnetic configuration to its ability to produce eruptive events. However, their qualitative nature prevents systematic studies of an active region's evolution for example. We introduce a new clustering of active regions that is based on the local geometry observed in Line of Sight magnetogram and continuum images. We use a reduced-dimension representation of an active region that is obtained by factoring the corresponding data matrix comprised of local image patches. Two factorizations can be compared via the definition of appropriate metrics on the resulting factors. The distances obtained from these metrics are then used to cluster the active regions. We find that these metrics result in natural clusterings of active regions. The clusterings are related to large scale descriptors of an active region such as its size, its local magnetic field distribution, and its complexity as measured by the Mount Wilson classification scheme. We also find that including data focused on the neutral line of an active region can result in an increased correspondence between our clustering results and other active region descriptors such as the Mount Wilson classifications and the $R$ value. We provide some recommendations for which metrics, matrix factorization techniques, and regions of interest to use to study active regions.

[6] oai:arXiv.org:1504.07116 [pdf] - 1371977

Meta learning of bounds on the Bayes classifier error

Moon, Kevin R.; Delouille, Veronique; Hero, Alfred O.

Comments: 6 pages, 3 figures, to appear in proceedings of 2015 IEEE Signal Processing and SP Education Workshop

Submitted: 2015-04-27, last modified: 2015-07-03

Meta learning uses information from base learners (e.g. classifiers or estimators) as well as information about the learning problem to improve upon the performance of a single base learner. For example, the Bayes error rate of a given feature space, if known, can be used to aid in choosing a classifier, as well as in feature selection and model selection for the base classifiers and the meta classifier. Recent work in the field of f-divergence functional estimation has led to the development of simple and rapidly converging estimators that can be used to estimate various bounds on the Bayes error. We estimate multiple bounds on the Bayes error using an estimator that applies meta learning to slowly converging plug-in estimators to obtain the parametric convergence rate. We compare the estimated bounds empirically on simulated data and then estimate the tighter bounds on features extracted from an image patch analysis of sunspot continuum and magnetogram images.

[7] oai:arXiv.org:1406.6390 [pdf] - 953591

Image patch analysis and clustering of sunspots: a dimensionality reduction approach

Moon, Kevin R.; Li, Jimmy J.; Delouille, Veronique; Watson, Fraser; Hero, Alfred O.

Comments: 5 pages, 7 figures, accepted to ICIP 2014

Submitted: 2014-06-24

Sunspots, as seen in white light or continuum images, are associated with regions of high magnetic activity on the Sun, visible on magnetogram images. Their complexity is correlated with explosive solar activity and so classifying these active regions is useful for predicting future solar activity. Current classification of sunspot groups is visually based and suffers from bias. Supervised learning methods can reduce human bias but fail to optimally capitalize on the information present in sunspot images. This paper uses two image modalities (continuum and magnetogram) to characterize the spatial and modal interactions of sunspot and magnetic active region images and presents a new approach to cluster the images. Specifically, in the framework of image patch analysis, we estimate the number of intrinsic parameters required to describe the spatial and modal dependencies, the correlation between the two modalities and the corresponding spatial patterns, and examine the phenomena at different scales within the images. To do this, we use linear and nonlinear intrinsic dimension estimators, canonical correlation analysis, and multiresolution analysis of intrinsic dimension.

[8] oai:arXiv.org:1110.3052 [pdf] - 622363

The First Stray Light Corrected EUV Images of Solar Coronal Holes

Shearer, Paul; Frazin, Richard A.; Hero, Alfred O.; Gilbert, Anna C.

Comments: Accepted to Astrophysical Journal Letters

Submitted: 2011-10-13, last modified: 2012-03-07

Coronal holes are the source regions of the fast solar wind, which fills most of the solar system volume near the cycle minimum. Removing stray light from extreme ultraviolet (EUV) images of the Sun's corona is of high astrophysical importance, as it is required to make meaningful determinations of temperatures and densities of coronal holes. EUV images tend to be dominated by the component of the stray light due to the long-range scatter caused by microroughness of telescope mirror surfaces, and this component has proven very difficult to measure in pre-flight characterization. In-flight characterization heretofore has proven elusive due to the fact that the detected image is simultaneously nonlinear in two unknown functions: the stray light pattern and the true image which would be seen by an ideal telescope. Using a constrained blind deconvolution technique that takes advantage of known zeros in the true image provided by a fortuitous lunar transit, we have removed the stray light from solar images seen by the EUVI instrument on STEREO-B in all four filter bands (171, 195, 284, and 304 \AA). Uncertainty measures of the stray light corrected images, which include the systematic error due to misestimation of the scatter, are provided. It is shown that in EUVI, stray light contributes up to 70% of the emission in coronal holes seen on the solar disk, which has dramatic consequences for diagnostics of temperature and density and therefore estimates of key plasma parameters such as the plasma $\beta$\ and ion-electron collision rates.