Normalized to: Polyakov, S.
[1]
oai:arXiv.org:1907.13303 [pdf] - 1926539
German-Russian Astroparticle Data Life Cycle Initiative
Haungs, Andreas;
Bychkov, Igor;
Dubenskaya, Julia;
Fedorov, Oleg;
Heiss, Andreas;
Kang, Donghwa;
Kazarina, Yulia;
Korosteleva, Elena;
Kostunin, Dmitriy;
Kryukov, Alexander;
Mikhailov, Andrey;
Nguyen, Minh-Duc;
Polgart, Frank;
Polyakov, Stanislav;
Postnikov, Evgeny;
Shigarov, Alexey;
Shipilov, Dmitry;
Streit, Achim;
Tokareva, Victoria;
Wochele, Doris;
Wochele, Jürgen;
Zhurov, Dmitry
Submitted: 2019-07-31
A data life cycle (DLC) is a high-level data processing pipeline that
involves data acquisition, event reconstruction, data analysis, publication,
archiving, and sharing. For astroparticle physics a DLC is particularly
important due to the geographical and content diversity of the research field.
A dedicated and experiment spanning analysis and data centre would ensure that
multi-messenger analyses can be carried out using state-of-the-art methods. The
German-Russian Astroparticle Data Life Cycle Initiative (GRADLCI) is a joint
project of the KASCADE-Grande and TAIGA collaborations, aimed at developing a
concept and creating a DLC prototype that takes into account the data
processing features specific for the research field. An open science system
based on the KASCADE Cosmic Ray Data Centre (KCDC), which is a web-based
platform to provide the astroparticle physics data for the general public, must
also include effective methods for distributed data storage algorithms and
techniques to allow the community to perform simulations and analyses with
sophisticated machine learning methods. The aim is to achieve more efficient
analyses of the data collected in different, globally dispersed observatories,
as well as a modern education to Big Data Scientist in the synergy between
basic research and the information society. The contribution covers the status
and future plans of the initiative.
[2]
oai:arXiv.org:1907.10480 [pdf] - 1922562
Deep Learning for Energy Estimation and Particle Identification in
Gamma-ray Astronomy
Submitted: 2019-07-23
Deep learning techniques, namely convolutional neural networks (CNN), have
previously been adapted to select gamma-ray events in the TAIGA experiment,
having achieved a good quality of selection as compared with the conventional
Hillas approach. Another important task for the TAIGA data analysis was also
solved with CNN: gamma-ray energy estimation showed some improvement in
comparison with the conventional method based on the Hillas analysis.
Furthermore, our software was completely redeveloped for the graphics
processing unit (GPU), which led to significantly faster calculations in both
of these tasks. All the results have been obtained with the simulated data of
TAIGA Monte Carlo software; their experimental confirmation is envisaged for
the near future.
[3]
oai:arXiv.org:1812.01906 [pdf] - 1794146
A distributed data warehouse system for astroparticle physics
Nguyen, Minh-Duc;
Kryukov, Alexander;
Dubenskaya, Julia;
Korosteleva, Elena;
Polyakov, Stanislav;
Postnikov, Evgeny;
Bychkov, Igor;
Mikhailov, Andrey;
Shigarov, Alexey;
Fedorov, Oleg;
Kazarina, Yulia;
Shipilov, Dmitry;
Zhurov, Dmitry
Submitted: 2018-12-05
A distributed data warehouse system is one of the actual issues in the field
of astroparticle physics. Famous experiments, such as TAIGA, KASCADE-Grande,
produce tens of terabytes of data measured by their instruments. It is critical
to have a smart data warehouse system on-site to store the collected data for
further distribution effectively. It is also vital to provide scientists with a
handy and user-friendly interface to access the collected data with proper
permissions not only on-site but also online. The latter case is handy when
scientists need to combine data from different experiments for analysis. In
this work, we describe an approach to implementing a distributed data warehouse
system that allows scientists to acquire just the necessary data from different
experiments via the Internet on demand. The implementation is based on
CernVM-FS with additional components developed by us to search through the
whole available data sets and deliver their subsets to users' computers.
[4]
oai:arXiv.org:1812.01324 [pdf] - 1791980
Using Binary File Format Description Languages for Documenting, Parsing,
and Verifying Raw Data in TAIGA Experiment
Bychkov, I.;
Demichev, A.;
Dubenskaya, J.;
Fedorov, O.;
Hmelnov, A.;
Kazarina, Y.;
Korosteleva, E.;
Kostunin, D.;
Kryukov, A.;
Mikhailov, A.;
Nguyen, M. D.;
Polyakov, S.;
Postnikov, E.;
Shigarov, A.;
Shipilov, D.;
Zhurov, D.
Submitted: 2018-12-04
The paper is devoted to the issues of raw binary data documenting, parsing
and verifying in astroparticle data lifecycle. The long-term preservation of
raw data of astroparticle experiments as originally generated is essential for
re-running analyses and reproducing research results. The selected high-quality
raw data should have detailed documentation and accompanied by open software
tools for access to them. We consider applicability of binary file format
description languages to specify, parse and verify raw data of the Tunka
Advanced Instrument for cosmic rays and Gamma Astronomy (TAIGA) experiment. The
formal specifications are implemented for five data formats of the experiment
and provide automatic generation of source code for data reading libraries in
target programming languages (e.g. C++, Java, and Python). These libraries were
tested on TAIGA data. They showed a good performance and help us to locate the
parts with corrupted data. The format specifications can be used as metadata
for exchanging of astroparticle raw data. They can also simplify software
development for data aggregation from various sources for the multi-messenger
analysis.
[5]
oai:arXiv.org:1811.11822 [pdf] - 1886463
Gamma/Hadron Separation in Imaging Air Cherenkov Telescopes Using Deep
Learning Libraries TensorFlow and PyTorch
Submitted: 2018-11-28, last modified: 2018-12-04
In this work we compare two open source machine learning libraries, PyTorch
and TensorFlow, as software platforms for rejecting hadron background events
detected by imaging air Cherenkov telescopes (IACTs). Monte Carlo simulation
for the TAIGA-IACT telescope is used to estimate background rejection quality.
A wide variety of neural network algorithms provided by both libraries can
easily be tested on various types of data, which is useful for various imaging
air Cherenkov experiments. The work is a component of the Astroparticle.online
project, which collaborates with the TAIGA and KASCADE experiments and welcomes
any astroparticle experiment to join.
[6]
oai:arXiv.org:1812.01551 [pdf] - 1792007
Particle identification in ground-based gamma-ray astronomy using
convolutional neural networks
Postnikov, E. B.;
Bychkov, I. V.;
Dubenskaya, J. Y.;
Fedorov, O. L.;
Kazarina, Y. A.;
Korosteleva, E. E.;
Kryukov, A. P.;
Mikhailov, A. A.;
Nguyen, M. D.;
Polyakov, S. P.;
Shigarov, A. O.;
Shipilov, D. A.;
Zhurov, D. P.
Submitted: 2018-12-04
Modern detectors of cosmic gamma-rays are a special type of imaging
telescopes (air Cherenkov telescopes) supplied with cameras with a relatively
large number of photomultiplier-based pixels. For example, the camera of the
TAIGA-IACT telescope has 560 pixels of hexagonal structure. Images in such
cameras can be analysed by deep learning techniques to extract numerous
physical and geometrical parameters and/or for incoming particle
identification. The most powerful deep learning technique for image analysis,
the so-called convolutional neural network (CNN), was implemented in this
study. Two open source libraries for machine learning, PyTorch and TensorFlow,
were tested as possible software platforms for particle identification in
imaging air Cherenkov telescopes. Monte Carlo simulation was performed to
analyse images of gamma-rays and background particles (protons) as well as
estimate identification accuracy. Further steps of implementation and
improvement of this technique are discussed.
[7]
oai:arXiv.org:1812.01212 [pdf] - 1791969
Application of HUBzero platform for the educational process in
astroparticle physics
Kazarina, Yulia;
Bychkov, Igor;
Kryukov, Alexander;
Dubenskaya, Julia;
Korosteleva, Elena;
Nguyen, Minh-Duc;
Polyakov, Stanislav;
Postnikov, Evgeny;
Mikhailov, Andrey;
Shigarov, Alexey;
Fedorov, Oleg;
Shipilov, Dmitry;
Zhurov, Dmitry
Submitted: 2018-12-03
In the frame of the Karlsruhe-Russian Astroparticle Data Life Cycle
Initiative it was proposed to deploy an educational resource
astroparticle.online for the training of students in the field of astroparticle
physics. This resource is based on HUBzero, which is an open-source software
platform for building powerful websites, which supports scientific discovery,
learning, and collaboration. HUBzero has been deployed on the servers of
Matrosov Institute for System Dynamics and Control Theory. The educational
resource astroparticle.online is being filled with the information covering
cosmic messengers, astroparticle physics experiments and educational courses
and schools on astroparticle physics. Furthermore, the educational resource
astroparticle.online can be used for online collaboration. We present the
current status of this project and our first experience of application of this
service as a collaboration framework.
[8]
oai:arXiv.org:1811.12086 [pdf] - 1791770
Russian-German Astroparticle Data Life Cycle Initiative
Bychkov, Igor;
Demichev, Andrey;
Dubenskaya, Julia;
Fedorov, Oleg;
Haungs, Andreas;
Heiss, Andreas;
Kang, Donghwa;
Kazarina, Yulia;
Korosteleva, Elena;
Kostunin, Dmitriy;
Kryukov, Alexander;
Mikhailov, Andrey;
Nguyen, Minh-Duc;
Polyakov, Stanislav;
Postnikov, Evgeny;
Shigarov, Alexey;
Shipilov, Dmitry;
Streit, Achim;
Tokareva, Victoria;
Wochele, Doris;
Wochele, Jürgen;
Zhurov, Dmitry
Submitted: 2018-11-29
Modern large-scale astroparticle setups measure high-energy particles, gamma
rays, neutrinos, radio waves, and the recently discovered gravitational waves.
Ongoing and future experiments are located worldwide. The data acquired have
different formats, storage concepts, and publication policies. Such differences
are a crucial point in the era of Big Data and of multi-messenger analysis in
astroparticle physics. We propose an open science web platform called
ASTROPARTICLE.ONLINE which enables us to publish, store, search, select, and
analyze astroparticle data. In the first stage of the project, the following
components of a full data life cycle concept are under development: describing,
storing, and reusing astroparticle data; software to perform multi-messenger
analysis using deep learning; and outreach for students, post-graduate
students, and others who are interested in astroparticle physics. Here we
describe the concepts of the web platform and the first obtained results,
including the meta data structure for astroparticle data, data analysis by
using convolution neural networks, description of the binary data, and the
outreach platform for those interested in astroparticle physics. The
KASCADE-Grande and TAIGA cosmic-ray experiments were chosen as pilot examples.