Normalized to: Streit, A.
[1]
oai:arXiv.org:1907.13303 [pdf] - 1926539
German-Russian Astroparticle Data Life Cycle Initiative
Haungs, Andreas;
Bychkov, Igor;
Dubenskaya, Julia;
Fedorov, Oleg;
Heiss, Andreas;
Kang, Donghwa;
Kazarina, Yulia;
Korosteleva, Elena;
Kostunin, Dmitriy;
Kryukov, Alexander;
Mikhailov, Andrey;
Nguyen, Minh-Duc;
Polgart, Frank;
Polyakov, Stanislav;
Postnikov, Evgeny;
Shigarov, Alexey;
Shipilov, Dmitry;
Streit, Achim;
Tokareva, Victoria;
Wochele, Doris;
Wochele, Jürgen;
Zhurov, Dmitry
Submitted: 2019-07-31
A data life cycle (DLC) is a high-level data processing pipeline that
involves data acquisition, event reconstruction, data analysis, publication,
archiving, and sharing. For astroparticle physics a DLC is particularly
important due to the geographical and content diversity of the research field.
A dedicated and experiment spanning analysis and data centre would ensure that
multi-messenger analyses can be carried out using state-of-the-art methods. The
German-Russian Astroparticle Data Life Cycle Initiative (GRADLCI) is a joint
project of the KASCADE-Grande and TAIGA collaborations, aimed at developing a
concept and creating a DLC prototype that takes into account the data
processing features specific for the research field. An open science system
based on the KASCADE Cosmic Ray Data Centre (KCDC), which is a web-based
platform to provide the astroparticle physics data for the general public, must
also include effective methods for distributed data storage algorithms and
techniques to allow the community to perform simulations and analyses with
sophisticated machine learning methods. The aim is to achieve more efficient
analyses of the data collected in different, globally dispersed observatories,
as well as a modern education to Big Data Scientist in the synergy between
basic research and the information society. The contribution covers the status
and future plans of the initiative.
[2]
oai:arXiv.org:1811.12086 [pdf] - 1791770
Russian-German Astroparticle Data Life Cycle Initiative
Bychkov, Igor;
Demichev, Andrey;
Dubenskaya, Julia;
Fedorov, Oleg;
Haungs, Andreas;
Heiss, Andreas;
Kang, Donghwa;
Kazarina, Yulia;
Korosteleva, Elena;
Kostunin, Dmitriy;
Kryukov, Alexander;
Mikhailov, Andrey;
Nguyen, Minh-Duc;
Polyakov, Stanislav;
Postnikov, Evgeny;
Shigarov, Alexey;
Shipilov, Dmitry;
Streit, Achim;
Tokareva, Victoria;
Wochele, Doris;
Wochele, Jürgen;
Zhurov, Dmitry
Submitted: 2018-11-29
Modern large-scale astroparticle setups measure high-energy particles, gamma
rays, neutrinos, radio waves, and the recently discovered gravitational waves.
Ongoing and future experiments are located worldwide. The data acquired have
different formats, storage concepts, and publication policies. Such differences
are a crucial point in the era of Big Data and of multi-messenger analysis in
astroparticle physics. We propose an open science web platform called
ASTROPARTICLE.ONLINE which enables us to publish, store, search, select, and
analyze astroparticle data. In the first stage of the project, the following
components of a full data life cycle concept are under development: describing,
storing, and reusing astroparticle data; software to perform multi-messenger
analysis using deep learning; and outreach for students, post-graduate
students, and others who are interested in astroparticle physics. Here we
describe the concepts of the web platform and the first obtained results,
including the meta data structure for astroparticle data, data analysis by
using convolution neural networks, description of the binary data, and the
outreach platform for those interested in astroparticle physics. The
KASCADE-Grande and TAIGA cosmic-ray experiments were chosen as pilot examples.
[3]
oai:arXiv.org:1410.3677 [pdf] - 929168
Architecture, implementation and parallelization of the software to
search for periodic gravitational wave signals
Submitted: 2014-10-14
The parallelization, design and scalability of the \sky code to search for
periodic gravitational waves from rotating neutron stars is discussed. The code
is based on an efficient implementation of the F-statistic using the Fast
Fourier Transform algorithm. To perform an analysis of data from the advanced
LIGO and Virgo gravitational wave detectors' network, which will start
operating in 2015, hundreds of millions of CPU hours will be required - the
code utilizing the potential of massively parallel supercomputers is therefore
mandatory. We have parallelized the code using the Message Passing Interface
standard, implemented a mechanism for combining the searches at different
sky-positions and frequency bands into one extremely scalable program. The
parallel I/O interface is used to escape bottlenecks, when writing the
generated data into file system. This allowed to develop a highly scalable
computation code, which would enable the data analysis at large scales on
acceptable time scales. Benchmarking of the code on a Cray XE6 system was
performed to show efficiency of our parallelization concept and to demonstrate
scaling up to 50 thousand cores in parallel.