Normalized to: Ragagnin, A.
[1]
oai:arXiv.org:2007.05199 [pdf] - 2131968
On the phase-space structure of galaxy clusters from cosmological
simulations
Marini, I.;
Saro, A.;
Borgani, S.;
Murante, G.;
Rasia, E.;
Dolag, K.;
Lin, W.;
Napolitano, N. R.;
Ragagnin, A.;
Tornatore, L.;
Wang, Y.
Submitted: 2020-07-10
Cosmological N-body simulations represent an excellent tool to study the
formation and evolution of dark matter (DM) halos and the mechanisms that have
originated the universal profile at the largest mass scales in the Universe. In
particular, the combination of the velocity dispersion $\sigma_\mathrm{v}$ with
the density $\rho$ can be used to define the pseudo-entropy
$S(r)=\sigma_\mathrm{v}^2/\rho^{2/3}$, whose profile is well-described by a
simple power-law $S\propto r^{\alpha}$. We analyze a set of cosmological
hydrodynamical re-simulations of massive galaxy clusters and study the
pseudo-entropy profiles as traced by different collisionless components in
simulated galaxy clusters: DM, stars, and substructures. We analyze four sets
of simulations, exploring different resolution and physics (N-body and full
hydrodynamical simulations) to investigate convergence and the impact of
baryons. We find that baryons significantly affect the inner region of
pseudo-entropy profile as traced by substructures, while pseudo-entropy
profiles as traced by DM particles are characterized by an almost universal
behavior and can be used as a low-scatter ($\sim6-7\%$) mass-proxy. We compare
observed and simulated pseudo-entropy profiles and find good agreement in both
normalization and slope. We demonstrate, however, that the method used to
derive observed pseudo-entropy profiles could introduce biases and
underestimate the impact of mergers. Finally, we investigate the pseudo-entropy
traced by the stars focusing our interest in the dynamical distinction between
intracluster light (ICL) and the stars bound to the brightest cluster galaxy
(BCG): the combination of these two pseudo-entropy profiles is well-described
by a single power-law out to almost the entire cluster virial radius.
[2]
oai:arXiv.org:2006.13951 [pdf] - 2122107
The DIANOGA simulations of galaxy clusters: characterizing star
formation in proto-clusters
Bassini, L.;
Rasia, E.;
Borgani, S.;
Granato, G. L.;
Ragone-Figueroa, C.;
Biffi, V.;
Ragagnin, A.;
Dolag, K.;
Murante, G.;
Taffoni, G.;
Tornatore, L.
Submitted: 2020-06-24
We studied the star formation rate (SFR) in cosmological hydrodynamical
simulations of galaxy (proto-)clusters in the redshift range $0<z<4$, comparing
them to recent observational studies; we also investigated the effect of
varying the parameters of the star formation model on galaxy properties such as
SFR, star-formation efficiency, and gas fraction. We analyze a set of zoom-in
cosmological hydrodynamical simulations centred on twelve clusters. The
simulations are carried out with the GADGET-3 TreePM/SPH code which includes
various subgrid models to treat unresolved baryonic physics, including AGN
feedback. Simulations do not reproduce the high values of SFR observed within
protoclusters cores, where the values of SFR are underpredicted by a factor
$\gtrsim 4$ both at $z\sim2$ and $z\sim 4$. The difference arises as
simulations are unable to reproduce the observed starburst population and is
worsened at $z\sim 2$ because simulations underpredict the normalization of the
main sequence of star forming galaxies (i.e., the correlation between stellar
mass and SFR) by a factor of $\sim 3$. As the low normalization of the main
sequence seems to be driven by an underestimated gas fraction, it remains
unclear whether numerical simulations miss starburst galaxies due to a too low
predicted gas fractions or too low star formation efficiencies. Our results are
stable against varying several parameters of the star formation subgrid model
and do not depend on the details of the AGN feedback.
[3]
oai:arXiv.org:2003.03283 [pdf] - 2077028
Performance and energy footprint assessment of FPGAs and GPUs on HPC
systems using Astrophysics application
Goz, David;
Ieronymakis, Georgios;
Papaefstathiou, Vassilis;
Dimou, Nikolaos;
Bertocco, Sara;
Simula, Francesco;
Ragagnin, Antonio;
Tornatore, Luca;
Coretti, Igor;
Taffoni, Giuliano
Submitted: 2020-03-06, last modified: 2020-04-10
New challenges in Astronomy and Astrophysics (AA) are urging the need for a
large number of exceptionally computationally intensive simulations. "Exascale"
(and beyond) computational facilities are mandatory to address the size of
theoretical problems and data coming from the new generation of observational
facilities in AA. Currently, the High Performance Computing (HPC) sector is
undergoing a profound phase of innovation, in which the primary challenge to
the achievement of the "Exascale" is the power-consumption. The goal of this
work is to give some insights about performance and energy footprint of
contemporary architectures for a real astrophysical application in an HPC
context. We use a state-of-the-art N-body application that we re-engineered and
optimized to exploit the heterogeneous underlying hardware fully. We
quantitatively evaluate the impact of computation on energy consumption when
running on four different platforms. Two of them represent the current HPC
systems (Intel-based and equipped with NVIDIA GPUs), one is a micro-cluster
based on ARM-MPSoC, and one is a "prototype towards Exascale" equipped with
ARM-MPSoCs tightly coupled with FPGAs. We investigate the behavior of the
different devices where the high-end GPUs excel in terms of time-to-solution
while MPSoC-FPGA systems outperform GPUs in power consumption. Our experience
reveals that considering FPGAs for computationally intensive application seems
very promising, as their performance is improving to meet the requirements of
scientific applications. This work can be a reference for future platforms
development for astrophysics applications where computationally intensive
calculations are required.
[4]
oai:arXiv.org:2003.10850 [pdf] - 2069198
Gadget3 on GPUs with OpenACC
Submitted: 2020-03-24
We present preliminary results of a GPU porting of all main Gadget3 modules
(gravity computation, SPH density computation, SPH hydrodynamic force, and
thermal conduction) using OpenACC directives. Here we assign one GPU to each
MPI rank and exploit both the host and accellerator capabilities by overlapping
computations on the CPUs and GPUs: while GPUs asynchronously compute
interactions between particles within their MPI ranks, CPUs perform tree-walks
and MPI communications of neighbouring particles. We profile various portions
of the code to understand the origin of our speedup, where we find that a peak
speedup is not achieved because of time-steps with few active particles. We run
a hydrodynamic cosmological simulation from the Magneticum project, with
$2\cdot10^{7}$ particles, where we find a final total speedup of $\approx 2.$
We also present the results of an encouraging scaling test of a preliminary
gravity-only OpenACC porting, run in the context of the EuroHack17 event, where
the prototype of the porting proved to keep a constant speedup up to $1024$
GPUs.
[5]
oai:arXiv.org:1912.05340 [pdf] - 2013400
INAF Trieste Astronomical Observatory Information Technology Framework
Submitted: 2019-12-11
INAF Trieste Astronomical Observatory (OATs) has a long tradition in
information technology applied to Astronomical and Astrophysical use cases,
particularly for what regards computing for data reduction, analysis and
simulations; data and archives management; space missions data processing;
design and software development for ground-based instruments. The ensemble of
these activities, in the last years, pushed the need to acquire new computing
resources and technologies and to deep competences in theirs management. In
this paper we describe INAF-OATs computing centre technological stuff, our
involvement in different EU Projects both in the path of building of EOSC, the
European Open Science Cloud; in the design and prototyping of new Exascale
supercomputers in Europe and the main research activities carried on using our
computing centre.
[6]
oai:arXiv.org:1910.14496 [pdf] - 1989177
Direct N-body application on low-power and energy-efficient parallel
architectures
Submitted: 2019-10-31
The aim of this work is to quantitatively evaluate the impact of computation
on the energy consumption on ARM MPSoC platforms, exploiting CPUs, embedded
GPUs and FPGAs. One of them possibly represents the future of High Performance
Computing systems: a prototype of an Exascale supercomputer. Performance and
energy measurements are made using a state-of-the-art direct $N$-body code from
the astrophysical domain. We provide a comparison of the time-to-solution and
energy delay product metrics, for different software configurations. We have
shown that FPGA technologies can be used for application kernel acceleration
and are emerging as a promising alternative to "traditional" technologies for
HPC, which purely focus on peak-performance than on power-efficiency.
[7]
oai:arXiv.org:1810.08212 [pdf] - 1878664
Dependency of halo concentration on mass, redshift and fossilness in
Magneticum hydrodynamic simulations
Submitted: 2018-10-18, last modified: 2019-05-07
We study the dependency of the concentration on mass and redshift using three
large N-body cosmological hydrodynamic simulations carried out by the
Magneticum project. We constrain the slope of the mass-concentration relation
with an unprecedented mass range for hydrodynamic simulations and find a
negative trend on the mass-concentration plane and a slightly negative redshift
dependency, in agreement with observations and other numerical works. We also
show how the concentration correlates with the fossil parameter, defined as the
stellar mass ratio between the central galaxy and the most massive satellite,
in agreement with observations. We find that haloes with high fossil parameter
have systematically higher concentration and investigate the cause in two
different ways. First we study the evolution of haloes that lives unperturbed
for a long period of time, where we find that the internal region keeps
accreting satellites as the fossil parameter increases and the scale radius
decreases (which increases the concentration). We also study the dependency of
the concentration on the virial ratio and the energy term from the surface
pressure $E_s$. We conclude that fossil objects have higher concentration
because they are dynamically relaxed, with no in-fall/out-fall material and had
time to accrete their satellites.
[8]
oai:arXiv.org:1810.09898 [pdf] - 1779727
Exploiting the Space Filling Curve Ordering of Particles in the
Neighbour Search of Gadget3
Submitted: 2018-10-23
Gadget3 is nowadays one of the most frequently used high performing parallel
codes for cosmological hydrodynamical simulations. Recent analyses have shown
t\ hat the Neighbour Search process of Gadget3 is one of the most
time-consuming parts. Thus, a considerable speedup can be expected from
improvements of the u\ nderlying algorithms. In this work we propose a novel
approach for speeding up the Neighbour Search which takes advantage of the
space-filling-curve particle ordering. Instead of performing Neighbour Search
for all particles individually, nearby active particles can be grouped and one
single Neighbour Search can be performed to obta\ in a common superset of
neighbours. Thus, with this approach we reduce the number of searches. On the
other hand, tree walks are performed within a larger searching radius. There is
an optimal size of grouping that maximize the speedup, which we found by
numerical experiments. We tested the algorithm within the boxes of the
Magneticum project. As a result we obtained a speedup of $1.65$ in the Density
and of $1.30$ in the Hydrodynamics computation, respectively, and a total
speedup of $1.34.$
[9]
oai:arXiv.org:1612.06380 [pdf] - 1769924
A web portal for hydrodynamical, cosmological simulations
Submitted: 2016-12-19, last modified: 2018-10-19
This article describes a data center hosting a web portal for accessing and
sharing the output of large, cosmological, hydro-dynamical simulations with a
broad scientific community. It also allows users to receive related scientific
data products by directly processing the raw simulation data on a remote
computing cluster. The data center has a multi-layer structure: a web portal, a
job control layer, a computing cluster and a HPC storage system. The outer
layer enables users to choose an object from the simulations. Objects can be
selected by visually inspecting 2D maps of the simulation data, by performing
highly compounded and elaborated queries or graphically by plotting arbitrary
combinations of properties. The user can run analysis tools on a chosen object.
These services allow users to run analysis tools on the raw simulation data.
The job control layer is responsible for handling and performing the analysis
jobs, which are executed on a computing cluster. The innermost layer is formed
by a HPC storage system which hosts the large, raw simulation data. The
following services are available for the users: (I) {\sc ClusterInspect}
visualizes properties of member galaxies of a selected galaxy cluster; (II)
{\sc SimCut} returns the raw data of a sub-volume around a selected object from
a simulation, containing all the original, hydro-dynamical quantities; (III)
{\sc Smac} creates idealised 2D maps of various, physical quantities and
observables of a selected object; (IV) {\sc Phox} generates virtual X-ray
observations with specifications of various current and upcoming instruments.
[10]
oai:arXiv.org:1609.01507 [pdf] - 1475588
Extreme Scale-out SuperMUC Phase 2 - lessons learned
Hammer, Nicolay;
Jamitzky, Ferdinand;
Satzger, Helmut;
Allalen, Momme;
Block, Alexander;
Karmakar, Anupam;
Brehm, Matthias;
Bader, Reinhold;
Iapichino, Luigi;
Ragagnin, Antonio;
Karakasis, Vasilios;
Kranzlmüller, Dieter;
Bode, Arndt;
Huber, Herbert;
Kühn, Martin;
Machado, Rui;
Grünewald, Daniel;
Edelmann, Philipp V. F.;
Röpke, Friedrich K.;
Wittmann, Markus;
Zeiser, Thomas;
Wellein, Gerhard;
Mathias, Gerald;
Schwörer, Magnus;
Lorenzen, Konstantin;
Federrath, Christoph;
Klessen, Ralf;
Bamberg, Karl-Ulrich;
Ruhl, Hartmut;
Schornbaum, Florian;
Bauer, Martin;
Nikhil, Anand;
Qi, Jiaxing;
Klimach, Harald;
Stüben, Hinnerk;
Deshmukh, Abhishek;
Falkenstein, Tobias;
Dolag, Klaus;
Petkova, Margarita
Submitted: 2016-09-06
In spring 2015, the Leibniz Supercomputing Centre (Leibniz-Rechenzentrum,
LRZ), installed their new Peta-Scale System SuperMUC Phase2. Selected users
were invited for a 28 day extreme scale-out block operation during which they
were allowed to use the full system for their applications. The following
projects participated in the extreme scale-out workshop: BQCD (Quantum
Physics), SeisSol (Geophysics, Seismics), GPI-2/GASPI (Toolkit for HPC),
Seven-League Hydro (Astrophysics), ILBDC (Lattice Boltzmann CFD), Iphigenie
(Molecular Dynamic), FLASH (Astrophysics), GADGET (Cosmological Dynamics), PSC
(Plasma Physics), waLBerla (Lattice Boltzmann CFD), Musubi (Lattice Boltzmann
CFD), Vertex3D (Stellar Astrophysics), CIAO (Combustion CFD), and LS1-Mardyn
(Material Science). The projects were allowed to use the machine exclusively
during the 28 day period, which corresponds to a total of 63.4 million
core-hours, of which 43.8 million core-hours were used by the applications,
resulting in a utilization of 69%. The top 3 users were using 15.2, 6.4, and
4.7 million core-hours, respectively.
[11]
oai:arXiv.org:1411.1409 [pdf] - 1223025
Galactic outflow and diffuse gas properties at z>=1 using different
baryonic feedback models
Submitted: 2014-11-05
We measure and quantify properties of galactic outflows and diffuse gas at $z
\geq 1$ in cosmological hydrodynamical simulations. Our novel sub-resolution
model, MUPPI, implements supernova feedback using fully local gas properties,
where the wind velocity and mass loading are not given as input. We find the
following trends at $z = 2$ by analysing central galaxies having a stellar mass
higher than $10^{9} M_{\odot}$. The outflow velocity and mass outflow rate
($\dot{M}_{\rm out}$) exhibit positive correlations with galaxy mass and with
the star formation rate (SFR). However, most of the relations present a large
scatter. The outflow mass loading factor ($\eta$) is between $0.2 - 10$. The
comparison Effective model generates a constant outflow velocity, and a
negative correlation of $\eta$ with halo mass. The number fraction of galaxies
where outflow is detected decreases at lower redshifts, but remains more than
$80 \%$ over $z = 1 - 5$. High SF activity at $z \sim 2 - 4$ drives strong
outflows, causing the positive and steep correlations of velocity and
$\dot{M}_{\rm out}$ with SFR. The outflow velocity correlation with SFR becomes
flatter at $z = 1$, and $\eta$ displays a negative correlation with halo mass
in massive galaxies. Our study demonstrates that both the MUPPI and Effective
models produce significant outflows at $\sim 1 / 10$ of the virial radius; at
the same time shows that the properties of outflows generated can be different
from the input speed and mass loading in the Effective model. Our MUPPI model,
using local properties of gas in the sub-resolution recipe, is able to develop
galactic outflows whose properties correlate with global galaxy properties, and
consistent with observations.