Normalized to: Bédorf, J.
[1]
oai:arXiv.org:2005.14049 [pdf] - 2103532
Trimodal structure of Hercules stream explained by originating from bar
resonances
Submitted: 2020-05-28
Gaia Data Release 2 revealed detailed structures of nearby stars in phase
space. These include the Hercules stream, whose origin is still debated.
Earlier numerical studies conjectured that the observed structures originate
from orbits in resonance with the bar, based on static potential models for the
Milky Way. We, in contrast, approach the problem via a self-consistent,
dynamic, and morphologically well-resolved model, namely a full $N$-body
simulation of the Milky Way. Our simulation comprises about 5.1 billion
particles in the galactic stellar bulge, bar, disk, and dark-matter halo and is
evolved to 10 Gyr. Our model's disk component is composed of 200 million
particles, and its simulation snapshots are stored every 10 Myr, enabling us to
resolve and classify resonant orbits of representative samples of stars. After
identifying the Sun's position in the simulation, we compare the distribution
of stars in its neighborhood with Gaia's astrometric data, thereby establishing
the role of identified resonantly trapped stars in the formation of
Hercules-like structures. From our orbital spectral-analysis we identify
multiple resonances and conclude that the Hercules stream is dominated by the
4:1 and 5:1 outer Lindblad and corotation resonances. In total, this yields a
trimodal structure of the Hercules stream. From the relation between resonances
and ridges in phase space, we conclude that the pattern speed of the Milky-Way
bar is 40$-$45 km s$^{-1}$ kpc$^{-1}$.
[2]
oai:arXiv.org:1909.07439 [pdf] - 2048536
Bonsai-SPH: A GPU accelerated astrophysical Smoothed Particle
Hydrodynamics code
Submitted: 2019-09-16, last modified: 2020-02-14
We present the smoothed-particle hydrodynamics simulation code, Bonsai-SPH,
which is a continuation of our previously developed gravity-only hierarchical
$N$-body code (called Bonsai). The code is optimized for Graphics Processing
Unit (GPU) accelerators which enables researchers to take advantage of these
powerful computational resources. Bonsa-SPH produces simulation results
comparable with state-of-the-art, CPU based, codes, but using an order of
magnitude less computation time. The code is freely available online and the
details are described in this work.
[3]
oai:arXiv.org:1807.10019 [pdf] - 1767541
Modeling the Milky Way as a Dry Galaxy
Submitted: 2018-07-26, last modified: 2018-10-10
We construct a model for the Milky Way Galaxy composed of a stellar disc and
bulge embedded in a dark-matter halo. All components are modelled as $N$-body
systems with up to 8 billion equal-mass particles and integrated up to an age
of 10\,Gyr. We find that net angular-momentum of the dark-matter halo with a
spin parameter of $\lambda=0.06$ is required to form a relatively short bar
($\sim 4$\,kpc) with a high pattern speed (40--50\,km\,s$^{-1}$). By comparing
our model with observations of the Milky Way Galaxy, we conclude that a disc
mass of $\sim 3.7\times10^{10}M_{\odot}$ and an initial bulge scale length and
velocity of $\sim 1$\,kpc and $\sim 300$\,km\,s$^{-1}$, respectively, fit best
to the observations. The disc-to-total mass fraction ($f_{\rm d}$) appears to
be an important parameter for the evolution of the Galaxy and models with
$f_{\rm d}\sim 0.45$ are most similar to the Milky Way Galaxy. In addition, we
compare the velocity distribution in the solar neighbourhood in our simulations
with observations in the Milky Way Galaxy. In our simulations the observed gap
in the velocity distribution, which is expected to be caused by the outer
Lindblad resonance (the so-called Hercules stream), appears to be a
time-dependent structure. The velocity distribution changes on a time scale of
20--30\,Myr and therefore it is difficult to estimate the pattern speed of the
bar from the shape of the local velocity distribution alone.
[4]
oai:arXiv.org:1712.00058 [pdf] - 1659611
The dynamics of stellar disks in live dark-matter halos
Submitted: 2017-11-30, last modified: 2018-03-16
Recent developments in computer hardware and software enable researchers to
simulate the self-gravitating evolution of galaxies at a resolution comparable
to the actual number of stars. Here we present the results of a series of such
simulations. We performed $N$-body simulations of disk galaxies with between
100 and 500 million particles over a wide range of initial conditions. Our
calculations include a live bulge, disk, and dark matter halo, each of which is
represented by self-gravitating particles in the $N$-body code. The simulations
are performed using the gravitational $N$-body tree-code Bonsai running on the
Piz Daint supercomputer. We find that the time scale over which the bar forms
increases exponentially with decreasing disk-mass fraction and that the bar
formation epoch exceeds a Hubble time when the disk-mass fraction is
$\sim0.35$. These results can be explained with the swing-amplification theory.
The condition for the formation of $m=2$ spirals is consistent with that for
the formation of the bar, which is also an $m=2$ phenomenon. We further argue
that the non-barred grand-design spiral galaxies are transitional, and that
they evolve to barred galaxies on a dynamical timescale. We also confirm that
the disk-mass fraction and shear rate are important parameters for the
morphology of disk galaxies. The former affects the number of spiral arms and
the bar formation epoch, and the latter determines the pitch angle of the
spiral arms.
[5]
oai:arXiv.org:1510.04068 [pdf] - 1293676
Sapporo2: A versatile direct $N$-body library
Submitted: 2015-10-14
Astrophysical direct $N$-body methods have been one of the first production
algorithms to be implemented using NVIDIA's CUDA architecture. Now, almost
seven years later, the GPU is the most used accelerator device in astronomy for
simulating stellar systems. In this paper we present the implementation of the
Sapporo2 $N$-body library, which allows researchers to use the GPU for $N$-body
simulations with little to no effort. The first version, released five years
ago, is actively used, but lacks advanced features and versatility in numerical
precision and support for higher order integrators. In this updated version we
have rebuilt the code from scratch and added support for OpenCL,
multi-precision and higher order integrators. We show how to tune these codes
for different GPU architectures and present how to continue utilizing the GPU
optimal even when only a small number of particles ($N < 100$) is integrated.
This careful tuning allows Sapporo2 to be faster than Sapporo1 even with the
added options and double precision data loads. The code runs on a range of
NVIDIA and AMD GPUs in single and double precision accuracy. With the addition
of OpenCL support the library is also able to run on CPUs and other
accelerators that support OpenCL.
[6]
oai:arXiv.org:1412.0659 [pdf] - 904525
24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way
Galaxy with 18600 GPUs
Submitted: 2014-12-01
We have simulated, for the first time, the long term evolution of the Milky
Way Galaxy using 51 billion particles on the Swiss Piz Daint supercomputer with
our $N$-body gravitational tree-code Bonsai. Herein, we describe the scientific
motivation and numerical algorithms. The Milky Way model was simulated for 6
billion years, during which the bar structure and spiral arms were fully
formed. This improves upon previous simulations by using 1000 times more
particles, and provides a wealth of new data that can be directly compared with
observations. We also report the scalability on both the Swiss Piz Daint and
the US ORNL Titan. On Piz Daint the parallel efficiency of Bonsai was above
95%. The highest performance was achieved with a 242 billion particle Milky Way
model using 18600 GPUs on Titan, thereby reaching a sustained GPU and
application performance of 33.49 Pflops and 24.77 Pflops respectively.
[7]
oai:arXiv.org:1409.5474 [pdf] - 869668
Computational Gravitational Dynamics with Modern Numerical Accelerators
Submitted: 2014-09-18
We review the recent optimizations of gravitational $N$-body kernels for
running them on graphics processing units (GPUs), on single hosts and massive
parallel platforms. For each of the two main $N$-body techniques, direct
summation and tree-codes, we discuss the optimization strategy, which is
different for each algorithm. Because both the accuracy as well as the
performance characteristics differ, hybridizing the two algorithms is essential
when simulating a large $N$-body system with high-density structures containing
few particles, and with low-density structures containing many particles. We
demonstrate how this can be realized by splitting the underlying Hamiltonian,
and we subsequently demonstrate the efficiency and accuracy of the hybrid code
by simulating a group of 11 merging galaxies with massive black holes in the
nuclei.
[8]
oai:arXiv.org:1301.6784 [pdf] - 1159336
The Effect of Many Minor Mergers on the Size Growth of Compact Quiescent
Galaxies
Submitted: 2013-01-28
Massive galaxies with a half-mass radius <~ 1kpc are observed in the early
universe (z~>2), but not in the local universe. In the local universe
similar-mass (within a factor of two) galaxies tend to be a factor of 4 to 5
larger. Dry minor mergers are known to drive the evolution of the size of a
galaxy without much increasing the mass, but it is unclear if the growth in
size is sufficient to explain the observations. We test the hypothesis that
galaxies grow through dry minor mergers by simulating merging galaxies with
mass ratios of q=1:1 (equal mass) to q=1:160. In our N-body simulations the
total mass of the parent galaxy doubles. We confirm that major mergers do not
cause a sufficient growth in size. The observation can be explained with
mergers with a mass ratio of q=1:5--1:10. Smaller mass ratios cause a more
dramatic growth in size, up to a factor of ~17 for mergers with a mass ratio of
1:80. For relatively massive minor mergers q ~> 1:20 the mass of the incoming
child galaxies tend to settle in the halo of the parent galaxy. This is caused
by the tidal stripping of the child galaxies by the time they enter the central
portion of the parent. When the accretion of minor galaxies becomes more
continuous, when q <~ 1:40, the foreign mass tends to concentrate more in the
central region of the parent galaxy. We speculate that this is caused by
dynamic interactions between the child galaxies inside the merger remnant and
the longer merging times when the difference in mass is larger. These
interactions cause dynamical heating which results in accretion of mass inside
the galaxy core and a reduction of the parent's circular velocity and density.
[9]
oai:arXiv.org:1204.3106 [pdf] - 1117972
A pilgrimage to gravity on GPUs
Submitted: 2012-04-13
In this short review we present the developments over the last 5 decades that
have led to the use of Graphics Processing Units (GPUs) for astrophysical
simulations. Since the introduction of NVIDIA's Compute Unified Device
Architecture (CUDA) in 2007 the GPU has become a valuable tool for N-body
simulations and is so popular these days that almost all papers about high
precision N-body simulations use methods that are accelerated by GPUs. With the
GPU hardware becoming more advanced and being used for more advanced algorithms
like gravitational tree-codes we see a bright future for GPU like hardware in
computational astrophysics.
[10]
oai:arXiv.org:1204.2280 [pdf] - 498955
Bonsai: A GPU Tree-Code
Submitted: 2012-04-10
We present a gravitational hierarchical N-body code that is designed to run
efficiently on Graphics Processing Units (GPUs). All parts of the algorithm are
executed on the GPU which eliminates the need for data transfer between the
Central Processing Unit (CPU) and the GPU. Our tests indicate that the
gravitational tree-code outperforms tuned CPU code for all parts of the
algorithm and show an overall performance improvement of more than a factor 20,
resulting in a processing rate of more than 2.8 million particles per second.
[11]
oai:arXiv.org:1106.1900 [pdf] - 497851
A sparse octree gravitational N-body code that runs entirely on the GPU
processor
Submitted: 2011-06-09, last modified: 2012-04-10
We present parallel algorithms for constructing and traversing sparse octrees
on graphics processing units (GPUs). The algorithms are based on parallel-scan
and sort methods. To test the performance and feasibility, we implemented them
in CUDA in the form of a gravitational tree-code which completely runs on the
GPU.(The code is publicly available at:
http://castle.strw.leidenuniv.nl/software.html) The tree construction and
traverse algorithms are portable to many-core devices which have support for
CUDA or OpenCL programming languages. The gravitational tree-code outperforms
tuned CPU code during the tree-construction and shows a performance improvement
of more than a factor 20 overall, resulting in a processing rate of more than
2.8 million particles per second.
[12]
oai:arXiv.org:1005.5384 [pdf] - 242129
Gravitational tree-code on graphics processing units: implementation in
CUDA
Submitted: 2010-05-28
We present a new very fast tree-code which runs on massively parallel
Graphical Processing Units (GPU) with NVIDIA CUDA architecture. The
tree-construction and calculation of multipole moments is carried out on the
host CPU, while the force calculation which consists of tree walks and
evaluation of interaction list is carried out on the GPU. In this way we
achieve a sustained performance of about 100GFLOP/s and data transfer rates of
about 50GB/s. It takes about a second to compute forces on a million particles
with an opening angle of $\theta \approx 0.5$. The code has a convenient user
interface and is freely available for use\footnote{{\tt
http://castle.strw.leidenuniv.nl/software/octgrav.html}}.