Normalized to: Gaburov, E.
[1]
oai:arXiv.org:1510.04068 [pdf] - 1293676
Sapporo2: A versatile direct $N$-body library
Submitted: 2015-10-14
Astrophysical direct $N$-body methods have been one of the first production
algorithms to be implemented using NVIDIA's CUDA architecture. Now, almost
seven years later, the GPU is the most used accelerator device in astronomy for
simulating stellar systems. In this paper we present the implementation of the
Sapporo2 $N$-body library, which allows researchers to use the GPU for $N$-body
simulations with little to no effort. The first version, released five years
ago, is actively used, but lacks advanced features and versatility in numerical
precision and support for higher order integrators. In this updated version we
have rebuilt the code from scratch and added support for OpenCL,
multi-precision and higher order integrators. We show how to tune these codes
for different GPU architectures and present how to continue utilizing the GPU
optimal even when only a small number of particles ($N < 100$) is integrated.
This careful tuning allows Sapporo2 to be faster than Sapporo1 even with the
added options and double precision data loads. The code runs on a range of
NVIDIA and AMD GPUs in single and double precision accuracy. With the addition
of OpenCL support the library is also able to run on CPUs and other
accelerators that support OpenCL.
[2]
oai:arXiv.org:1412.0659 [pdf] - 904525
24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way
Galaxy with 18600 GPUs
Submitted: 2014-12-01
We have simulated, for the first time, the long term evolution of the Milky
Way Galaxy using 51 billion particles on the Swiss Piz Daint supercomputer with
our $N$-body gravitational tree-code Bonsai. Herein, we describe the scientific
motivation and numerical algorithms. The Milky Way model was simulated for 6
billion years, during which the bar structure and spiral arms were fully
formed. This improves upon previous simulations by using 1000 times more
particles, and provides a wealth of new data that can be directly compared with
observations. We also report the scalability on both the Swiss Piz Daint and
the US ORNL Titan. On Piz Daint the parallel efficiency of Bonsai was above
95%. The highest performance was achieved with a 242 billion particle Milky Way
model using 18600 GPUs on Titan, thereby reaching a sustained GPU and
application performance of 33.49 Pflops and 24.77 Pflops respectively.
[3]
oai:arXiv.org:1307.2445 [pdf] - 1172591
Structure and evolution of high-mass stellar mergers
Submitted: 2013-07-09
In young dense clusters repeated collisions between massive stars may lead to
the formation of a very massive star (above 100 Msun). In the past the study of
the long-term evolution of merger remnants has mostly focussed on collisions
between low-mass stars (up to about 2 Msun) in the context of blue-straggler
formation. The evolution of collision products of more massive stars has not
been as thoroughly investigated. In this paper we study the long-term evolution
of a number of stellar mergers formed by the head-on collision of a primary
star with a mass of 5-40 Msun with a lower mass star at three points in its
evolution in order to better understand their evolution.
We use smooth particle hydrodynamics (SPH) calculations to model the
collision between the stars. The outcome of this calculation is reduced to one
dimension and imported into a stellar evolution code. We follow the subsequent
evolution of the collision product through the main sequence at least until the
onset of helium burning.
We find that little hydrogen is mixed into the core of the collision
products, in agreement with previous studies of collisions between low-mass
stars. For collisions involving evolved stars we find that during the merger
the surface nitrogen abundance can be strongly enhanced. The evolution of most
of the collision products proceeds analogously to that of normal stars with the
same mass, but with a larger radius and luminosity. However, the evolution of
collision products that form with a hydrogen depleted core is markedly
different from that of normal stars with the same mass. They undergo a
long-lived period of hydrogen shell burning close to the main-sequence band in
the Hertzsprung-Russell diagram and spend the initial part of core helium
burning as compact blue supergiants.
[4]
oai:arXiv.org:1209.4302 [pdf] - 1151501
Common Envelope Evolution: Where we stand and how we can move forward
Ivanova, N.;
Justham, S.;
Chen, X.;
De Marco, O.;
Fryer, C. L.;
Gaburov, E.;
Ge, H.;
Glebbeek, E.;
Han, Z.;
Li, X. -D.;
Lu, G.;
Marsh, T.;
Podsiadlowski, Ph.;
Potter, A.;
Soker, N.;
Taam, R.;
Tauris, T. M.;
Heuvel, E. P. J. van den;
Webbink, R. F.
Submitted: 2012-09-19, last modified: 2012-12-06
This work aims to present our current best physical understanding of
common-envelope evolution (CEE). We highlight areas of consensus and
disagreement, and stress ideas which should point the way forward for progress
in this important but long-standing and largely unconquered problem. Unusually
for CEE-related work, we mostly try to avoid relying on results from population
synthesis or observations, in order to avoid potentially being misled by
previous misunderstandings. As far as possible we debate all the relevant
issues starting from physics alone, all the way from the evolution of the
binary system immediately before CEE begins to the processes which might occur
just after the ejection of the envelope. In particular, we include extensive
discussion about the energy sources and sinks operating in CEE, and hence
examine the foundations of the standard energy formalism. Special attention is
also given to comparing the results of hydrodynamic simulations from different
groups and to discussing the potential effect of initial conditions on the
differences in the outcomes. We compare current numerical techniques for the
problem of CEE and also whether more appropriate tools could and should be
produced (including new formulations of computational hydrodynamics, and
attempts to include 3D processes within 1D codes). Finally we explore new ways
to link CEE with observations. We compare previous simulations of CEE to the
recent outburst from V1309 Sco, and discuss to what extent post-common-envelope
binaries and nebulae can provide information, e.g. from binary eccentricities,
which is not currently being fully exploited.
[5]
oai:arXiv.org:1201.4873 [pdf] - 1093190
Magnetically-levitating disks around supermassive black holes
Submitted: 2012-01-23, last modified: 2012-08-16
In this paper we report on the formation of magnetically-levitating accretion
disks around supermassive black holes. The structure of these disks is
calculated by numerically modelling tidal disruption of magnetized interstellar
gas clouds. We find that the resulting disks are entirely supported by the
pressure of the magnetic fields against the component of gravitational force
directed perpendicular to the disks. The magnetic field shows ordered
large-scale geometry that remains stable for the duration of our numerical
experiments extending over 10% of the disk lifetime. Strong magnetic pressure
allows high accretion rate and inhibits disk fragmentation. This in combination
with the repeated feeding of manetized molecular clouds to a supermassive black
hole yields a possible solution to the long-standing puzzle of black hole
growth in the centres of galaxies.
[6]
oai:arXiv.org:1204.2280 [pdf] - 498955
Bonsai: A GPU Tree-Code
Submitted: 2012-04-10
We present a gravitational hierarchical N-body code that is designed to run
efficiently on Graphics Processing Units (GPUs). All parts of the algorithm are
executed on the GPU which eliminates the need for data transfer between the
Central Processing Unit (CPU) and the GPU. Our tests indicate that the
gravitational tree-code outperforms tuned CPU code for all parts of the
algorithm and show an overall performance improvement of more than a factor 20,
resulting in a processing rate of more than 2.8 million particles per second.
[7]
oai:arXiv.org:1106.1900 [pdf] - 497851
A sparse octree gravitational N-body code that runs entirely on the GPU
processor
Submitted: 2011-06-09, last modified: 2012-04-10
We present parallel algorithms for constructing and traversing sparse octrees
on graphics processing units (GPUs). The algorithms are based on parallel-scan
and sort methods. To test the performance and feasibility, we implemented them
in CUDA in the form of a gravitational tree-code which completely runs on the
GPU.(The code is publicly available at:
http://castle.strw.leidenuniv.nl/software.html) The tree construction and
traverse algorithms are portable to many-core devices which have support for
CUDA or OpenCL programming languages. The gravitational tree-code outperforms
tuned CPU code during the tree-construction and shows a performance improvement
of more than a factor 20 overall, resulting in a processing rate of more than
2.8 million particles per second.
[8]
oai:arXiv.org:1201.5692 [pdf] - 1093295
How well do STARLAB and NBODY compare? II: Hardware and accuracy
Submitted: 2012-01-26
Most recent progress in understanding the dynamical evolution of star
clusters relies on direct N-body simulations. Owing to the computational
demands, and the desire to model more complex and more massive star clusters,
hardware calculational accelerators, such as GRAPE special-purpose hardware or,
more recently, GPUs (i.e. graphics cards), are generally utilised. In addition,
simulations can be accelerated by adjusting parameters determining the
calculation accuracy (i.e. changing the internal simulation time step used for
each star).
We extend our previous thorough comparison (Anders et al. 2009) of basic
quantities as derived from simulations performed either with STARLAB/KIRA or
NBODY6. Here we focus on differences arising from using different hardware
accelerations (including the increasingly popular graphic card
accelerations/GPUs) and different calculation accuracy settings.
We use the large number of star cluster models (for a fixed stellar mass
function, without stellar/binary evolution, primordial binaries, external tidal
fields etc) already used in the previous paper, evolve them with STARLAB/KIRA
(and NBODY6, where required), analyse them in a consistent way and compare the
averaged results quantitatively. For this quantitative comparison, we apply the
bootstrap algorithm for functional dependencies developed in our previous
study.
In general we find very high comparability of the simulation results,
independent of the used computer hardware (including the hardware accelerators)
and the used N-body code. For the tested accuracy settings we find that for
reduced accuracy (i.e. time step at least a factor 2.5 larger than the standard
setting) most simulation results deviate significantly from the results using
standard settings. The remaining deviations are comprehensible and explicable.
[9]
oai:arXiv.org:1006.4159 [pdf] - 1033223
Astrophysical Weighted Particle Magnetohydrodynamics
Submitted: 2010-06-21
This paper presents applications of weighted meshless scheme for conservation
laws to the Euler equations and the equations of ideal magnetohydrodynamics.
The divergence constraint of the latter is maintained to the truncation error
by a new meshless divergence cleaning procedure. The physics of the interaction
between the particles is described by an one-dimensional Riemann problem in a
moving frame. As a result, necessary diffusion which is required to treat
dissipative processes is added automatically. As a result, our scheme has no
free parameters that controls the physics of inter-particle interaction, with
the exception of the number of the interacting neighbours which control the
resolution and accuracy. The resulting equations have the form similar to SPH
equations, and therefore existing SPH codes can be used to implement the
weighed particle scheme. The scheme is validated in several hydrodynamic and
MHD test cases. In particular, we demonstrate for the first time the ability of
a meshless MHD scheme to model magneto-rotational instability in accretion
disks.
[10]
oai:arXiv.org:1005.5384 [pdf] - 242129
Gravitational tree-code on graphics processing units: implementation in
CUDA
Submitted: 2010-05-28
We present a new very fast tree-code which runs on massively parallel
Graphical Processing Units (GPU) with NVIDIA CUDA architecture. The
tree-construction and calculation of multipole moments is carried out on the
host CPU, while the force calculation which consists of tree walks and
evaluation of interaction list is carried out on the GPU. In this way we
achieve a sustained performance of about 100GFLOP/s and data transfer rates of
about 50GB/s. It takes about a second to compute forces on a million particles
with an opening angle of $\theta \approx 0.5$. The code has a convenient user
interface and is freely available for use\footnote{{\tt
http://castle.strw.leidenuniv.nl/software/octgrav.html}}.
[11]
oai:arXiv.org:0904.0997 [pdf] - 1001878
On the onset of runaway stellar collisions in dense star clusters - II.
Hydrodynamics of three-body interactions
Submitted: 2009-04-06
The onset of runaway stellar collisions in young star clusters is more likely
to initiate with an encounter between a binary and a third star than between
two single stars. Using the initial conditions of such three-star encounters
from direct $N$-body simulations, we model the resulting interaction by means
of Smoothed Particle Hydrodynamics (SPH). We find that, in the majority of the
cases considered, all three stars merge together, and in such three star
mergers, the hydrodynamic simulations reveal that: (1) mass lost as ejecta can
be a considerable fraction of the total mass in the system (up to $\sim25$%);
(2) due to asymmetric mass loss, the collision product can sometimes receive a
kick velocity that exceeds 10 km/s, large enough to allow the collision product
to escape the core of the cluster; and (3) the energy of the ejected matter can
be large enough (up to $\sim 3\times 10^{50}$ erg) to remove or disturb the
inter cluster gas appreciably.
[12]
oai:arXiv.org:0902.4463 [pdf] - 1001524
SAPPORO: A way to turn your graphics cards into a GRAPE-6
Submitted: 2009-02-25, last modified: 2009-03-05
We present Sapporo, a library for performing high-precision gravitational
N-body simulations on NVIDIA Graphical Processing Units (GPUs). Our library
mimics the GRAPE-6 library, and N-body codes currently running on GRAPE-6 can
switch to Sapporo by a simple relinking of the library. The precision of our
library is comparable to that of GRAPE-6, even though internally the GPU
hardware is limited to single precision arithmetics. This limitation is
effectively overcome by emulating double precision for calculating the distance
between particles. The performance loss of this operation is small (< 20%)
compared to the advantage of being able to run at high precision. We tested the
library using several GRAPE-6-enabled N-body codes, in particular with Starlab
and phiGRAPE. We measured peak performance of 800 Gflop/s for running with 10^6
particles on a PC with four commercial G92 architecture GPUs (two GeForce
9800GX2). As a production test, we simulated a 32k Plummer model with equal
mass stars well beyond core collapse. The simulation took 41 days, during which
the mean performance was 113 Gflop/s. The GPU did not show any problems from
running in a production environment for such an extended period of time.
[13]
oai:arXiv.org:0902.1753 [pdf] - 21222
The evolution of runaway stellar collision products
Submitted: 2009-02-10, last modified: 2009-02-13
In the cores of young dense star clusters repeated stellar collisions
involving the same object can occur, which has been suggested to lead to the
formation of an intermediate-mass black hole. In order to verify this scenario
we compute the detailed evolution of the merger remnant of three sequences. We
follow the evolution until the onset of carbon burning and estimate the final
remnant mass to determine the ultimate fate of a runaway merger sequence.
We use a detailed stellar evolution code to follow the evolution of the
collision product. At each collision, we mix the two colliding stars, taking
account of mass loss during the collision. During the stellar evolution we
apply mass loss rates from the literature, as appropriate for the evolutionary
stage of the merger remnant. We compute models for high ($Z=0.02$) and low
($Z=0.001$) metallicity to quantify metallicity effects.
We find that the merger remnant becomes a Wolf-Rayet star before the end of
core hydrogen burning. Mass loss from stellar winds dominates over the mass
increase due to repeated mergers for all three merger sequences that we
consider. In none of our high metallicity models an intermediate-mass black
hole is formed, instead our models have a mass of 10--14 \Msun{} at the onset
of carbon burning. For low metallicity we expect the final remnant of the
merger sequence to explode as a pair creation supernova. We find that our
metal-rich models become inflated as a result of developing an extended
low-density envelope. This may increase the probability of further collisions,
but self-consistent $N$-body calculations with detailed evolution of runaway
mergers are required to verify this.
[14]
oai:arXiv.org:0801.0596 [pdf] - 8631
Mass segregation in young star clusters: can it be detected from the
integrated photometric properties?
Submitted: 2008-01-03, last modified: 2008-08-18
We consider the effect of mass segregation on the observable integrated
properties of star clusters. The measurable properties depend on a combination
of the dynamical age of the cluster and the physical age of the stars in the
cluster. To investigate all possible combinations of these two quantities we
propose an analytical model for the mass function of segregated star clusters
that agrees with the results of N-body simulations, in which any combination
can be specified. For a realistic degree of mass segregation and a fixed
density profile we find with increasing age an increase in the measured core
radii and a central surface brightness that decreases in all filters more
rapidly than what is expected from stellar evolution alone. Within a Gyr the
measured core radius increases by a factor of two and the central surface
density in all filters of a segregated cluster will be overestimated by a
similar factor when not taking into account mass segregation in the conversion
from light to mass. We find that the $V-I$ colour of mass segregated clusters
decreases with radius by about 0.1-0.2 mag, which could be observable. From
recent observations of partially resolved extra-galactic clusters a decreasing
half-light radius with increasing wavelength was observed, which was attributed
to mass segregation. These observations can not be reproduced by our models. We
find that the differences between measured radii in different filters are
always smaller than 5%.
[15]
oai:arXiv.org:0707.0406 [pdf] - 2758
On the onset of runaway stellar collisions in dense star clusters I.
Dynamics of the first collision
Submitted: 2007-07-03, last modified: 2007-12-05
We study the circumstances under which first collisions occur in young and
dense star clusters. The initial conditions for our direct $N$-body simulations
are chosen such that the clusters experience core collapse within a few million
years, before the most massive stars have left the main-sequence. It turns out
that the first collision is typically driven by the most massive stars in the
cluster. Upon arrival in the cluster core, by dynamical friction, massive stars
tend to form binaries. The enhanced cross section of the binary compared to a
single star causes other stars to engage the binary. A collision between one of
the binary components and the incoming third star is then mediated by the
encounters between the binary and other cluster members. Due to the geometry of
the binary-single star engagement the relative velocity at the moment of impact
is substantially different than in a two-body encounter. This may have profound
consequences for the further evolution of the collision product.
[16]
oai:arXiv.org:0710.1473 [pdf] - 5772
Integrated properties of mass segregated star clusters
Submitted: 2007-10-08, last modified: 2007-10-09
In this contribution we study integrated properties of dynamically segregated
star clusters. The observed core radii of segregated clusters can be 50%
smaller than the ``true'' core radius. In addition, the measured radius in the
red filters is smaller than those measured in blue filters. However, these
difference are small ($\lesssim10%$), making it observationally challenging to
detect mass segregation in extra-galactic clusters based on such a comparison.
Our results follow naturally from the fact that in nearly all filters most of
the light comes from the most massive stars. Therefore, the observed surface
brightness profile is dominated by stars of similar mass, which are centrally
concentrated and have a similar spatial distribution.
[17]
oai:arXiv.org:0707.3021 [pdf] - 3282
Mixing in massive stellar mergers
Submitted: 2007-07-20, last modified: 2007-10-02
The early evolution of dense star clusters is possibly dominated by close
interactions between stars, and physical collisions between stars may occur
quite frequently. Simulating a stellar collision event can be an intensive
numerical task, as detailed calculations of this process require hydrodynamic
simulations in three dimensions. We present a computationally inexpensive
method in which we approximate the merger process, including shock heating,
hydrodynamic mixing and mass loss, with a simple algorithm based on
conservation laws and a basic qualitative understanding of the hydrodynamics of
stellar mergers. The algorithm relies on Archimedes' principle to dictate the
distribution of the fluid in the stable equilibrium situation. We calibrate and
apply the method to mergers of massive stars, as these are expected to occur in
young and dense star clusters. We find that without the effects of microscopic
mixing, the temperature and chemical composition profiles in a collision
product can become double-valued functions of enclosed mass. Such an unphysical
situation is mended by simulating microscopic mixing as a post-collision
effect. In this way we find that head-on collisions between stars of the same
spectral type result in substantial mixing, while mergers between stars of
different spectral type, such as type B and O stars ($\sim$10 and $\sim$40\msun
respectively), are subject to relatively little hydrodynamic mixing.
[18]
oai:arXiv.org:astro-ph/0702693 [pdf] - 89747
The present day mass function in the central region of the Arches
cluster
Submitted: 2007-02-26
We study the evolution of the mass function in young and dense star clusters
by means of direct N-body simulations. Our main aim is to explain the recent
observations of the relatively flat mass function observed near the centre of
the Arches star cluster. In this region, the power law index of the mass
function for stars more massive than about 5-6 solar mass, is larger than the
Salpeter value by about unity; whereas further out, and for the lower mass
stars, the mass function resembles the Salpeter distribution. We show that the
peculiarities in the Arches mass function can be explained satisfactorily
without primordial mass segregation. We draw two conclusions from our
simulations: 1) The Arches initial mass function is consistent with a Salpeter
slope down to ~1 solar mass, 2) The cluster is about half way towards core
collapse. The cores of other star clusters with characteristics similar to
those of the Arches are expected to show similar flattening in the mass
functions for the high mass (>5 solar mass) stars.
[19]
oai:arXiv.org:hep-th/0108065 [pdf] - 116823
Anisotropic Black Holes in Einstein and Brane Gravity
Submitted: 2001-08-09, last modified: 2001-12-30
We consider exact solutions of Einstein equations defining static black holes
parametrized by off-diagonal metrics which by anholonomic mappings can be
equivalently transformed into some diagonal metrics with coefficients being
very similar to those from the Schwarzschild and/or Reissner-N\"ordstrom
solutions with anisotropic renormalizations of constants. We emphasize that
such classes of solutions, for instance, with ellipsoidal symmetry of horizons,
can be constructed even in general relativity theory if off-diagonal metrics
and anholonomic frames are introduced into considerations. Such solutions do
not violate the Israel's uniqueness theorems on static black hole
configurations because at long radial distances one holds the usual
Schwarzschild limit. We show that anisotropic deformations of the
Reissner-N\"ordstrom metric can be an exact solution on the brane,
re-interpreted as a black hole with an effective electromagnetic like charge
anisotropically induced and polarized by higher dimension gravitational
interactions.