Normalized to: Kawai, A.
[1]
oai:arXiv.org:1602.02832 [pdf] - 1378912
Hierarchical Tree Algorithm for Collisional N-body Simulations on GRAPE
Submitted: 2016-02-08
We present an implementation of the hierarchical tree algorithm on the
individual timestep algorithm (the Hermite scheme) for collisional $N$-body
simulations, running on GRAPE-9 system, a special-purpose hardware accelerator
for gravitational many-body simulations. Such combination of the tree algorithm
and the individual timestep algorithm was not easy on the previous GRAPE system
mainly because its memory addressing scheme was limited only to sequential
access to a full set of particle data. The present GRAPE-9 system has an
indirect memory addressing unit and a particle memory large enough to store all
particles data and also tree nodes data. The indirect memory addressing unit
stores interaction lists for the tree algorithm, which is constructed on host
computer, and, according to the interaction lists, force pipelines calculate
only the interactions necessary. In our implementation, the interaction
calculations are significantly reduced compared to direct $N^2$ summation in
the original Hermite scheme. For example, we can archive about a factor 30 of
speedup (equivalent to about 17 teraflops) against the Hermite scheme for a
simulation of $N=10^6$ system, using hardware of a peak speed of 0.6 teraflops
for the Hermite scheme.
[2]
oai:arXiv.org:1101.4933 [pdf] - 1051679
Spatially Resolved Spectroscopic Observations of a Possible E+A
Progenitor SDSS J160241.00+521426.9
Submitted: 2011-01-25
In order to investigate the evolution of E+A galaxies, we observed a galaxy
SDSS J160241.00+521426.9, a possible E+A progenitor which shows both emission
and strong Balmer absorptions, and its neighbor galaxy. We used the integral
field spectroscopic mode of the Kyoto Tridimensional Spectrograph (Kyoto3DII),
mounted on the University of Hawaii 88-inch telescope located on Mauna Kea, and
the slit-spectroscopic mode of the Faint Object Camera and Spectrograph (FOCAS)
on the Subaru Telescope. We found a strong Balmer absorption region in the
center of the galaxy and an emission-line region located 2 kpc from the center,
in the direction of its neighbor galaxy. The recession velocities of the galaxy
and its neighbor galaxy differ only by 100 km s^-1, which suggests that they
are a physical pair and would have been interacting. Comparing observed Lick
indices of Balmer lines and color indices with those predicted from stellar
population synthesis models, we find that a suddenly quenched star-formation
scenario is plausible for the star-formation history of the central region. We
consider that star formation started in the galaxy due to galaxy interactions
and was quenched in the central region, whereas star formation in a region
offset from the center still continues or has begun recently. This work is the
first study of a possible E+A progenitor using spatially resolved spectroscopy.
[3]
oai:arXiv.org:0907.2012 [pdf] - 315982
Galactic Wind in the Nearby Starburst Galaxy NGC 253 Observed with the
Kyoto3DII Fabry-Perot Mode
Submitted: 2009-07-12
We have observed the central region of the nearby starburst galaxy NGC 253
with the Kyoto Tridimensional Spectrograph II (Kyoto3DII) Fabry-Perot mode in
order to investigate the properties of its galactic wind. Since this galaxy has
a large inclination, it is easy to observe its galactic wind. We produced the
Ha, [N II]6583, and [S II]6716,6731 images, as well as those line ratio maps.
The [N II]/Ha ratio in the galactic wind region is larger than those in H II
regions in the galactic disk. The [N II]/Ha ratio in the southeastern filament,
a part of the galactic wind, is the largest and reaches about 1.5. These large
[N II]/Ha ratios are explained by shock ionization/excitation. Using the [S
II]/Ha ratio map, we spatially separate the galactic wind region from the
starburst region. The kinetic energy of the galactic wind can be sufficiently
supplied by supernovae in a starburst region in the galactic center. The shape
of the galactic wind and the line ratio maps are non-axisymmetric about the
galactic minor axis, which is also seen in M82. In the [N II]6583/[S
II]6716,6731 map, the positions with large ratios coincide with the positions
of star clusters found in the Hubble Space Telescope (HST) observation. This
means that intense star formation causes strong nitrogen enrichment in these
regions. Our unique data of the line ratio maps including [S II] lines have
demonstrated their effectiveness for clearly distinguishing between shocked gas
regions and starburst regions, determining the extent of galactic wind and its
mass and kinetic energy, and discovering regions with enhanced nitrogen
abundance.
[4]
oai:arXiv.org:0801.1109 [pdf] - 8750
Integrated field spectroscopy of E+A (post-starburst) galaxies with the
Kyoto3DII
Submitted: 2008-01-07
We have performed a two-dimensional spectroscopy of three nearby E+A
(post-starburst) galaxies with the Kyoto3DII integral field spectrograph. In
all the cases, Hdelta absorption is stronger at the centre of the galaxies, but
significantly extended in a few kpc scale. For one galaxy (J1656), we found a
close companion galaxy at the same redshift. The galaxy turned out to be a
star-forming galaxy with a strong emission in Hgamma. For the other two
galaxies, we have found that the central post-starburst regions possibly extend
toward the direction of the tidal tails. Our results are consistent with the
merger/interaction origin of E+A galaxies, where the infalling-gas possibly
caused by a galaxy-galaxy merging creates a central-starburst, succeeded by a
post-starburst (E+A) phase once the gas is depleted.
[5]
oai:arXiv.org:astro-ph/0702392 [pdf] - 316846
Integral Field Spectroscopy of the Quadruply Lensed Quasar 1RXS
J1131-1231: New Light on Lens Substructures
Submitted: 2007-02-14
We have observed the quadruply lensed quasar 1RXS J1131-1231 with the
integral field spectrograph mode of the Kyoto Tridimensional Spectrograph II
mounted on the Subaru telescope. Its field of view has covered simultaneously
the three brighter lensed images A, B, and C, which are known to exhibit
anomalous flux ratios in their continuum emission. We have found that the
[OIII] line flux ratios among these lensed images are consistent with those
predicted by smooth-lens models. The absence of both microlensing and
millilensing effects on this [OIII] narrow line region sets important limits on
the mass of any substructures along the line of sight, which is expressed as
M_E < 10^5 M_solar for the mass inside an Einstein radius. In contrast, the
H_beta line emission, which originates from the broad line region, shows an
anomaly in the flux ratio between images B and C, i.e., a factor two smaller
C/B ratio than predicted by smooth-lens models. The ratio of A/B in the H_beta
line is well reproduced. We show that the anomalous C/B ratio for the H_beta
line is caused most likely by micro/milli-lensing of image C. This is because
other effects, such as the differential dust extinction and/or arrival time
difference between images B and C, or the simultaneous lensing of another pair
of images A and B, are all unlikely. In addition, we have found that the broad
H_beta line of image A shows a slight asymmetry in its profile compared with
those in the other images, which suggests the presence of a small microlensing
effect on this line emitting region of image A.
[6]
oai:arXiv.org:astro-ph/0504407 [pdf] - 1233525
GRAPE-6A: A single-card GRAPE-6 for parallel PC-GRAPE cluster system
Submitted: 2005-04-19
In this paper, we describe the design and performance of GRAPE-6A, a
special-purpose computer for gravitational many-body simulations. It was
designed to be used with a PC cluster, in which each node has one GRAPE-6A.
Such configuration is particularly effective in running parallel tree
algorithm. Though the use of parallel tree algorithm was possible with the
original GRAPE-6 hardware, it was not very cost-effective since a single
GRAPE-6 board was still too fast and too expensive. Therefore, we designed
GRAPE-6A as a single PCI card to minimize the reproduction cost and optimize
the computing speed. The peak performance is 130 Gflops for one GRAPE-6A board
and 3.1 Tflops for our 24 node cluster. We describe the implementation of the
tree, TreePM and individual timestep algorithms on both a single GRAPE-6A
system and GRAPE-6A cluster. Using the tree algorithm on our 16-node GRAPE-6A
system, we can complete a collisionless simulation with 100 million particles
(8000 steps) within 10 days.
[7]
oai:arXiv.org:astro-ph/0311179 [pdf] - 60759
A Study of the Distribution of Star-Forming Regions in Luminous Infrared
Galaxies by Means of H$\alpha$ Imaging Observations
Hattori, T.;
Yoshida, M.;
Ohtani, H.;
Sugai, H.;
Ishigaki, T.;
Sasaki, M.;
Hayashi, T.;
Ozaki, S.;
Ishii, M.;
Kawai, A.
Submitted: 2003-11-07
We performed H-alpha imaging observations of 22 luminous infrared galaxies to
investigate how the distribution of star-forming regions in these galaxies is
related to galaxy interactions. Based on correlation diagrams between H-alpha
flux and continuum emission for individual galaxies, a sequence for the
distribution of star-forming regions was found: very compact (~100 pc) nuclear
starbursts with almost no star-forming activity in the outer regions (type 1),
dominant nuclear starbursts < 1 kpc in size and a negligible contribution from
the outer regions (type 2), nuclear starbursts > 1 kpc in size and a
significant contribution from the outer regions (type 3), and extended
starbursts with relatively faint nuclei (type 4). These classes of star-forming
region were found to be strongly related to global star-forming properties such
as star-formation efficiency, far-infrared color, and dust extinction. There
was a clear tendency for the objects with more compact distributions of
star-forming regions to show a higher star-formation efficiency and hotter
far-infrared color. An appreciable fraction of the sample objects were
dominated by extended starbursts (type 4), which is unexpected in the standard
scenario of interaction-induced starburst galaxies. We also found that the
distribution of star-forming regions was weakly but clearly related to galaxy
morphology: severely disturbed objects had a more concentrated distribution of
star-forming regions. This suggests that the properties of galaxy interactions,
such as dynamical phase and orbital parameters, play a more important role than
the internal properties of progenitor galaxies, such as dynamical structure or
gas mass fraction. We also discuss the evolution of the distribution of
star-forming regions in interacting galaxies.
[8]
oai:arXiv.org:astro-ph/0306203 [pdf] - 57284
Structure of Dark Matter Halos From Hierarchical Clustering. III.
Shallowing of The Inner Cusp
Submitted: 2003-06-10
We investigate the structure of the dark matter halo formed in the cold dark
matter scenarios by N-body simulations with parallel treecode on GRAPE cluster
systems. We simulated 8 halos with the mass of $4.4\times 10^{14}M_{\odot}$ to
$1.6\times 10^{15}M_{\odot}$ in the SCDM and LCDM model using up to 30 million
particles. With the resolution of our simulations, the density profile is
reliable down to 0.2 percent of the virial radius. Our results show that the
slope of inner cusp within 1 percent virial radius is shallower than -1.5, and
the radius where the shallowing starts exhibits run-to-run variation, which
means the innermost profile is not universal.
[9]
oai:arXiv.org:astro-ph/0012041 [pdf] - 39630
Pseudoparticle Multipole Method: A Simple Method to Implement
High-Accuracy Treecode
Submitted: 2000-12-02
In this letter we describe the pseudoparticle multipole method (P2M2), a new
method to express multipole expansion by a distribution of pseudoparticles. We
can use this distribution of particles to calculate high order terms in both
the Barnes-Hut treecode and FMM. The primary advantage of P2M2 is that it works
on GRAPE. GRAPE is a special-purpose hardware for the calculation of
gravitational force between particles. Although the treecode has been
implemented on GRAPE, we could handle terms only up to dipole, since GRAPE can
calculate forces from point-mass particles only. Thus the calculation cost
grows quickly when high accuracy is required. With P2M2, the multipole
expansion is expressed by particles, and thus GRAPE can calculate high order
terms. Using P2M2, we implemented an arbitrary-order treecode on GRAPE-4.
Timing result shows GRAPE-4 accelerates the calculation by a factor between 10
(for low accuracy) to 150 (for high accuracy). Even on general-purpose
programmable computers, our method offers the advantage that the mathematical
formulae and therefore the actual program is much simpler than that of the
direct implementation of multipole expansion.
[10]
oai:arXiv.org:astro-ph/9905101 [pdf] - 106427
7.0/Mflops Astrophysical N-Body Simulation with Treecode on GRAPE-5
Submitted: 1999-05-08, last modified: 1999-11-24
As an entry for the 1999 Gordon Bell price/performance prize, we report an
astrophysical N-body simulation performed with a treecode on GRAPE-5 (Gravity
Pipe 5) system, a special-purpose computer for astrophysical N-body
simulations. The GRAPE-5 system has 32 pipeline processors specialized for the
gravitational force calculation. Other operations, such as tree construction,
tree traverse and time integration, are performed on a general purpose
workstation. The total cost for the GRAPE-5 system is 40,900 dollars. We
performed a cosmological N-body simulation with 2.1 million particles, which
sustained a performance of 5.92 Gflops averaged over 8.37 hours. The price per
performance obtained is 7.0 dollars per Mflops.
[11]
oai:arXiv.org:astro-ph/9909116 [pdf] - 1235380
GRAPE-5: A Special-Purpose Computer for N-body Simulation
Submitted: 1999-09-07
We have developed a special-purpose computer for gravitational many-body
simulations, GRAPE-5. GRAPE-5 is the successor of GRAPE-3. Both consist of
eight custom pipeline chips (G5 chip and GRAPE chip). The difference between
GRAPE-5 and GRAPE-3 are: (1) The G5 chip contains two pipelines operating at 80
MHz, while the GRAPE chip had one at 20 MHz. Thus, the calculation speed of the
G5 chip and that of GRAPE-5 board are 8 times faster than that of GRAPE chip
and GRAPE-3 board. (2) The GRAPE-5 board adopted PCI bus as the interface to
the host computer instead of VME of GRAPE-3, resulting in the communication
speed one order of magnitude faster. (3) In addition to the pure 1/r potential,
the G5 chip can calculate forces with arbitrary cutoff functions, so that it
can be applied to Ewald or P^3M methods. (4) The pairwise force calculated on
GRAPE-5 is about 10 times more accurate than that on GRAPE-3. On one GRAPE-5
board, one timestep of 128k-body simulation with direct summation algorithm
takes 14 seconds. With Barnes-Hut tree algorithm (theta = 0.75), one timestep
of 10^6-body simulation can be done in 16 seconds.
[12]
oai:arXiv.org:astro-ph/9906419 [pdf] - 1235354
PROGRAPE-1: A Programmable, Multi-Purpose Computer for Many-Body
Simulations
Submitted: 1999-06-25, last modified: 1999-07-08
We have developed PROGRAPE-1 (PROgrammable GRAPE-1), a programmable
multi-purpose computer for many-body simulations. The main difference between
PROGRAPE-1 and "traditional" GRAPE systems is that the former uses FPGA (Field
Programmable Gate Array) chips as the processing elements, while the latter
rely on the hardwired pipeline processor specialized to gravitational
interactions. Since the logic implemented in FPGA chips can be reconfigured, we
can use PROGRAPE-1 to calculate not only gravitational interactions but also
other forms of interactions such as van der Waals force, hydrodynamical
interactions in SPH calculation and so on. PROGRAPE-1 comprises two Altera
EPF10K100 FPGA chips, each of which contains nominally 100,000 gates. To
evaluate the programmability and performance of PROGRAPE-1, we implemented a
pipeline for gravitational interaction similar to that of GRAPE-3. One pipeline
fitted into a single FPGA chip, which operated at 16 MHz clock. Thus, for
gravitational interaction, PROGRAPE-1 provided the speed of 0.96
Gflops-equivalent. PROGRAPE will prove to be useful for wide-range of
particle-based simulations in which the calculation cost of interactions other
than gravity is high, such as the evaluation of SPH interactions.
[13]
oai:arXiv.org:astro-ph/9812431 [pdf] - 104538
A Simple Formulation of the Fast Multipole Method: Pseudo-Particle
Multipole Method
Submitted: 1998-12-23
We present the pseudo-particle multipole method (P2M2), a new method to
handle multipole expansion in fast multipole method and treecode. This method
uses a small number of pseudo-particles to express multipole expansion. With
this method, the implementation of FMM and treecode with high-order multipole
terms is greatly simplified. We applied P2M2 to treecode and combined it with
special-purpose computer GRAPE. Extensive tests on the accuracy and calculation
cost demonstrate that the new method is quite attractive.
[14]
oai:arXiv.org:astro-ph/9707079 [pdf] - 1235040
The PCI Interface for GRAPE Systems: PCI-HIB
Submitted: 1997-07-07, last modified: 1997-07-16
We developed a PCI interface for GRAPE systems. GRAPE(GRAvity piPE) is a
special-purpose computer for gravitational N-body simulations. A GRAPE system
consists of GRAPE processor boards and a host computer. GRAPE processors
perform the calculation of gravitational forces between particles. The host
computer performs the rest of calculations. The newest of GRAPE machines, the
GRAPE-4, achieved the peak performance of 1.08 Tflops. The GRAPE-4 system uses
TURBOChannel for the interface to the host, which limits the selection of the
host computer. The TURBOChannel bus is not supported by any of recent
workstations. We developed a new host interface board which adopts the PCI bus
instead of the TURBOChannel. PCI is an I/O bus standard developed by Intel. It
has fairly high peak transfer speed, and is available on wide range of
computers, from PCs to supercomputers. Thus, the new interface allows us to
connect GRAPE-4 to a wide variety of host computers. In test runs with a
Barnes-Hut treecode, we found that the performance of new system with PCI
interface is 40% better than that of the original system.