Normalized to: Ferrell, R.
[1]
oai:arXiv.org:astro-ph/9503042 [pdf] - 92459
A Parallel Processing Algorithm for Computing Short-Range Particle
Forces with Inhomogeneous Particle Distributions
Submitted: 1995-03-09
We present a computational algorithm for computing short range forces between
particles. The algorithm has two distinguishing features. First, it is
optimized for multi-processor computers, and will use as many processors as are
available. Second, it is optimized for inhomogeneous, dynamic particle
distributions; for any distribution the computational load is distributed
evenly to all processors, and the communication time is less than 15\% of the
total run time.
In this talk we present our new algorithm. We developed the program for a
grand-challenge problem in cosmology, simulation of the formation of
large-scale structure in the universe. This simulation, run on the Thinking
Machines Corporation CM-5, uses the particle-particle/particle-mesh (PPPM)
\cite{hock_east} algorithm. The particle-particle phase is computed using the
algorithm we describe in this paper. We discuss this and other applications.
[2]
oai:arXiv.org:comp-gas/9310002 [pdf] - 110258
Particle-Mesh Methods on the Connection Machine
Submitted: 1993-11-02
We describe an efficient Particle-Mesh algorithm for the Connection Machine
CM-5. Our particular method parallelizes well and the computation time per time
step decreases as the particles become more clustered. We achieve
floating-point computation rates of 4--5 MFlops/sec/processing node and total
operations (the sum of floating-point and integer arithmetic plus
communications) of 5--10 MOps/sec/processing node. The rates scale almost
linearly from 32 to 256 processors. Although some of what we discuss is
specific to the CM-5, many aspects (e.g., the computation of the force on a
mesh) are generic to all implementations, and other aspects (e.g., the
algorithm for assignment of the density to the mesh) are useful on any parallel
computer.