sort results by

Use logical operators AND, OR, NOT and round brackets to construct complex queries. Whitespace-separated words are treated as ANDed.

Show articles per page in mode

Ferrell, Robert C.

Normalized to: Ferrell, R.

2 article(s) in total. 1 co-authors. Median position in authors list is 1,0.

[1]  oai:arXiv.org:astro-ph/9503042  [pdf] - 92459
A Parallel Processing Algorithm for Computing Short-Range Particle Forces with Inhomogeneous Particle Distributions
Comments: 8 pages of uuencoded compressed postscript, figures included, to be published in Proceedings of the 1995 Society for Computer Simulation Multiconference, April 1995.
Submitted: 1995-03-09
We present a computational algorithm for computing short range forces between particles. The algorithm has two distinguishing features. First, it is optimized for multi-processor computers, and will use as many processors as are available. Second, it is optimized for inhomogeneous, dynamic particle distributions; for any distribution the computational load is distributed evenly to all processors, and the communication time is less than 15\% of the total run time. In this talk we present our new algorithm. We developed the program for a grand-challenge problem in cosmology, simulation of the formation of large-scale structure in the universe. This simulation, run on the Thinking Machines Corporation CM-5, uses the particle-particle/particle-mesh (PPPM) \cite{hock_east} algorithm. The particle-particle phase is computed using the algorithm we describe in this paper. We discuss this and other applications.
[2]  oai:arXiv.org:comp-gas/9310002  [pdf] - 110258
Particle-Mesh Methods on the Connection Machine
Comments: 22 pages, uuencoded compressed tarred postscript, 224 kB, available by anonymous ftp to arcturus.mit.edu as Preprints/pm.ps.Z
Submitted: 1993-11-02
We describe an efficient Particle-Mesh algorithm for the Connection Machine CM-5. Our particular method parallelizes well and the computation time per time step decreases as the particles become more clustered. We achieve floating-point computation rates of 4--5 MFlops/sec/processing node and total operations (the sum of floating-point and integer arithmetic plus communications) of 5--10 MOps/sec/processing node. The rates scale almost linearly from 32 to 256 processors. Although some of what we discuss is specific to the CM-5, many aspects (e.g., the computation of the force on a mesh) are generic to all implementations, and other aspects (e.g., the algorithm for assignment of the density to the mesh) are useful on any parallel computer.