Normalized to: Adsuara, J.
[1]
oai:arXiv.org:1705.00103 [pdf] - 1570378
Speeding up a few orders of magnitude the Jacobi method: high order
Chebyshev-Jacobi over GPUs
Submitted: 2017-04-28
In this technical note we show how to reach a remarkable speed up when
solving elliptic partial differential equations with finite differences thanks
to the joint use of the Chebyshev-Jacobi method with high order discretizations
and its parallel implementation over GPUs.
[2]
oai:arXiv.org:1607.03712 [pdf] - 1531020
On the equivalence between the Scheduled Relaxation Jacobi method and
Richardson's non-stationary method
Submitted: 2016-07-13
The Scheduled Relaxation Jacobi (SRJ) method is an extension of the classical
Jacobi iterative method to solve linear systems of equations ($Au=b$)
associated with elliptic problems. It inherits its robustness and accelerates
its convergence rate computing a set of $P$ relaxation factors that result from
a minimization problem. In a typical SRJ scheme, the former set of factors is
employed in cycles of $M$ consecutive iterations until a prescribed tolerance
is reached. We present the analytic form for the optimal set of relaxation
factors for the case in which all of them are different, and find that the
resulting algorithm is equivalent to a non-stationary generalized Richardson's
method. Our method to estimate the weights has the advantage that the explicit
computation of the maximum and minimum eigenvalues of the matrix $A$ is
replaced by the (much easier) calculation of the maximum and minimum
frequencies derived from a von Neumann analysis. This set of weights is also
optimal for the general problem, resulting in the fastest convergence of all
possible SRJ schemes for a given grid structure. We also show that with the set
of weights computed for the optimal SRJ scheme for a fixed cycle size it is
possible to estimate numerically the optimal value of the parameter $\omega$ in
the Successive Overtaxation (SOR) method in some cases. Finally, we demonstrate
with practical examples that our method also works very well for Poisson-like
problems in which a high-order discretization of the Laplacian operator is
employed. This is of interest since the former discretizations do not yield
consistently ordered $A$ matrices. Furthermore, the optimal SRJ schemes here
deduced, are advantageous over existing SOR implementations for high-order
discretizations of the Laplacian operator in as much as they do not need to
resort to multi-coloring schemes for their parallel implementation. (abridged)
[3]
oai:arXiv.org:1511.04292 [pdf] - 1460796
Scheduled Relaxation Jacobi method: improvements and applications
Submitted: 2015-11-11
Elliptic partial differential equations (ePDEs) appear in a wide variety of
areas of mathematics, physics and engineering. Typically, ePDEs must be solved
numerically, which sets an ever growing demand for efficient and highly
parallel algorithms to tackle their computational solution. The Scheduled
Relaxation Jacobi (SRJ) is a promising class of methods, atypical for combining
simplicity and efficiency, that has been recently introduced for solving linear
Poisson-like ePDEs. The SRJ methodology relies on computing the appropriate
parameters of a multilevel approach with the goal of minimizing the number of
iterations needed to cut down the residuals below specified tolerances. The
efficiency in the reduction of the residual increases with the number of levels
employed in the algorithm. Applying the original methodology to compute the
algorithm parameters with more than 5 levels notably hinders obtaining optimal
SRJ schemes, as the mixed (non-linear) algebraic-differential equations from
which they result become notably stiff. Here we present a new methodology for
obtaining the parameters of SRJ schemes that overcomes the limitations of the
original algorithm and provide parameters for SRJ schemes with up to 15 levels
and resolutions of up to $2^{15}$ points per dimension, allowing for
acceleration factors larger than several hundreds with respect to the Jacobi
method for typical resolutions and, in some high resolution cases, close to
1000. Furthermore, we extend the original algorithm to apply it to certain
systems of non-linear ePDEs.