Fourestey, Gilles
Normalized to: Fourestey, G.
2 article(s) in total. 3 co-authors, from 1 to 2 common article(s). Median position in authors list is 2,5.
[1]
oai:arXiv.org:2004.06352 [pdf] - 2077500
Lenstool-HPC: A High Performance Computing based mass modelling tool for
cluster-scale gravitational lenses
Submitted: 2020-04-14
With the upcoming generation of telescopes, cluster scale strong
gravitational lenses will act as an increasingly relevant probe of cosmology
and dark matter. The better resolved data produced by current and future
facilities requires faster and more efficient lens modeling software.
Consequently, we present Lenstool-HPC, a strong gravitational lens modeling
and map generation tool based on High Performance Computing (HPC) techniques
and the renowned Lenstool software. We also showcase the HPC concepts needed
for astronomers to increase computation speed through massively parallel
execution on supercomputers.
Lenstool-HPC was developed using lens modelling algorithms with high amounts
of parallelism. Each algorithm was implemented as a highly optimised CPU, GPU
and Hybrid CPU-GPU version. The software was deployed and tested on the Piz
Daint cluster of the Swiss National Supercomputing Centre (CSCS).
Lenstool-HPC perfectly parallel lens map generation and derivative
computation achieves a factor 30 speed-up using only 1 GPUs compared to
Lenstool. Lenstool-HPC hybrid Lens-model fit generation tested at Hubble Space
Telescope precision is scalable up to 200 CPU-GPU nodes and is faster than
Lenstool using only 4 CPU-GPU nodes.
[2]
oai:arXiv.org:1902.03252 [pdf] - 1829724
High Performance Computing for gravitational lens modeling: single vs
double precision on GPUs and CPUs
Submitted: 2019-02-08
Strong gravitational lensing is a powerful probe of cosmology and the dark
matter distribution. Efficient lensing software is already a necessity to fully
use its potential and the performance demands will only increase with the
upcoming generation of telescopes. In this paper, we study the possible impact
of High Performance Computing techniques on a performance-critical part of the
widely used lens modeling software LENSTOOL. We implement the algorithm once as
a highly optimized CPU version and once with graphics card acceleration for a
simple parametric lens model. In addition, we study the impact of finite
machine precision on the lensing algorithm. While double precision is the
default choice for scientific applications, we find that single precision can
be sufficiently accurate for our purposes and lead to a big speedup. Therefore
we develop and present a mixed precision algorithm which only uses double
precision when necessary. We measure the performance of the different
implementations and find that the use of High Performance Computing Techniques
dramatically improves the code performance both on CPUs and GPUs. Compared to
the current LENSTOOL implementation on 12 CPU cores, we obtain speedup factors
of up to 170. We achieve this optimal performance by using our mixed precision
algorithm on a high-end GPU which is common in modern supercomputers. We also
show that these techniques reduce the energy consumption by up to 98%.
Furthermore, we demonstrate that a highly competitive speedup can be reached
with consumer GPUs. While they are an order of magnitude cheaper than the
high-end graphics cards, they are rarely used for scientific computations due
to their low double precision performance. Our mixed precision algorithm
unlocks their full potential. The consumer GPU delivers a speedup which is only
a factor of four lower than the best speedup achieved by a high-end GPU.