sort results by

Use logical operators AND, OR, NOT and round brackets to construct complex queries. Whitespace-separated words are treated as ANDed.

Show articles per page in mode

Mohammed, Amjed

Normalized to: Mohammed, A.

3 article(s) in total. 8 co-authors. Median position in authors list is 3,0.

[1]  oai:arXiv.org:1911.06714  [pdf] - 2000312
Two-level Dynamic Load Balancing for High Performance Scientific Applications
Comments:
Submitted: 2019-11-15
Scientific applications are often complex, irregular, and computationally-intensive. To accommodate the ever-increasing computational demands of scientific applications, high-performance computing (HPC) systems have become larger and more complex, offering parallelism at multiple levels (e.g., nodes, cores per node, threads per core). Scientific applications need to exploit all the available multilevel hardware parallelism to harness the available computational power. The performance of applications executing on such HPC systems may adversely be affected by load imbalance at multiple levels, caused by problem, algorithmic, and systemic characteristics. Nevertheless, most existing load balancing methods do not simultaneously address load imbalance at multiple levels. This work investigates the impact of load imbalance on the performance of three scientific applications at the thread and process levels. We jointly apply and evaluate selected dynamic loop self-scheduling (DLS) techniques to both levels. Specifically, we employ the extended LaPeSD OpenMP runtime library at the thread level and extend the DLS4LB MPI-based dynamic load balancing library at the process level. This approach is generic and applicable to any multiprocess-multithreaded computationally-intensive application (programmed using MPI and OpenMP). We conduct an exhaustive set of experiments to assess and compare six DLS techniques at the thread level and eleven at the process level. The results show that improved application performance, by up to 21%, can only be achieved by jointly addressing load imbalance at the two levels. We offer insights into the performance of the selected DLS techniques and discuss the interplay of load balancing at the thread level and process level.
[2]  oai:arXiv.org:1208.6014  [pdf] - 1150980
MESAFace, a graphical interface to analyze the MESA output
Comments: 16 pages, 4 figures, submitted to Computer Physics Communications
Submitted: 2012-08-29
MESA (Modules for Experiments in Stellar Astrophysics) has become very popular among astrophysicists as a powerful and reliable code to simulate stellar evolution. Analyzing the output data thoroughly may, however, present some challenges and be rather time-consuming. Here we describe MESAFace, a graphical and dynamical interface which provides an intuitive, efficient and quick way to analyze the MESA output.
[3]  oai:arXiv.org:astro-ph/0306521  [pdf] - 57602
Non-Fickian diffusion and tau approximation from numerical turbulence
Comments: 10 pages, 7 figs, Physics of Fluids (final version)
Submitted: 2003-06-25, last modified: 2004-03-09
Evidence for non-Fickian diffusion of a passive scalar is presented using direct simulations of homogeneous isotropic turbulence. The results compare favorably with an explicitly time-dependent closure model based on the tau approximation. In the numerical experiments three different cases are considered: (i) zero mean concentration with finite initial concentration flux, (ii) an initial top hat profile for the concentration, and (iii) an imposed background concentration gradient. All cases agree in the resulting relaxation time in the tau approximation relating the triple correlation to the concentration flux. The first order smoothing approximation is shown to be inapplicable.