GROMACS version: 2023.2
GROMACS modification: No
Hi,
I am running some simulations in GROMACS using GPU and CPU. the system has 8 GPUs and I use 1 node of CPU with 128 CPU cores. The simulations run with no problem when using single MPI. But when I try to use multiple MPIs I get the following error. I don’t know if there is any limitation about number of MPIs that can be run with GPU or there is a problem with installation of the software.
[hipSYCL Error] from /tmp/hellsvik/hipsycl/0.9.4/cpeGNU-22.06-rocm-5.3.3-llvm/hipSYCL-0.9.4/src/runtime/hip/hip_queue.cpp:355 @ submit_memset()
: hip_queue: hipMemsetAsync() failed (error code = HIP:1)
============== hipSYCL error report ==============
hipSYCL has caught the following undhandled asynchronous errors:
- from /tmp/hellsvik/hipsycl/0.9.4/cpeGNU-22.06-rocm-5.3.3-llvm/hipSYCL-0.9.4/src/runtime/hip/hip_queue.cpp:355 @ submit_memset(): hip_queue: hipMem
setAsync() failed (error code = HIP:1)
The application will now be terminated.
terminate called without an active exception
srun: error: nid002892: task 39: Aborted (core dumped)
srun: launch/slurm: _step_signal: Terminating StepId=2789083.0
slurmstepd: error: *** STEP 2789083.0 ON nid002892 CANCELLED AT 2023-11-03T01:03:07 ***
srun: error: nid002892: tasks 0-38,40-63: Killed
srun: Force Terminated StepId=2789083.0
Bests,
Maryam Majd