Efficiency Warning During MD Simulation

harunalcakan · February 13, 2023, 4:26pm

GROMACS version: 2021.2
GROMACS modification: No

Dear all,

I have been running MD simulation on our HPC server. So far we have been able to carry out our work without any problems. However, we needed to stop last couple of our simulations due to a warning popping up saying “Your jobs are running below the expected efficiency, there might be some problems with your input files, libraries, parameters or with applications that you use. Finish your queued jobs and send them back to the queue with the appropriate configuration.”

-u: username -p: servername -N:1 -n:28 Eff:3.56%

What could be the cause of the problem here? I can provide the required files in case of need.

Thank you in advance.

Carsten · February 14, 2023, 12:47pm

Hi,

do you happen to know from where exactly that warning is coming? It’s not coming from GROMACS :)

Do you have reason to believe that your MD performance is a lot lower than expected? Can you compare your MD performances to other settings that ran fine?

Carsten

harunalcakan · February 14, 2023, 1:42pm

Hi,

I’m using slurm based cluster to perform the simulations, I’m not sure but maybe it’s coming from some problems on the server. When we contacted the specialist she stated that the problem might be related to -ntmpi and -ntomp parameters. I haven’t used these parameters before, actually I’m new to GROMACS and could not figure it out.

Actually the settings of MD simulations that worked properly are similar with the problematic ones. I did not change anything.

Carsten · February 14, 2023, 2:00pm

So what are you using for -ntmpi and -ntomp and how many threads does your node support?

Carsten

harunalcakan · February 14, 2023, 2:15pm

Let me share the related part of my .slurm file. I tried to perform the MD like this but it gives me an error.

export OMP_NUM_THREADS=1
export OMPI_MCA_btl_openib_allow_ib=1

module purge

module load centos7.9/comp/gcc/7
source /truba/sw/centos7.9/comp/intel/oneapi-2021.2/setvars.sh
module load centos7.9/app/gromacs/2021.2-impi-mkl-oneapi-2021.2-GOLD

echo "SLURM_NODELIST $SLURM_NODELIST"
echo "NUMBER OF CORES $SLURM_NTASKS"

#md command is here

gmx_mpi mdrun -ntmpi 2 -ntomp 4 -v -s md_0_10.tpr -cpi md_0_10.cpt -deffnm md_0_10 -append

exit

Carsten · February 15, 2023, 10:08am

There are two things unusual here which potentially lead to problems:

you set OMP_NUM_THREADS=1 although later you instruct mdrun to use 4 OpenMP threads (-ntomp 4).
The name gmx_mpi suggests that this executable is linked to a real MPI library, although then it would choke on the -ntmpi 2 command line argument. Maybe it’s just the naming, or is there something on stdout/stderr?

How big is your system in terms of number of atoms?

Best,
Carsten

harunalcakan · February 15, 2023, 3:21pm

Actually, I was facing the efficiency problem before adding these parameters (-ntmpi and ntomp). The problem didn’t lead to automatically stop the simulation but I needed to manually terminate it in order not to cause any trouble. And I need to state that I directly used these figures (2 and 4) which was taken from the “examples for mdrun on one node” section in the manual entitled “getting good performance from mdrun”

My command was like this before adding the parameters:

> gmx_mpi mdrun -v -s md_0_10.tpr -cpi md_0_10.cpt -deffnm md_0_10 -append

After adding the parameters, I faced the following error and couldn’t run the simulation.

> Fatal error:
> Setting the number of thread-MPI ranks is only supported with thread-MPI and
> GROMACS was compiled without thread-MPI

Since I am a fresh GROMACS user, I can’t say that I am very familiar with the terminology and I may not have fully answered your question. What values should I instruct to run the simulation properly and increase the efficiency of my work?

My system is around 9300 atoms that is composed of a protein and ligand.

Best,
Harun

Carsten · February 16, 2023, 12:06pm

A total of 8 cores seems ok for the small system size that you have, although for the sake of performance I would try to use either only MPI or only OpenMP parallelization, to get rid of one source of parallelization overhead.

In the header of your job script there should be some command to instruct SLURM how many MPI ranks to start - can you share that line?

Carsten

harunalcakan · February 16, 2023, 12:50pm

Let me share the whole SLURM file:

#!/bin/bash
#SBATCH -p servername
#SBATCH -A username
#SBATCH -J jobname
#SBATCH -N 1
#SBATCH -n 28
#SBATCH --time=2-8:00:00
#SBATCH --output=slurm-%j.out
#SBATCH --error=slurm-%j.err

export OMP_NUM_THREADS=1
export OMPI_MCA_btl_openib_allow_ib=1

module purge

module load centos7.9/comp/gcc/7
source /truba/sw/centos7.9/comp/intel/oneapi-2021.2/setvars.sh
module load centos7.9/app/gromacs/2021.2-impi-mkl-oneapi-2021.2-GOLD

echo "SLURM_NODELIST $SLURM_NODELIST"
echo "NUMBER OF CORES $SLURM_NTASKS"

#md command is here

gmx_mpi mdrun -ntmpi 2 -ntomp 4 -v -s md_0_10.tpr -cpi md_0_10.cpt -deffnm md_0_10 -append

exit

Carsten · February 16, 2023, 2:04pm

Ah, now I understand why you get the efficiency warning. With #SBATCH -n 28 you are asking for 28 compute cores, but you are only using 8 of them (2 ranks x 4 threads), leaving the remaining 20 cores unused. You should only ask for 8 cores if that is possible on your cluster.

harunalcakan · February 16, 2023, 2:29pm

I see, but my cluster supports that the number of cores per node must be 28 and its multiples :) Then, if I set these parameters to equal 28, can I run the simulation without problems? For example, could these kind of settings be suitable like (2 ranks x 12 threads) or (4 ranks x 7 threads, I’m not sure if odd numbers could be valid) or things like these?

I’m really sorry that I asked so many naive questions but I just need to make my mind more clear on this.

Carsten · February 17, 2023, 9:30am

The problem will be that such a small simulation is likely to be significantly slower with 28 threads than with 8. You could, however, make better use of the compute power of the node by running several similar simulations for better statistics. For example, you could generate 4 input .tpr files with different initial speeds, and then run them as part of a multi-simulation, where each rank will then work on one simulation with 7 OpenMP threads. More information about this can be found here https://manual.gromacs.org/current/user-guide/mdrun-features.html#running-multi-simulations

Best,
Carsten

Topic		Replies	Views
GROMACS parameters on HPC User discussions forcefield , mdp-parameters , mdrun , simulation-setup	0	133	June 7, 2024
Error: Choice of MPI ranks User discussions	2	409	May 13, 2024
Extreme RAM consumption of an md simulation User discussions mdrun , simulation-setup	11	655	September 28, 2023
Efficient parallelization schme User discussions mdrun , mdrun-performance	5	712	June 7, 2021
Benchmarking GROMACS 2023 using STMV - PME rank outside cutoff of domain decomposition User discussions mdp-parameters , mdrun	1	89	December 16, 2024

Efficiency Warning During MD Simulation

Related topics