Writing checkpoints only and not running

GROMACS version:2020.2
GROMACS modification: No
I am running simulations with 25000 atoms. It was running okay. But now it keeps giving the following error

Writing checkpoint, step 28700 at Sun Jan 9 14:39:12 2022
Writing checkpoint, step 28900 at Sun Jan 9 14:51:27 2022
Writing checkpoint, step 29200 at Sun Jan 9 15:09:45 2022
Writing checkpoint, step 29400 at Sun Jan 9 15:22:00 2022
Writing checkpoint, step 29600 at Sun Jan 9 15:34:22 2022
Writing checkpoint, step 29900 at Sun Jan 9 15:52:49 2022
Writing checkpoint, step 30100 at Sun Jan 9 16:05:05 2022
Writing checkpoint, step 30400 at Sun Jan 9 16:23:21 2022
Writing checkpoint, step 30600 at Sun Jan 9 16:35:34 2022
Writing checkpoint, step 30900 at Sun Jan 9 16:53:49 2022
Writing checkpoint, step 31100 at Sun Jan 9 17:06:07 2022

Received the TERM signal, stopping within 100 steps

It shows as running. This happens either at the beginning or after some time. Why does this happen and what can I do to avoid this?
If I keep submitting a few times it runs normally (cancel the job and run again several times). So I’m not sure what the error is. Is it memory? I run several jobs in parallel

Thank you
Best
Chathu

Sounds like a hardware problem, like other processes are fighting with yours. The performance looks quite bad. You’ve got a small system yet you’re only doing 200-300 integration steps every 12-16 minutes. That’s very slow.

1 Like