A series of performance benchmarks for MD Apps, including GROMACS

A few points related to what you seem to be assuming:

  • (synthetic) graphics benchmarks do not necessarily reflect MD performance;
  • MD codes use different algorithms and implementations and are likely tuned to a different extent on different GPU architectures; an alternative explanation could be that some applications are less efficient on NVIDIA than on AMD; e.g. GROMACS is well tuned on NVIDIA and has been optimized on CDNA but not much effort has gone into RDNA optimizations;
  • native HIP use equals better performance is not a given, algorithms, optimizations, “luck” with compilers often matter more; at least for our algorithms and implementation currently there is at most 10-15% performance left on the table vs plain HIP (on single GPU for large system sized) and that is on CDNA2 GPUs, I’m not sure about RDNA but I guess that number is smaller.

Thanks for the pointer. The pictures don’t seem to load for me (or are very slow to load), will try again later. Do you have a PDF version of your post – that may be easier to load?

I would recommend using “application performance” or just “performance” instead of “efficiency” since the latter implies that you are using a relative metric (e.g. performance relative to the best achievable would be an “efficiency”).

Cheers,
Szilárd