MSD curve has a large non-linear part

GROMACS version: 2022.4
GROMACS modification: No

Dear gromacs users,

In a micelle-water system, I hope to calculate the MSD and diffusion coefficient of surfactants. The overall surfactant number is ~120 and I simulated for 50ns,
I used command gmx msd -f md.xtc -s md.tpr -sel 'resname AE3 SDS BTN' -o surf.xvg -trestart 2 to calulate the msd curve and D, but the end of curve shows an abnormal drop, which makes the curve have a bad linear correlation.

I am awared that the fit for D only use the 10~90% of time range data for fit by defalut, but the the non-linear part seems to exceed 10% of simulation duration.

I have also referred previous posts like https://gromacs.bioexcel.eu/t/curve-in-water-msd/10999/9 and I thought things are a bit different here as I run the simulation for a longer time for better statistic average.

So here should I just use the linear part for fit, or is there any approaches to get a linear line for that? Any suggestions on this issue will be welcomed. Thank you.

You should use far less than 90%. I would never use more that up to 50%. In this case 5-25 ns seems reasonable.

Oh I see, thanks for your point Hess!

Hi Hess, I have another question about MSD and could you please offer me some suggestions?

In another simulation, an inflection point was observed in the MSD curve (see fig below) as the micelle initially placed at box center moved across the box boundary. The slope is not identical at different sides. In this case, which part should I choose for D calculation and is there any command that could eliminate this effect? Thank you.

If you use the standard per atom MSD, which is the case in your command line in your first post, then PBC is correctly handled. I don’t know what can cause the inflection point. Statistics is extremely low, as are atoms move in a correlated fashion, so it could just be “noise”.

I got it, much appreciate for your help, thanks hess!