Page 1 of 1
VASP 6.5.0 Data not present on GPU in rc_add
Posted: Thu Jan 16, 2025 10:04 am
by thibautvery
Hello,
I compiled VASP with the following environment nvidia-compilers/24.3 cuda/12.2.0 openmpi/4.1.5-cuda.
The options I included are:
- OpenACC
- OpenMP
- MKL
- python plugins
When I perform a test on a V100, I got a message that the variable c from std/mgrid.f90 rc_add line:1773 is not present on the GPU.
I attached the makefile.include and the file to reproduce the error.
Best regards
Re: VASP 6.5.0 Data not present on GPU in rc_add
Posted: Thu Jan 16, 2025 1:27 pm
by ahampel
Hi,
thank you for posting your question in the official VASP forum.
I tried to reproduce your problem without success. I used the VASP 6.5.0 released version and compiled your code with the nvidia compilers we have right now available: nvhpc/24.1, intel-oneapi-mkl/2023.2.0, and openmpi/4.1.6-cuda . I do not have a V100 card available, but I did run the job on two A30 cards with MPI, and OpenMP threading, without any problem:
Code: Select all
running 2 mpi-ranks, with 8 threads/rank, on 1 nodes
distrk: each k-point on 2 cores, 1 groups
distr: one band on 1 cores, 2 groups
Offloading initialized ... 2 GPUs detected
vasp.6.5.0 16Dec24 (build Jan 16 2025 14:04:17) complex
POSCAR found type information on POSCAR Ti
POSCAR found : 2 types and 333 ions
scaLAPACK will be used selectively (only on CPU)
WARNING: type information on POSCAR and POTCAR are incompatible
POTCAR overwrites the type information in POSCAR
typ 2 type information: O
LDA part: xc-table for (Slater+PW92), standard interpolation
POSCAR, INCAR and KPOINTS ok, starting setup
FFT: planning ... GRIDC
FFT: planning ... GRID_SOFT
FFT: planning ... GRID
WAVECAR not read
initial charge from wavefunction
entering main loop
N E dE d eps ncg rms rms(c)
DAV: 1 -0.150681060769E+07 -0.15068E+07 -0.26885E+07 3326 0.836E+03
DAV: 2 -0.182111908472E+07 -0.31431E+06 -0.27999E+06 3880 0.514E+03
DAV: 3 -0.196552237558E+07 -0.14440E+06 -0.13482E+06 4008 0.408E+03
DAV: 4 -0.203824784578E+07 -0.72725E+05 -0.68104E+05 4264 0.252E+03
DAV: 5 -0.208701008822E+07 -0.48762E+05 -0.44843E+05 4200 0.191E+03 0.292E+03
RMM: 6 -0.140572163328E+07 0.68129E+06 -0.32376E+04 2178 0.913E+02 0.159E+03
RMM: 7 -0.523878390546E+06 0.88184E+06 0.53564E+04 3910 0.106E+03 0.148E+03
...
without making any adjustments to your input files. Note, that your POTCAR has more elements that the POSCAR, but this does not matter here...
The only thing I changed was to remove the new ML interface, which btw is really experimental right now, so use this with great care, the python plugins, and changed the compute capability to cc80 for the A30 cards. Can you once try to remove the MLFF and python plugin things from the build and see if this could make a difference. I attached my makefile.include just in case you spot a difference that I do not see. The gcc_toolchain things you can ignore.
Best regards,
Alex
Re: VASP 6.5.0 Data not present on GPU in rc_add
Posted: Thu Jan 16, 2025 1:52 pm
by ahampel
The problem is related to the python plugins. This is indeed a bug in the code currently. Please remove for the OpenACC nvidia port the -DPLUGINS
option for now. I will create a known bug report on the wiki and we will fix this in an upcoming bugfix release.
Best regards,
Alex
Re: VASP 6.5.0 Data not present on GPU in rc_add
Posted: Fri Jan 17, 2025 9:04 am
by thibautvery
I confirm that disabling the plugins solves the problem.
Thanks a lot.
Re: VASP 6.5.0 Data not present on GPU in rc_add
Posted: Fri Jan 17, 2025 10:16 am
by ahampel
Perfect. I will leave this topic open until we have a patch in place. I will confirm he once more when the fix is ready so that you can use the plugins together with the openacc nvidia port.
Best,
Alex