[DFTB-Plus-User] Possible bug in DFTB+ with MPI
Ben Hourahine
benjamin.hourahine at strath.ac.uk
Sat Jun 22 15:54:40 CEST 2019
Hello David,
thoughts so far:
1) Which blas are you using? Sometimes openBlas steals extra threads, is
your top output shown with thread reporting enabled (H)?
2) Are you definitely getting the OMP_NUM_THREADS shell variable to each
of the instances?
3) If this is related to the i-PI mailing list question, is there any
reason not to run 8 serial instances with 1 thread each? The MPI
parallelism overhead on just one process is probably not worth it.
Regards
Ben
On 22/06/2019 14:41, David Furman wrote:
> Hi all,
> I have a question concerning the running of DFTB+ with MPI. There
> seems to be a problem when running more than one DFTB+ instance (with
> MPI parallelization) on the same system.
>
> When I run 4 instances (separate folders) of dftb+ with 2 cores each
> (i.e. 8 cores in total):
> mpirun.openmpi -np 2 dftb+, each %CPU usage drops to 50%.
>
> This is the 'top' output:
> ========================================================
> 35426 user 20 0 423384 40404 18248 R 50.5 0.0 0:37.69 dftb+
> 35434 user 20 0 423080 39720 18248 R 50.2 0.0 0:05.70 dftb+
> 35396 user 20 0 422044 38660 17644 R 49.8 0.0 1:21.35 dftb+
> 35402 user 20 0 423384 40460 18304 R 49.8 0.0 1:09.54 dftb+
> 35403 user 20 0 422044 38872 17848 R 49.8 0.0 1:16.15 dftb+
> 35427 user 20 0 422044 39048 18032 R 49.8 0.0 0:32.93 dftb+
> 35435 user 20 0 421732 38164 17716 R 49.8 0.0 0:09.07 dftb+
> 35395 user 20 0 423384 40360 18204 R 49.5 0.0 1:13.18 dftb+
> =========================================================
>
> Whereas, when I run one instance with 8 cores, the efficiency is 100%
> as expected:
>
> =========================================================
> 35837 user 20 0 441868 35196 19112 R 100.3 0.0 0:04.90 dftb+
> 35828 user 20 0 442580 35676 19120 R 100.0 0.0 0:04.83 dftb+
> 35829 user 20 0 442544 36024 19432 R 100.0 0.0 0:04.91 dftb+
> 35830 user 20 0 441912 35356 19360 R 100.0 0.0 0:04.91 dftb+
> 35831 user 20 0 441688 35048 19304 R 100.0 0.0 0:04.90 dftb+
> 35833 user 20 0 441904 35656 19500 R 100.0 0.0 0:04.91 dftb+
> 35841 user 20 0 441516 34824 19076 R 100.0 0.0 0:04.89 dftb+
> 35843 user 20 0 441224 34456 18868 R 99.7 0.0 0:04.89 dftb+
> =========================================================
>
> I run both cases with OMP_NUM_THREADS=1.
> with open-mpi 1.10.2 and gcc 5.4.0.
>
> Could anyone give a hint about what is wrong?
>
>
> _______________________________________________
> DFTB-Plus-User mailing list
> DFTB-Plus-User at mailman.zfn.uni-bremen.de
> https://mailman.zfn.uni-bremen.de/cgi-bin/mailman/listinfo/dftb-plus-user
--
Dr. B. Hourahine, Senior lecturer
SUPA, Department of Physics,
University of Strathclyde,
John Anderson Building,
107 Rottenrow, Glasgow G4 0NG, UK.
+44 141 548 2325, benjamin.hourahine at strath.ac.uk
The Department is a partner in SUPA, the Scottish
Universities Physics Alliance
The University of Strathclyde is a charitable body,
registered in Scotland, number SC015263
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.zfn.uni-bremen.de/pipermail/dftb-plus-user/attachments/20190622/bf90b2fb/attachment-0001.html>
More information about the DFTB-Plus-User
mailing list