[FLASH-USERS] MPI deadlock in amr_refine_derefine

Vishal Tiwari vtiwari at umassd.edu
Sun Feb 24 15:27:06 EST 2019


Hello,

I am facing issues with my simulations when running on stampede2, which gets stuck in the refinement part of the code. The code keeps refining until the number of blocks requested is smaller than the number of tasks, but hangs when no. of blocks >  ntasks. Looking at the trace of the code using ddt suggests that there is a MPI deadlock. (see the figure attached).

This issue occurs only on the stampede2 because it was refining fine on stampede1 and works fine on a local cluster on my campus.

Further, I found that people were facing the exact same issue in this thread [1]<http://flash.uchicago.edu/pipermail/flash-users/2017-September/002402.html>, but the thread wasn't concluded with a solution.

I would be grateful for any pointers with regards to this issue.

Thank you!

[1] http://flash.uchicago.edu/pipermail/flash-users/2017-September/002402.html

Regards,
Vishal Tiwari
Graduate Student, Physics
University of Massachusetts, Dartmouth
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://flash.rochester.edu/pipermail/flash-users/attachments/20190224/6cfa93d9/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: stack_trace.png
Type: image/png
Size: 30561 bytes
Desc: stack_trace.png
URL: <http://flash.rochester.edu/pipermail/flash-users/attachments/20190224/6cfa93d9/attachment.png>


More information about the flash-users mailing list