[FLASH-BUGS] MPI_REQUEST_MAX
Tomasz Plewa
tomek at flash.uchicago.edu
Thu Dec 2 09:31:44 CST 2004
Peter -
This is for the first time I see this kind of error.
I Can you associate this error with any part of the code? Having a
checkpoint file saved prior to suspected crash and restarting the code
would tell us whether such error is really due to accumulation of
requests. Writing some trace messages from master processor around
major modules in flash.F90 evolution loop would tell us where this
error might be occuring.
Tomek
--
On Thu, Dec 02, 2004 at 02:14:47PM +0100, Peter Woitke wrote:
> Dear developers,
>
> for long runs (more than 12 hours on 32 Processors) my flash-job
> crashes with error message
>
> *** MPI has run out of request entries.
> *** The current allocation level is:
> *** MPI_REQUEST_MAX = 16384
>
> Here is some advice from google:
> http://www.cray.com/craydoc/manuals/004-3687-001/html-004-3687-001/zfixe
> dlqtgeczn.htmlD
> 8.5. Why do I keep getting error messages about MPI_REQUEST_MAX being
> too small, no matter how large I set it?
> You are probably calling MPI_Isend(3) or MPI_Irecv(3) and not completing
> or freeing your request objects. You should use MPI_Request_free(3), as
> described in the previous question.
>
> I checked that, indeed, MPI_Isend and MPI_Irecv are called from the
> FLASH-code, but MPI_Request_free is never called.
>
> Have you thought about this?
>
> Thanx,
>
> Peter Woitke
--
Thu, 09:31 CST (15:31 GMT), Dec-02-2004
_______________________________________________________________________________
Tomasz Plewa www: flash.uchicago.edu
Computational Physics and Validation Group email: tomek at uchicago.edu
The ASC FLASH Center, The University of Chicago phone: 773.834.3227
5640 South Ellis Ave, RI 475, Chicago, IL 60637 fax: 773.834.3230
_______________________________________________________________________________
More information about the flash-bugs
mailing list