[FLASH-BUGS] MPI bug on SGI
Mike Zingale
zingale at flash.uchicago.edu
Thu Apr 17 14:54:47 CDT 2003
Markus, thanks for the bug report. I've added it to our bug tracker, so
we will have a record of it. I don't have any way of reproducing this at
the moment, but perhaps someone else can give it a try.
Mike
On Thu, 17 Apr 2003, Markus Gross wrote:
> Hi!
>
> I belive I have just encountered a bug in at least one FLASH routine.
>
> Our System:
>
> SGI origin IRIX64 6.5 07091542 IP35
> MPI 3.2.0.7 (MPT 1.4) -mpi Version 1266208220
> MIPSpro Compilers: Version 7.4
>
> Problem:
>
> 3-D hydro starting with 5x5x3 blocks, maxref = 3
> we get a:
>
> *** MPI has run out of request entries.
> *** The current allocation level is:
> *** MPI_REQUEST_MAX = 16384
> IOT Trap
> MPI: MPI_COMM_WORLD rank 0 has terminated without calling MPI_Finalize()
> MPI: aborting job
>
> when:
>
> [04-17-2003 18:48.39] <<< refined: tot_blocks = 2723 >>>
> [04-17-2003 18:48.59] *** plot file: gas_pre_hdf_plt_cnt_0009
> [04-17-2003 18:49.08] *** wrote to gas_pre_hdf_plt_cnt_0009
> [04-17-2003 18:49.41] step 91 t= 2.664610E-06 dt= 1.431909E-08
> [04-17-2003 18:50.12] step 92 t= 2.693248E-06 dt= 1.426362E-08
>
> MPI_REQUEST_MAX = 16384 is a hard limit on our machine and can only be
> decreased using the enivronment variable.
>
>
> I tracked that down to at least one non-blocking send to a blocking receive
> where the send reqest was not freed. This happens at least is:
>
> line 241 AMR_redist_blk.F90 FLASH 2.2 & 2.1
>
>
> according to the MPI standard that should be fixed by having a MPI_WAIT
> following the non-blocking send immediately.
>
> I attached a test program to this email which illustrates this point (at
> least on our system). Version as supplied should fails, if you uncomment the
> MPI_wait it should run until forever.
>
> Please let me know what you think about this and if I am right, if there are
> more if these problems in the code (I presume you would know from the top of
> your head). I try to get through it tonight, but Flash is a little bit
> bigger, so help would be appreciated.
>
> Regards,
>
> Markus.
>
>
>
> --
> _______________________________________________________________
>
> Markus Gross AMIMechE BEng (Hons.) Mechanical Engineering
>
> Heriot Watt University Edinburgh
> Department of Mechanical and Chemical Engineering
>
> Associate member of the Institution of Mechanical Engineers
> Member of the SPIE
> _______________________________________________________________
>
> further contact:
>
> Phone : +44 (0) 131 449 5111 etx. 4737
>
> UNiX talk: talk markus at lasersim.mce.hw.ac.uk
>
> _______________________________________________________________
>
> "Plans are a place to begin," Grove said. "They rarely deliver
> you to where you expect. Make your plans knowing you are going
> to throw them away."
>
> _______________________________________________________________
>
>
> -----BEGIN PGP PUBLIC KEY BLOCK-----
> Version: GnuPG v1.0.6 (GNU/Linux)
> Comment: For info see http://www.gnupg.org
>
> mQGiBDt30nARBACUL2XadcLU9YS+3ndnWUUpnrhAD3Ab/TqJP1Obtz7H356+vigu
> 2m3exw2P9fky08SBdTbjOCV5OTmVcRW5F0SNMTIl8gXtGdIcQHCcZxi1GcRJtTBj
> npfRCmZojHr8YgsKDdJ9xupNm+UYWcg1BpFX7xe65kXM4oVwVyuCcUT+VwCg6/Hd
> 6oVoRHm1eq1NdMbCfsc9x78D/2uGQSMJBKewQ/uSPpdEg+bkAQo5Y7MooEeYpLk9
> Iroo4rqxp12k0+bWHWVMdfywI80tivorTIgBTKpCtjLaiKen9E9zECo+2OR4+XRQ
> O6cBKXOm+V2lJOpFYOh500tJHPggshD21jksNPKJr9CM1byeZFw38fFJez+DyShZ
> Gmc6A/9L21jqbVjxvAPwaqe09GEmpT1EYOWtcPYSh2cuo1v6XsGFXfi8tnRSizhH
> xsyIfuOUuj0NKRIcXH60awrvABSa45fvQUefPw0hxSwdCnfc0yEqcD4x/03xTZJS
> 1hUK3T4oTlrXNugaCb59ezciEIh2y/d8pW1DR3RMcgXUSz39tLQhTWFya3VzIEdy
> b3NzIDxtLnMuZ3Jvc3NAaHcuYWMudWs+iFcEExECABcFAjt30nAFCwcKAwQDFQMC
> AxYCAQIXgAAKCRBTnmsr4a/0+e3tAJoDk08MfeqB83FXP6g8QlDyTlotSwCgiKwS
> PPkVAbopX8mahpkrphEaunW5Ag0EO3fSghAIAOeLTMjdFkl/xBvjgD2N0n4DCK4N
> EVfaXmkJK5dbTM3K6ZsJjILZ/TfC6R8oxYPsYKwkwLJJEe4E/h1TFg7ogDYbViW8
> DihOyeN8zKSymLTbbikEJX8aNeCdBRuy87X0pBazDq6//+oTIatS2EsvMyKk0tEZ
> Mh0vTmUJqoeorQeAKH66lPsH2A8QI6u0zv17t4S2aLNBXIbKgOJwsglZ1IwjwowR
> KEGsLsHInNkP4bnlBjd51RMiJT979QMPmNgLGEId9Vi0oijtF/v6J91GtH++G6WE
> HM08DsP/BKjRgO9ivWP7lOHSDCJpGYti5K6g38JAc1H2EoHGLD7eRgI/kR8AAwUH
> /j6NVGLxnFGDHGcBXEb2Y4Wwhf/eUeE8Wit0qqRoNypBuzvWl96GXSPj/saQA/Ri
> XcFWR1G3E4V/zdxYnU/xvvGADj9jKHpZjAs73fJJxLc85gzHLcx00tHcQVA07Ihr
> hao8WY+BtKhkZorYEJEvSDJ9xufjL/o1t40eCsHQ3b9Jh/1lqQ2KT1lhDXAaDo1P
> pE38E2uImC0umIW/UtuKJ1k52nXx2qulfcTjAHd35XidwGreH3aHy3fV+x/+wW1T
> CQBHy27dG7JL3ESzPiKW0w2miWjtd4k5jqJTfgKI4wFUhAShxTjp8M051ArzopRJ
> iX+s7RrHuEXHjh45V1zU5BWIRgQYEQIABgUCO3fSggAKCRBTnmsr4a/0+U/rAJ9o
> lOhPGltVnRncYJPzyIk00H0JDQCfWrlnYDh5cRQmhnHilnrVO5Ykr24=
> =i3eC
> -----END PGP PUBLIC KEY BLOCK-----
>
>
More information about the flash-bugs
mailing list