[FLASH-USERS] How to generate big checkpoint files in FLASH
Zheng Yuan
zhengyuan2014 at u.northwestern.edu
Wed Mar 2 12:47:25 EST 2016
Thanks Klaus! I will use 3D instead of 2D. Someone also suggested that
larger values of lrefine_min & lrefine_max results in larger checkpoint
file sizes. I will use 3D & larger Irefine_min&max, while keeping
nxb*nyb*nzb = 8*8*8.
Best
Zheng
On 3/2/16 11:23 AM, Klaus Weide wrote:
> On Tue, 1 Mar 2016, Zheng Yuan wrote:
>
>> Dear all,
>>
>> My goal is to run FLASH using 1K MPI processes and get large checkpoint files
>> (500 GB each. These large checkpoint files will be used as the test data of a
>> parallel data compression algorithm proposed by our group.). Does anybody know
>> how to set the parameters to run FLASH in large scale and get large checkpoint
>> files?
> Hello,
>
> If you really "just" want large checkpoint files, you should start with a 3D
> setup!
>
>
>> Currently, I am trying to run FLASH/Sedov to generate large checkpoint files.
>> However, FLASH reports an error when I increase the dimension of the grid.
>>
>> I configured FLASH using:
>>
>> ./setup Sedov -auto +pnetcdf -objdir=sedov_2d -parfile=sedov_io_69b_2d.par -2d
>> -nxb=64 -nyb=64 -maxblocks=200
>>
>> To run more iterations, I changed the 'nend' parameter to 10 in
>> sedov_2d/flash.par and make sedov_2d
>>
>> I run flash using command:
>>
>> mpirun -n 4 ./flash4
>>
>>
>> FLASH terminate at the third iteration. The error is:
> *** Wrote plotfile to sedov_2d_6lev_ncmpi_plt_cnt_0001 ****
> n t dt ( x, y, z) | dt_hydro
> 1 2.0000E-10 2.5000E-05 ( 4.993E-01, 4.875E-01, 0.000E+00) | 1.265E-05
> *** Wrote checkpoint file to sedov_2d_6lev_ncmpi_chk_0001 ****
> *** Wrote plotfile to sedov_2d_6lev_ncmpi_plt_cnt_0002 ****
> 2 5.0000E-05 2.5000E-05 ( 5.134E-01, 5.017E-01, 0.000E+00) | 8.673E-06
> *** Wrote checkpoint file to sedov_2d_6lev_ncmpi_chk_0002 ****
> *** Wrote plotfile to sedov_2d_6lev_ncmpi_plt_cnt_0003 ****
> 3 1.0000E-04 2.5000E-05 ( 5.139E-01, 4.988E-01, 0.000E+00) | 6.407E-07
> *** Wrote checkpoint file to sedov_2d_6lev_ncmpi_chk_0003 ****
>
> Nonconvergence in subroutine rieman
> .....
>
>
> By increasing NXB and NYB, you have increased the resolution. This
> requires a smaller timestep for stability, see "dt_hydro" column. But
> your flash.par fixes dtmin and dtmax at 2.5e-5, the direct effect of
> this can be seen in the "dt" column. So you are forcing FLASH to do
> Hydro advances with a time step that is too large for stability.
> So it is not surprising that the simulation fails after a few steps.
>
> Basically, aking your runtime parameter "dtmin" much smaller should
> get you a simulation that runs.
>
> Btw, a more common way to increase resolution (thus file sizes) would
> be to increase lrefine_in and/or lrefine_max and/or Nblock{X,Y,Z},
> rather than keep increasing the block size.
>
> Klaus
More information about the flash-users
mailing list