[FLASH-USERS] Big Problem??

Seyit Hocuk seyit at astro.rug.nl
Wed Jul 30 05:35:49 EDT 2008


BTW... The results also differ when I am using multiple cpu's instead of 1.

mpiexec -np 4 ./flash2 (on pc 3 instead of running just ./flash2)
Note: k,kJ,c0 etc. are the same


 flash:  initializing for jeans problem.
 
 kx   = 7.8539816E-20 ky   = 1.5707963E-19 kz   = 0.0000000E+00
 
 k       =   1.756203682760181E-019
 kJ      =   6.508776532382562E-019
 c0      =    181807.294175918    
 omega^2 =  -1.298354260861123E-026
 velamp  =    64881.6252564676    
 
 perturbation is unstable with growth time    8776137041619.21    
 min_blocks          1 max_blocks          2 tot_blocks          5
 min_blocks          5 max_blocks          6 tot_blocks         21
 min_blocks         21 max_blocks         22 tot_blocks         85
 min_blocks         85 max_blocks         86 tot_blocks        341
  INITIAL TIMESTEP =    100000.000000000    
 [CHECKPOINT_WR] NOTE: will send          520  blocks per message.
 [CHECKPOINT_WR] Writing checkpoint file jeans_r13vB_hdf5_chk_0000
 Progress:  |...
 *** Wrote output to jeans_r13vB_hdf5_chk_0000 (         341 blocks ) ***
 number of variables found for plotfile storage =            3
 dens pres temp
 *** Wrote output to jeans_r13vB_hdf5_plt_cnt_0000 ***
       n          t         dt |  dt_hydro
       1 2.0000E+05 2.0000E+05 |  4.197E+11
       2 6.0000E+05 4.0000E+05 |  4.197E+11
       3 1.4000E+06 8.0000E+05 |  4.197E+11
       4 3.0000E+06 1.6000E+06 |  4.197E+11
       5 6.2000E+06 3.2000E+06 |  4.197E+11
       6 1.2600E+07 6.4000E+06 |  4.197E+11
       7 2.5400E+07 1.2800E+07 |  4.197E+11
       8 5.1000E+07 2.5600E+07 |  4.197E+11
       9 1.0220E+08 5.1200E+07 |  4.197E+11
      10 2.0460E+08 1.0240E+08 |  4.198E+11
      11 4.0940E+08 2.0480E+08 |  4.198E+11
      12 8.1900E+08 4.0960E+08 |  4.198E+11
      13 1.6382E+09 8.1920E+08 |  4.199E+11
      14 3.2766E+09 1.6384E+09 |  4.201E+11
      15 6.5534E+09 3.0000E+09 |  4.205E+11
      16 1.2553E+10 3.0000E+09 |  4.213E+11
      17 1.8553E+10 3.0000E+09 |  4.221E+11
      18 2.4553E+10 3.0000E+09 |  4.229E+11
      19 3.0553E+10 3.0000E+09 |  4.237E+11
      20 3.6553E+10 3.0000E+09 |  4.246E+11
      21 4.2553E+10 3.0000E+09 |  4.254E+11
      22 4.8553E+10 3.0000E+09 |  4.263E+11
      23 5.4553E+10 3.0000E+09 |  4.272E+11
      24 6.0553E+10 3.0000E+09 |  4.281E+11
      25 6.6553E+10 3.0000E+09 |  4.290E+11
      26 7.2553E+10 3.0000E+09 |  4.300E+11
      27 7.8553E+10 3.0000E+09 |  4.309E+11
      28 8.4553E+10 3.0000E+09 |  4.319E+11
      29 9.0553E+10 3.0000E+09 |  4.328E+11
      30 9.6553E+10 3.0000E+09 |  4.338E+11
      31 1.0255E+11 3.0000E+09 |  4.348E+11
      32 1.0855E+11 3.0000E+09 |  4.358E+11
      33 1.1455E+11 3.0000E+09 |  4.369E+11
      34 1.2055E+11 3.0000E+09 |  4.380E+11
      35 1.2655E+11 3.0000E+09 |  4.393E+11
      36 1.3255E+11 3.0000E+09 |  4.405E+11
      37 1.3855E+11 3.0000E+09 |  4.416E+11
      38 1.4455E+11 3.0000E+09 |  4.429E+11
      39 1.5055E+11 3.0000E+09 |  4.441E+11
      40 1.5655E+11 3.0000E+09 |  4.454E+11
      41 1.6255E+11 3.0000E+09 |  4.465E+11
      42 1.6855E+11 3.0000E+09 |  4.478E+11
      43 1.7455E+11 3.0000E+09 |  4.491E+11
      44 1.8055E+11 3.0000E+09 |  4.503E+11
      45 1.8655E+11 3.0000E+09 |  4.516E+11
      46 1.9255E+11 3.0000E+09 |  4.529E+11
      47 1.9855E+11 3.0000E+09 |  4.541E+11
      48 2.0455E+11 3.0000E+09 |  4.554E+11
      49 2.1055E+11 3.0000E+09 |  4.567E+11
      50 2.1655E+11 3.0000E+09 |  4.576E+11




Seyit Hocuk wrote:
> Hi Tomek good morning,
>
> Let me answer your first question.
>
> * /Any luck with a single block/single level runs? /
>
> Nope. Even single level (lref_min = lref_max) runs differ in dt_hydro. 
> That means I don't use refinement at all. I guess it is also affecting 
> the end results. I have to do a complete simulation again to be sure. 
> But the initial files show small differences already.
>
> Here the initial results.
> Note 1: the differences in precision in k, kJ, c0 etc...
> Note 2: pc 1 and 3 are not that much differen, but grow more different 
> later on.
>
>
> *pc 1*
>
> flash:  initializing for jeans problem.
>
> kx   = 7.8539816E-20 ky   = 1.5707963E-19 kz   = 0.0000000E+00
>
> k       =   1.756203682760181E-019
> kJ      =   6.508776532382562E-019
> c0      =    181807.294175918    omega^2 =  -1.298354260861123E-026
> velamp  =    64881.6252564676   
> perturbation is unstable with growth time    8776137041619.21    
> min_blocks          5 max_blocks          5 tot_blocks          5
> min_blocks         21 max_blocks         21 tot_blocks         21
> min_blocks         85 max_blocks         85 tot_blocks         85
> min_blocks        341 max_blocks        341 tot_blocks        341
>  INITIAL TIMESTEP =    100000.000000000    [CHECKPOINT_WR] NOTE: will 
> send          520  blocks per message.
> Warning! The HDF5 header files included by this application do not 
> match the
> version used by the HDF5 library to which this application is linked. 
> Data
> corruption or segmentation faults may occur if the application is
> allowed to continue.  You can, at your own risk, disable this check by 
> setting
> the environment variable 'HDF5_DISABLE_VERSION_CHECK' to a value of '1'.
> Setting it to 2 will suppress the warning totally.
> Headers are 1.6.6, library is 1.6.7
> Bye...
> forrtl: info: Fortran error message number is 76.
> forrtl: warning: Could not open message catalog: ifcore_msg.cat.
> forrtl: info: Check environment variable NLSPATH and protection of 
> /usr/lib/ifcore_msg.cat.
> Abort
> [1]  + Done                          emacs flash.par
> [seyit at celsius objectT3]$ setenv HDF5_DISABLE_VERSION_CHECK 1
> [seyit at celsius objectT3]$ ./flash2
>
> flash:  initializing for jeans problem.
>
> kx   = 7.8539816E-20 ky   = 1.5707963E-19 kz   = 0.0000000E+00
>
> k       =   1.756203682760181E-019
> kJ      =   6.508776532382562E-019
> c0      =    181807.294175918    omega^2 =  -1.298354260861123E-026
> velamp  =    64881.6252564676   
> perturbation is unstable with growth time    8776137041619.21    
> min_blocks          5 max_blocks          5 tot_blocks          5
> min_blocks         21 max_blocks         21 tot_blocks         21
> min_blocks         85 max_blocks         85 tot_blocks         85
> min_blocks        341 max_blocks        341 tot_blocks        341
>  INITIAL TIMESTEP =    100000.000000000    [CHECKPOINT_WR] NOTE: will 
> send          520  blocks per message.
> [CHECKPOINT_WR] Writing checkpoint file jeans_r13vA_hdf5_chk_0000
> Progress:  |
> *** Wrote output to jeans_r13vA_hdf5_chk_0000 (         341 blocks ) ***
> number of variables found for plotfile storage =            3
> dens pres temp
> *** Wrote output to jeans_r13vA_hdf5_plt_cnt_0000 ***
>       n          t         dt |  dt_hydro
>       1 2.0000E+05 2.0000E+05 |  4.240E+11
>       2 6.0000E+05 4.0000E+05 |  4.240E+11
>       3 1.4000E+06 8.0000E+05 |  4.240E+11
>       4 3.0000E+06 1.6000E+06 |  4.240E+11
>       5 6.2000E+06 3.2000E+06 |  4.240E+11
>       6 1.2600E+07 6.4000E+06 |  4.240E+11
>       7 2.5400E+07 1.2800E+07 |  4.240E+11
>       8 5.1000E+07 2.5600E+07 |  4.240E+11
>       9 1.0220E+08 5.1200E+07 |  4.240E+11
>      10 2.0460E+08 1.0240E+08 |  4.240E+11
>      11 4.0940E+08 2.0480E+08 |  4.241E+11
>      12 8.1900E+08 4.0960E+08 |  4.241E+11
>      13 1.6382E+09 8.1920E+08 |  4.241E+11
>      14 3.2766E+09 1.6384E+09 |  4.242E+11
>      15 6.5534E+09 3.0000E+09 |  4.243E+11
>      16 1.2553E+10 3.0000E+09 |  4.247E+11
>      17 1.8553E+10 3.0000E+09 |  4.250E+11
>      18 2.4553E+10 3.0000E+09 |  4.255E+11
>      19 3.0553E+10 3.0000E+09 |  4.259E+11
>      20 3.6553E+10 3.0000E+09 |  4.266E+11
>      21 4.2553E+10 3.0000E+09 |  4.273E+11
>      22 4.8553E+10 3.0000E+09 |  4.281E+11
>      23 5.4553E+10 3.0000E+09 |  4.288E+11
>      24 6.0553E+10 3.0000E+09 |  4.296E+11
>      25 6.6553E+10 3.0000E+09 |  4.304E+11
>      26 7.2553E+10 3.0000E+09 |  4.313E+11
>      27 7.8553E+10 3.0000E+09 |  4.321E+11
>      28 8.4553E+10 3.0000E+09 |  4.330E+11
>      29 9.0553E+10 3.0000E+09 |  4.339E+11
>      30 9.6553E+10 3.0000E+09 |  4.348E+11
>      31 1.0255E+11 3.0000E+09 |  4.357E+11
>      32 1.0855E+11 3.0000E+09 |  4.366E+11
>      33 1.1455E+11 3.0000E+09 |  4.301E+11
>      34 1.2055E+11 3.0000E+09 |  3.803E+11
>      35 1.2655E+11 3.0000E+09 |  3.379E+11
>      36 1.3255E+11 3.0000E+09 |  3.057E+11
>      37 1.3855E+11 3.0000E+09 |  2.868E+11
>      38 1.4455E+11 3.0000E+09 |  2.876E+11
>      39 1.5055E+11 3.0000E+09 |  2.880E+11
>      40 1.5655E+11 3.0000E+09 |  2.903E+11
>      41 1.6255E+11 3.0000E+09 |  2.936E+11
>      42 1.6855E+11 3.0000E+09 |  2.978E+11
>      43 1.7455E+11 3.0000E+09 |  4.475E+11
>      44 1.8055E+11 3.0000E+09 |  4.486E+11
>      45 1.8655E+11 3.0000E+09 |  4.496E+11
>      46 1.9255E+11 3.0000E+09 |  4.507E+11
>      47 1.9855E+11 3.0000E+09 |  4.517E+11
>      48 2.0455E+11 3.0000E+09 |  4.528E+11
>      49 2.1055E+11 3.0000E+09 |  4.538E+11
>      50 2.1655E+11 3.0000E+09 |  4.549E+11
>
>
>
> *pc 2*
>
> flash:  initializing for jeans problem.
>
> kx   = 7.8539816E-20 ky   = 1.5707963E-19 kz   = 0.0000000E+00
>
> k       =    1.7562036827601810E-019
> kJ      =    6.5087765323825604E-019
> c0      =     181807.2941759182    omega^2 =   -1.2983542608611234E-026
> velamp  =     64881.62525646757   
> perturbation is unstable with growth time     8776137041619.213    
> min_blocks          5 max_blocks          5 tot_blocks          5
> min_blocks         21 max_blocks         21 tot_blocks         21
> min_blocks         85 max_blocks         85 tot_blocks         85
> min_blocks        341 max_blocks        341 tot_blocks        341
>  INITIAL TIMESTEP =     100000.0000000000    [CHECKPOINT_WR] NOTE: 
> will send           520  blocks per message.
> [CHECKPOINT_WR] Writing checkpoint file jeans_r13vA_hdf5_chk_0000
> Progress:  |
> *** Wrote output to jeans_r13vA_hdf5_chk_0000 (          341 blocks ) ***
> number of variables found for plotfile storage =             3
> dens pres temp
> *** Wrote output to jeans_r13vA_hdf5_plt_cnt_0000 ***
>       n          t         dt |  dt_hydro
>       1 2.0000E+05 2.0000E+05 |  4.406E+11
>       2 6.0000E+05 4.0000E+05 |  4.406E+11
>       3 1.4000E+06 8.0000E+05 |  4.406E+11
>       4 3.0000E+06 1.6000E+06 |  4.406E+11
>       5 6.2000E+06 3.2000E+06 |  4.406E+11
>       6 1.2600E+07 6.4000E+06 |  4.406E+11
>       7 2.5400E+07 1.2800E+07 |  4.406E+11
>       8 5.1000E+07 2.5600E+07 |  4.406E+11
>       9 1.0220E+08 5.1200E+07 |  4.406E+11
>      10 2.0460E+08 1.0240E+08 |  4.405E+11
>      11 4.0940E+08 2.0480E+08 |  4.404E+11
>      12 8.1900E+08 4.0960E+08 |  4.403E+11
>      13 1.6382E+09 8.1920E+08 |  4.399E+11
>      14 3.2766E+09 1.6384E+09 |  4.393E+11
>      15 6.5534E+09 3.0000E+09 |  4.381E+11
>      16 1.2553E+10 3.0000E+09 |  4.365E+11
>      17 1.8553E+10 3.0000E+09 |  4.353E+11
>      18 2.4553E+10 3.0000E+09 |  4.345E+11
>      19 3.0553E+10 3.0000E+09 |  4.340E+11
>      20 3.6553E+10 3.0000E+09 |  4.337E+11
>      21 4.2553E+10 3.0000E+09 |  4.337E+11
>      22 4.8553E+10 3.0000E+09 |  4.339E+11
>      23 5.4553E+10 3.0000E+09 |  4.343E+11
>      24 6.0553E+10 3.0000E+09 |  4.348E+11
>      25 6.6553E+10 3.0000E+09 |  4.355E+11
>      26 7.2553E+10 3.0000E+09 |  4.363E+11
>      27 7.8553E+10 3.0000E+09 |  4.371E+11
>      28 8.4553E+10 3.0000E+09 |  4.382E+11
>      29 9.0553E+10 3.0000E+09 |  4.393E+11
>      30 9.6553E+10 3.0000E+09 |  4.405E+11
>      31 1.0255E+11 3.0000E+09 |  4.418E+11
>      32 1.0855E+11 3.0000E+09 |  4.431E+11
>      33 1.1455E+11 3.0000E+09 |  4.447E+11
>      34 1.2055E+11 3.0000E+09 |  4.463E+11
>      35 1.2655E+11 3.0000E+09 |  4.479E+11
>      36 1.3255E+11 3.0000E+09 |  4.496E+11
>      37 1.3855E+11 3.0000E+09 |  4.513E+11
>      38 1.4455E+11 3.0000E+09 |  4.531E+11
>      39 1.5055E+11 3.0000E+09 |  4.550E+11
>      40 1.5655E+11 3.0000E+09 |  4.570E+11
>      41 1.6255E+11 3.0000E+09 |  4.590E+11
>      42 1.6855E+11 3.0000E+09 |  4.611E+11
>      43 1.7455E+11 3.0000E+09 |  4.632E+11
>      44 1.8055E+11 3.0000E+09 |  4.653E+11
>      45 1.8655E+11 3.0000E+09 |  4.675E+11
>      46 1.9255E+11 3.0000E+09 |  4.697E+11
>      47 1.9855E+11 3.0000E+09 |  4.719E+11
>      48 2.0455E+11 3.0000E+09 |  4.733E+11
>      49 2.1055E+11 3.0000E+09 |  4.734E+11
>      50 2.1655E+11 3.0000E+09 |  4.736E+11
>
>
>
> *pc 3*
>
>
> flash:  initializing for jeans problem.
>
> kx   = 7.8539816E-20 ky   = 1.5707963E-19 kz   = 0.0000000E+00
>
> k       =   1.756203682760181E-019
> kJ      =   6.508776532382562E-019
> c0      =    181807.294175918    omega^2 =  -1.298354260861123E-026
> velamp  =    64881.6252564676   
> perturbation is unstable with growth time    8776137041619.21    
> min_blocks          5 max_blocks          5 tot_blocks          5
> min_blocks         21 max_blocks         21 tot_blocks         21
> min_blocks         85 max_blocks         85 tot_blocks         85
> min_blocks        341 max_blocks        341 tot_blocks        341
>  INITIAL TIMESTEP =    100000.000000000    [CHECKPOINT_WR] NOTE: will 
> send          520  blocks per message.
> [CHECKPOINT_WR] Writing checkpoint file jeans_r13vA_hdf5_chk_0000
> Progress:  |
> *** Wrote output to jeans_r13vA_hdf5_chk_0000 (         341 blocks ) ***
> number of variables found for plotfile storage =            3
> dens pres temp
> *** Wrote output to jeans_r13vA_hdf5_plt_cnt_0000 ***
>       n          t         dt |  dt_hydro
>       1 2.0000E+05 2.0000E+05 |  4.240E+11
>       2 6.0000E+05 4.0000E+05 |  4.240E+11
>       3 1.4000E+06 8.0000E+05 |  4.240E+11
>       4 3.0000E+06 1.6000E+06 |  4.240E+11
>       5 6.2000E+06 3.2000E+06 |  4.240E+11
>       6 1.2600E+07 6.4000E+06 |  4.240E+11
>       7 2.5400E+07 1.2800E+07 |  4.240E+11
>       8 5.1000E+07 2.5600E+07 |  4.240E+11
>       9 1.0220E+08 5.1200E+07 |  4.240E+11
>      10 2.0460E+08 1.0240E+08 |  4.240E+11
>      11 4.0940E+08 2.0480E+08 |  4.241E+11
>      12 8.1900E+08 4.0960E+08 |  4.241E+11
>      13 1.6382E+09 8.1920E+08 |  4.241E+11
>      14 3.2766E+09 1.6384E+09 |  4.242E+11
>      15 6.5534E+09 3.0000E+09 |  4.244E+11
>      16 1.2553E+10 3.0000E+09 |  4.247E+11
>      17 1.8553E+10 3.0000E+09 |  4.251E+11
>      18 2.4553E+10 3.0000E+09 |  4.255E+11
>      19 3.0553E+10 3.0000E+09 |  4.259E+11
>      20 3.6553E+10 3.0000E+09 |  4.266E+11
>      21 4.2553E+10 3.0000E+09 |  4.273E+11
>      22 4.8553E+10 3.0000E+09 |  4.281E+11
>      23 5.4553E+10 3.0000E+09 |  4.288E+11
>      24 6.0553E+10 3.0000E+09 |  4.296E+11
>      25 6.6553E+10 3.0000E+09 |  4.304E+11
>      26 7.2553E+10 3.0000E+09 |  4.313E+11
>      27 7.8553E+10 3.0000E+09 |  4.321E+11
>      28 8.4553E+10 3.0000E+09 |  4.330E+11
>      29 9.0553E+10 3.0000E+09 |  4.338E+11
>      30 9.6553E+10 3.0000E+09 |  4.347E+11
>      31 1.0255E+11 3.0000E+09 |  4.356E+11
>      32 1.0855E+11 3.0000E+09 |  4.366E+11
>      33 1.1455E+11 3.0000E+09 |  4.375E+11
>      34 1.2055E+11 3.0000E+09 |  4.384E+11
>      35 1.2655E+11 3.0000E+09 |  4.394E+11
>      36 1.3255E+11 3.0000E+09 |  4.404E+11
>      37 1.3855E+11 3.0000E+09 |  4.413E+11
>      38 1.4455E+11 3.0000E+09 |  4.423E+11
>      39 1.5055E+11 3.0000E+09 |  4.433E+11
>      40 1.5655E+11 3.0000E+09 |  4.444E+11
>      41 1.6255E+11 3.0000E+09 |  4.454E+11
>      42 1.6855E+11 3.0000E+09 |  4.464E+11
>      43 1.7455E+11 3.0000E+09 |  4.474E+11
>      44 1.8055E+11 3.0000E+09 |  4.484E+11
>      45 1.8655E+11 3.0000E+09 |  4.495E+11
>      46 1.9255E+11 3.0000E+09 |  4.505E+11
>      47 1.9855E+11 3.0000E+09 |  4.516E+11
>      48 2.0455E+11 3.0000E+09 |  4.526E+11
>      49 2.1055E+11 3.0000E+09 |  4.537E+11
>      50 2.1655E+11 3.0000E+09 |  4.547E+11
>
>
>
>
> *here again the 3 different pc's*
>
>
> 1)
> Centos5
> Linux celsius 2.6.18-53.1.19.el5 #1 SMP Wed May 7 08:20:19 EDT 2008 
> i686 i686 i386 GNU/Linux
> ifort intel compiler version 8.0
> hdf5 1.6.6 (serial) *(updated recently to 1.6.7)*
> mpich2 Version:  1.0.3
> gcc version 4.1.2 20070626 (Red Hat 4.1.2-14)
> FFLAGS_OPT   =  -c -r8 -i4 -fast -ipo -ipo_obj  -I $(MPI_PATH)/include
>
> 2)
> Debian (??)
> Linux hpcibm1 2.6.8-11-amd64-k8-smp #1 SMP Sun Oct 2 20:03:22 UTC 2005 
> x86_64 GNU/Linux
> pgf90 6.0-5 32-bit target on x86-64 Linux
> hdf5 1.4.5 (serial)
> mpich2 version 1.0.5
> gcc version 3.3.5 (Debian 1:3.3.5-13)
> FFLAGS_OPT = -c -fast -r8 -i4
>
> 3)
> Ubuntu
> Linux si01 2.6.24.3 #1 SMP Fri Jul 4 11:02:09 CEST 2008 x86_64 GNU/Linux
> ifort intel compiler version 10.1.012
> hdf5 1.6.7 (serial)
> mpich2 Version: 1.0.6
> gcc version 4.2.3 (Ubuntu 4.2.3-2ubuntu7)
> FFLAGS_OPT   = -c -r8 -i4  -xT -O3 -no-prec-div -static -I 
> $(MPI_PATH)/include
> or (only -ipo difference, result seemingly doesn't change)
> FFLAGS_OPT   = -c -r8 -i4  -fast -I $(MPI_PATH)/include
>
>
>
>
>
>
> Tomasz Plewa wrote:
>> Seyit -
>>
>> Avoiding aggressive optimization is certainly good idea when it comes 
>> to debugging implementation. Alas, given you experience problems with 
>> 3 different compilers it is rather unlilkely that compilers are at 
>> fault.
>>
>> If your grid does not resolve structures correctly (persumably due to 
>> lax refinement criteria), the code may quite easily produce an 
>> unphysical state. But that should not happen during the initial grid 
>> generation.
>>
>> Any luck with a single block/single level runs?
>>
>> Tomek
>> -- 
>> Seyit Hocuk wrote:
>>> Hi,
>>>
>>> It has been some time now, but to continue the discussion, I have 
>>> tested Jeans standard problem and Sedov standard problem including 
>>> changing the refinement levels. The result is exactly same for all 
>>> three different computers. Unfortunately my own setup shows 
>>> differences and it seems the end result changes by the coice of 
>>> computer/environment/compiler whatever. My own setup has a lot more 
>>> physics than the simpe test problems.
>>>
>>> One thing I am thinking about is that I adjusted the refinement 
>>> criteria to use a reference density instead of looking at the second 
>>> order derivative. I don't know if this could be the problem, but 
>>> dt_hydro should not depend on the refinement criteria so I guess 
>>> there is more to it.
>>>
>>> Unfortunately it is not easy to change compilers and some other 
>>> stuff (like cpu/kernel etc.) on the different computers because I 
>>> have limited rights on those machines. I don't want to find out I 
>>> have to choose one machine and stick with it. How do I know which 
>>> machine is the best. Anyway, I am going to try at least to get the 
>>> libraries the same (hdf5 / mpi) for all machines and see if that 
>>> helps. Btw the latest hdf5 (1.8.1) doesn't seem to work with Flash2.5.
>>>
>>> I will also play with compiler options. Try to make them exactly the 
>>> same and avoid aggressive optimizations. I hope this is where the 
>>> solution lies.
>>>
>>> Thanks,
>>> Seyit
>>>
>>>
>>>
>>>
>>> Tomasz Plewa wrote:
>>>> Although I prefer a low cost approach to debugging which avoids 
>>>> dealing with setup/system/compiler information at first and 
>>>> attempts to eliminate 99% of the source code, here are  quick 
>>>> comments on your systems.
>>>>
>>>> (1) pgi 6.0 is a quite antique compiler. It's so old that now I do 
>>>> not even remember what problems we may have had with it. I suspect 
>>>> quite a few and none of them might be relevant...
>>>>
>>>> (2) As you might noticed, intel has a newer compiler version. Your 
>>>> CentOS5 system needs upgrades on both mpich2 and compiler sides.
>>>>
>>>> (3) Your Ubuntu system seems relatively up-to-date although HDF5 
>>>> 1.8.x is out there and seems working fine.
>>>>
>>>> (4) Compilers are known to do strange things to perfect codes! So 
>>>> if you run into a trouble on a new system, one of the first things 
>>>> to try is to lower compiler optimization level. Avoiding -fast or 
>>>> -ip or -inline is usually a good idea. Simple -O0 may suffice for a 
>>>> couple of steps. As a bonus, the code compiles fast. Then the trick 
>>>> is to get it crashing quickly...
>>>>
>>>> Again, I suggest to simplify your application, isolate the problem, 
>>>> fix it, then find the best compiler, mpich, options, etc., to run 
>>>> your production code.
>>>>
>>>> Enjoy your weekend!
>>>>
>>>> Tomek
>>>> -- 
>>>> Seyit Hocuk wrote:
>>>>> Hi Nathan,
>>>>>
>>>>> Thats a good Idea, I'll start simple and test those first thing 
>>>>> monday. It's too late for me now.
>>>>>
>>>>> To answer your questions:
>>>>>
>>>>> The problem I am running is my own home made, actually it's a 
>>>>> modified version of Jeans. Basically including heat, cool, eos 
>>>>> gamma, compositions, adjusted refinement criteria similar to one 
>>>>> of the other problems (with delta_ref and delta_deref) and a scala 
>>>>> of other small things.
>>>>>
>>>>> 1)
>>>>> Centos5
>>>>> Linux celsius 2.6.18-53.1.19.el5 #1 SMP Wed May 7 08:20:19 EDT 
>>>>> 2008 i686 i686 i386 GNU/Linux
>>>>> ifort intel compiler version 8.0
>>>>> hdf5 1.6.6 (serial)
>>>>> mpich2 Version:  1.0.3
>>>>> gcc version 4.1.2 20070626 (Red Hat 4.1.2-14)
>>>>> FFLAGS_OPT   =  -c -r8 -i4 -fast -ipo -ipo_obj  -I 
>>>>> $(MPI_PATH)/include
>>>>>
>>>>> 2)
>>>>> Debian (??)
>>>>> Linux hpcibm1 2.6.8-11-amd64-k8-smp #1 SMP Sun Oct 2 20:03:22 UTC 
>>>>> 2005 x86_64 GNU/Linux
>>>>> pgf90 6.0-5 32-bit target on x86-64 Linux
>>>>> hdf5 1.4.5 (serial)
>>>>> mpich2 version 1.0.5
>>>>> gcc version 3.3.5 (Debian 1:3.3.5-13)
>>>>> FFLAGS_OPT = -c -fast -r8 -i4
>>>>>
>>>>> 3)
>>>>> Ubuntu
>>>>> Linux si01 2.6.24.3 #1 SMP Fri Jul 4 11:02:09 CEST 2008 x86_64 
>>>>> GNU/Linux
>>>>> ifort intel compiler version 10.1.012
>>>>> hdf5 1.6.7 (serial)
>>>>> mpich2 Version: 1.0.6
>>>>> gcc version 4.2.3 (Ubuntu 4.2.3-2ubuntu7)
>>>>> FFLAGS_OPT   = -c -r8 -i4  -xT -O3 -no-prec-div -static -I 
>>>>> $(MPI_PATH)/include
>>>>> or (only -ipo difference, result seemingly doesn't change)
>>>>> FFLAGS_OPT   = -c -r8 -i4  -fast -I $(MPI_PATH)/include
>>>>>
>>>>>
>>>>>
>>>>>
>>>>> Nathan Hearn wrote:
>>>>>> Hi Seyit,
>>>>>>
>>>>>>     Are you seeing these differences in the standard test problems
>>>>>> included in Flash (e.g., Noh, Sod, Sedov), or are they specific 
>>>>>> to the
>>>>>> problem that you are running.  (Really, this follows from Tomek's
>>>>>> advice to look at the simplest configurations where the differences
>>>>>> are seen.)
>>>>>>
>>>>>>     Also, it would be good to include some more information in your
>>>>>> messages, for instance: the problem that you are running, the 
>>>>>> specific
>>>>>> architectures and compilers, the optimization flags, and the library
>>>>>> versions for HDF5 and MPI that you are using.  Details like these 
>>>>>> can
>>>>>> be really helpful in evaluating and fixing the issues you are 
>>>>>> seeing.
>>>>>>
>>>>>>
>>>>>> - Nathan
>>>>>>
>>>>>>
>>>>>> On Fri, Jul 18, 2008 at 11:32 AM, Seyit Hocuk 
>>>>>> <seyit at astro.rug.nl> wrote:
>>>>>>  
>>>>>>> Oh man,
>>>>>>>
>>>>>>> I am using 3 different computers (with different architectures, 
>>>>>>> compilers,
>>>>>>> optimizations, environments). All three start differently 
>>>>>>> immediately from
>>>>>>> the beginning (difference in dt_hydro and refinement).
>>>>>>> Should I worry very very very much?
>>>>>>> I'm now doing a complete simulation and am going to check if my 
>>>>>>> end results
>>>>>>> also change.
>>>>>>>
>>>>>>> I'm trembling by the thought.....
>>>>>>>     
>>>>>
>>>>>
>>>>
>>>
>




More information about the flash-users mailing list