[FLASH-USERS] HDF5 IO problem
t_schlegel at t-online.de
t_schlegel at t-online.de
Thu Jun 21 06:13:36 EDT 2018
Hello,
I have installed the modules:
GCC/7.2.0 5)
OpenMPI/3.0.0-GCC-7.2.0
FLASH/4.5-OpenMPI-3.0.0-GCC-7.2.0
hypre/2.11.2-OpenMPI-3.0.0-GCC-7.2.0
HDF5/1.10.1-OpenMPI-3.0.0-GCC-7.2.0 for parallel output
either HDF5/1.10.2-GCC-7.2.0 for serial one
parallel_netcdf/1.9.0-OpenMPI-3.0.0-GCC-7.2.0 on our cluster.
The Sedov example with serial HDF5 works as expected,
but using the HDF5/1.10.1-OpenMPI-3.0.0-GCC-7.2.0 module instead of serial
IO
(./setup Sedov -auto +parallelio, putting the right path for
HDF5/1.10.1-OpenMPI-3.0.0-GCC-7.2.0
in Makefile.h)
the job will be aborted at first write of a ckeckpoint file.
Part of the error output:
HDF5-DIAG: Error detected in HDF5 (1.10.1) MPI-process 3:
#000: H5Ddeprec.c line 187 in H5Dopen1(): not a location
major: Invalid arguments to routine
minor: Inappropriate type
#001: H5Gloc.c line 251 in H5G_loc(): invalid object ID
major: Invalid arguments to routine
minor: Bad value
flash4: io_h5_attribute.c:49: io_h5_attribute_create: Assertion `dsetID >=
0' failed.
Program received signal SIGABRT: Process abort signal.
A similar situation I observed with the more complex example LaserSlab:
HDF5-DIAG: Error detected in HDF5 (1.10.1) MPI-process 4:
#000: H5Dio.c line 269 in H5Dwrite(): can't prepare for writing data
major: Dataset
minor: Write failed
#001: H5Dio.c line 345 in H5D__pre_write(): can't write data
major: Dataset
minor: Write failed
#002: H5Dio.c line 791 in H5D__write(): can't write data
major: Dataset
minor: Write failed
#003: H5Dmpio.c line 527 in H5D__contig_collective_write(): couldn't finish
shared collective MPI-IO
major: Low-level I/O
minor: Write failed
#004: H5Dmpio.c line 1398 in H5D__inter_collective_io(): couldn't finish
collective MPI-IO
major: Low-level I/O
minor: Can't get value
#005: H5Dmpio.c line 1442 in H5D__final_collective_io(): optimized write
failed
major: Dataset
minor: Write failed
#006: H5Dmpio.c line 295 in H5D__mpio_select_write(): can't finish
collective parallel write
major: Low-level I/O
minor: Write failed
#007: H5Fio.c line 195 in H5F_block_write(): write through page buffer
failed
major: Low-level I/O
minor: Write failed
#008: H5PB.c line 1041 in H5PB_write(): write through metadata accumulator
failed
major: Page Buffering
minor: Write failed
#009: H5Faccum.c line 834 in H5F__accum_write(): file write failed
major: Low-level I/O
minor: Write failed
#010: H5FDint.c line 308 in H5FD_write(): driver write request failed
major: Virtual File Layer
minor: Write failed
#011: H5FDmpio.c line 1864 in H5FD_mpio_write(): file write failed
major: Low-level I/O
minor: Write failed
flash4: io_h5_xfer_mesh_dataset.c:210: io_h5_xfer_mesh_dataset: Assertion
`err >= 0' failed.
Program received signal SIGABRT: Process abort signal.
How can I overcome these problems?
Thanks in advancce,
Theo
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://flash.rochester.edu/pipermail/flash-users/attachments/20180621/fb35e11f/attachment.htm>
More information about the flash-users
mailing list