Message Boards

Back

Delft3D running without parallel option but not running in parallel

BJ
Braulio Juarez, modified 11 Days ago.

Delft3D running without parallel option but not running in parallel

Youngling Posts: 1 Join Date: 8/23/18 Recent Posts

Dear community.

When I try to run my experiments on parallel mode I get the next error:

forrtl: No such file or directory
forrtl: No such file or directory
forrtl: No such file or directory
forrtl: No such file or directory
forrtl: severe (28): CLOSE error, unit 33, file "Unknown"
Image              PC                Routine            Line        Source             
libifcoremt.so.5   00002B562C4AD1C2  for__io_return        Unknown  Unknown
libifcoremt.so.5   00002B562C49C42A  for_close             Unknown  Unknown
libflow2d3d.so.0.  00002B562734022D  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B5626F69F6B  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B5626F7D05C  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B5626F7E181  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B5626F2B434  Unknown               Unknown  Unknown
d_hydro.exe        0000000000403A42  main                  Unknown  Unknown
libc-2.17.so       00002B562275B3D5  __libc_start_main     Unknown  Unknown
d_hydro.exe        0000000000403669  Unknown               Unknown  Unknown
forrtl: severe (28): CLOSE error, unit 33, file "Unknown"
Image              PC                Routine            Line        Source             
libifcoremt.so.5   00002B81B590B1C2  for__io_return        Unknown  Unknown
libifcoremt.so.5   00002B81B58FA42A  for_close             Unknown  Unknown
libflow2d3d.so.0.  00002B81B079E22D  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B81B03C7F6B  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B81B03DB05C  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B81B03DC181  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B81B0389434  Unknown               Unknown  Unknown
d_hydro.exe        0000000000403A42  main                  Unknown  Unknown
libc-2.17.so       00002B81ABBB93D5  __libc_start_main     Unknown  Unknown
d_hydro.exe        0000000000403669  Unknown               Unknown  Unknown
forrtl: severe (28): CLOSE error, unit 33, file "Unknown"
Image              PC                Routine            Line        Source             
libifcoremt.so.5   00002B3E6B25B1C2  for__io_return        Unknown  Unknown
libifcoremt.so.5   00002B3E6B24A42A  for_close             Unknown  Unknown
libflow2d3d.so.0.  00002B3E660EE22D  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B3E65D17F6B  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B3E65D2B05C  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B3E65D2C181  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002B3E65CD9434  Unknown               Unknown  Unknown
d_hydro.exe        0000000000403A42  main                  Unknown  Unknown
libc-2.17.so       00002B3E615093D5  __libc_start_main     Unknown  Unknown
d_hydro.exe        0000000000403669  Unknown               Unknown  Unknown
forrtl: severe (28): CLOSE error, unit 33, file "Unknown"
Image              PC                Routine            Line        Source             
libifcoremt.so.5   00002BA20183F1C2  for__io_return        Unknown  Unknown
libifcoremt.so.5   00002BA20182E42A  for_close             Unknown  Unknown
libflow2d3d.so.0.  00002BA1FC6D222D  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002BA1FC2FBF6B  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002BA1FC30F05C  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002BA1FC310181  Unknown               Unknown  Unknown
libflow2d3d.so.0.  00002BA1FC2BD434  Unknown               Unknown  Unknown
d_hydro.exe        0000000000403A42  main                  Unknown  Unknown
libc-2.17.so       00002BA1F7AED3D5  __libc_start_main     Unknown  Unknown
d_hydro.exe        0000000000403669  Unknown               Unknown  Unknown
slurmstepd: error: c29a-s21 [0] pmixp_client_v1.c:198 [_errhandler] mpi/pmix: ERROR: Error handler invoked: status = -25, nranges = 0: Interrupted system call (4)
slurmstepd: error: c29a-s21 [0] pmixp_client_v1.c:198 [_errhandler] mpi/pmix: ERROR: Error handler invoked: status = -25, nranges = 0: Success (0)
slurmstepd: error: *** STEP 27516479.0 ON c29a-s21 CANCELLED AT 2018-11-05T14:51:50 ***
srun: Job step aborted: Waiting up to 32 seconds for job step to finish.
slurmstepd: error: c29a-s21 [0] pmixp_client_v1.c:198 [_errhandler] mpi/pmix: ERROR: Error handler invoked: status = -25, nranges = 0: Success (0)
slurmstepd: error: c29a-s21 [0] pmixp_client_v1.c:198 [_errhandler] mpi/pmix: ERROR: Error handler invoked: status = -25, nranges = 0: Success (0)
slurmstepd: error: c29a-s21 [0] pmixp_client_v1.c:198 [_errhandler] mpi/pmix: ERROR: Error handler invoked: status = -25, nranges = 0: Success (0)
srun: error: c29a-s21: task 0: Killed
srun: error: c29a-s21: tasks 1-4: Exited with exit code 28
 

This just happens running on parallel. The experiment works fine when is not on parallel. This happened just after the cluster administrator updated the openmpi, delft3d, and intel versions. I'll appreciate any help. Thanks