I've hit this issue when trying to start EESSI from a container (note: this is not the EESSI container, it's just docker:rockylinux9 - I'm experimenting with mounting a squashfs filesystem providing EESSI into this container), but I expect it would happen on any system that doesn't have ssh or rsh.
mpirun -np 2 gmx_mpi --version
--------------------------------------------------------------------------
The value of the MCA parameter "plm_rsh_agent" was set to a path
that could not be found:
plm_rsh_agent: ssh : rsh
Please either unset the parameter, or check that the path is correct
--------------------------------------------------------------------------
[tcn43.local.snellius.surf.nl:99269] [[INVALID],INVALID] FORCE-TERMINATE AT Not found:-13 - error plm_rsh_component.c(335)
Now, I'm not saying we should have ssh and rsh in the compat layer, but... at this point, we have software in the software layer (namely: OpenMPI) which relies on OS dependencies (namely ssh and rsh) that are NOT present in the compatibility layer. That means that, on a system with very minimal OS, a natively installed EESSI would not be able to launch MPI tasks.
I've hit this issue when trying to start EESSI from a container (note: this is not the EESSI container, it's just
docker:rockylinux9- I'm experimenting with mounting a squashfs filesystem providing EESSI into this container), but I expect it would happen on any system that doesn't have ssh or rsh.Now, I'm not saying we should have ssh and rsh in the compat layer, but... at this point, we have software in the software layer (namely: OpenMPI) which relies on OS dependencies (namely ssh and rsh) that are NOT present in the compatibility layer. That means that, on a system with very minimal OS, a natively installed EESSI would not be able to launch MPI tasks.