Replies: 2 comments 14 replies
-
You will need to launch your application using mpiexec/mpirun If you want to use the mpi parcelport. A good example on how to do this can be found here: https://github.com/STEllAR-GROUP/hpx/wiki/Running-HPX-on-Rostam#running-hpx-applications-on-compute-nodes. So you should probably use something like:
|
Beta Was this translation helpful? Give feedback.
14 replies
-
Try:
This all depends on how your slurm/MPI integration was built and configured. We might have to try different things. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
my university project group and me have some issues in relation to run the “hello_world_distributed” program. We want to run this program with SLURM on two nodes and the partition is called “qdr”. We start the running process with the following command:
srun -p qdr -N 2 hello_world_distributed
Unfortunately, we get the following error message per node:
Could not find usable bootstrap parcelport.
Preconfigured bootstrap parcelport: 'tcp'
List of available parcelports:
mpi, priority: -100, can bootstrap: 1
With the following command we get the debug information:
srun -p qdr -N 2 hello_world_distributed --hpx:list-parcel-ports --hpx:debug-clp
The output is here:
output.txt
We already searched for possible solutions and found an issue entry in #4891. There we found the tip to expand the srun-command with the following options:
srun -p qdr -N 4 hello_world_distributed --hpx:ini=hpx.parcel.mpi.enable!=1 --hpx:ini=hpx.parcel.bootstrap!=mpi
Unfortunately, now we get the error message:
terminate called without an active exception
Could not find usable bootstrap parcelport.
Preconfigured bootstrap parcelport: 'mpi'
List of available parcelports:
mpi, priority: -100, can bootstrap: 1
Here are our CmakeLists.txt and load-env.sh:
CMakeLists.txt
load-env.txt
Do you have any ideas how to fix the errors?
Beta Was this translation helpful? Give feedback.
All reactions