Intel® MPI Library Reference Manual for Linux* OS
Start Multipurpose daemon* (MPD).
mpd [ --help ] [ -V ] [ --version ] [ --host=<host> --port=<portnum> ] \
[ --noconsole ] [ --trace ] [ --echo ] [ --daemon ] [ --bulletproof ]\
[ --i fhn <interface/hostname> ] [ --listenport <listenport> ]
Multipurpose daemon* (MPD) is the Intel® MPI Library process management system for starting parallel jobs. Before running a job, start mpd daemons on each host and connect them into a ring. Long parameter names may be abbreviated to their first letters by using only one hyphen and no equal sign. For example,
$ mpd -h masterhost -p 4268 -n
is equivalent to
$ mpd --host=masterhost --port=4268 -noconsole
If a file named .mpd.conf is available in the user's home directory, only the user can have read and write privileges. The file must minimally contain a line with secretword=<secretword>. If you want to run MPD as root, create the mpd.conf file in the /etc directory instead of .mpd.conf in the root's home directory to run mpd as root. Avoid starting the MPD ring under the root account.
Multipurpose daemon* (MPD) has been deprecated since Intel® MPI Library 5.0 release. You can use scalable process management system (Hydra) to start parallel jobs.
Start mpd ring.
mpdboot [ -h ] [ -V ] [ -n <#nodes> ] [ -f <hostsfile> ] [ -r <rshcmd> ] \
[ -u <user> ] [ -m <mpdcmd> ] [ --loccons ] [ --remcons ] \
[ -s ] [ -d ] [ -v ] [ -1 ] [ --ncpus=<ncpus> ] [ -o ] \
[ -b <maxbranch> ] [ -p ]
or
mpdboot [ --help ] [ --version ] [ --totalnum=<#nodes> ] \
[ --file=<hostsfile> ] [ --rsh=<rshcmd> ] [ --user=<user> ] \
[ --mpd =<mpdcmd> ] [ --loccons ] [ --remcons ] [ --shell ] \
[ --debug ] [ --verbose ] [ -1 ] [ --ncpus=<ncpus> ] [ --ordered ]
[ --maxbranch=<maxbranch> ] [ --parallel-startup ]
Start the mpd daemons on the specified number of nodes by providing a list of node names in <mpd.hosts>.
The mpd daemons are started using the ssh command by default. If the ssh connectivity is not enabled, use the -r rsh option to switch over to rsh. Make sure that all nodes in the cluster can connect to each other through the ssh command without a password or, if the -r rsh option is used, through the rsh command without a password.
The mpdboot command spawns an MPD daemon on the host machine, even if the machine name is not listed in the mpd.hosts file.
Shut down a single mpd daemon.
mpdexit [ --help ] [ -V ] [--version ] <mpdid>
Display a help message |
|
-V | --version |
Display Intel® MPI Library version information |
<mpdid> |
Use this command to cause the single mpd daemon to exit. Use <mpdid> obtained through the mpdtrace -l command in the form <hostname>_<port number>.
Shut down all mpd daemons on all nodes.
Display a help message |
|
-V | --version |
Display Intel® MPI Library version information |
Use this command to shut down all MPD rings you own.
Clean up the environment after an mpd crash.
mpdcleanup [ -h ] [ -V ] [ -f <hostsfile> ] [ -r <rshcmd> ] [ -u <user> ]\
[ -c <cleancmd> ] [ -a]
or
mpdcleanup [ --help ] [ --version ] [ --file=<hostsfile> ] \
[ --rsh=<rshcmd> ] [ --user=<user> ] [ --clean=<cleancmd> ] \
[ --all]
Use this command to clean up the environment after an mpd crash. It removes the UNIX* socket on local and remote machines or kills all mpd daemons related to the current environment controlled by the I_MPI_JOB_CONTEXT environment variable.
For instance, use the following command to remove the UNIX sockets on machines specified in the hostsfile file:
$ mpdcleanup --file=hostsfile
Use the following command to kill the mpd daemons on the machines specified in the hostsfile file:
$ mpdcleanup --file=hostsfile --all
Determine whether mpd is running.
mpdtrace [ --help ] [ -V ] [ --version ] [ -l ]
Display a help message |
|
-V | --version |
Display Intel® MPI Library version information |
Show MPD identifiers instead of the hostnames |
Use this command to list the hostnames or identifiers of all mpds in the ring. The output identifiers have the form <hostname>_<port number>.
Check for configuration problems on the host or print configuration information about this host.
mpdcheck [ -v ] [ -l ] [ -h ] [ --help ] [ -V ] [ --version ]
mpdcheck -pc [ -v ] [ -l]
mpdcheck -f <host_file> [ -ssh ] [ -v ] [ -l]
mpdcheck -s [ -v ] [ -l]
mpdcheck -c < server_host> <server_port> [ -v ] [ -l]
Use this command to check configuration problems on the cluster nodes. Any output line that starts with *** indicates a potential problem.
If you have problems running parallel jobs through mpd on one or more hosts, try to run the script once on each of those hosts.
Test the MPD ring.
mpdringtest [ --help ] [ -V ] [ --version ] <number of loops>
Display a help message |
|
-V | --version |
Display Intel® MPI Library version information |
<number of loops> |
Number of loops |
Use this command to test how long it takes for a message to circle the mpd ring.
mpdlistjobs [ -h ] [ -V ] [ -u <username> ] [ -a <jobalias> ] [ -j <jobid> ]
or
mpdlistjobs [ --help ] [ --version ] [ --user=<username> ] \
[ --alias=<jobalias> ] [ --jobid=<jobid> ]
-h | --help |
Display a help message |
-V | --version |
Display Intel® MPI Library version information |
-u <username>| --user=<username> |
List jobs of a particular user |
-a <jobalias> | --alias=<jobalias> |
List information about the particular job specified by <jobalias> |
-j <jobid> | --jobid=<jobid> |
List information about the particular job specified by <jobid> |
Use this command to list the running processes for a set of MPI jobs. All jobs for the current machine are displayed by default.
Apply a signal to a process running an application.
mpdsigjob [ --help ] [ -V ] [ --version ] <sigtype> \
[-j <jobid> | -a <jobalias> ] [-s | -g ]
Display a help message |
|
-V | --version |
Display Intel® MPI Library version information |
<sigtype> |
Specify the signal type to send. Valid options are -j or -a. |
-a <jobalias> |
Send a signal to the job specified by <jobalias> |
-j <jobid> |
Send a signal to the job specified by <jobid> |
-s |
Deliver a signal to a single user process |
-g |
Deliver a signal to a group of processes. This is the default behavior. |
Use this command to deliver a specific signal to the processes of a running job. The desired signal is the first argument. Specify one of two options: -j or -a.
Terminate a job.
mpdkilljob [ --help ] [ -V ] [ --version ] [ <jobnum> ] [ -a <jobalias> ]
Display a help message |
|
-V | --version |
Display Intel® MPI Library version information |
<jobnum> |
Kill the job specified by <jobnum> |
-a <jobalias> |
Kill the job specified by <jobalias> |
Use this command to kill the job specified by <jobnum> or by <jobalias>. Obtain <jobnum> and <jobalias> from the mpdlistjobs command. The <jobid> field has the following format: <jobnum>@<mpdid>.
Print brief help concerning MPD commands.
mpdhelp [ -V ] [ --version ]
-V | --version |
Display Intel® MPI Library version information |
Description
Use this command to obtain a brief help message concerning MPD commands.