Timer Driven Checkpoint

In the following example, a checkpoint is taken every 3600 seconds (=1hour). The checkpoints are stored in a directory called ckptdir. Each node generates one checkpoint which is named by the node number and number of that checkpoint.

user@head $ mpiexec.hydra -ckpoint on -ckpoint-prefix /home/user/ckptdir -ckpoint-interval 3600 -ckpointlib blcr -n 32 -f hosts /home/user/myapp