Checkpoint a running sequential process using the open pal checkpoint/restart service (crs). note: this should only be used by the user if the application being checkpointed is an opal-only application. if it is an open rte or open mpi program their respective tools should be used.
opal-checkpoint [ options ] <PID>
opal-checkpoint will attempt to notify a running process that it has been requested that the process checkpoint itself. A snapshot handle reference is presented to the user, which is used in opal_restart to restart the process.
<PID>
Process ID of the running target process.
-h | --help
Display help for this command
--term
After checkpointing the running process, terminate it.
-v | --verbose
Enable verbose output for debugging.
-n | --name
Request a specific name for the local snapshot reference.
-w | --where
Request that the local snapshot reference be placed in a specific location.
-gmca | --gmca <key> <value>
Pass global MCA parameters that are applicable to all contexts. <key> is the parameter name; <value> is the parameter value.
-mca | --mca <key> <value>
Send arguments to various MCA modules.
opal-checkpoint can be invoked multiple, non-overlapping times. This allows the user to take involuntary checkpoints of a running sequential process. See opal_crs(7) for more information about the CRS framework and components. It is convenient to note that the user does not need to spectify the checkpointer to be used here, as that is determined completely by the running process being checkpointed.