The script scripts/recovery.py
can be used to run recoveries for testing.
You should probably set up ssh master mode for each of the cluster nodes. Here is a shell script that you can run on rcmaster to do it:
#!/bin/sh # # This script sets up ssh master mode for all of the machines # in the RAMCloud cluster. if [ $(hostname) == "rcmaster.scs.stanford.edu" ]; then for host in rc{01..80}; do if [ -z "$(pgrep -u $USER -fx "ssh -fMN $host true")" ]; then ssh -fMN $host true 2>/dev/null & fi done fi |
scripts/recovery.py
. recover
method, which are specified at the very end of scripts/recovery.py
.recovery/latest
. If you run more recoveries, logs/latest
always refers to the most recent recovery, but log files from old recoveries are retained in other subdirectories of recovery
.scripts/recoverymetrics.py
, which will examine the logs in logs/latest
and produce summary information describing the recovery.