Recap

Assumptions

Approach #4: temporarily commit to another server's RAM for speed, eventually to disk
- some form of logging in RAM and batching writes to disk + checkpointing
- need to "shard" data for each server so that many servers serve as a backup for a single master to speed recovery time
  - likely, backup shards will need to be able to temporarily become masters for the data while rebuilding the master

Disk write bandwidth
Best approach to achieve good write bandwidth on disk: have at least 3 disks, one for write logging, one archiving the last amount of log data, one for compaction. Using this scheme we can completely eliminate seeks which should give us about 100 MB/s. Unfortunately, we'll need RAID as well so the total is more than 3 disks just to achieve the write bandwidth of 1 disk.

All soft state, fixes alot of consistency issues on recovery.
Compare backups after a master is elected to check consistency, version numbers make this possible, only the most recently written value can be inconsistent, so we only need to compare k values for k backup shards. This is much cheaper than an agreement protocol for the backups.