This page documents the goals and results for a series of milestones for the first implementation of RAMCloud. Each milestone last 2 weeks, ending on a Monday.
Milestone 9: ends June 14, 2010
- Ryan:
- Benchmark planning
- Try out simulator for performance analysis
- Steve:
- 6 machines ordered (est: 1 day)
- Set machines up (est: 1 day)
- Cleanup log code (TBD)
- Understand existing drivers (w. Aravind)
- Aravind:
- Transfer knowledge
- Diego:
- Replicate Aravind 11 usec numbers (est: 1 day)
- Take over fast transport from Aravind (to be fleshed out ASAP)
- Integrate Buffer into code base
Milestone 8: ends May 31, 2010
- Ryan:
- Working on other projects/quals
- Steve:
- Working on other projects/quals
- Aravind:
- Finish testing/docs for RPC driver (est: 2 days; actual: 2 days, done)
- Write RPC transport class (est: 5 days; actual: 1 day, not done)
- GSRC talk (est: 0; actual: 2 days, done)
- POMI poster (est: 0; actual 0.5 day)
- Diego:
- Buffer chunk deallocation (est: 2 days; actual: 2.25 days, done)
- Finish TCPTransport cleanup (est: 1 day; actual: .625 days, done)
- Try out Google Mock framework (est: 1 day; actual: .875 days, done/discarded)
Milestone 7: ends May 17, 2010
- Ryan:
- Working on other projects/quals
- Steve:
- Working on other projects/quals
- Aravind:
- Fast RPC driver class (NIC, buffers) (est: 2 days; actual: 3 days, not done (no tests/doc))
- Fast RPC transport class (mux/de-mux) (est: 5 days; actual: 0.5 day, not done)
- Protocol design for retransmit/large packets (est: 1 day; actual: done, 1 day)
- Web site for code reviews (est: 0.5 day; actual: done, 0.5 day)
- Diego:
- ZooKeeper measurements: latency/bandwidth of get/set (est: 1 day; actual: done, 1 day)
- Buffer chunk deallocation (est: 1 day; actual: 1 day, coding not started)
- Update Hash Table after code review (est: 1 day; actual: done, 1 day)
- Try out Google Mock framework (est: 1 day; actual: not started)
- TCPTransport code review cleanup (est: 1 day; actual: 0.5 day, not done)
Milestone 6: ends May 3, 2010
- Ryan:
- Working on other projects
- Steve:
- Working on other projects
- Aravind:
- Documentation and tests for RPC code (est: 2 days, actual: done, 1 day)
- Integrate properly Buffer into the code - right now Buffer is only used just before sending an RPC. (est: 2 days, actual: done, 0.5 day)
- Computer forum poster (est: 2 days, actual: N/A)
- Diego:
- Poster (est: 3 days, actual: done, 2 days)
- Unit tests and docs for TCP transport (est: 2 days, actual: done, 2 days)
- ZooKeeper measurements: latency/bandwidth of get/set (est: 1 day, actual: not started)
Milestone 5: ends April 19, 2010
- Ryan:
- Mostly working on other projects
- Flesh out design of index recovery with partitions (est: 1 day, actual: no time to work)
- Steve:
- Working on other projects (actual: done!)
- Aravind:
- Full task list for TCP-based RPC (est: 1 day, actual: done, 1 day)
- Other RPC tasks TBD
- Port existing code to Buffer and RPC (est: was TBD, actual: done except for comments and tests, 2 days)
- Hook up RPC system to use Diego's TCP Transport (est: was TBD; actual: done except for comments and tests, 1 days)
- Mail Jeff Dean about ProtocolBuffer performance disconnect (est: 1 day, actual: done, 1 day)
- Diego:
- TCP transport for RPC (est: 3 days) (actual: not done, 2.5 days)
- Get ZooKeeper installed and running (est: 1 day) (actual: done, 0.5 day)
- ZooKeeper measurements: latency/bandwidth of get/set (est: 1 day) (actual: not started)
Milestone 4: ends March 29, 2010
- Ryan:
- Slides for review
- Steve:
- Slides for review
- Aravind:
- Slides for review
- RPC refactoring (no estimate)
- Diego:
- Slides for review
Milestone 3: ends March 15, 2010
- Aravind:
- Implement RPC API on TCP (est: 3 days) (actual: done, 2 days)
- Finish Protocol Buffer analysis (est: 2 days) (actual: "done", 1.5 days)
- Deallocate Buffer memory (est: 2 days) (actual: not done)
- Extra stuff: reworked Buffer class after code review
- Diego:
- RAM-33: Least-usable file system: small file ops (est: 2 days) (actual: done, 0.5 day)
- RAM-51: RAMCloud build attempt with TCP and post-mortem analysis (est: 1 day) (actual: done, except unit test binary too large to fit, 0.5 day)
- RAM-50: File-by-file opt in/out of "extra checks", add to pre-commit hooks (est: 2 days) (actual: done, 1 day)
- RAM-41: Hash table cleanup/code review (est: 2 days) (actual: done, 3 days)
- Ryan:
- Multi-host backup (est: 1 day) (actual: not done, 0.5 day)
- Save and collect segement location info for restore (est: 2 days) (actual: not done)
- RPC design (actual: done, 0.5 day)
- Steve:
- Log/Segment refactoring (est: 2 days) (actual: not done)
- Other code review issues (est: 2 days) (actual: not done)
- Threading design (actual: done, 1 day)
Milestone 2: ends March 1, 2010
- Aravind:
- Analyze protocol buffer performance (est: 0.5 day) (actual: not done, 0.5 day)
- Implement new RPC API on TCP (est: 4 days) (actual: 70% done?, 3 days)
- Code review of RCBUF (est: 0.5 day) (actual: done, 3 days)
- Quick and dirty user-level implementation of scatter-gather (est: 1 day) (actual: done, but not with Buffer, 1 day)
- Diego (modified 2010-02-18 with John's approval, and somewhat changed my mind 2010-02-19):
- RAM-36: Finish mini-transactions (est: 1 day)s (actual: done, 0.5 day)
- RAM-33: Least usable version of file system (est: 5 days) (actual: not done (only directories so far), 4 days)
- RAM-26: Table enumeration (est: 2 days) or maybe instead RAM-38: C/C++ extension for Python bindings (est: 2 days) (actual: not started)
- Ryan:
- Propagate GetSegMetaData through server (est: 0.5 day) (actual: done, 0.5 day)
- Boilerplate recovery routine (est: 0.5 day) (actual: in flux because of segment refactor, 1 day)
- Creating shadow objects in log (est: 2 days) (actual: not done, depends on segment refactor)
- Steve:
- Log/tombstone tests (est: 1 day) (actual: done, 2 days)
- Design for multi-threading (est: 2 days) (actual: not done, 1 day so far)
Milestone 1: ends February 15, 2010
- Aravind:
- RCBuf implementation (including docs and tests) (est: 3 days) (actual: done in 4 days, ~10 hours)
- Use RCBufs in current RPC mechanism (est: 1 day) (actual: done in 2 half-days)
- New RPC API implemented on TCP (est: 4 days) (actual: not done)
- Extras: protocol buffer tests
- Diego:
- Mini-transactions returning Booleans (est: 3 of "7") (actual: not done (90%?) spent "5")
- Mini-transactions returning results/error info (est: 4 of "7") (actual: not done (90%?), spent "2")
- Extras: transaction discussions, functional tests
- Ryan:
- Finish GetSegMetaData (est: 2 days) (actual: done in 16 actual hours)
- Support multiple segments in backup (est: 3 days) (actual: not done)
- Support multiple backups/master (est: 3 days) (actual: not done)
- Extras: cleanup (4 hours)
- Steve:
- New tombstone code (est: 1 day) (actual: done in 4 hours)
- Log/tombstone docs (est: 1 day) (actual: done in 2 hours)
- Log/tombstone tests (est: 3 days) (actual: not done; halfway?)
- Segment docs (est: 1 day) (actual: done in 1 hour)
- Segment tests (est: 1 day) (actual: done in 2-3 hours)