RAMCloud in a Box - ATOM based Micro Modular Server 'mmatom'

Contents)

Cluster Outline
User's Guide
1. User setup
2. Running RAMCloud
3. Development and enhancement
Performance result:
- clusterperf.py
- TBD) recovery.py
System administrator's guide
1. System configuration
  1. System photograph
  2. Server module specification
  3. Network connection
2. System administration guide
  1. Hardware maintenance
  2. Security solution

Cluster Outline)

Basically the procedure is the similar to rccluster except that:

NOTE) Please wait a while to use the ATOM cluster...
mmres has been ported from rcres. We are now adding resource management feature needed for Intel DPDK kernel bypass driver.
Use local git repository located on 'mmatom'
1. The source code and scripts are pretested for the ATOM based 'RAMCloud in a Box'.
The cluster has 132 ATOM servers ( Total : 1,056 cores, 4.1TB DRAM, 16.5 TB SSD) connected to dedicated management server 'mmatom.scs.stanford.edu', which is directly connected to the Internet. Please take a look at System configuration below for detail.
Broken or unstable server is disconnected with management tool, and the continuous IPaddress/hostname among servers is always maintained. IPaddress and hostname is always corresponding. The physical slot and chassis number can be identified in a management file on the management server.

User's Guide)

User setup)

ssh login to management server 'mmatom.scs.stanford.edu' with public key authentification.
Install your 'key pair for the cluster-login' to ~/.ssh . For existing RAMCloud users they are already copied from your home in rcnfs.
- Add the cluster-login-public key to ~/.ssh/authrized_keys .
  Note)
  - Your home is shared with all the ATOM servers with NFS, so you can login to all atom servers with public key authentification.
  - Do not copy your private key for mmatom login. Please create different key pair to ssh to AROM cluster from mmatom.
Initialize known_host:
1. You can use /usr/local/scripts/knownhost_init.sh
  1. Usage) /usr/local/scripts/knownhost_init.sh <ServerNumberFrom> <ServerNumberTo>
    If the host is already initialized result of 'hostname' on remote machine is displayed, otherwise you are prompted whether you will add the host to known_host database, where you should type 'yes'.
  2. Example)
    $ knownhost_init.sh 1 20
    atom001
    atom002
    :

Compile RAMCloud on the host 'mmatom' )

Git clone from local RAMCloud repository, which is pre-tested for the ATOM cluster. Directory structure is almost the same as Stanford RAMCloud.
- - ```
  $ git clone /var/git/ramcloud-dpdk.git
```
- ```
$ cd ramcloud-dpdk
```
  - ```
  $ git submodule update --init --recursive
```
Compile
1. Modify path so that you can use '/usr/bin/gcc (4.4.7)' instead of '/usr/local/bin/gcc (4.9.1)' for RAMCloud compile.
  - ```
  $ export PATH=/usr/bin:$PATH
```
2. ```
Compile RAMCloud: ARCH=atom is now default in GNUmakefile
```
  - ```
  $ make DEBUG=no
```

Run clusterperf.py from 'mmatom') - You have RAMCloud source compiled.

ATOM Cluster specific setting is decleared in localconfig.py, config.py, which are referred by clusterperf.py (micro benchmark) and cluster.py (job spawning).
1. Basic settings to run RAMCloud application on ATOM servers are provided in config.py . So far we have tested followings. We are going to test more commands:
- clusterperf.py
- TBD: clusterperf.py (running all of the standard performance tests)
Reserve ( Lease ) ATOM servers with /usr/local/bin/mmres
1. mmres has been ported from rcres. Now we are adding resource management needed for DPDK (Please wait a while to use the cluster...)
2. check usage with: $ mmres --help
  Example)
  - ```
  $ mmres ls -l
```
- ```
$ mmres lease 14:00 atom10-35 -m 'testing clusterperf.py'
```
Edit config.py for your servers reserved.
Run clusterperf.py : (default parameter is equivalent to rccluster, which is replica=3, server=4, backup=1)
$ scripts/clusterperf.py
- clusterperf.py uses cluster.py to span needed processes on the cluster, which means cluster.py works for ATOM cluster too.
Other test / job control scripts are to be ported.
1. TBD: sample application and its execution with cluster.py
2. TBD: fix clusterperf.py to run all the test. Now test failes at "readDistRandom" after "multiWrite_oneMaster"
3. TBD: recovery.py

Performance result)

Clusterperf.py

Note) We are now trying to move the disturbing DPDK log from standard out to some logfile.

Measured on Wed 25 Feb 2015 12:31:42 PM PST :

basic.read100 13.456 us read single 100B object (30B key) median
basic.read100.min 12.423 us read single 100B object (30B key) minimum
basic.read100.9 13.837 us read single 100B object (30B key) 90%
basic.read100.99 17.698 us read single 100B object (30B key) 99%
basic.read100.999 24.356 us read single 100B object (30B key) 99.9%
basic.readBw100 4.7 MB/s bandwidth reading 100B object (30B key)

basic.read1K 20.425 us read single 1KB object (30B key) median
basic.read1K.min 19.593 us read single 1KB object (30B key) minimum
basic.read1K.9 20.786 us read single 1KB object (30B key) 90%
basic.read1K.99 24.306 us read single 1KB object (30B key) 99%
basic.read1K.999 36.569 us read single 1KB object (30B key) 99.9%
basic.readBw1K 33.4 MB/s bandwidth reading 1KB object (30B key)
basic.read10K 52.461 us read single 10KB object (30B key) median
basic.readBw10K 125.6 MB/s bandwidth reading 10KB object (30B key)
basic.read100K 358.567 us read single 100KB object (30B key) median
basic.readBw100K 187.6 MB/s bandwidth reading 100KB object (30B key)
basic.read1M 3.449 ms read single 1MB object (30B key) median
basic.readBw1M 212.1 MB/s bandwidth reading 1MB object (30B key)
basic.write100 43.307 us write single 100B object (30B key) median
basic.write100.min 41.031 us write single 100B object (30B key) minimum
basic.write100.9 47.528 us write single 100B object (30B key) 90%
basic.write100.99 86.543 us write single 100B object (30B key) 99%
basic.write100.999 38.363 ms write single 100B object (30B key) 99.9%
basic.writeBw100 542.6 KB/s bandwidth writing 100B object (30B key)

basic.write1K 63.481 us write single 1KB object (30B key) median
basic.write1K.min 60.453 us write single 1KB object (30B key) minimum
basic.write1K.9 66.720 us write single 1KB object (30B key) 90%
basic.write1K.99 126.391 us write single 1KB object (30B key) 99%
basic.write1K.999 41.500 ms write single 1KB object (30B key) 99.9%
basic.writeBw1K 4.9 MB/s bandwidth writing 1KB object (30B key)
basic.write10K 199.648 us write single 10KB object (30B key) median
basic.writeBw10K 12.0 MB/s bandwidth writing 10KB object (30B key)
basic.write100K 1.508 ms write single 100KB object (30B key) median
basic.writeBw100K 17.8 MB/s bandwidth writing 100KB object (30B key)
basic.write1M 41.949 ms write single 1MB object (30B key) median
basic.writeBw1M 21.8 MB/s bandwidth writing 1MB object (30B key)

# RAMCloud multiRead performance for 100 B objects with 30 byte keys
# located on a single master.
# Generated by 'clusterperf.py multiRead_oneMaster'
#
# Num Objs Num Masters Objs/Master Latency (us) Latency/Obj (us)
#----------------------------------------------------------------------------
1 1 1 23.0 22.99
2 1 2 28.3 14.15
3 1 3 33.2 11.07

9 1 9 49.5 5.50
50 1 50 168.7 3.37
60 1 60 235.5 3.93
70 1 70 209.1 2.99

Recovery.py

System administrator's guide)

System configuration)
1. system photograph)

2. ATOM server module specification)

3 chassis are installed (The rack in above picture contains 16 chassis.)
Installed ATOM modules are with C2730 (1.7GHz, 8 core/8 threads, 12 W)

3. Network connection)

Due to historical reason and considering future experiment, VLAN configuration is different in chassis
Management server is directly connect to the internet, the cluster is isolated from other Stanford servers.
1. Management server works as firewall, login server, firewall, NIS server, DHCP server, and PXE server for reconfiguring ATOM servers.

Hardware maintenance)

Getting module's slot/chassis information and Mac address for a hostname or IP address.
1. You will find the information in mmatom: /etc/hosts :
  
  IPaddress hostname Mac Address Installed slot and host port connected.

Security solution)

Reference:

ATOM Cluster - Old Page