Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

MySQL related:

https://archive.fosdem.org/2014/schedule/event/spider_storage_engine/attachments/slides/408/export/events/attachments/spider_storage_engine/slides/408/spider_storage_engine.pdf

http://dev.mysql.com/doc/internals/en/support-for-indexing.html

...

Interesting blog about importance of and technical difficulty in implementing features like indexing (in context of FoundationDB, VoltDB):

...

Also note good quote from Google's F1 paper: "Features like indexes and ad hoc query are not just nice to have, but absolute requirements for our business.”

...

Scaling memcache at Facebook - https://www.usenix.org/conference/nsdi13/scaling-memcache-facebook

...

Sinfonia

...

Calvin

...

Spanner makes an argument that NoSQL is not enough. We need transactions and SQL like query language.

"Spanner exposes the following set of data features to applications: a data model based on schematized semi-relational tables, a query language, and general purpose transactions. The move towards supporting these features was driven by many factors. The need to support schematized semi-relational tables and synchronous replication is supported by the popularity of Megastore.

At least 300 applications within Google use Megastore (despite its relatively low performance) because its data model is simpler to manage than Bigtable’s, and because of its support for synchronous replication across datacenters. (Bigtable only supports eventually-consistent replication across datacenters.) Examples of well-known Google applications that use Megastore are Gmail, Picasa, Calendar, Android Market, and AppEngine.

The need to support a SQLlike query language in Spanner was also clear, given the popularity of Dremel as an interactive data analysis tool. Finally, the lack of cross-row transactions in Bigtable led to frequent complaints; Percolator was in part built to address this failing.

Some authors have claimed that general two-phase commit is too expensive to s upport, because of the performance or availability problems that it brings [9, 10, 19]. We believe it is better to have application programmers deal with performance problems due to overuse of transactions as bottlenecks arise, rather than always coding around the lack of transactions. Running two-phase commit over Paxos mitigates the availability problems."

...

Octopus DB which stores data in log structure and has concept of storage views where you can define your data models. It can mimic its working as row store or column store or like as you wish it to. Quite an interesting idea (patent pending): http://www.cidrdb.org/cidr2011/Papers/CIDR11_Paper25.pdf  

...

Wasp - Multi-key transactions

...

Marcus’s thesis

http://e-collection.library.ethz.ch/eserv/eth:5403/eth-5403-01.pdf

...

Cristian Tinnefeld's work

...

Column DB, Abadi - http://cs-www.cs.yale.edu/homes/dna/papers/abadiphd.pdf

Vertica, Stonebraker

...

Daniel Abadi advocates use of PACELC instead of CAP to judge a db system design. http://dbmsmusings.blogspot.in/2010/04/problems-with-cap-and-yahoos-little.html

...

CAP theorem by Eric Brewer

...

Survey papers on data models for nosql dbs:

Google datastore

...

kyoto cabinet - very interesting DB lib:

http://fallabs.com/kyotocabinet/