Thursday, September 24, 2009

Cassandra DB articles

A compilation of popular articles (growing list) on Cassandra. (View a list of clients to access Cassandra)

Official wiki for Cassandra
http://wiki.apache.org/cassandra/
http://wiki.apache.org/cassandra/ArticlesAndPresentations

Most Popular articles
Cassandra (Bigtable+Dynamo) - Jonathan Ellis
What every developer should know about database scalability - Jonathan Ellis
Cassandra Project, Rackspace article - Jonathan Ellis
Up and running with cassandra - Evan Weaver
WTF is a SuperColumn? An Intro to the Cassandra Data Model - Digg's Arin Sarkissian
Looking to the future with Cassandra - Digg's Ian
BlueRunner: Building an Email Service in the Cloud - IBM's Jun Rao
Cassandra and Ruby: A Love Affair? - Engine Yard
Building Scalable Databases: Denormalization, the NoSQL Movement and Digg - Dare Obasanjo

Structured Storage System over a P2P Network - By Avinash Lakshman et al.
Data Presentations Cassandra Sigmod
Cassandra presentation at NoSQL(same more details)
Cassandra – A structured storage system on a P2P Network - Facebook Notes
Cassandra - E-Team Lecture, Faceboook Video
NoSQL - Cassandra Video

Google BigTable paper by Fay Chang et al.
Bigtable: A Distributed Storage System for Structured Data

Amazon Dynamo paper by Avinash Lakshman et al.
Amazon Dynamo paper

The phi Accrual Failure Detector by Naohiro Hayashibara
phi Accrual Failure Detector

Werner Vogels on distributed systems
Eventually Consistent - Revisited
Amazon Dynamo

Roadmap and interesting issues
Cassandra Roadmap
Proactive repair - merkle trees ?
Cassandra data model misconceptions, and their sources
Hadoop integration
Ingesting from Hadoop to Cassandra
Mailing list archives: cassandra-user@incubator.apache.org
org.apache.incubator.cassandra-user - Mark Mail
Cassandra User Mail Archive
fauna (Twitter's ruby client) documentation

Some interesting NoSQL articles
NoSQL debrief
Anti-RDBMS: A list of distributed key-value stores
Needle in a Haystack: Efficient Storage of Billions of Photos
NoSQL: If Only It Was That Easy
Quick Reference to Alternative data storages
Some Notes on Distributed Key Stores
Key Value Store List
Cassandra Vs CouchDB
NoSQL and the Relational Model: don’t throw the baby out with the bathwater
Why we migrated from mysql to mongodb
No to SQL? Anti-database movement gains steam
Should you go Beyond Relational Databases?
Adventures with Cassandra Distributed Database