Sunday, February 12, 2012

Cassandra database - No SQL - Non relational database

Cassandra is a apache open source project. It is a non relational - non sql database. It promises linear scalability, low latency and in built caching of data on cheap commodity hardware or cloud infrastruture. The project is hosted at - http://cassandra.apache.org/

The comcept is simple and completely different from regular RDBMS like MySQL, Oracle, etc.

Cassandra stores data in this format:
KeySpace which corresponds to a schema. This will be like one schema for one project across multiple cluster of nodes.
Column Family like a table, each keyspace can have multiple column families.
And finally column or key / value which represents the column in the column family.

Cassandra is tightly integrated with Apache Hadoop for large scale distributed parallel data processing using Java Map Reduce.

1 comment: