Lesson 2: Building a Big Data Infrastructure Part 2
Structured Storage & Cassandra
Structured Data
-
More like tables
-
Fast write and query times
Cassandra
Modeled after Google's BigTable
Distributed
Column Oriented Database
Open source originally from Facebook
Used in Twitter, LinkedIn, Netflix, etc.
Column Oriented Properties
Column names not set
Wide rows
Rows occupy non-contiguous disk space
Other Column Oriented Data Stores
BigTable
HBase
DynamoDB
CAP Theorem
Consistency
Availability
Partition Tolerance
Cassandra relaxes consistency
Cassandra is Good For
Time Series Data
Event Data
Timelines
High Volume
←
→
/
#