The Polymathic Engineer

The Polymathic Engineer

Share this post

The Polymathic Engineer
The Polymathic Engineer
Cassandra
Copy link
Facebook
Email
Notes
More

Cassandra

A deep dive into the Apache Cassandra database.

Franco Fernando's avatar
Franco Fernando
Nov 25, 2023
∙ Paid
10

Share this post

The Polymathic Engineer
The Polymathic Engineer
Cassandra
Copy link
Facebook
Email
Notes
More
1
Share

Hi Friends,

Welcome to the 48th edition of the Polymathic Engineer newsletter, the first reserved for paid subscribers. Thanks for your trust and support. I hope you’ll enjoy the reading.

This time, we will focus on one of the most popular NoSQL data stores: Apache Cassandra.

The outline will be as follow:

  • introduction

  • architecture

  • data model

  • partition and replication

  • automation and scalability

  • trade-offs

  • how to set up and use a cluster

Introduction

Cassandra is a popular NoSQL data store that was developed by Facebook and incorporates architectural ideas from Bigtable and Dynamo DB. It is a data store built for scale, and some of its features only work on a multi-node Cassandra cluster.

The largest Cassandra clusters have tens of thousands of nodes and store petabytes of data. Users of Cassandra include many big tech companies like Apple, Netflix, Uber, Meta and so on.

Architecture

The first thing to keep in mind is that Cassandra has a has a decentralized architecture where all nodes in a cluster perform the same functions. Clients can connect to any node, and when they do, that node becomes the session coordinator for the client.

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Franco Fernando
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More