B Tree Basics - TalkPoints

Listen to the twitter space!

Simple walkthrough/guide

Recap storage engines at the heart B-Tree. Data structures contiguous vs pointer based.

disk considerations data set cannot fit in memory, we must cache and page in and out of memory.

“we have to perform balancing, relocate nodes, and update pointers rather frequently. Increased maintenance costs make BSTs impractical as on-disk data structuree”

HDDs vs SSDs how they work important - block device abstraction locality of reference

goals:

B Trees: we actually mean Bplus Tree. sorted data structure, pointer based = binary search. why do we want search trees? balancing and order

B-Trees are useful for:

  1. In-memory indexes (also used here!)
  2. persisted on disk storage organisation. <– we’re here.

“Since B-Trees are a page organization technique (i.e., they are used to organize and navigate fixed-size pages), we often use terms node and page interchangeably.”

terminology: seperator key root internal node contains seperator key leaf node degree/branching factor occupancy

desired properties:

operations

internal operations:

not covered:

closing - implementation is on chapter four & six, we shall revisit again!