Heap Sort

Heap sort is the only sort that gives you both things you can’t get from quicksort or merge sort at once: a hard O(n log n) worst case (no pathological inputs) and O(1) extra space (no recursion stack, no merge buffer). It feels like cheating — until you learn the price it quietly pays in cache misses. Here’s how it pulls off both guarantees.

Heap Sort gives you everything you wish quicksort had: O(n log n) worst case and O(1) extra space, all in one algorithm.

The catch? Worse constant factors than quicksort, no cache locality, and unstable. It’s the “no surprises” sort — perfect when worst-case guarantees matter, less common in everyday code.

The big idea

A heap is a binary tree (stored as an array) where every parent is >= its children (a max-heap). The maximum element is always at the root.

Heap Sort exploits this in two phases:

Build a max-heap from the input array in O(n).
Repeatedly swap the root with the last element, shrink the heap by one, and sift down to restore the heap property. After n - 1 iterations, the array is sorted in ascending order.

We’ll spend Day 7 in detail on heaps. For now, here’s the minimum you need to follow Heap Sort.

Heap as array (the trick)

A binary heap is stored as a flat array using these index relations:

parent(i) = (i - 1) / 2
left(i)   = 2 * i + 1
right(i)  = 2 * i + 2

No pointers, no nodes — just arithmetic. This is what makes heap sort in-place: the “tree” lives entirely in the same array we’re sorting.

Heap [16, 14, 10, 8, 7, 9, 3, 2, 4, 1] as a tree:

              16
           /      \
         14       10
        /  \     /  \
       8    7   9    3
      / \  /
     2  4 1

The max is at index 0. Children of index 0 are at indices 1 and 2. Children of index 1 are at 3 and 4. And so on.

The “sift down” subroutine

To restore the heap property at index i after the root is replaced:

Find the larger of a[i]’s two children.
If that child is larger than a[i], swap them.
Recursively repeat at the child’s position.

This takes O(log n) per call — the height of the heap.

The full algorithm

Predict first

Heap sort builds a MAX-heap (largest at the root). Phase 2 repeatedly swaps the root to the end and shrinks the heap. Why does a *max*-heap produce *ascending* order?

Recall the two phases — fill in the line that lifts each max out to the tail:

python · fill in the blanks0/1 hints

def heap_sort(a):
  n = len(a)
  # Phase 1: build a max-heap (O(n))
  for i in range(n // 2 - 1, -1, -1):
      sift_down(a, i, n)
  # Phase 2: repeatedly extract the max
  for i in range(n - 1, 0, -1):
      # ??? move the current max to the shrinking tail
      sift_down(a, 0, i)         # restore the heap on the prefix of length i

The sneaky build-heap O(n)

You’d think building a heap is O(n log n) — n calls to siftDown, each O(log n). But it’s actually O(n).

The reason: most nodes are near the leaves, and sifting down from a leaf is O(1). Only the root pays the full O(log n). The total work is bounded by:

∑ (n / 2^(h+1)) · h    ≤    2n

This is one of the most surprising “tight bound” results in basic algorithms — and a common interview question.

The full algorithm is O(n) build + O(n log n) extractions = O(n log n).

Properties

Time: O(n log n) — best, average, and worst case. No bad pivots, no surprises.
Space: O(1) — fully in-place. The “tree” is just the input array.
Stable: No. Swaps during heapify can reorder equal elements.
Adaptive: No.

When to actually use heap sort

In raw speed, quicksort beats heap sort by ~2× because of cache behavior — heap operations jump around the array in a way that defeats CPU prefetching. But heap sort wins in three specific scenarios:

Worst-case guarantees matter. Real-time systems, security-sensitive code (you don’t want an adversary picking inputs that trigger quicksort’s O(n²)), embedded systems with tight deadlines.
Tight memory. Heap sort is O(1) extra space. Merge sort is O(n). On a microcontroller with 8 KB of RAM, that gap is everything.
The fallback for hybrid sorts. Introsort uses heap sort as the fallback when quicksort recursion gets too deep. You don’t have to write this — but knowing why is useful.

The bigger picture

Heap sort isn’t usually the right answer for “sort this array.” But the heap data structure itself — independent of sorting — is one of the most important structures in computer science. We use it for:

Priority queues (the canonical use)
Dijkstra’s algorithm (Day 21)
A* search
Top-K problems (covered in Top K Frequent Elements)
Median maintenance (two heaps trick)
Event scheduling, Huffman coding, and more

You met heaps in full on Day 7; here they give you one of the cleanest O(n log n) sorts in your toolbox.

Quick check

Quicksort usually beats heap sort by ~2× in wall-clock time. So why does C++'s introsort fall back to heap sort when quicksort's recursion goes too deep?

Quicksort, merge, and heap sort are all O(n log n) — the comparison-sort ceiling. Next: linear sorts — counting, radix, and bucket sort, which break that ceiling by not comparing elements at all.

Quick Sort Linear-Time Sorts

Finished this page?