Quick Sort

Quicksort is the fastest comparison sort in practice — and it has a trap so cruel it’s almost funny: feed the textbook version an already-sorted array (one of the most common inputs in the real world) and it slows to a crawling O(n²). The whole art of quicksort is one decision — how you pick the pivot — and getting it right is what separates a 2× speedup from a production outage.

The most-used comparison sort in the world. Quick Sort is divide-and-conquer with a twist: instead of splitting cleanly in half (the way merge sort does), it picks a pivot and partitions the array so all smaller elements are on the left and all larger ones are on the right. Then it recurses on both sides.

The idea in three sentences

Pick a pivot element.
Partition the array so everything <= pivot is on the left and everything > is on the right; the pivot lands in its final position.
Recursively quicksort both sides of the pivot.

That’s it. The whole algorithm is “partition, recurse.” Once the pivot is in its correct sorted position, it never moves again.

Watch it run

quick_sort([5, 1, 4, 2, 8, 3]) — Lomuto partition

Step 1 / 21 — Start quicksort (Lomuto partition).

comparingswappingsortedpivotcurrent min

Each pass picks a purple pivot, scans the slice, and ends with that pivot in its permanent green slot.

Two famous partition schemes

The partition step is the soul of quicksort. There are two classic ways to do it, and they trade clarity for cleverness.

Lomuto partition (simpler, the one most schools teach)

Pick the last element as pivot. Walk a single pointer i that tracks “boundary between <= pivot and > pivot” zones. For each element a[j]:

If a[j] <= pivot, advance the boundary by swapping a[j] into the left zone.
Otherwise, leave it alone.

At the end, swap the pivot into its rightful position.

Recall the boundary move — the line that grows the ”≤ pivot” zone:

python · fill in the blanks0/1 hints

def partition(a, lo, hi):
  pivot = a[hi]                     # last element as pivot
  i = lo - 1                        # boundary of the <= pivot zone
  for j in range(lo, hi):
      if a[j] <= pivot:
          i += 1
          # ??? grow the left (<= pivot) zone
  a[i + 1], a[hi] = a[hi], a[i + 1] # drop the pivot into its final slot
  return i + 1

Hoare partition (faster in practice)

Hoare’s original scheme uses two pointers walking toward each other, swapping out-of-place elements along the way. It does roughly three times fewer swaps than Lomuto on average — that constant factor is why most production code uses Hoare-style partitioning.

def hoare_partition(a, lo, hi):
    pivot = a[(lo + hi) // 2]
    i, j = lo - 1, hi + 1
    while True:
        i += 1
        while a[i] < pivot: i += 1
        j -= 1
        while a[j] > pivot: j -= 1
        if i >= j:
            return j
        a[i], a[j] = a[j], a[i]

The Hoare partition returns an index j such that everything left of and at j is <= pivot and everything right is >= pivot. The recursion shape is slightly different — quickSort(a, lo, p) and quickSort(a, p+1, hi) — but everything else is the same.

Don’t memorize both. Lomuto is easier to remember and right for interviews; Hoare is what you’d write for performance in production. When asked “which would you use?”, the honest answer is “I’d call the standard library, but I can explain both partitions.”

The O(n²) worst case (and how to avoid it)

Predict first

You run the textbook 'pivot = last element' (Lomuto) quicksort on an array that's ALREADY sorted. What's the running time — and why is that ironic?

Quicksort’s beauty is also its danger. If you always pick the smallest (or largest) remaining element as the pivot, every partition produces a 0-element side and an (n-1)-element side. The recursion looks like a degenerate stick:

n elements → partition → 0 left, n-1 right
n-1 → 0 left, n-2 right
n-2 → 0 left, n-3 right
...

That’s n + (n-1) + (n-2) + ... = O(n²) total work. And here’s the catch: with the naive “pick the last element” Lomuto strategy above, this happens on already-sorted input — the most common real-world case.

Three standard pivot fixes:

Random pivot. Pick a uniformly random index in [lo, hi]. Expected runtime O(n log n) regardless of input.
Median-of-three. Look at a[lo], a[mid], a[hi], pick the median. Fast in practice; breaks the sorted-input trap.
Introsort. Use random/median pivots, but track the recursion depth — if it exceeds 2·log₂(n) (a sign of bad pivots), switch to heap sort for a hard O(n log n) guarantee. This is exactly what C++‘s std::sort does.

Random pivot in two lines

int randomPivot(int lo, int hi) {
    return lo + rand() % (hi - lo + 1);
}
 
int partition(vector<int>& a, int lo, int hi) {
    int r = randomPivot(lo, hi);
    swap(a[r], a[hi]);   // move random pivot to the end, then run Lomuto as usual
    return /* … original Lomuto code … */;
}

One swap upfront, problem solved. Random pivots take quicksort from “O(n²) worst case” to “O(n²) only if you’re cosmically unlucky.”

Properties

Time: O(n log n) average, O(n²) worst case (with bad pivots). Best case also O(n log n).
Space: O(log n) for the recursion stack (with good pivots). O(n) worst-case stack if you’re truly unlucky.
Stable: No. Swaps during partitioning can leap equal elements over each other.
Adaptive: No — runtime depends on pivot choice, not input order. Already-sorted input is bad, not good, unless you randomize pivots.

Why quicksort wins in practice

On paper, merge sort matches quicksort: both are O(n log n) average. But on real hardware, quicksort consistently wins by a factor of 2–3× because:

In-place. No O(n) merge buffer means no allocation, no cache pressure from a parallel array.
Cache-friendly. Each partition reads contiguous memory, sequentially. CPU prefetchers love this.
Tiny constant factor. The inner loop is just a comparison and a swap — both ~1 cycle on modern CPUs.

Merge sort wins when stability or worst-case guarantees matter. Otherwise, quicksort wins.

3-way partitioning (the Dutch flag generalization)

When the input has lots of duplicates, standard quicksort wastes time recursing on regions of equal values. 3-way quicksort partitions into three regions: <, ==, and > the pivot. The middle region is already sorted (it’s all equal to the pivot) and doesn’t need recursing on.

You already saw the structure in Sort Colors — Dutch National Flag is the 3-way partition. Combined with quicksort, it makes the worst case for duplicates-heavy input go from O(n²) to O(n).

Quickselect — finding the Kth element in O(n)

A killer side-effect of partitioning: if you only need to find the Kth smallest element (not sort the whole array), you only need to recurse on one side of each partition — the side that contains index K.

That changes the recurrence from T(n) = 2T(n/2) + O(n) (which solves to O(n log n)) to T(n) = T(n/2) + O(n) (which solves to O(n)).

We’ll do quickselect in the practice section as Kth Largest Element.

Quick check

Quickselect finds the Kth smallest element by partitioning and recursing on only ONE side. Why does that make it O(n) average instead of O(n log n)?

Quicksort is fast on average but has no worst-case guarantee. Next: heap sort — which delivers a hard O(n log n) worst case and O(1) space (no recursion, no merge buffer), using the heap you built on Day 7.

Overview Heap Sort

Finished this page?