Big O Notation: The Language of Algorithmic Napkin Math

27 May, 2025

You’re in a coffee shop, sketching on a napkin, trying to convince your friend that your new algorithm is actually practical. You don’t need precise measurements or detailed benchmarks — you need to quickly estimate whether your approach will work with a million users, or ten million, or a billion.

This is where Big O notation shines. It’s essentially the napkin math of computer science — a way to quickly reason about how algorithms scale without getting bogged down in implementation details or hardware specifics.

Much like Fermi estimation in physics, where you make order-of-magnitude calculations about seemingly impossible questions (“How many piano tuners are in Chicago?”), Big O analysis helps you make rapid, order-of-magnitude assessments about algorithmic scalability. Both techniques share the same philosophy: rough reasoning that gets you in the right ballpark is often more valuable than precise calculation that takes too long to compute.

The Art of “Good Enough” Analysis

Big O notation embodies a fundamentally practical approach to algorithmic analysis. Instead of asking “exactly how long will this take?” it asks “roughly how does the runtime grow as the input gets bigger?”

This shift in perspective is liberating. You don’t need to count every single operation or worry about whether one implementation uses 100 operations versus 120 operations. You’re looking at the shape of the performance curve.

Consider these growth patterns:

Input Size (n) | O(1) | O(log n) | O(n) | O(n log n) | O(n²)
     10        |  1   |    3     |  10  |     33     |  100
    100        |  1   |    7     | 100  |    664     | 10,000
  1,000        |  1   |   10     |1,000 |  9,966     | 1,000,000
 10,000        |  1   |   13     |10,000| 132,877    | 100,000,000

The beauty of this table isn’t in the exact numbers — it’s in the patterns. You can immediately see that O(n²) algorithms become completely impractical for large inputs, while O(log n) algorithms barely care about input size at all.

Napkin Math in Action: Real-World Estimates

Let’s put this to work with some back-of-the-envelope calculations that any developer might face:

You’re building a social media app. Each user follows about 200 other users on average, and you need to generate their personalized feed.

Naive approach: For each of the user’s 200 followed accounts, fetch their recent posts and merge them chronologically.

Time complexity: O(f × p) where f = follows, p = posts per person
With 200 follows and 50 recent posts each: ~10,000 operations per feed
With 1 million active users: ~10 billion operations at peak

Napkin math verdict: This doesn’t scale. You need a better approach.

Smarter approach: Pre-compute feed updates as posts are created, store in a cache.

Time complexity: O(1) for reading, O(f) for writing when someone posts
Reading becomes instant regardless of user count
Writing cost is distributed across all posting events

Scenario 2: Search Functionality

Your e-commerce site has 1 million products and needs fast search.

Linear search: Check every product title against the search query

Time complexity: O(n)
For 1 million products: 1 million comparisons per search
With 1,000 searches per second: 1 billion operations per second

Napkin math verdict: Your servers will melt.

Indexed search: Build an inverted index of keywords to products

Time complexity: O(log n) for lookup, O(k) for k results
For 1 million products: ~20 comparisons per search
With 1,000 searches per second: ~20,000 operations per second

Napkin math verdict: Totally manageable.

Building Algorithmic Intuition

Big O notation trains you to think algorithmically about problems, not just programmatically.

When you internalize Big O, you start seeing optimization opportunities everywhere:

# Naive password validation - O(n) for each check
def is_weak_password(password):
    weak_passwords = ["123456", "password", "qwerty", ...] # 1000 entries
    return password in weak_passwords  # Linear search!

# Better approach - O(1) for each check
def is_weak_password(password):
    weak_passwords = {"123456", "password", "qwerty", ...} # Set lookup
    return password in weak_passwords  # Hash table lookup!

The difference? With a list of 1,000 weak passwords, the first version averages 500 comparisons per check. The second version averages ~1 comparison per check. That’s a 500x improvement with a one-character change (list brackets to curly braces).

Beyond Worst-Case: Practical Napkin Math

While Big O traditionally focuses on worst-case scenarios, practical napkin math often considers average and best cases too:

Example: Binary Search vs Hash Tables

Binary search: O(log n) — predictable, no extra memory
Hash table: O(1) average, O(n) worst-case — fast but needs more memory

For a phone book app:

Binary search on 1 million entries: ~20 lookups maximum
Hash table on 1 million entries: ~1 lookup average, but 1 million in worst case

Napkin math decision: Hash table for speed, with safeguards against worst-case behavior (good hash function, load factor management).

The Art of Approximation: Learning from Fermi

The real skill in algorithmic napkin math isn’t perfect analysis — it’s useful approximation. This mirrors the Fermi estimation process, where you break complex problems into manageable, estimatable pieces.

Fermi process applied to algorithms:

Identify the core operation (like “How many comparisons?” in a sorting algorithm)
Estimate the frequency (How many times does this operation happen per input item?)
Scale up (What happens with 10x, 100x, 1000x more input?)
Cross-check with known patterns (Does this match O(n), O(n²), etc.?)

You develop intuition for when different complexities matter:

O(1) vs O(log n): Usually irrelevant for practical purposes
O(log n) vs O(n): Matters when n > 1,000
O(n) vs O(n log n): Matters when n > 10,000
O(n log n) vs O(n²): Matters when n > 100
O(n²) vs O(n³): Avoid both for large n!

This lets you make rapid design decisions:

“We’ll have ~10,000 users? O(n²) might be acceptable.”
“We expect millions of records? Anything worse than O(n log n) is out.”

Constants Matter (Sometimes)

Here’s where napkin math gets nuanced. Big O ignores constants, but sometimes they’re crucial:

# Both are O(n), but very different in practice
def algorithm_a(data):
    result = []
    for item in data:
        result.append(expensive_operation(item))  # 10ms per call
    return result

def algorithm_b(data):
    result = []
    for item in data:
        result.append(cheap_operation(item))  # 0.01ms per call
    return result

For 1,000 items:

Algorithm A: ~10 seconds
Algorithm B: ~0.01 seconds

Key insight: When constants differ by orders of magnitude, they can trump complexity class differences for practical input sizes.

The Ultimate Napkin Math Question

When evaluating any algorithm, ask yourself: “What happens when I 10x the input size?”

If performance stays roughly the same: O(1) or O(log n) — you’re golden
If performance increases proportionally: O(n) — usually fine
If performance increases more than proportionally but manageably: O(n log n) — proceed with caution
If performance explodes: O(n²) or worse — redesign time

This simple mental model helps you quickly assess scalability without detailed analysis.

Conclusion: The Value of Rough Reasoning

Big O notation teaches us that rough reasoning often beats precise calculation when it comes to algorithmic design. Perfect accuracy is less important than directional correctness.

In a world where software needs to handle everything from 10 users to 10 million users, the ability to quickly estimate scalability is invaluable. Big O provides the mathematical vocabulary to think clearly about these trade-offs and communicate them effectively to others.

The next time you’re designing an algorithm, grab that metaphorical napkin and ask: “How does this scale?” Your future self — and your users — will benefit from thinking about scalability from the start.

Big O Notation: The Language of Algorithmic Napkin Math

The Art of “Good Enough” Analysis

Napkin Math in Action: Real-World Estimates

Scenario 2: Search Functionality

Building Algorithmic Intuition

Beyond Worst-Case: Practical Napkin Math

Example: Binary Search vs Hash Tables

The Art of Approximation: Learning from Fermi

Constants Matter (Sometimes)

The Ultimate Napkin Math Question

Conclusion: The Value of Rough Reasoning

Further Reading

Essential Resources for Big O Analysis

For the Curious: Advanced Topics

Fermi Estimation Resources

Big O Notation: The Language of Algorithmic Napkin Math

The Art of “Good Enough” Analysis

Napkin Math in Action: Real-World Estimates

Scenario 1: Social Media Feed

Scenario 2: Search Functionality

Building Algorithmic Intuition

Beyond Worst-Case: Practical Napkin Math

Example: Binary Search vs Hash Tables

The Art of Approximation: Learning from Fermi

Constants Matter (Sometimes)

The Ultimate Napkin Math Question

Conclusion: The Value of Rough Reasoning

Further Reading

Essential Resources for Big O Analysis

For the Curious: Advanced Topics

Fermi Estimation Resources